GMO-AC: Gaussian-Based Minority Oversampling With Adaptive Outlier Filtering and Class Overlap Weightingopen access
- Authors
- Yang, Seung Jee; Cha, Kyungjoon
- Issue Date
- Dec-2024
- Publisher
- Institute of Electrical and Electronics Engineers Inc.
- Keywords
- GMM; imbalanced classification; oversampling
- Citation
- IEEE Access, v.12, pp 192494 - 192509
- Pages
- 16
- Indexed
- SCIE
SCOPUS
- Journal Title
- IEEE Access
- Volume
- 12
- Start Page
- 192494
- End Page
- 192509
- URI
- https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/206619
- DOI
- 10.1109/ACCESS.2024.3518573
- ISSN
- 2169-3536
2169-3536
- Abstract
- Imbalanced data significantly affects the performance of standard classification models. Data-level approaches primarily use oversampling methods, such as the synthetic minority oversampling technique (SMOTE), to address this problem. However, because methods such as SMOTE generate instances via linear interpolation, the synthetic data space may appear similar to a star or tree. Thus, some methods apply Gaussian weights to linear interpolation to address this issue. In this study, we propose a Gaussian-based minority oversampling with adaptive outlier filtering and class overlap weighting (GMO-AC) for imbalanced datasets. Unlike existing oversampling techniques, our method employs a Gaussian mixture model (GMM) to approximate the distribution of the minority class and generate new instances that follow this distribution. As outliers can affect the distribution approximation, GMO-AC identifies outliers by calculating the Mahalanobis distance for each instance and the covariance determinant. This process uses segmented linear regression to assess whether an instance falls outside the expected distribution. In addition, we defined the degree of class overlap to generate additional instances in the overlapping areas to improve the classification of the minority class in those areas. Experiments were conducted on synthetic and benchmark datasets, comparing the performance of GMO-AC with that of other methods, such as SMOTE. Experimental results show that GMO-AC yielded better AUROC and G-mean.
- Files in This Item
-
Go to Link
- Appears in
Collections - 서울 자연과학대학 > 서울 수학과 > 1. Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.