Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

GMO-AC: Gaussian-Based Minority Oversampling With Adaptive Outlier Filtering and Class Overlap Weightingopen access

Authors
Yang, Seung JeeCha, Kyungjoon
Issue Date
Dec-2024
Publisher
Institute of Electrical and Electronics Engineers Inc.
Keywords
GMM; imbalanced classification; oversampling
Citation
IEEE Access, v.12, pp 192494 - 192509
Pages
16
Indexed
SCIE
SCOPUS
Journal Title
IEEE Access
Volume
12
Start Page
192494
End Page
192509
URI
https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/206619
DOI
10.1109/ACCESS.2024.3518573
ISSN
2169-3536
2169-3536
Abstract
Imbalanced data significantly affects the performance of standard classification models. Data-level approaches primarily use oversampling methods, such as the synthetic minority oversampling technique (SMOTE), to address this problem. However, because methods such as SMOTE generate instances via linear interpolation, the synthetic data space may appear similar to a star or tree. Thus, some methods apply Gaussian weights to linear interpolation to address this issue. In this study, we propose a Gaussian-based minority oversampling with adaptive outlier filtering and class overlap weighting (GMO-AC) for imbalanced datasets. Unlike existing oversampling techniques, our method employs a Gaussian mixture model (GMM) to approximate the distribution of the minority class and generate new instances that follow this distribution. As outliers can affect the distribution approximation, GMO-AC identifies outliers by calculating the Mahalanobis distance for each instance and the covariance determinant. This process uses segmented linear regression to assess whether an instance falls outside the expected distribution. In addition, we defined the degree of class overlap to generate additional instances in the overlapping areas to improve the classification of the minority class in those areas. Experiments were conducted on synthetic and benchmark datasets, comparing the performance of GMO-AC with that of other methods, such as SMOTE. Experimental results show that GMO-AC yielded better AUROC and G-mean.
Files in This Item
Go to Link
Appears in
Collections
서울 자연과학대학 > 서울 수학과 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Altmetrics

Total Views & Downloads

BROWSE