Reducing overfitting of adaboost by clustering-based pruning of hard examples
- Authors
- Kim, Dae-Sun; Baek, Yeul-Min; Kim, Whoi-Yul
- Issue Date
- Jan-2013
- Publisher
- Association for Computing Machinary, Inc.
- Keywords
- Adaboost; Clustering; Hard-to-learn samples; Overfitting
- Citation
- Proceedings of the 7th International Conference on Ubiquitous Information Management and Communication, ICUIMC 2013, pp 1 - 3
- Pages
- 3
- Indexed
- SCOPUS
- Journal Title
- Proceedings of the 7th International Conference on Ubiquitous Information Management and Communication, ICUIMC 2013
- Start Page
- 1
- End Page
- 3
- URI
- https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/202726
- DOI
- 10.1145/2448556.2448646
- ISSN
- 0000-0000
- Abstract
- In order to solve the problem of overfitting in AdaBoost, we propose a novel AdaBoost algorithm using K-means clustering. AdaBoost is known as an effective method for improving the performance of base classifiers both theoretically and empirically. However, previous studies have shown that AdaBoost is prone to overfitting in overlapped classes. In order to overcome the overfitting problem of AdaBoost, the proposed method uses Kmeans clustering to remove hard-to-learn samples that exist on overlapped region. Since the proposed method does not consider hard-to-learn samples, it suffers less from the overfitting problem compared to conventional AdaBoost. Both synthetic and real world data were tested to confirm the validity of the proposed method.
- Files in This Item
-
Go to Link
- Appears in
Collections - 서울 공과대학 > 서울 융합전자공학부 > 1. Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.