Distinct Views Improve Generalization and Robustness: Combinations of Augmentations with Different Featuresopen access
- Authors
- Kim, Keon; Kim, Hyun Woo; Choi, Yong Suk
- Issue Date
- Mar-2025
- Publisher
- Institute of Electrical and Electronics Engineers Inc.
- Keywords
- Image color analysis; Robustness; Shape; Distortion; Visualization; Training; Data models; Data augmentation; Perturbation methods; Transforms; Adversarial training; corruption robustness; data augmentation; image classification
- Citation
- IEEE Access, v.13, pp 50353 - 50366
- Pages
- 14
- Indexed
- SCIE
SCOPUS
- Journal Title
- IEEE Access
- Volume
- 13
- Start Page
- 50353
- End Page
- 50366
- URI
- https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/207126
- DOI
- 10.1109/ACCESS.2025.3552110
- ISSN
- 2169-3536
2169-3536
- Abstract
- Data augmentation is an effective method for improving deep-learning model performance. In the vision domain, various augmentation studies have been conducted to enhance generalization ability and robustness against corruption. However, recent augmentation studies have focused on transforming data to be more diverse and challenging. This approach can prevent models from properly learning key features of objects, such as texture and shape. In response, unlike traditional methods that employ a single augmentation strategy, our method simultaneously utilizes three distinct augmentations, each with different characteristics. We transform the images into color-preserving, shape-preserving, and diversity-enhancing views. More specifically, to ensure the model still captures the key factors of visual information, we utilize two feature-preserving views, one with local color(texture) and the other with global shape information. The third view is transformed by an augmentation that enhances diversity. By utilizing those three distinct augmentations, DV (Distinct Views) helps the model effectively learn all the important features of visual information. To further improve robustness against corruption, we incorporate adversarial perturbations into the third (diversity-enhancing) view, unifying additional hardness and diversity. Experimental results show that DV considerably enhances generalization and robustness against corruption, achieving state-of-the-art performance on various image benchmark datasets. Furthermore, we confirmed that DV is quite effective even for Transformer-based models, which typically underperform on small datasets.
- Files in This Item
-
Go to Link
- Appears in
Collections - 서울 공과대학 > 서울 컴퓨터소프트웨어학부 > 1. Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.