Exploring 1D Data Augmentation Techniques for Improved File Fragment Classification
- Authors
- Kim, Mincheol; Liu, Sisung; Kim, Hyeongsik; Hong, Je Hyeong
- Issue Date
- Sep-2025
- Publisher
- Institute of Electrical and Electronics Engineers Inc.
- Keywords
- data augmentation; digital forensics; file fragment classification
- Citation
- 2025 International Technical Conference on Circuits/Systems, Computers, and Communications, ITC-CSCC 2025, pp 1 - 6
- Pages
- 6
- Indexed
- SCOPUS
- Journal Title
- 2025 International Technical Conference on Circuits/Systems, Computers, and Communications, ITC-CSCC 2025
- Start Page
- 1
- End Page
- 6
- URI
- https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/208923
- DOI
- 10.1109/ITC-CSCC66376.2025.11137577
- ISSN
- 2997-7401
2997-741X
- Abstract
- File Fragment Classification (FFC) is essential for digital forensics, facilitating file type identification without relying on metadata or intact headers. Despite advances in model architectures for FFC, the potential of data augmentation for improving model robustness and generalization has not been extensively explored, especially for 1D byte sequence data. This study investigates the applicability and impact of two well-known image-based augmentation techniques, Masking and CutMix, on 1D byte-sequence-based FFC models. We conducted comparative experiments applying Masking, CutMix, and the previously proposed Gaussian Bit Flip (GBFlip) augmentation across several model architectures, including FiFTy, DSCNN, ByteRCNN, ResNet1D, and XMP. Experimental results reveal that augmentation effectiveness varies depending on model architecture. Notably, both FragMask and FragMix yielded performance gains in deeper models such as XMP and ResNet1D, across both short and long fragments.
- Files in This Item
-
Go to Link
- Appears in
Collections - 서울 공과대학 > 서울 융합전자공학부 > 1. Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.