Cited 0 time in
XMP: A CROSS-ATTENTION MULTI-SCALE PERFORMER FOR FILE FRAGMENT CLASSIFICATION
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Park, Jeong Gyu | - |
| dc.contributor.author | Liu, Sisung | - |
| dc.contributor.author | Hong, Je Hyeong | - |
| dc.date.accessioned | 2026-03-12T00:30:21Z | - |
| dc.date.available | 2026-03-12T00:30:21Z | - |
| dc.date.issued | 2024-04 | - |
| dc.identifier.issn | 0736-7791 | - |
| dc.identifier.issn | 1520-6149 | - |
| dc.identifier.uri | https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/211203 | - |
| dc.description.abstract | File fragment classification (FFC) is the task of identifying the file type given a small fraction of binary data, and serves a crucial role in digital forensics and cybersecurity. Recent studies have adopted convolutional neural networks (CNNs) for this problem, significantly improving the accuracy over the traditional methods relying on handcrafted features. In this paper, we aim to expand on the recent performance gain by better leveraging the large amount of digital files available for training. We propose to achieve this by employing a Transformer encoder-based network known for its weak inductive bias suited for large-scale training. Our model, XMP, is inspired by the CrossViT architecture for image recognition and utilizes multi-scale self and cross-attentions between CNN features extracted from the byte n-grams of input binary data. Experimental results on the latest public dataset show XMP achieving state-of-the-art accuracies in almost all scenarios without need for additional preprocessing of binary data such as bit shifting, demonstrating the effectiveness of the Transformer-based architecture for FFC. The benefit of each proposed component is assessed through ablation study. Our code is available at github.com/pank40/xmp. | - |
| dc.format.extent | 5 | - |
| dc.language | 영어 | - |
| dc.language.iso | ENG | - |
| dc.publisher | Institute of Electrical and Electronics Engineers Inc. | - |
| dc.title | XMP: A CROSS-ATTENTION MULTI-SCALE PERFORMER FOR FILE FRAGMENT CLASSIFICATION | - |
| dc.type | Article | - |
| dc.publisher.location | 미국 | - |
| dc.identifier.doi | 10.1109/ICASSP48485.2024.10447626 | - |
| dc.identifier.scopusid | 2-s2.0-105002995136 | - |
| dc.identifier.wosid | 001285850004148 | - |
| dc.identifier.bibliographicCitation | ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp 4505 - 4509 | - |
| dc.citation.title | ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings | - |
| dc.citation.startPage | 4505 | - |
| dc.citation.endPage | 4509 | - |
| dc.type.docType | Proceedings Paper | - |
| dc.description.isOpenAccess | N | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.relation.journalResearchArea | Acoustics | - |
| dc.relation.journalResearchArea | Computer Science | - |
| dc.relation.journalResearchArea | Engineering | - |
| dc.relation.journalResearchArea | Imaging Science & Photographic Technology | - |
| dc.relation.journalWebOfScienceCategory | Acoustics | - |
| dc.relation.journalWebOfScienceCategory | Computer Science, Artificial Intelligence | - |
| dc.relation.journalWebOfScienceCategory | Engineering, Electrical & Electronic | - |
| dc.relation.journalWebOfScienceCategory | Imaging Science & Photographic Technology | - |
| dc.subject.keywordPlus | Convolutional Neural Network | - |
| dc.subject.keywordPlus | Binary Data | - |
| dc.subject.keywordPlus | Image Recognition | - |
| dc.subject.keywordPlus | Convolutional Neural Network Features | - |
| dc.subject.keywordPlus | File Type | - |
| dc.subject.keywordPlus | Inductive Bias | - |
| dc.subject.keywordPlus | Bit-shift | - |
| dc.subject.keywordPlus | Model Performance | - |
| dc.subject.keywordPlus | Training Data | - |
| dc.subject.keywordPlus | Support Vector Machine | - |
| dc.subject.keywordPlus | Data Augmentation | - |
| dc.subject.keywordPlus | Attention Mechanism | - |
| dc.subject.keywordPlus | Multilayer Perceptron | - |
| dc.subject.keywordPlus | Output Feature | - |
| dc.subject.keywordPlus | Hyperparameter Tuning | - |
| dc.subject.keywordPlus | Multi-scale Features | - |
| dc.subject.keywordPlus | Large-scale Features | - |
| dc.subject.keywordPlus | Multi-scale Feature Extraction | - |
| dc.subject.keywordPlus | Position Embedding | - |
| dc.subject.keywordPlus | Small-scale Features | - |
| dc.subject.keywordPlus | Bytes Of Data | - |
| dc.subject.keywordPlus | Attention Matrix | - |
| dc.subject.keywordPlus | Key Matrix | - |
| dc.subject.keywordAuthor | file fragment classification | - |
| dc.subject.keywordAuthor | Transformer | - |
| dc.subject.keywordAuthor | multi-scale attention | - |
| dc.subject.keywordAuthor | cross-attention | - |
| dc.subject.keywordAuthor | performer | - |
| dc.identifier.url | https://ieeexplore.ieee.org/document/10447626 | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366
COPYRIGHT © 2024 HANYANG UNIVERSITY.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
