Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Efficient Features for Function Matching in Multi-Architecture Binary Executables

Full metadata record
DC Field Value Language
dc.contributor.authorUllah, Sami-
dc.contributor.authorJin, Wenhui-
dc.contributor.authorOh, Heekuck-
dc.date.accessioned2023-12-11T08:00:22Z-
dc.date.available2023-12-11T08:00:22Z-
dc.date.issued2021-08-
dc.identifier.issn2169-3536-
dc.identifier.urihttps://scholarworks.bwise.kr/erica/handle/2021.sw.erica/116254-
dc.description.abstractBinary-binary function matching problem serves as a plinth in many reverse engineering techniques such as binary diffing, malware analysis, and code plagiarism detection. In literature, function matching is performed by first extracting function features (syntactic and semantic), and later these features are used as selection criteria to formulate an approximate 1:1 correspondence between binary functions. The accuracy of the approximation is dependent on the selection of efficient features. Although substantial research has been conducted on this topic, we have explored two major drawbacks in previous research. (i) The features are optimized only for a single architecture and their matching efficiency drops for other architectures. (ii) function matching algorithms mainly focus on the structural properties of a function, which are not inherently resilient against compiler optimizations. To resolve the architecture dependency and compiler optimizations, we benefit from the intermediate representation (IR) of function assembly and propose a set of syntactic and semantic (embedding-based) features which are efficient for multi-architectures, and sensitive to compiler-based optimizations. The proposed function matching algorithm employs one-shot encoding that is flexible to small changes and uses a KNN based approach to effectively map similar functions. We have evaluated proposed features and algorithms using various binaries, which were compiled for x86 and ARM architectures; and the prototype implementation is compared with Diaphora (an industry-standard tool), and other baseline research. Our proposed prototype has achieved a matching accuracy of approx. 96%, which is higher than the compared tools and consistent against optimizations and multi-architecture binaries.-
dc.format.extent19-
dc.language영어-
dc.language.isoENG-
dc.publisherInstitute of Electrical and Electronics Engineers Inc.-
dc.titleEfficient Features for Function Matching in Multi-Architecture Binary Executables-
dc.typeArticle-
dc.publisher.location미국-
dc.identifier.doi10.1109/ACCESS.2021.3099429-
dc.identifier.scopusid2-s2.0-85111929749-
dc.identifier.wosid000679517100001-
dc.identifier.bibliographicCitationIEEE Access, v.9, pp 104950 - 104968-
dc.citation.titleIEEE Access-
dc.citation.volume9-
dc.citation.startPage104950-
dc.citation.endPage104968-
dc.type.docTypeArticle-
dc.description.isOpenAccessY-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalResearchAreaEngineering-
dc.relation.journalResearchAreaTelecommunications-
dc.relation.journalWebOfScienceCategoryComputer Science, Information Systems-
dc.relation.journalWebOfScienceCategoryEngineering, Electrical & Electronic-
dc.relation.journalWebOfScienceCategoryTelecommunications-
dc.subject.keywordPlusCODE-
dc.subject.keywordAuthorFeature extraction-
dc.subject.keywordAuthorSemantics-
dc.subject.keywordAuthorOptimization-
dc.subject.keywordAuthorTools-
dc.subject.keywordAuthorSyntactics-
dc.subject.keywordAuthorMalware-
dc.subject.keywordAuthorComputer architecture-
dc.subject.keywordAuthorBinary diffing-
dc.subject.keywordAuthorefficient features-
dc.subject.keywordAuthorfunction matching-
dc.subject.keywordAuthormulti-architecture-
dc.identifier.urlhttps://www.scopus.com/record/display.uri?eid=2-s2.0-85111929749&origin=inward&txGid=5d7ef9a192a57c5a6a812bd14b94ba3f-
Files in This Item
Go to Link
Appears in
Collections
COLLEGE OF COMPUTING > ERICA 컴퓨터학부 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Oh, Hee kuck photo

Oh, Hee kuck
ERICA 소프트웨어융합대학 (ERICA 컴퓨터학부)
Read more

Altmetrics

Total Views & Downloads

BROWSE