Efficient Phishing Website Detection via HTML Tag Sequence Analysis Using Encoder Models
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Ahn, Jemin | - |
dc.contributor.author | Xiong, Zuobin | - |
dc.contributor.author | Cho, Homook | - |
dc.contributor.author | Kang, Kyungtae | - |
dc.contributor.author | Son, Junggab | - |
dc.date.accessioned | 2025-09-30T08:00:24Z | - |
dc.date.available | 2025-09-30T08:00:24Z | - |
dc.date.issued | 2025-08 | - |
dc.identifier.issn | 1095-2055 | - |
dc.identifier.uri | https://scholarworks.bwise.kr/erica/handle/2021.sw.erica/126570 | - |
dc.description.abstract | The rapid proliferation of Internet of Things (IoT) devices has led to a significant increase in the number of network users, prompting advancements in security mechanisms. Consequently, traditional attacks targeting specific vulnerabilities have become less effective due to these enhanced defense systems, leading attackers to increasingly adopt phishing strategies as a primary means of bypassing security measures. Among these, phishing websites have been increasing rapidly, exploiting the carelessness of countless users. In response, numerous phishing website detection methods have been investigated, with machine learning-based approaches emerging as a leading strategy. However, these machine learning-based classification methods require substantial computational resources, posing challenges for their direct application in the already widespread IoT environment. To address these challenges, we propose an efficient phishing website detection method based on HTML tag sequences, the core structural elements of websites, by leveraging encoder models known for their effectiveness in classifying sequential data. Our approach also incorporates a customized tokenizer and dictionary specifically tailored for HTML tags. Experiments conducted on publicly available datasets demonstrate that the proposed method achieves over 95% accuracy across key performance metrics. Furthermore, comparative analyses highlight several advantages of our method, including reduced model size and faster detection times compared to existing approaches. | - |
dc.language | 영어 | - |
dc.language.iso | ENG | - |
dc.publisher | Institute of Electrical and Electronics Engineers Inc. | - |
dc.title | Efficient Phishing Website Detection via HTML Tag Sequence Analysis Using Encoder Models | - |
dc.type | Article | - |
dc.identifier.doi | 10.1109/ICCCN65249.2025.11133972 | - |
dc.identifier.scopusid | 2-s2.0-105016243147 | - |
dc.identifier.bibliographicCitation | Proceedings - International Conference on Computer Communications and Networks, ICCCN | - |
dc.citation.title | Proceedings - International Conference on Computer Communications and Networks, ICCCN | - |
dc.type.docType | Conference Paper | - |
dc.description.isOpenAccess | N | - |
dc.description.journalRegisteredClass | scopus | - |
dc.subject.keywordAuthor | Classification (of Information) | - |
dc.subject.keywordAuthor | Computer Crime | - |
dc.subject.keywordAuthor | Html | - |
dc.subject.keywordAuthor | Internet Of Things | - |
dc.subject.keywordAuthor | Learning Algorithms | - |
dc.subject.keywordAuthor | Learning Systems | - |
dc.subject.keywordAuthor | Machine Learning | - |
dc.subject.keywordAuthor | Network Security | - |
dc.subject.keywordAuthor | Phishing | - |
dc.subject.keywordAuthor | Websites | - |
dc.subject.keywordAuthor | Defence Systems | - |
dc.subject.keywordAuthor | Detection Methods | - |
dc.subject.keywordAuthor | Html Tags | - |
dc.subject.keywordAuthor | Machine-learning | - |
dc.subject.keywordAuthor | Network Users | - |
dc.subject.keywordAuthor | Phishing Websites | - |
dc.subject.keywordAuthor | Security Measure | - |
dc.subject.keywordAuthor | Security Mechanism | - |
dc.subject.keywordAuthor | Sequence Analysis | - |
dc.subject.keywordAuthor | Signal Encoding | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
55 Hanyangdeahak-ro, Sangnok-gu, Ansan, Gyeonggi-do, 15588, Korea+82-31-400-4269 sweetbrain@hanyang.ac.kr
COPYRIGHT © 2021 HANYANG UNIVERSITY. ALL RIGHTS RESERVED.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.