Word Vector Representation and Ngram based Malicious URL Detection using Machine Learning

황성운

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Word Vector Representation and Ngram based Malicious URL Detection using Machine Learning

Authors: 황성운

Issue Date: 2-Feb-2017

Publisher: ICGHIT

Citation: ICGHIT 2018, v.Part 1, no.Part 1, pp.93 - 95

Journal Title: ICGHIT 2018

Volume: Part 1

Number: Part 1

Start Page: 93

End Page: 95

URI: https://scholarworks.bwise.kr/hongik/handle/2020.sw.hongik/6101

Abstract: Most Intrusion Detection Systems (IDS) nowadays are signature-based. They are very useful and accurate for detecting known attacks. However, they are inefficient with unknown attacks. One of promising methods to overcome this problem is to apply machine learning detection techniques. In this paper, we present a new lexical approach to classify URLs by using machine learning techniques which patternize the URLs. Our approach is based on word vector representation and ngram models on the blacklist word. Our experimentation shows that our approach can achieve a high degree of accuracy at 97.1% in the case of SVM(Support Vector Machine). Moreover, it can maintain a high level of robustness with 0.97 precision and 0.93 recall scores.

Files in This Item: There are no files associated with this item.

Appears in Collections: College of Science and Technology > Department of Computer and Information Communications Engineering > 1. Journal Articles

Show full item record

qrcode

Altmetrics

Total Views & Downloads

STATISTICS: Total View :2,623,089; Today View :1,710

RSS_1.0 RSS_2.0 ATOM_1.0

94, Wausan-ro, Mapo-gu, Seoul, 04066, Korea02-320-1314

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Altmetrics

Total Views & Downloads

BROWSE