Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Detection Method for Randomly Generated User IDs: Lift the Curse of Dimensionality

Full metadata record
DC Field Value Language
dc.contributor.authorRo, Inwoo-
dc.contributor.authorKang, Boojoong-
dc.contributor.authorSeo, Choonghyun-
dc.contributor.authorIm, Eul Gyu-
dc.date.accessioned2023-07-05T03:52:54Z-
dc.date.available2023-07-05T03:52:54Z-
dc.date.created2022-09-08-
dc.date.issued2022-08-
dc.identifier.issn2169-3536-
dc.identifier.urihttps://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/186183-
dc.description.abstractInternet services are essential to our daily life in these days, and user accounts are usually required for downloading or browsing for multimedia contents from service providers such as Yahoo, Google, YouTube and so on. Attackers who perform malicious actions against these services use fake user accounts to hide their identity, or use them to continue malicious actions even after being caught by the service's detection system. Using a random string generation algorithm for user identification (ID) string is one of the common method to create and obtain a large number of fake user accounts. To detect IDs and to defend against such attacks, some researchers have proposed the models that detect randomly generated IDs. Among these detection models, the n-gram-based using term frequency-inverse document frequency model is regarded as a state-of-the-art model to detect randomly generated IDs, but n-gram-based approaches have the problem of the curse of dimensionality because the sparsity of feature vector increases exponentially with the increase of size n. As a result, the improvement of the detection accuracy is limited since size n cannot be increased. This paper proposes two methods to detect randomly generated IDs more accurately. The first is to avoid the curse of dimensionality with the compression of feature dimension size. The second is a technique to reduce false positives by using pattern matching and Bhattacharyya distance. We tested our method with about 3 million normal user IDs collected from the real portal service, 1 million IDs generated by a random string generation algorithm, and 8,541 IDs found after being used for malicious behavior in real portal services. The experimental results showed that the proposed method can improve detection accuracy as well as inference performance.-
dc.language영어-
dc.language.isoen-
dc.publisherIEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC-
dc.titleDetection Method for Randomly Generated User IDs: Lift the Curse of Dimensionality-
dc.typeArticle-
dc.contributor.affiliatedAuthorIm, Eul Gyu-
dc.identifier.doi10.1109/ACCESS.2022.3198687-
dc.identifier.scopusid2-s2.0-85137897247-
dc.identifier.wosid000844077200001-
dc.identifier.bibliographicCitationIEEE ACCESS, v.10, pp.86020 - 86028-
dc.relation.isPartOfIEEE ACCESS-
dc.citation.titleIEEE ACCESS-
dc.citation.volume10-
dc.citation.startPage86020-
dc.citation.endPage86028-
dc.type.rimsART-
dc.type.docTypeArticle-
dc.description.journalClass1-
dc.description.isOpenAccessY-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalResearchAreaEngineering-
dc.relation.journalResearchAreaTelecommunications-
dc.relation.journalWebOfScienceCategoryComputer Science, Information Systems-
dc.relation.journalWebOfScienceCategoryEngineering, Electrical & Electronic-
dc.relation.journalWebOfScienceCategoryTelecommunications-
dc.subject.keywordPlusComputer crime-
dc.subject.keywordPlusInverse problems-
dc.subject.keywordPlusMultimedia services-
dc.subject.keywordPlusPattern matching-
dc.subject.keywordPlusText processing-
dc.subject.keywordPlusCurse of dimensionality-
dc.subject.keywordPlusDetection accuracy-
dc.subject.keywordPlusDetection methods-
dc.subject.keywordPlusGeneration algorithm-
dc.subject.keywordPlusIdentity management systems-
dc.subject.keywordPlusN-grams-
dc.subject.keywordPlusPortal services-
dc.subject.keywordPlusRandom string-
dc.subject.keywordPlusUser ID-
dc.subject.keywordPlusWeb-sites-
dc.subject.keywordPlusAuthentication-
dc.subject.keywordAuthorAuthentication-
dc.subject.keywordAuthorcomputer crime-
dc.subject.keywordAuthoridentity management systems-
dc.subject.keywordAuthorweb sites-
dc.identifier.urlhttps://ieeexplore.ieee.org/document/9856640-
Files in This Item
Appears in
Collections
서울 공과대학 > 서울 컴퓨터소프트웨어학부 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Im, Eul Gyu photo

Im, Eul Gyu
COLLEGE OF ENGINEERING (SCHOOL OF COMPUTER SCIENCE)
Read more

Altmetrics

Total Views & Downloads

BROWSE