Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

OCR Meets the DarkWeb: Identifying the Content Type Regarding Illegal and Cybercrime

Full metadata record
DC Field Value Language
dc.contributor.authorKim, Donghyun-
dc.contributor.authorJeon, Seungho-
dc.contributor.authorShin, Jiho-
dc.contributor.authorSeo, Jung Taek-
dc.date.accessioned2024-06-04T06:30:25Z-
dc.date.available2024-06-04T06:30:25Z-
dc.date.issued2024-01-
dc.identifier.issn0302-9743-
dc.identifier.issn1611-3349-
dc.identifier.urihttps://scholarworks.bwise.kr/gachon/handle/2020.sw.gachon/91401-
dc.description.abstractThe dark web provides features such as encryption and routing changes to ensure anonymity and make tracking difficult. Cybercrimes exploit the characteristics to gain revenue by distributing illegal and cybercrime content through the dark web and take a financial benefit as a business strategy. Illegal and cybercrime content includes drug and arms trafficking, counterfeit documents, malware, and the sale of personal information. A text crawling system in dark web has been developed and researched to counter illegal and cybercrime content distribution. However, because traditional text crawler in the dark web collects all text, identifying the exact data type can be difficult if dark web pages serve different types of illegal and cybercrime content. In this paper, we propose amethod of using the text embedded within images to accurately identify the types of illegal and cybercrime content on the dark web. We conducted the experiments with a combination of text and texts from both web page and images to accurately identify illegal and cybercrime content types. We collected keywords for the three types of illegal and cybercrime content. The distribution and types of illegal and cybercrime content were identified by calculating whether the collected keywords were included in dark web pages. Through experiments, we confirmed that using text embedded within images improves performance. Our proposed method accurately identified over 90% of dark web pages where drugs were distributed.-
dc.format.extent12-
dc.language영어-
dc.language.isoENG-
dc.publisherSPRINGER-VERLAG SINGAPORE PTE LTD-
dc.titleOCR Meets the DarkWeb: Identifying the Content Type Regarding Illegal and Cybercrime-
dc.typeArticle-
dc.identifier.wosid001206151300016-
dc.identifier.doi10.1007/978-981-99-8024-6_16-
dc.identifier.bibliographicCitationINFORMATION SECURITY APPLICATIONS, WISA 2023, v.14402, pp 201 - 212-
dc.description.isOpenAccessN-
dc.identifier.scopusid2-s2.0-85182608939-
dc.citation.endPage212-
dc.citation.startPage201-
dc.citation.titleINFORMATION SECURITY APPLICATIONS, WISA 2023-
dc.citation.volume14402-
dc.type.docTypeProceedings Paper-
dc.publisher.location싱가폴-
dc.subject.keywordAuthorDark Web-
dc.subject.keywordAuthorCrawler-
dc.subject.keywordAuthorIllegal and Cybercrime Content-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalWebOfScienceCategoryComputer Science, Software Engineering-
dc.relation.journalWebOfScienceCategoryComputer Science, Theory & Methods-
dc.description.journalRegisteredClassscopus-
Files in This Item
There are no files associated with this item.
Appears in
Collections
ETC > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher SEO, JUNGTAEK photo

SEO, JUNGTAEK
College of IT Convergence (컴퓨터공학부(스마트보안전공))
Read more

Altmetrics

Total Views & Downloads

BROWSE