A paragraph-inserted word salad filtering algorithm
- Authors
- Jeong, Ok-Ran; Kim, Won
- Issue Date
- 2012
- Publisher
- INDERSCIENCE ENTERPRISES LTD
- Keywords
- social spam; spam filtering; word salad
- Citation
- INTERNATIONAL JOURNAL OF WEB AND GRID SERVICES, v.8, no.1, pp.56 - 71
- Journal Title
- INTERNATIONAL JOURNAL OF WEB AND GRID SERVICES
- Volume
- 8
- Number
- 1
- Start Page
- 56
- End Page
- 71
- URI
- https://scholarworks.bwise.kr/gachon/handle/2020.sw.gachon/17558
- DOI
- 10.1504/IJWGS.2012.046730
- ISSN
- 1741-1106
- Abstract
- Social spam is one type of spam which includes spamming the members of social websites by sending or posting unwanted ads or baiting them to visit particular websites. Word salad in turn is one type of social spam which aims at baiting people to visit particular websites, such as blogs, personal profiles, third-party applications built on social networking websites, etc. A word salad is created by inserting either words or paragraphs within a normal document, where the inserted words or paragraphs have no relevance to the document. The purpose of a word salad is to fool the search engines into assigning high ranks to the document. In this paper, we discuss an algorithm that filters (detects) paragraph-inserted word salads. The algorithm is based on the Singular Value Decomposition (SVD) method and, based on experiments, shows up to 81.3% accuracy.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - IT융합대학 > 소프트웨어학과 > 1. Journal Articles
![qrcode](https://api.qrserver.com/v1/create-qr-code/?size=55x55&data=https://scholarworks.bwise.kr/gachon/handle/2020.sw.gachon/17558)
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.