A robust method to extract polar words from unstructured text
- Authors
- He, G.; Lee, S.
- Issue Date
- Apr-2017
- Publisher
- Research India Publications
- Keywords
- Nature language processing; Polar words extraction; Preprocessing; Sentiment analysis
- Citation
- International Journal of Applied Engineering Research, v.12, no.7, pp.1345 - 1349
- Journal Title
- International Journal of Applied Engineering Research
- Volume
- 12
- Number
- 7
- Start Page
- 1345
- End Page
- 1349
- URI
- http://scholarworks.bwise.kr/ssu/handle/2018.sw.ssu/7263
- ISSN
- 0973-4562
- Abstract
- In the last decade, sentiment analysis becomes popular by helping to quantify user’s opinion. A very important step to implement sentiment analysis is to extract polar words from the target text. It is easy to achieve if the text is clean and structured like news contents. By contrast, if the text is dirty and unstructured, particularly for the social data such as tweets and product reviews, polar words extraction becomes very hard. This problem may be much more serious for some Asian languages like Korean. In order to extract high-quality polar words from the unstructured text in the Korean language, this paper presents a robust method by detecting and expanding the variations of polar word roots. The experimental results show that the proposed method can extract more polar words than the basic extraction method and meanwhile reserve a very high precision. © Research India Publications.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - College of Information Technology > School of Software > 1. Journal Articles
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.