Intelligent Hybrid Feature Selection for Textual Sentiment Classification
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Khan, Jawad | - |
dc.contributor.author | Alam, Aftab | - |
dc.contributor.author | Lee, Youngmoon | - |
dc.date.accessioned | 2022-07-18T01:31:10Z | - |
dc.date.available | 2022-07-18T01:31:10Z | - |
dc.date.created | 2021-12-06 | - |
dc.date.issued | 2021-10 | - |
dc.identifier.issn | 2169-3536 | - |
dc.identifier.uri | https://scholarworks.bwise.kr/erica/handle/2021.sw.erica/108171 | - |
dc.description.abstract | Sentiment Analysis (SA) aims to extract useful information from online Unstructured User-Generated Contents (UUGC) and classify them into positive and negative classes. State-of-the-art techniques for SA suffer a high dimensional feature space because of noisy and irrelevant features from the UUGC. Researchers have also proposed feature extraction and selection techniques to reduce high dimensional feature space, but they fall short in extracting and selecting the most effective sentiment features for sentiment model learning. Effective feature extraction and selection are significant for the SA because they can boost the learning algorithm's predictive performance while reducing the high-dimensional feature space. To address these concerns, we propose an Intelligent Hybrid Feature Selection for Sentiment Analysis (IHFSSA) based on ensemble learning methods. IHFSSA first identifies sentiment features in the review text utilizing Penn Treebank part-of-speech tagset and integrated Wide Coverage Sentiment Lexicons (WCSL). The sentiment features subset is then selected employing a fast and simple rank-based ensemble of multiple filters feature selection method. The selected sentiment features are further refined by applying a wrapper-based backward feature selection method. Finally, for textual sentiment classification, the well-known classification algorithms Support Vector Machine (SVM), Naive Bayes (NB), Generalized Linear Model (GLM) are trained in the ensemble model on the refined sentiment feature set. The in-depth evaluation using heterogeneous domain benchmark datasets demonstrates that IHFSSA outperforms existing SA techniques. | - |
dc.language | 영어 | - |
dc.language.iso | en | - |
dc.publisher | IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC | - |
dc.title | Intelligent Hybrid Feature Selection for Textual Sentiment Classification | - |
dc.type | Article | - |
dc.contributor.affiliatedAuthor | Lee, Youngmoon | - |
dc.identifier.doi | 10.1109/ACCESS.2021.3118982 | - |
dc.identifier.scopusid | 2-s2.0-85117133311 | - |
dc.identifier.wosid | 000709071600001 | - |
dc.identifier.bibliographicCitation | IEEE ACCESS, v.9, pp.140590 - 140608 | - |
dc.relation.isPartOf | IEEE ACCESS | - |
dc.citation.title | IEEE ACCESS | - |
dc.citation.volume | 9 | - |
dc.citation.startPage | 140590 | - |
dc.citation.endPage | 140608 | - |
dc.type.rims | ART | - |
dc.type.docType | Article | - |
dc.description.journalClass | 1 | - |
dc.description.isOpenAccess | N | - |
dc.description.journalRegisteredClass | scie | - |
dc.description.journalRegisteredClass | scopus | - |
dc.relation.journalResearchArea | Computer Science | - |
dc.relation.journalResearchArea | Engineering | - |
dc.relation.journalResearchArea | Telecommunications | - |
dc.relation.journalWebOfScienceCategory | Computer Science, Information Systems | - |
dc.relation.journalWebOfScienceCategory | Engineering, Electrical & Electronic | - |
dc.relation.journalWebOfScienceCategory | Telecommunications | - |
dc.subject.keywordPlus | MODEL | - |
dc.subject.keywordPlus | SVM | - |
dc.subject.keywordPlus | MACHINE | - |
dc.subject.keywordAuthor | Feature extraction | - |
dc.subject.keywordAuthor | Support vector machines | - |
dc.subject.keywordAuthor | Motion pictures | - |
dc.subject.keywordAuthor | Entropy | - |
dc.subject.keywordAuthor | Sentiment analysis | - |
dc.subject.keywordAuthor | Semantics | - |
dc.subject.keywordAuthor | Social networking (online) | - |
dc.subject.keywordAuthor | Sentiment classification | - |
dc.subject.keywordAuthor | hybrid feature selection | - |
dc.subject.keywordAuthor | ensemble learning | - |
dc.subject.keywordAuthor | linguistic semantic rules | - |
dc.subject.keywordAuthor | wide coverage sentiment lexicons | - |
dc.subject.keywordAuthor | natural language processing | - |
dc.identifier.url | https://www.scopus.com/record/display.uri?eid=2-s2.0-85117133311&origin=inward&txGid=d9eb16bef0a5fab7564c2323e4c03505 | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
55 Hanyangdeahak-ro, Sangnok-gu, Ansan, Gyeonggi-do, 15588, Korea+82-31-400-4269 sweetbrain@hanyang.ac.kr
COPYRIGHT © 2021 HANYANG UNIVERSITY. ALL RIGHTS RESERVED.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.