Enhancing User Experience on Q&A Platforms: Measuring Text Similarity Based on Hybrid CNN-LSTM Model for Efficient Duplicate Question Detection
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Faseeh, Muhammad | - |
dc.contributor.author | Khan, Murad Ali | - |
dc.contributor.author | Iqbal, Naeem | - |
dc.contributor.author | Qayyum, Faiza | - |
dc.contributor.author | Mehmood, Asif | - |
dc.contributor.author | Kim, Jungsuk | - |
dc.date.accessioned | 2024-04-06T06:00:19Z | - |
dc.date.available | 2024-04-06T06:00:19Z | - |
dc.date.issued | 2024-01 | - |
dc.identifier.issn | 2169-3536 | - |
dc.identifier.uri | https://scholarworks.bwise.kr/gachon/handle/2020.sw.gachon/90911 | - |
dc.description.abstract | This research introduces an innovative approach for identifying duplicate questions within the Stack Overflow community, a challenging task in NLP. Leveraging deep learning techniques, our proposed methodology combines Convolutional Neural Networks (CNN) and Long Short-Term Memory (LSTM) networks to capture both local and long-term dependencies in textual data. We employ word embeddings, specifically Google's Word2Vec and GloVe, to enhance text representation. Extensive experiments on the Stack Overflow dataset demonstrate the effectiveness of our approach, achieving an impressive accuracy of 87.09% and a recall rate of 87.%. The integration of CNN and LSTM models significantly streamlines preprocessing, making it a valuable tool for detecting duplicate questions. Future directions include extending the model to multiple languages and exploring alternative word embedding techniques. Our approach presents promising applications beyond Stack Overflow, offering solutions for identifying similar questions on various QA platforms. | - |
dc.format.extent | 15 | - |
dc.language | 영어 | - |
dc.language.iso | ENG | - |
dc.publisher | IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC | - |
dc.title | Enhancing User Experience on Q&A Platforms: Measuring Text Similarity Based on Hybrid CNN-LSTM Model for Efficient Duplicate Question Detection | - |
dc.type | Article | - |
dc.identifier.wosid | 001178242200001 | - |
dc.identifier.doi | 10.1109/ACCESS.2024.3358422 | - |
dc.identifier.bibliographicCitation | IEEE ACCESS, v.12, pp 34512 - 34526 | - |
dc.description.isOpenAccess | Y | - |
dc.identifier.scopusid | 2-s2.0-85183943662 | - |
dc.citation.endPage | 34526 | - |
dc.citation.startPage | 34512 | - |
dc.citation.title | IEEE ACCESS | - |
dc.citation.volume | 12 | - |
dc.type.docType | Article | - |
dc.publisher.location | 미국 | - |
dc.subject.keywordAuthor | Deep learning | - |
dc.subject.keywordAuthor | Semantics | - |
dc.subject.keywordAuthor | Brain modeling | - |
dc.subject.keywordAuthor | Task analysis | - |
dc.subject.keywordAuthor | Feature extraction | - |
dc.subject.keywordAuthor | Convolutional neural networks | - |
dc.subject.keywordAuthor | Syntactics | - |
dc.subject.keywordAuthor | Natural language processing | - |
dc.subject.keywordAuthor | Question answering (information retrieval) | - |
dc.subject.keywordAuthor | Duplicate question identification | - |
dc.subject.keywordAuthor | stack overflow | - |
dc.subject.keywordAuthor | deep learning (DL) | - |
dc.subject.keywordAuthor | word embeddings | - |
dc.subject.keywordAuthor | natural language processing (NLP) | - |
dc.subject.keywordAuthor | question-and-answer (QA) platforms | - |
dc.subject.keywordPlus | TWEETS | - |
dc.relation.journalResearchArea | Computer Science | - |
dc.relation.journalResearchArea | Engineering | - |
dc.relation.journalResearchArea | Telecommunications | - |
dc.relation.journalWebOfScienceCategory | Computer Science, Information Systems | - |
dc.relation.journalWebOfScienceCategory | Engineering, Electrical & Electronic | - |
dc.relation.journalWebOfScienceCategory | Telecommunications | - |
dc.description.journalRegisteredClass | scie | - |
dc.description.journalRegisteredClass | scopus | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
1342, Seongnam-daero, Sujeong-gu, Seongnam-si, Gyeonggi-do, Republic of Korea(13120)031-750-5114
COPYRIGHT 2020 Gachon University All Rights Reserved.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.