Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Identification of Software Bugs by Analyzing Natural Language-BasedRequirements Using Optimized Deep Learning Featuresopen access

Authors
ul Haq, Qazi MazharArif, FahimAurangzeb, Khursheedul Ain, NoorKhan, Javed AliRubab, SaddafAnwar, Muhammad Shahid
Issue Date
Mar-2024
Publisher
TECH SCIENCE PRESS
Keywords
Natural language processing; software bug prediction; transfer learning; ensemble learning; feature selection
Citation
CMC-COMPUTERS MATERIALS & CONTINUA, v.78, no.3, pp 4379 - 4397
Pages
19
Journal Title
CMC-COMPUTERS MATERIALS & CONTINUA
Volume
78
Number
3
Start Page
4379
End Page
4397
URI
https://scholarworks.bwise.kr/gachon/handle/2020.sw.gachon/91399
DOI
10.32604/cmc.2024.047172
ISSN
1546-2218
1546-2226
Abstract
Software project outcomes heavily depend on natural language requirements, often causing diverse interpretationsand issues like ambiguities and incomplete or faulty requirements. Researchers are exploring machine learningto predict software bugs, but a more precise and general approach is needed. Accurate bug prediction is crucialfor software evolution and user training, prompting an investigation into deep and ensemble learning methods.However, these studies are not generalized and efficient when extended to other datasets. Therefore, this paperproposed a hybrid approach combining multiple techniques to explore their effectiveness on bug identificationproblems. The methods involved feature selection, which is used to reduce the dimensionality and redundancyof features and select only the relevant ones; transfer learning is used to train and test the model on differentdatasets to analyze how much of the learning is passed to other datasets, and ensemble method is utilized toexplore the increase in performance upon combining multiple classifiers in a model. Four National Aeronauticsand Space Administration (NASA) and four Promise datasets are used in the study, showing an increase in themodel's performance by providing better Area Under the Receiver Operating Characteristic Curve (AUC-ROC)values when different classifiers were combined. It reveals that using an amalgam of techniques such as those usedin this study, feature selection, transfer learning, and ensemble methods prove helpful in optimizing the softwarebug prediction models and providing high-performing, useful end mode
Files in This Item
There are no files associated with this item.
Appears in
Collections
ETC > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Anwar, Muhammad Shahid photo

Anwar, Muhammad Shahid
College of IT Convergence (Department of Software)
Read more

Altmetrics

Total Views & Downloads

BROWSE