Depth linear discrimination-oriented feature selection method based on adaptive sine cosine algorithm for software defect prediction

annif.suggestionsalgorithms|machine learning|computer programmes|information technology|mutations|life cycle analysis|programming|data processing|software engineering|software technology|enen
annif.suggestions.linkshttp://www.yso.fi/onto/yso/p14524|http://www.yso.fi/onto/yso/p21846|http://www.yso.fi/onto/yso/p26592|http://www.yso.fi/onto/yso/p5462|http://www.yso.fi/onto/yso/p15346|http://www.yso.fi/onto/yso/p13611|http://www.yso.fi/onto/yso/p4887|http://www.yso.fi/onto/yso/p2407|http://www.yso.fi/onto/yso/p17097|http://www.yso.fi/onto/yso/p6297en
dc.contributor.authorNasser, Abdullah B.
dc.contributor.authorGhanem, Waheed Ali H.M.
dc.contributor.authorSaad, Abdul-Malik H.Y.
dc.contributor.authorAbdul-Qawy, Antar Shaddad Hamed
dc.contributor.authorGhaleb, Sanaa A.A.
dc.contributor.authorAlduais, Nayef Abdulwahab Mohammed
dc.contributor.authorDin, Fakhrud
dc.contributor.authorGhetas, Mohamed
dc.contributor.facultyfi=Tekniikan ja innovaatiojohtamisen yksikkö|en=School of Technology and Innovations|-
dc.contributor.orcidhttps://orcid.org/0000-0002-5377-999X-
dc.contributor.organizationfi=Vaasan yliopisto|en=University of Vaasa|
dc.date.accessioned2024-06-25T13:04:01Z
dc.date.accessioned2025-06-25T13:47:29Z
dc.date.available2024-06-25T13:04:01Z
dc.date.issued2024-05-20
dc.description.abstractSoftware Defect Prediction (SDP) plays a vital role in the software development life cycle as it helps identify and fix software defects. However, predicting software defects with irrelevant features and overlapping classes is challenging and can lead to lengthy training and low model accuracy. To address these challenges, this research introduces a novel Depth Linear Discrimination-Oriented Feature Selection Method based on Adaptive Sine Cosine Algorithm, named Depth Adaptive Sine Cosine Feature Selection (DASC-FS). DASC-FS integrates the Adaptive Sine Cosine Algorithm (ASCA) as a search algorithm to determine the relevant features and adopts Depth Linear Discriminant Analysis (D-LDA) to identify the discriminative features that maximize class separation. The paper proposes ASCA which is a metaheuristic algorithm meticulously designed to enhance the search capabilities of the standard Sine Cosine Algorithm (SCA). Combining the simplicity of the SCA with the efficiency of multiple mutation operators inspired by Genetic Algorithms (GA), ASCA enhances the diversity of the solutions and imparts remarkable adaptability to various situations. Furthermore, this study introduces a novel linear discriminant method, called Depth Linear Discriminant Analysis (D-LDA) to enhance the robustness of the original LDA. D-LDA systematically integrates the matrix depth concept into LDA, offering a systematic approach to address the challenges associated with scatter matrix estimation. As matrix depth measures how central or deep a particular matrix is within a distribution with respect to different directions, it is an efficient tool for computing a robust scatter matrix estimator that can handle outliers and complex data structures. The experimental results showed that DASC-FS consistently obtains the highest accuracy compared to most existing methods by integrating ASCA and D-LDA, thereby considering both accuracy optimization and class separation. The results also show that the use of multiple mutation operators in ASCA improves the search process capabilities. The results also show that the capacity of D-LDA to reduce data dimensionality and increase class separation yields highly competitive results compared to other LDAs. Finally, features related to code size and complexity have emerged as key factors for SDP because they consistently rank as important features across different classifiers and datasets. DASC-FS offers a valuable solution in domain knowledge for enhancing predictive accuracy and understanding factors contributing to software defects through enhanced search capabilities, robust scatter matrix estimation, and the ability to reduce data dimensionality.-
dc.description.notification© 2024 The Author(s). Published by Elsevier Ltd. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).-
dc.description.reviewstatusfi=vertaisarvioitu|en=peerReviewed|-
dc.format.bitstreamtrue
dc.format.contentfi=kokoteksti|en=fulltext|-
dc.format.extent27-
dc.identifier.olddbid21239
dc.identifier.oldhandle10024/17874
dc.identifier.urihttps://osuva.uwasa.fi/handle/11111/2750
dc.identifier.urnURN:NBN:fi-fe2024062558165-
dc.language.isoeng-
dc.publisherElsevier-
dc.relation.doi10.1016/j.eswa.2024.124266-
dc.relation.ispartofjournalExpert Systems with Applications-
dc.relation.issn1873-6793-
dc.relation.issn0957-4174-
dc.relation.urlhttps://doi.org/10.1016/j.eswa.2024.124266-
dc.relation.volume253-
dc.rightsCC BY 4.0-
dc.source.identifierWOS:001248392600001-
dc.source.identifierScopus:85194467606-
dc.source.identifierhttps://osuva.uwasa.fi/handle/10024/17874
dc.subjectSoftware defect prediction-
dc.subjectFeature selection-
dc.subjectMetaheuristic algorithms-
dc.subjectSine cosine algorithm-
dc.subjectLinear discriminant analysis-
dc.subject.disciplinefi=Tietotekniikka|en=Computer Science|-
dc.subject.ysomachine learning-
dc.titleDepth linear discrimination-oriented feature selection method based on adaptive sine cosine algorithm for software defect prediction-
dc.type.okmfi=A1 Alkuperäisartikkeli tieteellisessä aikakauslehdessä|en=A1 Peer-reviewed original journal article|sv=A1 Originalartikel i en vetenskaplig tidskrift|-
dc.type.publicationarticle-
dc.type.versionpublishedVersion-

Tiedostot

Näytetään 1 - 1 / 1
Ladataan...
Name:
Osuva_Nasser_Ghanem_Saad_Abdul-Qawy_Ghaleb_Alduais_Din_Ghetas_2024.pdf
Size:
5.66 MB
Format:
Adobe Portable Document Format
Description:
Article

Kokoelmat