Revolutionizing Urdu Sentiment Analysis : Harnessing the Power of XLM-R and GPT-2

annif.suggestionssocial media|Urdu language|data mining|Pakistan|machine learning|natural language|Twitter|deep learning|Indo-Aryan languages|text mining|enen
annif.suggestions.linkshttp://www.yso.fi/onto/yso/p20774|http://www.yso.fi/onto/yso/p17877|http://www.yso.fi/onto/yso/p5520|http://www.yso.fi/onto/yso/p105965|http://www.yso.fi/onto/yso/p21846|http://www.yso.fi/onto/yso/p26762|http://www.yso.fi/onto/yso/p24097|http://www.yso.fi/onto/yso/p39324|http://www.yso.fi/onto/yso/p24821|http://www.yso.fi/onto/yso/p27112en
dc.contributor.authorAshraf, Muhammad Rehan
dc.contributor.authorHussain, Muzammal
dc.contributor.authorJaffar, M. Arfan
dc.contributor.authorRamay, Waheed Yousuf
dc.contributor.authorFaheem, Muhammad
dc.contributor.facultyfi=Tekniikan ja innovaatiojohtamisen yksikkö|en=School of Technology and Innovations|-
dc.contributor.orcidhttps://orcid.org/0000-0003-4628-4486-
dc.contributor.organizationfi=Vaasan yliopisto|en=University of Vaasa|
dc.date.accessioned2025-06-02T14:18:10Z
dc.date.accessioned2025-06-25T14:05:16Z
dc.date.available2025-06-02T14:18:10Z
dc.date.issued2024-07-18
dc.description.abstractSentiment analysis extracts valuable insights from textual sources using computation, textual or systematic analysis, and natural language processing. It identifies and measures the attitudes, beliefs, and emotional states individuals express through text data. Recent research on sentiment analysis has largely focused on the English language; therefore, low-resource languages are getting much less attention. Conducting sentiment analysis of low-resource languages is difficult because large datasets and related repositories are unavailable. This paper creates a new dataset for low-resource language (Urdu) to address this issue. The dataset, namely LUCSA-23, consists of more than 65,000 user reviews from various genres, including food, sports, showbiz, apps, and political reviews from developing countries, i.e., Pakistan. Urdu domain experts further annotate the created dataset. This paper proposes an Urdu sentiment analysis approach leveraging the transformer model, i.e., XLM-R and GPT-2. It preprocesses the Urdu text input, generates BERT embeddings, and passes them to the proposed classifier as input for sentiment classification. The proposed classifier is compared with machine/deep/embedded classifiers to evaluate its performance. The findings show that the proposed classifiers outperform existing state-of-the-art approaches with an accuracy of 95%.-
dc.description.notification©2024 The Authors. This work is licensed under a Creative Commons Attribution (BY) 4.0 License.-
dc.description.reviewstatusfi=vertaisarvioitu|en=peerReviewed|-
dc.format.bitstreamtrue
dc.format.contentfi=kokoteksti|en=fulltext|-
dc.format.extent15-
dc.format.pagerange99779-99793-
dc.identifier.olddbid23942
dc.identifier.oldhandle10024/19674
dc.identifier.urihttps://osuva.uwasa.fi/handle/11111/3300
dc.identifier.urnURN:NBN:fi-fe2025060257996-
dc.language.isoeng-
dc.publisherIEEE-
dc.relation.doi10.1109/ACCESS.2024.3429496-
dc.relation.ispartofjournalIEEE access-
dc.relation.issn2169-3536-
dc.relation.urlhttps://doi.org/10.1109/ACCESS.2024.3429496-
dc.relation.volume12-
dc.rightsCC BY 4.0-
dc.source.identifierWOS:001276323400001-
dc.source.identifier2-s2.0-85199099391-
dc.source.identifierhttps://osuva.uwasa.fi/handle/10024/19674
dc.subjectSentiment analysis; Web sites; Analytical models; Accuracy; Video on demand; Reviews; Electronic mail; Natural language processing; Urdu; XLM-R; GPT-2; classification; BERT-
dc.subject.disciplinefi=Tietotekniikka|en=Computer Science|-
dc.subject.ysoUrdu language-
dc.subject.ysodeep learning-
dc.titleRevolutionizing Urdu Sentiment Analysis : Harnessing the Power of XLM-R and GPT-2-
dc.type.okmfi=A1 Alkuperäisartikkeli tieteellisessä aikakauslehdessä|en=A1 Peer-reviewed original journal article|sv=A1 Originalartikel i en vetenskaplig tidskrift|-
dc.type.publicationarticle-
dc.type.versionpublishedVersion-

Tiedostot

Näytetään 1 - 1 / 1
Ladataan...
Name:
Osuva_Ashraf_Hussain_Jaffar_Ramay_Faheem_2024.pdf
Size:
1.77 MB
Format:
Adobe Portable Document Format

Kokoelmat