Engineers, Aware! Commercial Tools Disagree on Social Media Sentiment : Analyzing the Sentiment Bias of Four Major Tools

annif.suggestionssocial media|Twitter|data mining|users|blogs|news|media|applications (computer programmes)|online services|copyright|enen
annif.suggestions.linkshttp://www.yso.fi/onto/yso/p20774|http://www.yso.fi/onto/yso/p24097|http://www.yso.fi/onto/yso/p5520|http://www.yso.fi/onto/yso/p16550|http://www.yso.fi/onto/yso/p2719|http://www.yso.fi/onto/yso/p13915|http://www.yso.fi/onto/yso/p2445|http://www.yso.fi/onto/yso/p8456|http://www.yso.fi/onto/yso/p6624|http://www.yso.fi/onto/yso/p2346en
dc.contributor.authorJung, Soon-Gyo
dc.contributor.authorSalminen, Joni
dc.contributor.authorJansen, Bernard J.
dc.contributor.departmentfi=Ei tutkimusalustaa|en=No platform|-
dc.contributor.facultyfi=Markkinoinnin ja viestinnän yksikkö|en=School of Marketing and Communication|-
dc.contributor.orcidhttps://orcid.org/0000-0003-3230-0561-
dc.contributor.organizationfi=Vaasan yliopisto|en=University of Vaasa|
dc.date.accessioned2022-06-22T07:35:21Z
dc.date.accessioned2025-06-25T13:46:23Z
dc.date.available2022-06-22T07:35:21Z
dc.date.issued2022-06-17
dc.description.abstractLarge commercial sentiment analysis tools are often deployed in software engineering due to their ease of use. However, it is not known how accurate these tools are, and whether the sentiment ratings given by one tool agree with those given by another tool. We use two datasets - (1) NEWS consisting of 5,880 news stories and 60K comments from four social media platforms: Twitter, Instagram, YouTube, and Facebook; and (2) IMDB consisting of 7,500 positive and 7,500 negative movie reviews - to investigate the agreement and bias of four widely used sentiment analysis (SA) tools: Microsoft Azure (MS), IBM Watson, Google Cloud, and Amazon Web Services (AWS). We find that the four tools assign the same sentiment on less than half (48.1%) of the analyzed content. We also find that AWS exhibits neutrality bias in both datasets, Google exhibits bi-polarity bias in the NEWS dataset but neutrality bias in the IMDB dataset, and IBM and MS exhibit no clear bias in the NEWS dataset but have bi-polarity bias in the IMDB dataset. Overall, IBM has the highest accuracy relative to the known ground truth in the IMDB dataset. Findings indicate that psycholinguistic features - especially affect, tone, and use of adjectives - explain why the tools disagree. Engineers are urged caution when implementing SA tools for applications, as the tool selection affects the obtained sentiment labels.-
dc.description.notification© Owner/Author(s). ACM 2022. This is the author's version of the work. It is posted here for your personal use. Not for redistribution. The definitive Version of Record was published in Proceedings of the ACM on Human-Computer Interaction, https://doi.org/10.1145/3532203.-
dc.description.reviewstatusfi=vertaisarvioitu|en=peerReviewed|-
dc.format.bitstreamtrue
dc.format.contentfi=kokoteksti|en=fulltext|-
dc.format.extent20-
dc.format.pagerange1-20-
dc.identifier.olddbid16654
dc.identifier.oldhandle10024/14421
dc.identifier.urihttps://osuva.uwasa.fi/handle/11111/2709
dc.identifier.urnURN:NBN:fi-fe2022062248556-
dc.language.isoeng-
dc.publisherACM-
dc.relation.doi10.1145/3532203-
dc.relation.ispartofjournalProceedings of the ACM on Human-Computer Interaction-
dc.relation.issn2573-0142-
dc.relation.issueEICS-
dc.relation.urlhttps://doi.org/10.1145/3532203-
dc.relation.volume6-
dc.source.identifierhttps://osuva.uwasa.fi/handle/10024/14421
dc.subjectComputing methodologies-
dc.subjectMachine learning approaches-
dc.subjectsentiment analysis-
dc.subjectevaluation-
dc.subjectbias-
dc.subjectagreement-
dc.subject.disciplinefi=Markkinointi|en=Marketing|-
dc.titleEngineers, Aware! Commercial Tools Disagree on Social Media Sentiment : Analyzing the Sentiment Bias of Four Major Tools-
dc.type.okmfi=A1 Alkuperäisartikkeli tieteellisessä aikakauslehdessä|en=A1 Peer-reviewed original journal article|sv=A1 Originalartikel i en vetenskaplig tidskrift|-
dc.type.publicationarticle-
dc.type.versionacceptedVersion-

Tiedostot

Näytetään 1 - 1 / 1
Ladataan...
Name:
Osuva_Jung_Salminen_Jansen_2022.pdf
Size:
753.64 KB
Format:
Adobe Portable Document Format
Description:
Artikkeli

Kokoelmat