Passer à la navigation principale Passer à la recherche Passer au contenu principal

Language-Agnostic Method for Sentiment Analysis of Twitter

  • Amir Reza Jafari
  • , Reza Farahbakhsh
  • , Mostafa Salehi
  • , Noel Crespi
  • Telecom Sudparis
  • University of Tehran

Résultats de recherche: Le chapitre dans un livre, un rapport, une anthologie ou une collectionContribution à une conférenceRevue par des pairs

Résumé

With the different events and crises that we are witnessing these days, Twitter plays an essential role in sharing thoughts, opinions, and news worldwide in various languages. Understanding the sentiment of user-generated content has garnered much interest in both industrial and academic communities in recent studies. Due to the limited availability of data from low-resource languages, the focus on multilingual resources is a limiting and challenging issue of sentiment analysis task. Considering the importance of pre-processing in the implementation of a sentiment analysis system, we propose a method consisting of two steps for the pre-processing of tweets in different languages i) a language-agnostic step to replace or remove some elements in the Twitter data structure and ii) a text-normalization step based on the main high-resource language. In addition, we used machine translation techniques to translate low-resource language texts into the main language. We evaluated sentiment classification approaches based on four deep models: an RNN model and three BERT-based architectures, namely vanilla-version, a language-specific, and a large-scale pre-trained model for Twitter. The results show that our method had better accuracy when using a large-scale BERT-based pre-trained model.

langue originaleAnglais
titreProceedings of Data Analytics and Management - ICDAM 2023
rédacteurs en chefAbhishek Swaroop, Zdzislaw Polkowski, Sérgio Duarte Correia, Bal Virdee
EditeurSpringer Science and Business Media Deutschland GmbH
Pages597-606
Nombre de pages10
ISBN (imprimé)9789819965465
Les DOIs
étatPublié - 1 janv. 2024
EvénementInternational Conference on Data Analytics and Management, ICDAM 2023 - Jelenia Gora, Pologne
Durée: 23 juin 202324 juin 2023

Série de publications

NomLecture Notes in Networks and Systems
Volume786
ISSN (imprimé)2367-3370
ISSN (Electronique)2367-3389

Une conférence

Une conférenceInternational Conference on Data Analytics and Management, ICDAM 2023
Pays/TerritoirePologne
La villeJelenia Gora
période23/06/2324/06/23

Empreinte digitale

Examiner les sujets de recherche de « Language-Agnostic Method for Sentiment Analysis of Twitter ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation