Multilingual Fake News Detection: A Study on Various Models and Training Scenarios

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Amidst the surge in global online news consumption, tackling the escalating challenge of fake news requires a multilingual approach. While extensive research has explored fake news detection from various perspectives, a notable gap persists—the majority of studies concentrate on the English language. This highlights, the need for more research focusing on other languages, especially considering the scarcity of available non-English fake news datasets, particularly in low-resource settings. Focused on mBERT, XLM-RoBERTa, and LASER embeddings, this study addresses three key questions. Firstly, it evaluates the efficacy of several multilingual models across languages, highlighting the robust performance of mBERT and XLM-RoBERTa. Secondly, it examines the impact of multilingual and cross-lingual training data, demonstrating the effectiveness of multilingual training, including its potential in zero-shot and transfer learning scenarios. Thirdly, it compares multilingual models with translation-based strategies, revealing the superior performance of the former in multilingual fake news detection. Leveraging two datasets encompassing news in English, Spanish, French, Portuguese, Italian, Hindi, Indonesian, Swahili, and Vietnamese, our research underscores the effectiveness of multilingual approaches offering valuable insights for future research to combat the global problem of fake news more effectively.

Original languageEnglish
Title of host publicationIntelligent Systems and Applications - Proceedings of the 2024 Intelligent Systems Conference IntelliSys Volume 2
EditorsKohei Arai
PublisherSpringer Science and Business Media Deutschland GmbH
Pages73-89
Number of pages17
ISBN (Print)9783031664274
DOIs
Publication statusPublished - 1 Jan 2024
EventIntelligent Systems Conference, IntelliSys 2024 - Amsterdam, Netherlands
Duration: 5 Sept 20246 Sept 2024

Publication series

NameLecture Notes in Networks and Systems
Volume1066 LNNS
ISSN (Print)2367-3370
ISSN (Electronic)2367-3389

Conference

ConferenceIntelligent Systems Conference, IntelliSys 2024
Country/TerritoryNetherlands
CityAmsterdam
Period5/09/246/09/24

Keywords

  • Cross-lingual
  • Fake news detection
  • Low-resource
  • Multilingual
  • Transfer-learning
  • Zero-shot

Fingerprint

Dive into the research topics of 'Multilingual Fake News Detection: A Study on Various Models and Training Scenarios'. Together they form a unique fingerprint.

Cite this