Statistical Claim Checking: StatCheck in Action

  • Oana Balalau
  • , Simon Ebel
  • , Théo Galizzi
  • , Ioana Manolescu
  • , Quentin Massonnat
  • , Antoine Deiana
  • , Emilie Gautreau
  • , Antoine Krempf
  • , Thomas Pontillon
  • , Gérald Roux
  • , Joanna Yakin

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

To strengthen public trust and counter disinformation, computational fact-checking, leveraging digital data sources, attracts interest from the journalists and the computer science community. A particular class of interesting data sources is statistics, that is, numerical data compiled mostly by governments, administrations, and international organizations. Statistics typically are multidimensional datasets, where multiple dimensions characterize one value, and the dimensions may be organized in a hierarchy. We developed StatCheck, a fact-checking system specialized in French. The technical novelty of StatCheck is twofold: (i) we focus on multidimensional, complex-structure statistics, which have received little attention so far, despite their practical importance; and (ii) novel statistical claim extraction modules for French, an area where few resources exist. We will demonstrate our system on large statistic datasets (hundreds of millions of facts), including the complete INSEE (French) and Eurostat (European Union) datasets. More information about StatCheckis available online at: https://team.inria.fr/cedar/projects/statcheck/.

Original languageEnglish
Title of host publicationCIKM 2022 - Proceedings of the 31st ACM International Conference on Information and Knowledge Management
PublisherAssociation for Computing Machinery
Pages4798-4802
Number of pages5
ISBN (Electronic)9781450392365
DOIs
Publication statusPublished - 17 Oct 2022
Event31st ACM International Conference on Information and Knowledge Management, CIKM 2022 - Atlanta, United States
Duration: 17 Oct 202221 Oct 2022

Publication series

NameInternational Conference on Information and Knowledge Management, Proceedings
ISSN (Print)2155-0751

Conference

Conference31st ACM International Conference on Information and Knowledge Management, CIKM 2022
Country/TerritoryUnited States
CityAtlanta
Period17/10/2221/10/22

Keywords

  • data warehouses
  • fact-checking
  • multidimensional data
  • natural language processing

Fingerprint

Dive into the research topics of 'Statistical Claim Checking: StatCheck in Action'. Together they form a unique fingerprint.

Cite this