Harnessing truth discovery algorithms on the topic labelling problem

  • Ngurah Agus Sanjaya Er
  • , Talel Abdessalem
  • , Mouhamadou Lamine Ba
  • , Stéphane Bressan

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Topics in topic modelling approaches are represented as a collection of weighted words. The labels for the topics, however, are not clearly defined and must be interpreted manually. Topic labelling proposes to automatically label the topics by leveraging a knowledge base or applying data mining and machine learning algorithms. We propose a naive topic labelling approach where we transform the labeling problem into selecting the best label for each word in the topic. The candidate labels are generated by querying a knowledge base using the top-N words of each topic. We construct a heterogeneous graph of topics, words, articles and candidate labels. To rank the candidate labels, we apply truth discovery algorithms on the graph. The performance evaluation using popular topic modelling datasets shows that the approach receives satisfactory accuracy.

Original languageEnglish
Title of host publication20th International Conference on Information Integration and Web-Based Applications and Services, iiWAS 2018 - Proceedings
EditorsGabriele Anderst-Kotsis, Eric Pardede, Matthias Steinbauer, Maria Indrawan-Santiago, Ivan Luiz Salvadori, Ivan Luiz Salvadori, Ismail Khalil
PublisherAssociation for Computing Machinery
Pages8-14
Number of pages7
ISBN (Electronic)9781450364799
DOIs
Publication statusPublished - 19 Nov 2018
Externally publishedYes
Event20th International Conference on Information Integration and Web-Based Applications and Services, iiWAS 2018 - Yogyakarta, Indonesia
Duration: 19 Nov 201821 Nov 2018

Publication series

NameACM International Conference Proceeding Series

Conference

Conference20th International Conference on Information Integration and Web-Based Applications and Services, iiWAS 2018
Country/TerritoryIndonesia
CityYogyakarta
Period19/11/1821/11/18

Keywords

  • Evaluation
  • Ranking
  • Topic labelling
  • Truth discovery
  • Truth finding

Fingerprint

Dive into the research topics of 'Harnessing truth discovery algorithms on the topic labelling problem'. Together they form a unique fingerprint.

Cite this