Knowledge harvesting in the big-data era

Fabian Suchanek, Gerhard Weikum

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The proliferation of knowledge-sharing Communities such as Wiki-pedia and the progress in scalable information extraction from Web and text sources have enabled the automatic construction of very large knowledge bases. Endeavors of this kind include projects such as DBpedia, Freebase, KnowItAll, ReadTheWeb, and YAGO. These projects provide automatically constructed knowledge bases of facts about named entities, their semantic classes, and their mutual relationships. They contain millions of entities and hundreds of millions of facts about them. Such world knowledge in turn enables cognitive applications and knowledge-centric services like disam-biguating natural-language text, semantic search for entities and relations in Web and enterprise data, and entity-oriented analytics over unstructured contents. Prominent examples of how knowledge bases can be harnessed include the Google Knowledge Graph and the IBM Watson question answering system. This tutorial presents state-of-the-art methods, recent advances, research opportunities, and open challenges along this avenue of knowledge harvesting and its applications. Particular emphasis will be on the twofold role of knowledge bases for big-data analytics: using scalable distributed algorithms for harvesting knowledge from Web and text sources, and leveraging entity-centric knowledge for deeper interpretation of and better intelligence with Big Data.

Original languageEnglish
Title of host publicationSIGMOD 2013 - International Conference on Management of Data
Pages933-937
Number of pages5
DOIs
Publication statusPublished - 29 Jul 2013
Externally publishedYes
Event2013 ACM SIGMOD Conference on Management of Data, SIGMOD 2013 - New York, NY, United States
Duration: 22 Jun 201327 Jun 2013

Publication series

NameProceedings of the ACM SIGMOD International Conference on Management of Data
ISSN (Print)0730-8078

Conference

Conference2013 ACM SIGMOD Conference on Management of Data, SIGMOD 2013
Country/TerritoryUnited States
CityNew York, NY
Period22/06/1327/06/13

Keywords

  • Big Data
  • Entity Recognition
  • Information Extraction
  • Knowledge Base
  • Ontology
  • Web Contents

Fingerprint

Dive into the research topics of 'Knowledge harvesting in the big-data era'. Together they form a unique fingerprint.

Cite this