WebContent: Efficient P2P warehousing of web data

  • S. Abiteboul
  • , T. Allard
  • , P. Chatalic
  • , G. Gardarin
  • , A. Ghitescu
  • , F. Goasdoué
  • , I. Manolescu
  • , B. Nguyen
  • , M. Ouazara
  • , A. Somani
  • , N. Travers
  • , G. Vasile
  • , S. Zoupanos

Research output: Contribution to journalArticlepeer-review

Abstract

We present the WebContent platform for managing distributed repositories of XML and semantic Web data. The platform allows integrating various data processing building blocks (crawling, translation, semantic annotation, full-text search, structured XML querying, and semantic querying), presented as Web services, into a large-scale efficient platform. Calls to various services are combined inside ActiveXML [8] documents, which are XML documents including service calls. An ActiveXML optimizer is used to: (i) efficiently distribute computations among sites; (ii) perform XQuery-specific optimizations by leveraging an algebraic XQuery optimizer; and (iii) given an XML query, chose among several distributed indices the most appropriate in order to answer the query.

Original languageEnglish
Pages (from-to)1428-1431
Number of pages4
JournalProceedings of the VLDB Endowment
Volume1
Issue number2
DOIs
Publication statusPublished - 1 Jan 2008
Externally publishedYes

Fingerprint

Dive into the research topics of 'WebContent: Efficient P2P warehousing of web data'. Together they form a unique fingerprint.

Cite this