RDF data management in the Amazon cloud

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Cloud computing has been massively adopted recently in many applications for its elastic scaling and fault-tolerance. At the same time, given that the amount of available RDF data sources on the Web increases rapidly, there is a constant need for scalable RDF data management tools. In this paper we propose a novel architecture for the distributed management of RDF data, exploiting an existing commercial cloud infrastructure, namely Amazon Web Services (AWS). We study the problem of indexing RDF data stored within AWS, by using SimpleDB, a key-value store provided by AWS for small data items. The goal of the index is to efficiently identify the RDF datasets which may have answers for a given query, and route the query only to those. We devised and experimented with several indexing strategies; we discuss experimental results and avenues for future work.

Original languageEnglish
Title of host publicationProceedings - Joint EDBT/ICDT Workshops 2012
Pages61-72
Number of pages12
DOIs
Publication statusPublished - 27 Jul 2012
Externally publishedYes
EventJoint EDBT/ICDT Workshops 2012 - Berlin, Germany
Duration: 30 Mar 201230 Mar 2012

Publication series

NameACM International Conference Proceeding Series

Conference

ConferenceJoint EDBT/ICDT Workshops 2012
Country/TerritoryGermany
CityBerlin
Period30/03/1230/03/12

Fingerprint

Dive into the research topics of 'RDF data management in the Amazon cloud'. Together they form a unique fingerprint.

Cite this