Scalability issues in designing and implementing semantic provenance management systems

Mohamed Amin Sakka, Bruno Defude

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Provenance is a key metadata for assessing electronic documents trustworthiness. Most of the applications exchanging and processing documents on the web or in the cloud become provenance aware and provide heterogeneous, decentralized and not interoperable provenance data. A new type of system emerges, called provenance management system (or PMS). These systems offer a unified way to model, collect and query provenance data from various applications. This work presents such a system based on semantic web technologies and focuses on scalability issues. In fact, modern infrastructure such as cloud can produce huge volume of provenance data and scalability becomes a major issue. We describe here an implementation of our PMS based on an NoSQL DBMS coupled with the map-reduce parallel model and present different experimentations illustrating how it scales linearly depending on the size of the processed logs.

Original languageEnglish
Title of host publicationData Management in Cloud, Grid and P2P Systems - 5th International Conference, Globe 2012, Proceedings
Pages49-61
Number of pages13
DOIs
Publication statusPublished - 24 Sept 2012
Event5th International Conference on Data Management in Cloud, Grid, and P2P Systems, Globe 2012 - Vienna, Austria
Duration: 5 Sept 20126 Sept 2012

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume7450 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference5th International Conference on Data Management in Cloud, Grid, and P2P Systems, Globe 2012
Country/TerritoryAustria
CityVienna
Period5/09/126/09/12

Fingerprint

Dive into the research topics of 'Scalability issues in designing and implementing semantic provenance management systems'. Together they form a unique fingerprint.

Cite this