Uncertain version control in open collaborative editing of tree-structured documents

M. Lamine Ba, Talel Abdessalem, Pierre Senellart

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In order to ease content enrichment, exchange, and sharing, web-scale collaborative platforms such as Wikipedia or Google Docs enable unbounded interactions between a large number of contributors, without prior knowledge of their level of expertise and reliability. Version control is then essential for keeping track of the evolution of the shared content and its provenance. In such environments, uncertainty is ubiquitous due to the unreliability of the sources, the incompleteness and imprecision of the contributions, the possibility of malicious editing and vandalism acts, etc. To handle this uncertainty, we use a probabilistic XML model as a basic component of our version control framework. Each version of a shared document is represented by an XML tree and the whole document, together with its different versions, is modeled as a probabilistic XML document. Uncertainty is evaluated using the probabilistic model and the reliability measure associated to each source, each contributor, or each editing event, resulting in an uncertainty measure on each version and each part of the document. We show that standard version control operations can be implemented directly as operations on the probabilistic XML model; efficiency with respect to deterministic version control systems is demonstrated on real-world datasets.

Original languageEnglish
Title of host publicationDocEng 2013 - Proceedings of the 2013 ACM Symposium on Document Engineering
PublisherAssociation for Computing Machinery
Pages27-36
Number of pages10
ISBN (Print)9781450317894
DOIs
Publication statusPublished - 1 Jan 2013
Externally publishedYes
Event2013 ACM Symposium on Document Engineering, DocEng 2013 - Florence, Italy
Duration: 10 Sept 201313 Sept 2013

Publication series

NameDocEng 2013 - Proceedings of the 2013 ACM Symposium on Document Engineering

Conference

Conference2013 ACM Symposium on Document Engineering, DocEng 2013
Country/TerritoryItaly
CityFlorence
Period10/09/1313/09/13

Keywords

  • collaborative work
  • uncertain data
  • version control
  • xml

Fingerprint

Dive into the research topics of 'Uncertain version control in open collaborative editing of tree-structured documents'. Together they form a unique fingerprint.

Cite this