Capturing data uncertainty in high-volume stream processing

  • Yanlei Diao
  • , Boduo Li
  • , Anna Liu
  • , Liping Peng
  • , Charles Sutton
  • , Thanh Tran
  • , Michael Zink

Research output: Contribution to conferencePaperpeer-review

Abstract

We present the design and development of a data stream system that captures data uncertainty from data collection to query processing to final result generation. Our system focuses on data that is naturally modeled as continuous random variables such as many types of sensor data. To provide an end-to-end solution, our system employs probabilistic modeling and inference to generate uncertainty description for raw data, and then a suite of statistical techniques to capture changes of uncertainty as data propagates through query operators. To cope with high-volume streams, we explore advanced approximation techniques for both space and time efficiency. We are currently working with a group of scientists to evaluate our system using traces collected from real-world applications for hazardous weather monitoring and for object tracking and monitoring.

Original languageEnglish
Publication statusPublished - 1 Dec 2009
Externally publishedYes
Event4th Biennal Conference on Innovative Data Systems Research, CIDR 2009 - Asilomar, CA, United States
Duration: 4 Jan 20097 Jan 2009

Conference

Conference4th Biennal Conference on Innovative Data Systems Research, CIDR 2009
Country/TerritoryUnited States
CityAsilomar, CA
Period4/01/097/01/09

Fingerprint

Dive into the research topics of 'Capturing data uncertainty in high-volume stream processing'. Together they form a unique fingerprint.

Cite this