Skip to main navigation Skip to search Skip to main content

Optimization for active learning-based interactive database exploration

  • Enhui Huang
  • , Liping Peng
  • , Luciano Di Palma
  • , Ahmed Abdelkafi
  • , Anna Liu
  • , Yanlei Diao
  • UMass Amherst

Research output: Contribution to journalConference articlepeer-review

Abstract

There is an increasing gap between fast growth of data and limited human ability to comprehend data. Consequently, there has been a growing demand of data management tools that can bridge this gap and help the user retrieve highvalue content from data more effectively. In this work, we aim to build interactive data exploration as a new database service, using an approach called “explore-by-example“. In particular, we cast the explore-by-example problem in a principled “active learning“ framework, and bring the properties of important classes of database queries to bear on the design of new algorithms and optimizations for active learning-based database exploration. These new techniques allow the database system to overcome a fundamental limitation of traditional active learning, i.e., the slow convergence problem. Evaluation results using real-world datasets and user interest patterns show that our new system significantly outperforms state-of-the-art active learning techniques and data exploration systems in accuracy while achieving desired efficiency for interactive performance.

Original languageEnglish
Pages (from-to)71-84
Number of pages14
JournalProceedings of the VLDB Endowment
Volume12
Issue number1
DOIs
Publication statusPublished - 1 Jan 2018
Event45th International Conference on Very Large Data Bases, VLDB 2019 - Los Angeles, United States
Duration: 26 Aug 201730 Aug 2017

Fingerprint

Dive into the research topics of 'Optimization for active learning-based interactive database exploration'. Together they form a unique fingerprint.

Cite this