Skip to main navigation Skip to search Skip to main content

UDAO: A next-generation unified data analytics optimizer

  • Khaled Zaouk
  • , Fei Song
  • , Chenghao Lyu
  • , Arnab Sinha
  • , Yanlei Diao
  • , Prashant Shenoy

Research output: Contribution to journalConference articlepeer-review

Abstract

Big data analytics systems today still lack the ability to take user performance goals and budgetary constraints, collectively referred to as "objectives", and automatically configure an analytic job to achieve the objectives. This paper presents UDAO, a unified data analytics optimizer that can automatically determine the parameters of the runtime system, collectively called a job configuration, for general dataflow programs based on user objectives. UDAO embodies key techniques including in-situ modeling, which learns a model for each user objective in the same computing environment as the job is run, and multi-objective optimization, which computes a Pareto optimal set of job configurations to reveal tradeoffs between different objectives. Using benchmarks developed based on industry needs, our demonstration will allow the user to explore (1) learned models to gain insights into how various parameters affect user objectives; (2) Pareto frontiers to understand interesting tradeoffs between different objectives and how a configuration recommended by the optimizer explores these tradeoffs; (3) endtoend benefits that UDAO can provide over default configurations or those manually tuned by engineers.

Original languageEnglish
Pages (from-to)1934-1937
Number of pages4
JournalProceedings of the VLDB Endowment
Volume12
Issue number12
DOIs
Publication statusPublished - 1 Jan 2018
Event45th International Conference on Very Large Data Bases, VLDB 2019 - Los Angeles, United States
Duration: 26 Aug 201730 Aug 2017

Fingerprint

Dive into the research topics of 'UDAO: A next-generation unified data analytics optimizer'. Together they form a unique fingerprint.

Cite this