Passer à la navigation principale Passer à la recherche Passer au contenu principal

Reuse-based optimization for pig Latin

  • Hortonworks
  • Université Paris Dauphine
  • University of Suttgart
  • Fractal Analytics

Résultats de recherche: Le chapitre dans un livre, un rapport, une anthologie ou une collectionContribution à une conférenceRevue par des pairs

Résumé

Pig Latin is a popular language which is widely used for parallel processing of massive data sets. Currently, subexpressions occurring repeatedly in Pig Latin scripts are executed as many times as they appear, and the current Pig Latin optimizer does not identify reuse opportunities. We present a novel optimization approach aiming at identifying and reusing repeated subexpressions in Pig Latin scripts. Our optimization algorithm, named PigReuse, identifies subexpression merging opportunities, selects the best ones to execute based on a cost function, and reuses their results as needed in order to compute exactly the same output as the original scripts. Our experiments demonstrate the effectiveness of our approach.

langue originaleAnglais
titreCIKM 2016 - Proceedings of the 2016 ACM Conference on Information and Knowledge Management
EditeurAssociation for Computing Machinery
Pages2215-2220
Nombre de pages6
ISBN (Electronique)9781450340731
Les DOIs
étatPublié - 24 oct. 2016
Evénement25th ACM International Conference on Information and Knowledge Management, CIKM 2016 - Indianapolis, États-Unis
Durée: 24 oct. 201628 oct. 2016

Série de publications

NomInternational Conference on Information and Knowledge Management, Proceedings
Volume24-28-October-2016

Une conférence

Une conférence25th ACM International Conference on Information and Knowledge Management, CIKM 2016
Pays/TerritoireÉtats-Unis
La villeIndianapolis
période24/10/1628/10/16

Empreinte digitale

Examiner les sujets de recherche de « Reuse-based optimization for pig Latin ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation