Résumé
Big data processing at the production scale presents a highly complex environment for resource optimization (RO), a problem crucial for meeting performance goals and budgetary constraints of analytical users. The RO problem is challenging because it involves a set of decisions (the partition count, placement of parallel instances on machines, and resource allocation to each instance), requires multi-objective optimization (MOO), and is compounded by the scale and complexity of big data systems while having to meet stringent time constraints for scheduling. This paper presents a MaxCompute based integrated system to support multi-objective resource optimization via fine-grained instance-level modeling and optimization. We propose a new architecture that breaks RO into a series of simpler problems, new fine-grained predictive models, and novel optimization methods that exploit these models to make effective instance-level RO decisions well under a second. Evaluation using production workloads shows that our new RO system could reduce 37-72% latency and 43-78% cost at the same time, compared to the current optimizer and scheduler, while running in 0.02-0.23s.
| langue originale | Anglais |
|---|---|
| Pages (de - à) | 3098-3111 |
| Nombre de pages | 14 |
| journal | Proceedings of the VLDB Endowment |
| Volume | 15 |
| Numéro de publication | 11 |
| Les DOIs | |
| état | Publié - 1 janv. 2022 |
| Modification externe | Oui |
| Evénement | 48th International Conference on Very Large Data Bases, VLDB 2022 - Sydney, Australie Durée: 5 sept. 2022 → 9 sept. 2022 |
Empreinte digitale
Examiner les sujets de recherche de « Fine-Grained Modeling and Optimization for Intelligent Resource Management in Big Data Processing ». Ensemble, ils forment une empreinte digitale unique.Contient cette citation
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver