TY - GEN
T1 - Forecasting Algorithms for Intelligent Resource Scaling
T2 - 15th Annual ACM Symposium on Cloud Computing, SoCC 2024
AU - Diao, Yanlei
AU - Horn, Dominik
AU - Kipf, Andreas
AU - Shchur, Oleksandr
AU - Benito, Ines
AU - Dong, Wenjian
AU - Pagano, Davide
AU - Pfeil, Pascal
AU - Nathan, Vikram
AU - Narayanaswamy, Balakrishnan
AU - Kraska, Tim
N1 - Publisher Copyright:
© 2024 Owner/Author.
PY - 2024/11/20
Y1 - 2024/11/20
N2 - There has been a growing demand for making modern cloud-based data analytics systems cost-effective and easy to use. AI-powered intelligent resource scaling is one such effort, aiming at automating scaling decisions for serverless offerings like Amazon Redshift Serverless. The foundation of intelligent resource scaling lies in the ability to forecast query workloads and their resource consumption accurately. Although the forecasting problem has been extensively studied across various domains, there is a lack of thorough analysis of existing forecasting algorithms for large-scale, real-world cloud query workloads. This paper fills this gap by providing an in-depth analysis of forecasting algorithms for real-world cloud workloads, covering the fundamental data characteristics that distinguish query workload forecasting from prior problems and evaluating the strengths and limitations of existing algorithms in this new domain. We anticipate that our findings will provide valuable insights in informing the design of an efficient and effective solution for production use, as well as in steering the forecasting community toward more effective algorithms of high real-world impact.
AB - There has been a growing demand for making modern cloud-based data analytics systems cost-effective and easy to use. AI-powered intelligent resource scaling is one such effort, aiming at automating scaling decisions for serverless offerings like Amazon Redshift Serverless. The foundation of intelligent resource scaling lies in the ability to forecast query workloads and their resource consumption accurately. Although the forecasting problem has been extensively studied across various domains, there is a lack of thorough analysis of existing forecasting algorithms for large-scale, real-world cloud query workloads. This paper fills this gap by providing an in-depth analysis of forecasting algorithms for real-world cloud workloads, covering the fundamental data characteristics that distinguish query workload forecasting from prior problems and evaluating the strengths and limitations of existing algorithms in this new domain. We anticipate that our findings will provide valuable insights in informing the design of an efficient and effective solution for production use, as well as in steering the forecasting community toward more effective algorithms of high real-world impact.
UR - https://www.scopus.com/pages/publications/85215529201
U2 - 10.1145/3698038.3698564
DO - 10.1145/3698038.3698564
M3 - Conference contribution
AN - SCOPUS:85215529201
T3 - SoCC 2024 - Proceedings of the 2024 ACM Symposium on Cloud Computing
SP - 126
EP - 143
BT - SoCC 2024 - Proceedings of the 2024 ACM Symposium on Cloud Computing
PB - Association for Computing Machinery, Inc
Y2 - 20 November 2024 through 22 November 2024
ER -