MagicScaler: Uncertainty-aware, Predictive Autoscaling

Zhicheng Pan, Yihang Wang, Yingying Zhang*, Sean Bin Yang, Yunyao Cheng, Peng Chen, Chenjuan Guo, Qingsong Wen, Xiduo Tian, Yunliang Dou, Zhiqiang Zhou, Chengcheng Yang, Aoying Zhou, Bin Yang*

*Kontaktforfatter

Publikation: Bidrag til tidsskriftKonferenceartikel i tidsskriftForskningpeer review

1 Citationer (Scopus)
9 Downloads (Pure)

Abstract

Predictive autoscaling is a key enabler for optimizing cloud resource allocation in Alibaba Cloud’s computing platforms, which dynamically adjust the Elastic Compute Service (ECS) instances based on predicted user demands to ensure Quality of Service (QoS). However, user demands in the cloud are often highly complex, with high uncertainty and scale-sensitive temporal dependencies, thus posing great challenges for accurate prediction of future demands. These in turn make autoscaling challenging—autoscaling needs to properly account for demand uncertainty while maintaining a reasonable trade-off between two contradictory factors, i.e., low instance running costs vs. low QoS violation risks. To address the above challenges, we propose a novel predictive autoscaling framework MagicScaler, consisting of a Multi-scale attentive Gaussian process based predictor and an uncertaintyaware scaler. First, the predictor carefully bridges the best of two successful prediction methodologies—multi-scale attention mechanisms, which are good at capturing complex, multi-scale features, and stochastic process regression, which can quantify prediction uncertainty, thus achieving accurate demand prediction with quantified uncertainty. Second, the scaler takes the quantified future demand uncertainty into a judiciously designed loss function with stochastic constraints, enabling flexible trade-off between running costs and QoS violation risks. Extensive experiments on three clusters of Alibaba Cloud in different Chinese cities demonstrate the effectiveness and efficiency of MagicScaler, which outperforms other commonly adopted scalers, thus justifying our design choices.

OriginalsprogEngelsk
TidsskriftProceedings of the VLDB Endowment
Vol/bind16
Udgave nummer12
Sider (fra-til)3808-3821
Antal sider14
ISSN2150-8097
DOI
StatusUdgivet - 2023
Begivenhed49th International Conference on Very Large Data Bases, VLDB 2023 - Vancouver, Canada
Varighed: 28 aug. 20231 sep. 2023

Konference

Konference49th International Conference on Very Large Data Bases, VLDB 2023
Land/OmrådeCanada
ByVancouver
Periode28/08/202301/09/2023

Fingeraftryk

Dyk ned i forskningsemnerne om 'MagicScaler: Uncertainty-aware, Predictive Autoscaling'. Sammen danner de et unikt fingeraftryk.

Citationsformater