Factored Reinforcement Learning for Auto-scaling in Tandem Queues

  • Thomas Tournaire
  • , Yue Jin
  • , Armen Aghasaryan
  • , Hind Castel-Taleb
  • , Emmanuel Hyon

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

As today's networking systems utilise more virtual-isation, efficient auto-scaling of resources becomes increasingly critical for controlling both the performance and energy consumption. In this paper, we study the techniques to learn the optimal auto-scaling policies in a distributed network when parts of the system dynamics are unknown. Reinforcement Learning methods have been applied to solve auto-scaling problems. However they can run into computational and convergence issues as the problem scale grows. On the other hand, distributed networks have relational structures with local dependencies between physical and virtual resources. We can exploit these structures to overcome the convergence issues by using a factored representation of the system.We consider a distributed network in the form of a tandem queue composed of two nodes. The objective of the auto-scaling problem is to find policies that have a good trade-off between quality of service (QoS) and operating costs. We develop a factored Reinforcement Learning algorithm, named FMDP online, to find the optimal auto-scaling policies. We evaluate our algorithm with a simulated environment. We compare it with existing Reinforcement Learning methods and show its relevance in terms of policy efficiency and convergence speed.

Original languageEnglish
Title of host publicationProceedings of the IEEE/IFIP Network Operations and Management Symposium 2022
Subtitle of host publicationNetwork and Service Management in the Era of Cloudification, Softwarization and Artificial Intelligence, NOMS 2022
EditorsPal Varga, Lisandro Zambenedetti Granville, Alex Galis, Istvan Godor, Noura Limam, Prosper Chemouil, Jerome Francois, Marc-Oliver Pahl
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781665406017
DOIs
Publication statusPublished - 1 Jan 2022
Externally publishedYes
Event2022 IEEE/IFIP Network Operations and Management Symposium, NOMS 2022 - Budapest, Hungary
Duration: 25 Apr 202229 Apr 2022

Publication series

NameProceedings of the IEEE/IFIP Network Operations and Management Symposium 2022: Network and Service Management in the Era of Cloudification, Softwarization and Artificial Intelligence, NOMS 2022

Conference

Conference2022 IEEE/IFIP Network Operations and Management Symposium, NOMS 2022
Country/TerritoryHungary
CityBudapest
Period25/04/2229/04/22

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 7 - Affordable and Clean Energy
    SDG 7 Affordable and Clean Energy

Keywords

  • Auto-scaling
  • Factored MDP
  • N tier architecture
  • Queuing Systems
  • Reinforcement Learning

Fingerprint

Dive into the research topics of 'Factored Reinforcement Learning for Auto-scaling in Tandem Queues'. Together they form a unique fingerprint.

Cite this