A Proactive Cloud Application Auto-Scaler using Reinforcement Learning

Research output: Chapter in Book/Report/Conference proceedingPaper in conference proceedingpeer-review

78 Downloads (Pure)

Abstract

This work explores the use of reinforcement learning to design a proactive cloud resource auto-scaler that is able to predict usage across a distributed microservice application. The focus is on serving time-sensitive workloads, e.g., industrial automation, connected XR/VR (eXtended Reality/Virtual Reality), etc., where each job has a deadline and there is some cost associated with missing a deadline. A simple workload model, as well as a microservice application model, is presented. A reinforcement learning agent is trained to identify workloads and predict needed utilization for identified service chains. The results are compared to standard purely reactive techniques.
Original languageEnglish
Title of host publicationUCC '22: Proceedings of the 15th IEEE/ACM International Conference on Utility and Cloud Computing
Pages213-220
Number of pages8
ISBN (Electronic)9781665460873
DOIs
Publication statusPublished - 2022
Event15th ACM-IEEE International Conference on Formal Methods and Models for System Design - New York, United States
Duration: 2017 Sept 292017 Oct 2
Conference number: 15th

Conference

Conference15th ACM-IEEE International Conference on Formal Methods and Models for System Design
Country/TerritoryUnited States
CityNew York
Period2017/09/292017/10/02

Subject classification (UKÄ)

  • Control Engineering

Fingerprint

Dive into the research topics of 'A Proactive Cloud Application Auto-Scaler using Reinforcement Learning'. Together they form a unique fingerprint.

Cite this