Adaptive Routing with Guaranteed Delay Bounds using Safe Reinforcement Learning

Gautham Nayak Seetanadi, Martina Maggio, Karl-Erik Årzén

Research output: Chapter in Book/Report/Conference proceedingPreface to conference proceeding

296 Downloads (Pure)

Abstract

Time-critical networks require strict delay bounds on the transmission time of packets from source to destination. Routes for transmissions are usually statically determined, using knowledge about worst-case transmission times between nodes. This is generally a conservative method, that guarantees transmission times but does not provide any optimization for the typical case. In real networks, the typical delays vary from those considered during static route planning. The challenge in such a scenario is to minimize the total delay from a source to a destination node, while adhering to the timing constraints. For known typical and worst-case delays, an algorithm was presented to (statically) determine the policy to be followed during the packet transmission in terms of edge choices.

In this paper we relax the assumption of knowing the typical delay, and we assume only worst-case bounds are available. We present a reinforcement learning solution to obtain optimal routing paths from a source to a destination when the typical transmission time is stochastic and unknown. Our reinforcement learning policy is based on the observation of the state-space during each packet transmission and on adaptation for future packets to congestion and unpredictable circumstances in the network. We ensure that our policy only makes safe routing decisions, thus never violating pre-determined timing constraints. We conduct experiments to evaluate the routing in a congested network and in a network where the typical delays have a large variance. Finally, we analyze the application of the algorithm to large randomly generated networks.
Original languageEnglish
Title of host publicationRTNS 2020: Proceedings of the 28th International Conference on Real-Time Networks and Systems
Pages149–160
DOIs
Publication statusPublished - 2020 Jun
Event28th International Conference on Real-Time Networks and Systems - Paris, France
Duration: 2020 Jun 92020 Jun 11

Conference

Conference28th International Conference on Real-Time Networks and Systems
Abbreviated titleRTNS
Country/TerritoryFrance
CityParis
Period2020/06/092020/06/11

Subject classification (UKÄ)

  • Control Engineering

Fingerprint

Dive into the research topics of 'Adaptive Routing with Guaranteed Delay Bounds using Safe Reinforcement Learning'. Together they form a unique fingerprint.

Cite this