Adaptive Routing with Guaranteed Delay Bounds using Safe Reinforcement Learning

Forskningsoutput: Kapitel i bok/rapport/Conference proceedingFörord till konferenspublikationForskning

110 Nedladdningar (Pure)

Sammanfattning

Time-critical networks require strict delay bounds on the transmission time of packets from source to destination. Routes for transmissions are usually statically determined, using knowledge about worst-case transmission times between nodes. This is generally a conservative method, that guarantees transmission times but does not provide any optimization for the typical case. In real networks, the typical delays vary from those considered during static route planning. The challenge in such a scenario is to minimize the total delay from a source to a destination node, while adhering to the timing constraints. For known typical and worst-case delays, an algorithm was presented to (statically) determine the policy to be followed during the packet transmission in terms of edge choices.

In this paper we relax the assumption of knowing the typical delay, and we assume only worst-case bounds are available. We present a reinforcement learning solution to obtain optimal routing paths from a source to a destination when the typical transmission time is stochastic and unknown. Our reinforcement learning policy is based on the observation of the state-space during each packet transmission and on adaptation for future packets to congestion and unpredictable circumstances in the network. We ensure that our policy only makes safe routing decisions, thus never violating pre-determined timing constraints. We conduct experiments to evaluate the routing in a congested network and in a network where the typical delays have a large variance. Finally, we analyze the application of the algorithm to large randomly generated networks.
Originalspråkengelska
Titel på gästpublikationRTNS 2020: Proceedings of the 28th International Conference on Real-Time Networks and Systems
Sidor149–160
DOI
StatusPublished - 2020 jun
Evenemang28th International Conference on Real-Time Networks and Systems - Paris, Frankrike
Varaktighet: 2020 jun 92020 jun 11

Konferens

Konferens28th International Conference on Real-Time Networks and Systems
Förkortad titelRTNS
Land/TerritoriumFrankrike
OrtParis
Period2020/06/092020/06/11

Ämnesklassifikation (UKÄ)

  • Reglerteknik

Fingeravtryck

Utforska forskningsämnen för ”Adaptive Routing with Guaranteed Delay Bounds using Safe Reinforcement Learning”. Tillsammans bildar de ett unikt fingeravtryck.

Citera det här