TY - GEN
T1 - Q-learning for distributed routing in LEO satellite constellations
AU - Soret, Beatriz
AU - Leyva-Mayorga, Israel
AU - Lozano-Cuadra, Federico
AU - Thorsager, Mathias D.
PY - 2024/8/15
Y1 - 2024/8/15
N2 - End-to-end routing in Low Earth Orbit (LEO) Satellite Constellations (LSatCs) is a complex and dynamic problem. The topology, of finite size, is dynamic and predictable, the traffic from/to Earth and transiting the space segment is highly imbalanced, and the delay is dominated by the propagation time in non-congested routes and by the queueing time at Inter-Satellite Links (ISLs) in congested routes. Traditional routing algorithms depend on excessive communication with ground or other satellites, and oversimplify the characterization of the path links towards the destination. We model the problem as a multi-agent Partially Observable Markov Decision Problem (POMDP) where the nodes (i.e., the satellites) interact only with nearby nodes. We propose a distributed Q-learning solution that leverages the knowledge of the neighbours and the correlation of the routing decisions of each node. We compare our results to two centralized algorithms based on the shortest path: one aiming at using the highest data rate links and a second genie algorithm that assumes that the instantaneous queueing delays are available at all satellites. The results of our proposal are positive on every front: (1) it experiences delays that are comparable to the benchmarks in steady-state conditions; (2) it increases the supported traffic load without congestion; and (3) it can be easily implemented in a LSatC as it does not depend on the ground segment and minimizes the signaling overhead among satellites.
AB - End-to-end routing in Low Earth Orbit (LEO) Satellite Constellations (LSatCs) is a complex and dynamic problem. The topology, of finite size, is dynamic and predictable, the traffic from/to Earth and transiting the space segment is highly imbalanced, and the delay is dominated by the propagation time in non-congested routes and by the queueing time at Inter-Satellite Links (ISLs) in congested routes. Traditional routing algorithms depend on excessive communication with ground or other satellites, and oversimplify the characterization of the path links towards the destination. We model the problem as a multi-agent Partially Observable Markov Decision Problem (POMDP) where the nodes (i.e., the satellites) interact only with nearby nodes. We propose a distributed Q-learning solution that leverages the knowledge of the neighbours and the correlation of the routing decisions of each node. We compare our results to two centralized algorithms based on the shortest path: one aiming at using the highest data rate links and a second genie algorithm that assumes that the instantaneous queueing delays are available at all satellites. The results of our proposal are positive on every front: (1) it experiences delays that are comparable to the benchmarks in steady-state conditions; (2) it increases the supported traffic load without congestion; and (3) it can be easily implemented in a LSatC as it does not depend on the ground segment and minimizes the signaling overhead among satellites.
KW - Satellite constellations
KW - Satellites
KW - Q-learning
KW - Correlation
KW - Low earth orbit satellites
KW - Telecommunication traffic
KW - Benchmark testing
UR - http://www.scopus.com/inward/record.url?scp=85198145709&partnerID=8YFLogxK
U2 - 10.1109/ICMLCN59089.2024.10624807
DO - 10.1109/ICMLCN59089.2024.10624807
M3 - Article in proceeding
SN - 979-8-3503-4320-5
T3 - 2024 IEEE International Conference on Machine Learning for Communication and Networking, ICMLCN 2024
SP - 208
EP - 213
BT - 2024 IEEE International Conference on Machine Learning for Communication and Networking (ICMLCN)
PB - IEEE (Institute of Electrical and Electronics Engineers)
T2 - 2024 IEEE International Conference on Machine Learning for Communication and Networking (ICMLCN)
Y2 - 5 May 2024 through 8 May 2024
ER -