Online Radio Pattern Optimization Based on Dual Reinforcement-Learning Approach for 5G URLLC Networks

Ali Abdelmawgood Ali Ali Esswie; Klaus Ingemann Pedersen; Preben E. Mogensen

doi:10.1109/ACCESS.2020.3011026

Online Radio Pattern Optimization Based on Dual Reinforcement-Learning Approach for 5G URLLC Networks

Ali Abdelmawgood Ali Ali Esswie, Klaus Ingemann Pedersen, Preben E. Mogensen

Publikation: Bidrag til tidsskrift › Tidsskriftartikel › Forskning › peer review

11 Citationer (Scopus)

95 Downloads (Pure)

Abstract

The fifth generation (5G) radio access technology is designed to support highly delay-sensitive applications, i.e., ultra-reliable and low-latency communications (URLLC). For dynamic time division duplex (TDD) systems, the real-time optimization of the radio pattern selection becomes of a vital significance in achieving decent URLLC outage latency. In this study, a dual reinforcement machine learning (RML) approach is developed for online pattern optimization in 5G new radio TDD deployments. The proposed solution seeks to minimizing the maximum URLLC tail latency, i.e., min-max problem, by introducing nested RML instances. The directional and real-time traffic statistics are monitored and given to the primary RML layer to estimate the sufficient number of downlink (DL) and uplink (UL) symbols across the upcoming radio pattern. The secondary RML sub-networks determine the DL and UL symbol structure which best minimizes the URLLC outage latency. The proposed solution is evaluated by extensive and highly-detailed system level simulations, where our results demonstrate a considerable URLLC outage latency improvement with the proposed scheme, compared to the state-of-the-art dynamic-TDD proposals.

Originalsprog	Engelsk
Artikelnummer	9145539
Tidsskrift	IEEE Access
Vol/bind	8
Sider (fra-til)	132922-132936
Antal sider	15
ISSN	2169-3536
DOI	https://doi.org/10.1109/ACCESS.2020.3011026
Status	Udgivet - jul. 2020

Adgang til dokumentet

10.1109/ACCESS.2020.3011026Licens: CC BY 4.0

Open Access versionForlagets udgivne version, 1,5 MBLicens: CC BY 4.0
ACCESS3011026

AUB Link

Søg efter materialet i Aalborg Universitetsbiblioteks søgemaskine

Andre filer og links

http://www.scopus.com/inward/record.url?scp=85089307031&partnerID=8YFLogxK

5G Smart Production Lab
Casper Schou (Leder) & Ignacio Rodriguez Larrad (Leder)
Institut for Materialer og Produktion
Facilitet: Laboratorie

Citationsformater

@article{aeec21e0303849b69bd6c4267adb1e83,

title = "Online Radio Pattern Optimization Based on Dual Reinforcement-Learning Approach for 5G URLLC Networks",

abstract = "The fifth generation (5G) radio access technology is designed to support highly delay-sensitive applications, i.e., ultra-reliable and low-latency communications (URLLC). For dynamic time division duplex (TDD) systems, the real-time optimization of the radio pattern selection becomes of a vital significance in achieving decent URLLC outage latency. In this study, a dual reinforcement machine learning (RML) approach is developed for online pattern optimization in 5G new radio TDD deployments. The proposed solution seeks to minimizing the maximum URLLC tail latency, i.e., min-max problem, by introducing nested RML instances. The directional and real-time traffic statistics are monitored and given to the primary RML layer to estimate the sufficient number of downlink (DL) and uplink (UL) symbols across the upcoming radio pattern. The secondary RML sub-networks determine the DL and UL symbol structure which best minimizes the URLLC outage latency. The proposed solution is evaluated by extensive and highly-detailed system level simulations, where our results demonstrate a considerable URLLC outage latency improvement with the proposed scheme, compared to the state-of-the-art dynamic-TDD proposals. ",

keywords = "5G new radio, Dynamic-TDD, Q-learning, URLLC, cross link interference (CLI), machine learning, reinforcement learning",

author = "Esswie, {Ali Abdelmawgood Ali Ali} and Pedersen, {Klaus Ingemann} and {E. Mogensen}, Preben",

year = "2020",

month = jul,

doi = "10.1109/ACCESS.2020.3011026",

language = "English",

volume = "8",

pages = "132922--132936",

journal = "IEEE Access",

issn = "2169-3536",

publisher = "IEEE",

}

TY - JOUR

T1 - Online Radio Pattern Optimization Based on Dual Reinforcement-Learning Approach for 5G URLLC Networks

AU - Esswie, Ali Abdelmawgood Ali Ali

AU - Pedersen, Klaus Ingemann

AU - E. Mogensen, Preben

PY - 2020/7

Y1 - 2020/7

N2 - The fifth generation (5G) radio access technology is designed to support highly delay-sensitive applications, i.e., ultra-reliable and low-latency communications (URLLC). For dynamic time division duplex (TDD) systems, the real-time optimization of the radio pattern selection becomes of a vital significance in achieving decent URLLC outage latency. In this study, a dual reinforcement machine learning (RML) approach is developed for online pattern optimization in 5G new radio TDD deployments. The proposed solution seeks to minimizing the maximum URLLC tail latency, i.e., min-max problem, by introducing nested RML instances. The directional and real-time traffic statistics are monitored and given to the primary RML layer to estimate the sufficient number of downlink (DL) and uplink (UL) symbols across the upcoming radio pattern. The secondary RML sub-networks determine the DL and UL symbol structure which best minimizes the URLLC outage latency. The proposed solution is evaluated by extensive and highly-detailed system level simulations, where our results demonstrate a considerable URLLC outage latency improvement with the proposed scheme, compared to the state-of-the-art dynamic-TDD proposals.

AB - The fifth generation (5G) radio access technology is designed to support highly delay-sensitive applications, i.e., ultra-reliable and low-latency communications (URLLC). For dynamic time division duplex (TDD) systems, the real-time optimization of the radio pattern selection becomes of a vital significance in achieving decent URLLC outage latency. In this study, a dual reinforcement machine learning (RML) approach is developed for online pattern optimization in 5G new radio TDD deployments. The proposed solution seeks to minimizing the maximum URLLC tail latency, i.e., min-max problem, by introducing nested RML instances. The directional and real-time traffic statistics are monitored and given to the primary RML layer to estimate the sufficient number of downlink (DL) and uplink (UL) symbols across the upcoming radio pattern. The secondary RML sub-networks determine the DL and UL symbol structure which best minimizes the URLLC outage latency. The proposed solution is evaluated by extensive and highly-detailed system level simulations, where our results demonstrate a considerable URLLC outage latency improvement with the proposed scheme, compared to the state-of-the-art dynamic-TDD proposals.

KW - 5G new radio

KW - Dynamic-TDD

KW - Q-learning

KW - URLLC

KW - cross link interference (CLI)

KW - machine learning

KW - reinforcement learning

UR - http://www.scopus.com/inward/record.url?scp=85089307031&partnerID=8YFLogxK

U2 - 10.1109/ACCESS.2020.3011026

DO - 10.1109/ACCESS.2020.3011026

M3 - Journal article

SN - 2169-3536

VL - 8

SP - 132922

EP - 132936

JO - IEEE Access

JF - IEEE Access

M1 - 9145539

ER -

Online Radio Pattern Optimization Based on Dual Reinforcement-Learning Approach for 5G URLLC Networks

Abstract

Adgang til dokumentet

AUB Link

Andre filer og links

Fingeraftryk

Udstyr

5G Smart Production Lab

Citationsformater