TY - GEN
T1 - Reinforcement Learning Method for Beam Management in Millimeter-Wave Networks
AU - Wang, Ruiyu
AU - Onireti, Oluwakayode
AU - Zhang, Lei
AU - Imran, Muhammad Ali
AU - Ren, Guangmei
AU - Qiu, Jing
AU - Tian, Tingjian
N1 - Publisher Copyright:
© 2019 IEEE.
PY - 2019/8
Y1 - 2019/8
N2 - With the rapid growth of mobile data demand, the fifth generation (5G) mobile network must exploit the large amount of spectrum in the millimeter wave (mmWave) band to increase the network capacity. Due to the limitation of propagation distance, line-of-sight (LOS) link is highly desirable for mmWave systems. However, LOS channel is not feasible all the time and mmWave is also impacted significantly by the surrounding environment. The LOS signal can be easily blocked by surrounding buildings. Based on this issue, in this paper, we propose to use reinforcement learning to manage the non line of sight (NLOS) scenario. Specifically, we build a model simulating blocked LOS signal for the user equipment (UE) with only NLOS channel available for the UE. Q-Learning is used to select the NLOS beam that meets the UE's quality of service requirements. Simulation results show that Q-Learning can be used to manage the beam selection. In particular, at initial training stage the Q-Learning explores in the environment. However, with the training process, Q-Learning learns from experience and the received power increases significantly and converges to an excellent level.
AB - With the rapid growth of mobile data demand, the fifth generation (5G) mobile network must exploit the large amount of spectrum in the millimeter wave (mmWave) band to increase the network capacity. Due to the limitation of propagation distance, line-of-sight (LOS) link is highly desirable for mmWave systems. However, LOS channel is not feasible all the time and mmWave is also impacted significantly by the surrounding environment. The LOS signal can be easily blocked by surrounding buildings. Based on this issue, in this paper, we propose to use reinforcement learning to manage the non line of sight (NLOS) scenario. Specifically, we build a model simulating blocked LOS signal for the user equipment (UE) with only NLOS channel available for the UE. Q-Learning is used to select the NLOS beam that meets the UE's quality of service requirements. Simulation results show that Q-Learning can be used to manage the beam selection. In particular, at initial training stage the Q-Learning explores in the environment. However, with the training process, Q-Learning learns from experience and the received power increases significantly and converges to an excellent level.
KW - Beam tracking
KW - millimeter wave
KW - none-line-of-sight
KW - reinforcement learning
UR - https://www.scopus.com/pages/publications/85074938120
U2 - 10.1109/UCET.2019.8881841
DO - 10.1109/UCET.2019.8881841
M3 - Conference contribution
AN - SCOPUS:85074938120
T3 - 2019 UK/China Emerging Technologies, UCET 2019
BT - 2019 UK/China Emerging Technologies, UCET 2019
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2019 UK/China Emerging Technologies, UCET 2019
Y2 - 21 August 2019 through 22 August 2019
ER -