TY - JOUR
T1 - Learning to Continuously Optimize Wireless Resource in a Dynamic Environment
T2 - A Bilevel Optimization Perspective
AU - Sun, Haoran
AU - Pu, Wenqiang
AU - Fu, Xiao
AU - Chang, Tsung Hui
AU - Hong, Mingyi
N1 - Publisher Copyright:
© 1991-2012 IEEE.
PY - 2022
Y1 - 2022
N2 - There has been a growing interest in developing data-driven, and in particular deep neural network (DNN) based methods for modern communication tasks. These methods achieve state-of-the-art performance for a few popular wireless resource allocation problems, while requiring less computational efforts, less resources for acquiring channel state information (CSI), etc. However, it is often challenging for these approaches to learn in a dynamic environment. This work develops a new approach that enables data-driven methods to continuously learn and optimize wireless resource allocation in a dynamic environment. Specifically, we consider an 'episodically dynamic' setting where the environment statistics change in 'episodes,' and in each episode the environment is stationary. We propose to build the notion of continual learning (CL) into wireless system design, so that the learning model can incrementally adapt to the new episodes, without forgetting knowledge learned from the previous episodes. We demonstrate the effectiveness of the CL approach by integrating it with three popular DNN based models for power control, beamforming and multi-user MIMO, respectively, and testing using both synthetic and ray-tracing based data sets. These numerical results show that the proposed CL approach is not only able to adapt to the new scenarios quickly and seamlessly, but importantly, it also maintains high performance over the previously encountered scenarios as well.
AB - There has been a growing interest in developing data-driven, and in particular deep neural network (DNN) based methods for modern communication tasks. These methods achieve state-of-the-art performance for a few popular wireless resource allocation problems, while requiring less computational efforts, less resources for acquiring channel state information (CSI), etc. However, it is often challenging for these approaches to learn in a dynamic environment. This work develops a new approach that enables data-driven methods to continuously learn and optimize wireless resource allocation in a dynamic environment. Specifically, we consider an 'episodically dynamic' setting where the environment statistics change in 'episodes,' and in each episode the environment is stationary. We propose to build the notion of continual learning (CL) into wireless system design, so that the learning model can incrementally adapt to the new episodes, without forgetting knowledge learned from the previous episodes. We demonstrate the effectiveness of the CL approach by integrating it with three popular DNN based models for power control, beamforming and multi-user MIMO, respectively, and testing using both synthetic and ray-tracing based data sets. These numerical results show that the proposed CL approach is not only able to adapt to the new scenarios quickly and seamlessly, but importantly, it also maintains high performance over the previously encountered scenarios as well.
KW - Adaptation models
KW - Data models
KW - Deep learning
KW - Resource management
KW - Task analysis
KW - Training
KW - Wireless communication
KW - bilevel optimization
KW - continual learning
KW - dynamic environments
KW - wireless communication
UR - http://www.scopus.com/inward/record.url?scp=85123315400&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85123315400&partnerID=8YFLogxK
U2 - 10.1109/TSP.2022.3143372
DO - 10.1109/TSP.2022.3143372
M3 - Article
AN - SCOPUS:85123315400
SN - 1053-587X
VL - 70
SP - 1900
EP - 1917
JO - IRE Transactions on Audio
JF - IRE Transactions on Audio
ER -