Goto

Collaborating Authors

 ac-dqn


Scheduling and Power Control for Wireless Multicast Systems via Deep Reinforcement Learning

arXiv.org Machine Learning

Multicasting in wireless systems is a natural way to exploit the redundancy in user requests in a Content Centric Network. Power control and optimal scheduling can significantly improve the wireless multicast network's performance under fading. However, the model based approaches for power control and scheduling studied earlier are not scalable to large state space or changing system dynamics. In this paper, we use deep reinforcement learning where we use function approximation of the Q-function via a deep neural network to obtain a power control policy that matches the optimal policy for a small network. We show that power control policy can be learnt for reasonably large systems via this approach. Further we use multi-timescale stochastic optimization to maintain the average power constraint. We demonstrate that a slight modification of the learning algorithm allows tracking of time varying system statistics. Finally, we extend the multi-timescale approach to simultaneously learn the optimal queueing strategy along with power control. We demonstrate scalability, tracking and cross layer optimization capabilities of our algorithms via simulations. The proposed multi-timescale approach can be used in general large state space dynamical systems with multiple objectives and constraints, and may be of independent interest.


Deep Reinforcement Learning Based Power control for Wireless Multicast Systems

arXiv.org Machine Learning

Deep Reinforcement Learning Based Power control for Wireless Multicast Systems Ramkumar Raghu 1, Pratheek Upadhyaya 1, Mahadesh Panju 1, V aneet Aggarwal 1,2, and Vinod Sharma 1 1 Indian Institute of Science, Bangalore, INDIA. Abstract -- We consider a multicast scheme recently proposed for a wireless downlink in [1]. It was shown earlier that power control can significantly improve its performance. However for this system, obtaining optimal power control is intractable because of a very large state space. Therefore in this paper we use deep reinforcement learning where we use function approximation of the Q-function via a deep neural network. We show that optimal power control can be learnt for reasonably large systems via this approach. The average power constraint is ensured via a Lagrange multiplier, which is also learnt. Finally, we demonstrate that a slight modification of the learning algorithm allows the optimal control to track the time varying system statistics. I NTRODUCTION Wireless networks are being constantly refined to cater for seamless delivery of huge amount of data to the end users. With increased user generated contents and proliferation of social networking sites, almost 78% of mobile data traffic is expected to be due to mobile videos [2]. Also, the requested traffic for these contents is ridden with redundant requests [3]. Thus, multicasting is a natural way to address these requests. A multicast queue with network coding is studied in [4], [5] with infinite library of files.