Article (Scientific journals)
Energy Minimization in UAV-Aided Networks: Actor-Critic Learning for Constrained Scheduling Optimization
Yuan, Yaxiong; Lei, Lei; Vu, Thang Xuan et al.
2021In IEEE Transactions on Vehicular Technology
Peer Reviewed verified by ORBi
 

Files


Full Text
09416816.pdf
Publisher postprint (5.3 MB)
Download

All documents in ORBilu are protected by a user license.

Send to



Details



Keywords :
UAV; deep reinforcement learning; user scheduling; hovering time allocation; energy optimization; actor-critic
Abstract :
[en] In unmanned aerial vehicle (UAV) applications, the UAV's limited energy supply and storage have triggered the development of intelligent energy-conserving scheduling solutions. In this paper, we investigate energy minimization for UAV-aided communication networks by jointly optimizing data-transmission scheduling and UAV hovering time. The formulated problem is combinatorial and non-convex with bilinear constraints. To tackle the problem, firstly, we provide an optimal relax-and-approximate solution and develop a near-optimal algorithm. Both the proposed solutions are served as offline performance benchmarks but might not be suitable for online operation. To this end, we develop a solution from a deep reinforcement learning (DRL) aspect. The conventional RL/DRL, e.g., deep Q-learning, however, is limited in dealing with two main issues in constrained combinatorial optimization, i.e., exponentially increasing action space and infeasible actions. The novelty of solution development lies in handling these two issues. To address the former, we propose an actor-critic-based deep stochastic online scheduling (AC-DSOS) algorithm and develop a set of approaches to confine the action space. For the latter, we design a tailored reward function to guarantee the solution feasibility. Numerical results show that, by consuming equal magnitude of time, AC-DSOS is able to provide feasible solutions and saves 29.94% energy compared with a conventional deep actor-critic method. Compared to the developed near-optimal algorithm, AC-DSOS consumes around 10% higher energy but reduces the computational time from minute-level to millisecond-level.
Disciplines :
Electrical & electronics engineering
Author, co-author :
Yuan, Yaxiong ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > SigCom
Lei, Lei ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > SigCom
Vu, Thang Xuan  ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > SigCom
Chatzinotas, Symeon  ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > SigCom
Sun, Sumei;  Institute for Inforcom, Agency for Science, Technology, and Reseach, Singapore
Ottersten, Björn ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT)
External co-authors :
yes
Language :
English
Title :
Energy Minimization in UAV-Aided Networks: Actor-Critic Learning for Constrained Scheduling Optimization
Publication date :
27 April 2021
Journal title :
IEEE Transactions on Vehicular Technology
ISSN :
0018-9545
Publisher :
Institute of Electrical and Electronics Engineers, United States
Peer reviewed :
Peer Reviewed verified by ORBi
FnR Project :
FNR12173206 - Learning-assisted Optimization For Resource And Security Management In Slicing-based 5g Networks - LARGOS, 2017 (15/03/2018-14/03/2022) - Srikanth Bommaraven
Available on ORBilu :
since 18 June 2021

Statistics


Number of views
174 (54 by Unilu)
Number of downloads
105 (24 by Unilu)

Scopus citations®
 
27
Scopus citations®
without self-citations
20
WoS citations
 
21

Bibliography


Similar publications



Contact ORBilu