TY - JOUR
T1 - 3D UAV Trajectory and Data Collection Optimisation Via Deep Reinforcement Learning
AU - Nguyen, Khoi Khac
AU - Duong, Trung Q.
AU - Do-Duy, Tan
AU - Claussen, Holger
AU - Hanzo, Lajos
N1 - Publisher Copyright:
© 1972-2012 IEEE.
PY - 2022/4/1
Y1 - 2022/4/1
N2 - Unmanned aerial vehicles (UAVs) are now beginning to be deployed for enhancing the network performance and coverage in wireless communication. However, due to the limitation of their on- board power and flight time, it is challenging to obtain an optimal resource allocation scheme for the UAV-assisted Internet of Things (IoT). In this paper, we design a new UAV-assisted IoT system relying on the shortest flight path of the UAVs while maximising the amount of data collected from IoT devices. Then, a deep reinforcement learning-based technique is conceived for finding the optimal trajectory and throughput in a specific coverage area. After training, the UAV has the ability to autonomously collect all the data from user nodes at a significant total sum-rate improvement while minimising the associated resources used. Numerical results are provided to highlight how our techniques strike a balance between the throughput attained, trajectory, and the time spent. More explicitly, we characterise the attainable performance in terms of the UAV trajectory, the expected reward and the total sum-rate.
AB - Unmanned aerial vehicles (UAVs) are now beginning to be deployed for enhancing the network performance and coverage in wireless communication. However, due to the limitation of their on- board power and flight time, it is challenging to obtain an optimal resource allocation scheme for the UAV-assisted Internet of Things (IoT). In this paper, we design a new UAV-assisted IoT system relying on the shortest flight path of the UAVs while maximising the amount of data collected from IoT devices. Then, a deep reinforcement learning-based technique is conceived for finding the optimal trajectory and throughput in a specific coverage area. After training, the UAV has the ability to autonomously collect all the data from user nodes at a significant total sum-rate improvement while minimising the associated resources used. Numerical results are provided to highlight how our techniques strike a balance between the throughput attained, trajectory, and the time spent. More explicitly, we characterise the attainable performance in terms of the UAV trajectory, the expected reward and the total sum-rate.
KW - and deep reinforcement learning
KW - data collection
KW - trajectory
KW - UAV-assisted wireless network
UR - https://www.scopus.com/pages/publications/85124183715
U2 - 10.1109/TCOMM.2022.3148364
DO - 10.1109/TCOMM.2022.3148364
M3 - Article
AN - SCOPUS:85124183715
SN - 0090-6778
VL - 70
SP - 2358
EP - 2371
JO - IEEE Transactions on Communications
JF - IEEE Transactions on Communications
IS - 4
ER -