3D UAV Trajectory and Data Collection Optimisation Via Deep Reinforcement Learning

Research output: Contribution to journalArticlepeer-review

Abstract

Unmanned aerial vehicles (UAVs) are now beginning to be deployed for enhancing the network performance and coverage in wireless communication. However, due to the limitation of their on- board power and flight time, it is challenging to obtain an optimal resource allocation scheme for the UAV-assisted Internet of Things (IoT). In this paper, we design a new UAV-assisted IoT system relying on the shortest flight path of the UAVs while maximising the amount of data collected from IoT devices. Then, a deep reinforcement learning-based technique is conceived for finding the optimal trajectory and throughput in a specific coverage area. After training, the UAV has the ability to autonomously collect all the data from user nodes at a significant total sum-rate improvement while minimising the associated resources used. Numerical results are provided to highlight how our techniques strike a balance between the throughput attained, trajectory, and the time spent. More explicitly, we characterise the attainable performance in terms of the UAV trajectory, the expected reward and the total sum-rate.

Original languageEnglish
Pages (from-to)2358-2371
Number of pages14
JournalIEEE Transactions on Communications
Volume70
Issue number4
DOIs
Publication statusPublished - 1 Apr 2022

Keywords

  • and deep reinforcement learning
  • data collection
  • trajectory
  • UAV-assisted wireless network

Fingerprint

Dive into the research topics of '3D UAV Trajectory and Data Collection Optimisation Via Deep Reinforcement Learning'. Together they form a unique fingerprint.

Cite this