TY - GEN
T1 - Enhancing UAV Path Planning Efficiency Through Accelerated Learning
AU - Viana, Joseanne
AU - Galkin, Boris
AU - Ho, Lester
AU - Claussen, Holger
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - Unmanned Aerial Vehicles (UAVs) are increasingly essential in various fields such as surveillance, reconnaissance, and telecommunications. This study aims to develop a learning algorithm for the path planning of UAV wireless communication relays, which can reduce storage requirements and accelerate Deep Reinforcement Learning (DRL) convergence. Assuming the system possesses terrain maps of the area and can estimate user locations using localization algorithms or direct GPS reporting, it can input these parameters into the learning algorithms to achieve optimized path planning performance. However, higher resolution terrain maps are necessary to extract topological information such as terrain height, object distances, and signal blockages. This requirement increases memory and storage demands on UAVs while also lengthening convergence times in DRL algorithms. Similarly, defining the telecommunication coverage map in UAV wireless communication relays using these terrain maps and user position estimations demands higher memory and storage utilization for the learning path planning algorithms. Our approach reduces path planning training time by applying a dimensionality reduction technique based on Principal Component Analysis (PCA), sample combination, Prioritized Experience Replay (PER), and the combination of Mean Squared Error (MSE) and Mean Absolute Error (MAE) loss calculations in the coverage map estimates, thereby enhancing a Twin Delayed Deep Deterministic Policy Gradient (TD3) algorithm. The proposed solution reduces the convergence episodes needed for basic training by approximately four times compared to the traditional TD3.
AB - Unmanned Aerial Vehicles (UAVs) are increasingly essential in various fields such as surveillance, reconnaissance, and telecommunications. This study aims to develop a learning algorithm for the path planning of UAV wireless communication relays, which can reduce storage requirements and accelerate Deep Reinforcement Learning (DRL) convergence. Assuming the system possesses terrain maps of the area and can estimate user locations using localization algorithms or direct GPS reporting, it can input these parameters into the learning algorithms to achieve optimized path planning performance. However, higher resolution terrain maps are necessary to extract topological information such as terrain height, object distances, and signal blockages. This requirement increases memory and storage demands on UAVs while also lengthening convergence times in DRL algorithms. Similarly, defining the telecommunication coverage map in UAV wireless communication relays using these terrain maps and user position estimations demands higher memory and storage utilization for the learning path planning algorithms. Our approach reduces path planning training time by applying a dimensionality reduction technique based on Principal Component Analysis (PCA), sample combination, Prioritized Experience Replay (PER), and the combination of Mean Squared Error (MSE) and Mean Absolute Error (MAE) loss calculations in the coverage map estimates, thereby enhancing a Twin Delayed Deep Deterministic Policy Gradient (TD3) algorithm. The proposed solution reduces the convergence episodes needed for basic training by approximately four times compared to the traditional TD3.
KW - 5G
KW - 6G
KW - Commu-nications
KW - Dimensionality Reduction
KW - Memory
KW - Reinforcement Learning
KW - Relays
KW - Storage
KW - TD3
KW - UAVs
UR - https://www.scopus.com/pages/publications/105002869240
U2 - 10.1109/CAMAD62243.2024.10942806
DO - 10.1109/CAMAD62243.2024.10942806
M3 - Conference proceeding
AN - SCOPUS:105002869240
T3 - IEEE International Workshop on Computer Aided Modeling and Design of Communication Links and Networks, CAMAD
BT - 2024 IEEE 29th International Workshop on Computer Aided Modeling and Design of Communication Links and Networks, CAMAD 2024
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 29th IEEE International Workshop on Computer Aided Modeling and Design of Communication Links and Networks, CAMAD 2024
Y2 - 21 October 2024 through 23 October 2024
ER -