TY - JOUR
T1 - Variations in Multi-Agent Actor–Critic Frameworks for Joint Optimizations in UAV Swarm Networks
T2 - Recent Evolution, Challenges, and Directions
AU - Alam, Muhammad Morshed
AU - Trina, Sayma Akter
AU - Hossain, Tamim
AU - Mahmood, Shafin
AU - Ahmed, Md Sanim
AU - Arafat, Muhammad Yeasir
N1 - Publisher Copyright:
© 2025 by the authors.
PY - 2025/2
Y1 - 2025/2
N2 - Autonomous unmanned aerial vehicle (UAV) swarm networks (UAVSNs) can efficiently perform surveillance, connectivity, computing, and energy transfer services for ground users (GUs). These missions require trajectory planning, UAV-GUs association, task offloading, next-hop selection, and resource allocation, including transmit power, bandwidth, timeslots, caching, and computing resources, to enhance network performance. Owing to the highly dynamic topology, limited resources, stringent quality of service requirements, and lack of global knowledge, optimizing network performance in UAVSNs is very intricate. To address this, an adaptive joint optimization framework is required to handle both discrete and continuous decision variables, ensuring optimal performance under various dynamic constraints. A multi-agent deep reinforcement learning-based adaptive actor–critic framework offers an effective solution by leveraging its ability to extract hidden features through agent interactions, generate hybrid actions under uncertainty, and adaptively learn with scalable generalization in dynamic conditions. This paper explores the recent evolutions of actor–critic frameworks to deal with joint optimization problems in UAVSNs by proposing a novel taxonomy based on the modifications in the internal actor–critic neural network structure. Additionally, key open research challenges are identified, and potential solutions are suggested as directions for future research in UAVSNs.
AB - Autonomous unmanned aerial vehicle (UAV) swarm networks (UAVSNs) can efficiently perform surveillance, connectivity, computing, and energy transfer services for ground users (GUs). These missions require trajectory planning, UAV-GUs association, task offloading, next-hop selection, and resource allocation, including transmit power, bandwidth, timeslots, caching, and computing resources, to enhance network performance. Owing to the highly dynamic topology, limited resources, stringent quality of service requirements, and lack of global knowledge, optimizing network performance in UAVSNs is very intricate. To address this, an adaptive joint optimization framework is required to handle both discrete and continuous decision variables, ensuring optimal performance under various dynamic constraints. A multi-agent deep reinforcement learning-based adaptive actor–critic framework offers an effective solution by leveraging its ability to extract hidden features through agent interactions, generate hybrid actions under uncertainty, and adaptively learn with scalable generalization in dynamic conditions. This paper explores the recent evolutions of actor–critic frameworks to deal with joint optimization problems in UAVSNs by proposing a novel taxonomy based on the modifications in the internal actor–critic neural network structure. Additionally, key open research challenges are identified, and potential solutions are suggested as directions for future research in UAVSNs.
KW - actor–critic frameworks
KW - joint optimizations
KW - multi-agent deep reinforcement learning
KW - resource allocation
KW - task offloading
KW - trajectory control
KW - UAV swarm networks
UR - https://www.scopus.com/pages/publications/85218626571
U2 - 10.3390/drones9020153
DO - 10.3390/drones9020153
M3 - Review article
AN - SCOPUS:85218626571
SN - 2504-446X
VL - 9
JO - Drones
JF - Drones
IS - 2
M1 - 153
ER -