TY - GEN
T1 - Deadline-Aware TDMA Scheduling for Multihop Networks Using Reinforcement Learning
AU - Chilukuri, Shanti
AU - Piao, Guangyuan
AU - Lugones, DIego
AU - Pesch, DIrk
N1 - Publisher Copyright:
© 2021 IFIP.
PY - 2021/6/21
Y1 - 2021/6/21
N2 - Time division multiple access (TDMA) is the medium access control strategy of choice for multihop networks with deterministic delay guarantee requirements. As such, many Internet of Things applications use protocols based on time division multiple access. Optimal slot assignment in such networks is NP-hard when there are strict deadline requirements and is generally done using heuristics that give suboptimal transmission schedules in linear time. However, existing heuristics make a scheduling decision at each time slot based on the same criterion without considering its effect on subsequent network states or scheduling actions. Here, we first identify a set of node features that capture the information necessary for network state representation to aid building schedules using Reinforcement Learning (RL). We then propose three different centralized approaches to RL-based TDMA scheduling that vary in training and network representation methods. Using RL allows applying diverse criteria at different time slots while considering the effect of a scheduling action on meeting the scheduling objective for the entire TDMA frame, resulting in better schedules. We compare the three proposed schemes in terms of how well they meet the scheduling objectives and their applicability to networks with memory and time constraints. One of the schemes proposed is RLSchedule, which is particularly suited to constrained networks. Simulation results for a variety of network scenarios show that RLSchedule reduces the percentage of packets missing deadlines by up to 60% compared to the best available baseline heuristic.
AB - Time division multiple access (TDMA) is the medium access control strategy of choice for multihop networks with deterministic delay guarantee requirements. As such, many Internet of Things applications use protocols based on time division multiple access. Optimal slot assignment in such networks is NP-hard when there are strict deadline requirements and is generally done using heuristics that give suboptimal transmission schedules in linear time. However, existing heuristics make a scheduling decision at each time slot based on the same criterion without considering its effect on subsequent network states or scheduling actions. Here, we first identify a set of node features that capture the information necessary for network state representation to aid building schedules using Reinforcement Learning (RL). We then propose three different centralized approaches to RL-based TDMA scheduling that vary in training and network representation methods. Using RL allows applying diverse criteria at different time slots while considering the effect of a scheduling action on meeting the scheduling objective for the entire TDMA frame, resulting in better schedules. We compare the three proposed schemes in terms of how well they meet the scheduling objectives and their applicability to networks with memory and time constraints. One of the schemes proposed is RLSchedule, which is particularly suited to constrained networks. Simulation results for a variety of network scenarios show that RLSchedule reduces the percentage of packets missing deadlines by up to 60% compared to the best available baseline heuristic.
UR - https://www.scopus.com/pages/publications/85112791273
U2 - 10.23919/IFIPNetworking52078.2021.9472801
DO - 10.23919/IFIPNetworking52078.2021.9472801
M3 - Conference proceeding
AN - SCOPUS:85112791273
T3 - 2021 IFIP Networking Conference, IFIP Networking 2021
BT - 2021 IFIP Networking Conference, IFIP Networking 2021
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 20th Annual IFIP Networking Conference, IFIP Networking 2021
Y2 - 21 June 2021 through 24 June 2021
ER -