TY - GEN
T1 - Solving Mixed Influence Diagrams by Reinforcement Learning
AU - Prestwich, S. D.
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2024.
PY - 2024
Y1 - 2024
N2 - While efficient optimisation methods exist for problems with special properties (linear, continuous, differentiable, unconstrained), real-world problems often involve inconvenient complications (constrained, discrete, multi-stage, multi-level, multi-objective). Each of these complications has spawned research areas in Artificial Intelligence and Operations Research, but few methods are available for hybrid problems. We describe a reinforcement learning-based solver for a broad class of discrete problems that we call Mixed Influence Diagrams, which may have multiple stages, multiple agents, multiple non-linear objectives, correlated chance variables, exogenous and endogenous uncertainty, constraints (hard, soft and chance) and partially observed variables. We apply the solver to problems taken from stochastic programming, chance-constrained programming, limited-memory influence diagrams, multi-level and multi-objective optimisation. We expect the approach to be useful on new hybrid problems for which no specialised solution methods exist.
AB - While efficient optimisation methods exist for problems with special properties (linear, continuous, differentiable, unconstrained), real-world problems often involve inconvenient complications (constrained, discrete, multi-stage, multi-level, multi-objective). Each of these complications has spawned research areas in Artificial Intelligence and Operations Research, but few methods are available for hybrid problems. We describe a reinforcement learning-based solver for a broad class of discrete problems that we call Mixed Influence Diagrams, which may have multiple stages, multiple agents, multiple non-linear objectives, correlated chance variables, exogenous and endogenous uncertainty, constraints (hard, soft and chance) and partially observed variables. We apply the solver to problems taken from stochastic programming, chance-constrained programming, limited-memory influence diagrams, multi-level and multi-objective optimisation. We expect the approach to be useful on new hybrid problems for which no specialised solution methods exist.
UR - https://www.scopus.com/pages/publications/85186269900
U2 - 10.1007/978-3-031-53966-4_19
DO - 10.1007/978-3-031-53966-4_19
M3 - Conference proceeding
AN - SCOPUS:85186269900
SN - 9783031539657
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 255
EP - 269
BT - Machine Learning, Optimization, and Data Science - 9th International Conference, LOD 2023, Revised Selected Papers
A2 - Nicosia, Giuseppe
A2 - Ojha, Varun
A2 - La Malfa, Emanuele
A2 - La Malfa, Gabriele
A2 - Pardalos, Panos M.
A2 - Umeton, Renato
PB - Springer Science and Business Media Deutschland GmbH
T2 - 9th International Conference on Machine Learning, Optimization, and Data Science, LOD 2023
Y2 - 22 September 2023 through 26 September 2023
ER -