This paper introduces a novel algorithm for Reinforcement Learning (RL) in Regular Decision Processes (RDPs), a model of non-Markovian decision processes where dynamics and rewards depend on regular properties of the history. Our algorithm is inspired by Monte Carlo tree search (MCTS), yet it is improved with state merging capabilities. Performing merges allows us to evolve the tree model into a graph over time, as we periodically perform similarity tests borrowed from automata learning theory to learn states that are equivalent to one another. This results in improved efficiency and scalability over standard MCTS. We present empirical results that demonstrate orders of magnitude performance improvement over the state-of-the-art RL algorithms for RDPs.

Monte Carlo Tree Search with State Merging for Reinforcement Learning in Regular Decision Processes / Licks, G. P.; Patrizi, F.; De Giacomo, G.. - 392:(2024), pp. 1373-1380. ( 27th European Conference on Artificial Intelligence, ECAI 2024 Santiago de Compostela; Spain ) [10.3233/FAIA240637].

Monte Carlo Tree Search with State Merging for Reinforcement Learning in Regular Decision Processes

Licks G. P.;Patrizi F.;De Giacomo G.
2024

Abstract

This paper introduces a novel algorithm for Reinforcement Learning (RL) in Regular Decision Processes (RDPs), a model of non-Markovian decision processes where dynamics and rewards depend on regular properties of the history. Our algorithm is inspired by Monte Carlo tree search (MCTS), yet it is improved with state merging capabilities. Performing merges allows us to evolve the tree model into a graph over time, as we periodically perform similarity tests borrowed from automata learning theory to learn states that are equivalent to one another. This results in improved efficiency and scalability over standard MCTS. We present empirical results that demonstrate orders of magnitude performance improvement over the state-of-the-art RL algorithms for RDPs.
2024
27th European Conference on Artificial Intelligence, ECAI 2024
Regular Decision Processes; Monte Carlo Tree Search; Reinforcement Learning
04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
Monte Carlo Tree Search with State Merging for Reinforcement Learning in Regular Decision Processes / Licks, G. P.; Patrizi, F.; De Giacomo, G.. - 392:(2024), pp. 1373-1380. ( 27th European Conference on Artificial Intelligence, ECAI 2024 Santiago de Compostela; Spain ) [10.3233/FAIA240637].
File allegati a questo prodotto
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1738695
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact