Recently regular decision processes have been pro- posed as a well-behaved form of non-Markov decision process. Regular decision processes are characterised by a transition function and a reward function that depend on the whole history, though regularly (as in regular languages). In practice both the transition and the reward functions can be seen as finite transducers. We study reinforcement learning in regular decision processes. Our main contribution is to show that a near-optimal policy can be PAC-learned in polynomial time in a set of parameters that describe the underlying decision process. We argue that the identified set of parameters is minimal and it reasonably captures the difficulty of a regular decision process.

Efficient PAC Reinforcement Learning in Regular Decision Processes / Ronca, Alessandro; DE GIACOMO, Giuseppe. - In: IJCAI. - ISSN 1045-0823. - (2021), pp. 2026-2032. (Intervento presentato al convegno 30th International Joint Conference on Artificial Intelligence, IJCAI 2021 tenutosi a Montreal, Canada) [10.24963/ijcai.2021/279].

Efficient PAC Reinforcement Learning in Regular Decision Processes

Alessandro Ronca
;
Giuseppe De Giacomo
2021

Abstract

Recently regular decision processes have been pro- posed as a well-behaved form of non-Markov decision process. Regular decision processes are characterised by a transition function and a reward function that depend on the whole history, though regularly (as in regular languages). In practice both the transition and the reward functions can be seen as finite transducers. We study reinforcement learning in regular decision processes. Our main contribution is to show that a near-optimal policy can be PAC-learned in polynomial time in a set of parameters that describe the underlying decision process. We argue that the identified set of parameters is minimal and it reasonably captures the difficulty of a regular decision process.
2021
30th International Joint Conference on Artificial Intelligence, IJCAI 2021
MDP; Reinforcement Learning; non-Markov decision process
04 Pubblicazione in atti di convegno::04c Atto di convegno in rivista
Efficient PAC Reinforcement Learning in Regular Decision Processes / Ronca, Alessandro; DE GIACOMO, Giuseppe. - In: IJCAI. - ISSN 1045-0823. - (2021), pp. 2026-2032. (Intervento presentato al convegno 30th International Joint Conference on Artificial Intelligence, IJCAI 2021 tenutosi a Montreal, Canada) [10.24963/ijcai.2021/279].
File allegati a questo prodotto
File Dimensione Formato  
Ronca_Efficient-PAC_2021.pdf.pdf

accesso aperto

Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Creative commons
Dimensione 165.59 kB
Formato Adobe PDF
165.59 kB Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1575264
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 9
  • ???jsp.display-item.citation.isi??? 2
social impact