High-level programming of robotic agents requires the use of a representation formalism able to cope with several sources of complexity (e.g. parallel execution, partial observability, exogenous events, etc.) and the ability of the designer to model the domain in a precise way. Reinforcement Learning has proved promising in improving the performance, adaptability and robustness of plans in under-specified domains, although it does not scale well with the complexity of common robotic applications. In this paper we propose to combine an extremely expressive plan representation formalism (Petri Net Plans), with Reinforcement Learning over a stochastic process derived directly from such a plan. The derived process has a significantly reduced search space and thus the framework scales well with the complexity of the domain and allows for actually improving the performance of complex behaviors from experience. To prove the effectiveness of the system, we show how to model and learn the behavior of the robotic agents in the context of Keepaway Soccer (a widely accepted benchmark for RL) and the RoboCup Standard Platform League. © 2011 Springer-Verlag.

LearnPNP: A tool for learning agent behaviors / Matteo, Leonetti; Iocchi, Luca. - 6556 LNAI:(2011), pp. 418-429. (Intervento presentato al convegno 14th Annual RoboCup International Symposium tenutosi a Singapore; Singapore) [10.1007/978-3-642-20217-9_36].

LearnPNP: A tool for learning agent behaviors

IOCCHI, Luca
2011

Abstract

High-level programming of robotic agents requires the use of a representation formalism able to cope with several sources of complexity (e.g. parallel execution, partial observability, exogenous events, etc.) and the ability of the designer to model the domain in a precise way. Reinforcement Learning has proved promising in improving the performance, adaptability and robustness of plans in under-specified domains, although it does not scale well with the complexity of common robotic applications. In this paper we propose to combine an extremely expressive plan representation formalism (Petri Net Plans), with Reinforcement Learning over a stochastic process derived directly from such a plan. The derived process has a significantly reduced search space and thus the framework scales well with the complexity of the domain and allows for actually improving the performance of complex behaviors from experience. To prove the effectiveness of the system, we show how to model and learn the behavior of the robotic agents in the context of Keepaway Soccer (a widely accepted benchmark for RL) and the RoboCup Standard Platform League. © 2011 Springer-Verlag.
2011
14th Annual RoboCup International Symposium
agent programming; plan representation; reinforcement learning
04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
LearnPNP: A tool for learning agent behaviors / Matteo, Leonetti; Iocchi, Luca. - 6556 LNAI:(2011), pp. 418-429. (Intervento presentato al convegno 14th Annual RoboCup International Symposium tenutosi a Singapore; Singapore) [10.1007/978-3-642-20217-9_36].
File allegati a questo prodotto
File Dimensione Formato  
VE_2011_11573-378667.pdf

solo gestori archivio

Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 364.31 kB
Formato Adobe PDF
364.31 kB Adobe PDF   Contatta l'autore

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/378667
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 3
  • ???jsp.display-item.citation.isi??? ND
social impact