Policy gradient learning for quadruped soccer robots

Cherubini, A.; Giannone, F.; Iocchi, Luca; Nardi, Daniele; Palamara, P. F.

doi:10.1016/j.robot.2010.03.008

In real-world robotic applications, many factors, both at low level (e.g., vision, motion control and behaviors) and at high level (e.g., plans and strategies) determine the quality of the robot performance. Consequently, fine tuning of the parameters, in the implementation of the basic functionalities, as well as in the strategic decisions, is a key issue in robot software development. In recent years, machine learning techniques have been successfully used to find optimal parameters for typical robotic functionalities. However, one major drawback of learning techniques is time consumption: in practical applications, methods designed for physical robots must be effective with small amounts of data. In this paper, we present a method for concurrent learning of best strategy and optimal parameters using policy gradient reinforcement learning algorithm. The results of our experimental work in a simulated environment and on a real robot show a very high convergence rate. (C) 2010 Elsevier B.V. All rights reserved.

Policy gradient learning for quadruped soccer robots / A., C., F., G., Iocchi, L., Nardi, D., P. F., P.. - In: ROBOTICS AND AUTONOMOUS SYSTEMS. - ISSN 0921-8890. - 58:7(2010), pp. 872-878. [10.1016/j.robot.2010.03.008]

Policy gradient learning for quadruped soccer robots

A. Cherubini;F. Giannone;IOCCHI, Luca;NARDI, Daniele;P. F. Palamara

2010

Abstract

In real-world robotic applications, many factors, both at low level (e.g., vision, motion control and behaviors) and at high level (e.g., plans and strategies) determine the quality of the robot performance. Consequently, fine tuning of the parameters, in the implementation of the basic functionalities, as well as in the strategic decisions, is a key issue in robot software development. In recent years, machine learning techniques have been successfully used to find optimal parameters for typical robotic functionalities. However, one major drawback of learning techniques is time consumption: in practical applications, methods designed for physical robots must be effective with small amounts of data. In this paper, we present a method for concurrent learning of best strategy and optimal parameters using policy gradient reinforcement learning algorithm. The results of our experimental work in a simulated environment and on a real robot show a very high convergence rate. (C) 2010 Elsevier B.V. All rights reserved.

Scheda breve

Scheda completa

	Anno di pubblicazione
	
				2010
			
	Parole chiave
	
				reinforcement learning; soccer robots
			
	Tipologia
	
				01 Pubblicazione su rivista::01a Articolo in rivista
			
	Citazione
	
				Policy gradient learning for quadruped soccer robots / A., C., F., G., Iocchi, L., Nardi, D., P. F., P.. - In: ROBOTICS AND AUTONOMOUS SYSTEMS. - ISSN 0921-8890. - 58:7(2010), pp. 872-878. [10.1016/j.robot.2010.03.008]
			
	Appartiene alla tipologia:
	
				01a Articolo in rivista

File allegati a questo prodotto

File	Dimensione	Formato
VE_2010_11573-75943.pdf solo gestori archivio Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 1.24 MB Formato Adobe PDF Contatta l'autore	1.24 MB	Adobe PDF	Contatta l'autore

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/75943

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni

ND

10

5

Catalogo dei prodotti della ricerca