An extended policy gradient algorithm for robot task learning

Cherubini, A; Giannone, F; Iocchi, Luca; Palamara, P. F.

doi:10.1109/IROS.2007.4399219

In real-world robotic applications, many factors, both at low-level (e.g., vision and motion control parameters) and at high-level (e.g., the behaviors) determine the quality of the robot performance. Thus, for many tasks, robots require fine tuning of the parameters, in the implementation of behaviors and basic control actions, as well as in strategic decisional processes. In recent years, machine learning techniques have been used to find optimal parameter sets for different behaviors. However, a drawback of learning techniques is time consumption: in practical applications, methods designed for physical robots must be effective with small amounts of data. In this paper, we present a method for concurrent learning of best strategy and optimal parameters, by extending the policy gradient reinforcement learning algorithm. The results of our experimental work in a simulated environment and on a real robot show a very high convergence rate. ©2007 IEEE.

An extended policy gradient algorithm for robot task learning / Cherubini, A., Giannone, F., Iocchi, L., Palamara, P.F.. - (2007), pp. 4121-4126. (2007 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2007 San Diego; United States 29 Oct / 2 Nov 2007) [10.1109/IROS.2007.4399219].

An extended policy gradient algorithm for robot task learning

CHERUBINI A;GIANNONE F;IOCCHI, Luca;PALAMARA P. F.

2007

Abstract

In real-world robotic applications, many factors, both at low-level (e.g., vision and motion control parameters) and at high-level (e.g., the behaviors) determine the quality of the robot performance. Thus, for many tasks, robots require fine tuning of the parameters, in the implementation of behaviors and basic control actions, as well as in strategic decisional processes. In recent years, machine learning techniques have been used to find optimal parameter sets for different behaviors. However, a drawback of learning techniques is time consumption: in practical applications, methods designed for physical robots must be effective with small amounts of data. In this paper, we present a method for concurrent learning of best strategy and optimal parameters, by extending the policy gradient reinforcement learning algorithm. The results of our experimental work in a simulated environment and on a real robot show a very high convergence rate. ©2007 IEEE.

Scheda breve

Scheda completa

	Anno di pubblicazione
	
				2007
			
	Nome convegno
	
				2007 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2007
			
	Parole chiave
	
				International conferences; Optimal parameters; Robot performance
			
	Tipologia
	
				04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
			
	Citazione
	
				An extended policy gradient algorithm for robot task learning / Cherubini, A., Giannone, F., Iocchi, L., Palamara, P.F.. - (2007), pp. 4121-4126. (2007 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2007 San Diego; United States 29 Oct / 2 Nov 2007) [10.1109/IROS.2007.4399219].
			
	Appartiene alla tipologia:
	
				04b Atto di convegno in volume

File allegati a questo prodotto

File	Dimensione	Formato
VE_2007_11573-358629.pdf solo gestori archivio Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 1.35 MB Formato Adobe PDF Contatta l'autore	1.35 MB	Adobe PDF	Contatta l'autore

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/358629

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni

ND

2

0

Catalogo dei prodotti della ricerca