On the Emergence of Whole-body Strategies from Humanoid Robot Push-recovery Learning

Ferigo, D.; Camoriano, R.; Viceconte, P. M.; Calandriello, D.; Traversaro, S.; Rosasco, L.; Pucci, D.

doi:10.1109/LRA.2021.3076955

Balancing and push-recovery are essential capabilities enabling humanoid robots to solve complex locomotion tasks. In this context, classical control systems tend to be based on simplified physical models and hard-coded strategies. Although successful in specific scenarios, this approach requires demanding tuning of parameters and switching logic between specifically-designed controllers for handling more general perturbations. We apply model-free Deep Reinforcement Learning for training a general and robust humanoid push-recovery policy in a simulation environment. Our method targets high-dimensional whole-body humanoid control and is validated on the iCub humanoid. Reward components incorporating expert knowledge on humanoid control enable fast learning of several robust behaviors by the same policy, spanning the entire body. We validate our method with extensive quantitative analyses in simulation, including out-of-sample tasks which demonstrate policy robustness and generalization, both key requirements towards real-world robot deployment.

On the Emergence of Whole-body Strategies from Humanoid Robot Push-recovery Learning / Ferigo, D.; Camoriano, R.; Viceconte, P. M.; Calandriello, D.; Traversaro, S.; Rosasco, L.; Pucci, D.. - In: IEEE ROBOTICS AND AUTOMATION LETTERS. - ISSN 2377-3766. - 6:4(2021), pp. 8561-8568. [10.1109/LRA.2021.3076955]

On the Emergence of Whole-body Strategies from Humanoid Robot Push-recovery Learning

Camoriano R.^Co-primo;Viceconte P. M.;Calandriello D.;Traversaro S.;Rosasco L.;Pucci D.

2021

Abstract

Balancing and push-recovery are essential capabilities enabling humanoid robots to solve complex locomotion tasks. In this context, classical control systems tend to be based on simplified physical models and hard-coded strategies. Although successful in specific scenarios, this approach requires demanding tuning of parameters and switching logic between specifically-designed controllers for handling more general perturbations. We apply model-free Deep Reinforcement Learning for training a general and robust humanoid push-recovery policy in a simulation environment. Our method targets high-dimensional whole-body humanoid control and is validated on the iCub humanoid. Reward components incorporating expert knowledge on humanoid control enable fast learning of several robust behaviors by the same policy, spanning the entire body. We validate our method with extensive quantitative analyses in simulation, including out-of-sample tasks which demonstrate policy robustness and generalization, both key requirements towards real-world robot deployment.

Scheda breve

Scheda completa

	Anno di pubblicazione
	
				2021
			
	Parole chiave
	
				Robotics; Humanoids; Reinforcement Learning; Whole-body Control
			
	Tipologia
	
				01 Pubblicazione su rivista::01a Articolo in rivista
			
	Citazione
	
				On the Emergence of Whole-body Strategies from Humanoid Robot Push-recovery Learning / Ferigo, D.; Camoriano, R.; Viceconte, P. M.; Calandriello, D.; Traversaro, S.; Rosasco, L.; Pucci, D.. - In: IEEE ROBOTICS AND AUTOMATION LETTERS. - ISSN 2377-3766. - 6:4(2021), pp. 8561-8568. [10.1109/LRA.2021.3076955]
			
	Appartiene alla tipologia:
	
				01a Articolo in rivista

File allegati a questo prodotto

File	Dimensione	Formato
Ferigo_On-the-Emergence_2021.pdf accesso aperto Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore) Licenza: Creative commons Dimensione 1.12 MB Formato Adobe PDF	1.12 MB	Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1583072

Citazioni

ND

15

7

Catalogo dei prodotti della ricerca