In this letter, we present a human-in-the-loop learning framework for mobile robots to generate effective local policies in order to recover from navigation failures in long-term autonomy. We present an analysis of failure and recovery cases derived from long-term autonomous operation of a mobile robot, and propose a two-layer learning framework that allows to detect and recover from such navigation failures. Employing a learning by demonstration approach, our framework can incrementally learn to autonomously recover from situations it initially needs humans to help with. The learning framework allows for both real-time failure detection and regression using Gaussian processes. Our empirical results on two different failure scenarios indicate that given 40 failure state observations, the true positive rate of the failure detection model exceeds 90%, ending with successful recovery actions in more than 90% of all detected cases.

Do Not Make the Same Mistakes Again and Again: Learning Local Recovery Policies for Navigation from Human Demonstrations / Del Duchetto, F.; Kucukyilmaz, A.; Iocchi, L.; Hanheide, M.. - In: IEEE ROBOTICS AND AUTOMATION LETTERS. - ISSN 2377-3766. - 3:4(2018), pp. 4084-4091. [10.1109/LRA.2018.2861080]

Do Not Make the Same Mistakes Again and Again: Learning Local Recovery Policies for Navigation from Human Demonstrations

DEL DUCHETTO, FRANCESCO
;
Iocchi L.
;
Hanheide M.
2018

Abstract

In this letter, we present a human-in-the-loop learning framework for mobile robots to generate effective local policies in order to recover from navigation failures in long-term autonomy. We present an analysis of failure and recovery cases derived from long-term autonomous operation of a mobile robot, and propose a two-layer learning framework that allows to detect and recover from such navigation failures. Employing a learning by demonstration approach, our framework can incrementally learn to autonomously recover from situations it initially needs humans to help with. The learning framework allows for both real-time failure detection and regression using Gaussian processes. Our empirical results on two different failure scenarios indicate that given 40 failure state observations, the true positive rate of the failure detection model exceeds 90%, ending with successful recovery actions in more than 90% of all detected cases.
2018
failure detection and recovery; learning from demonstration; Service robots
01 Pubblicazione su rivista::01a Articolo in rivista
Do Not Make the Same Mistakes Again and Again: Learning Local Recovery Policies for Navigation from Human Demonstrations / Del Duchetto, F.; Kucukyilmaz, A.; Iocchi, L.; Hanheide, M.. - In: IEEE ROBOTICS AND AUTOMATION LETTERS. - ISSN 2377-3766. - 3:4(2018), pp. 4084-4091. [10.1109/LRA.2018.2861080]
File allegati a questo prodotto
File Dimensione Formato  
DelDuchetto_Do-Not-Make_2018.pdf

solo gestori archivio

Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 2.61 MB
Formato Adobe PDF
2.61 MB Adobe PDF   Contatta l'autore

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1328456
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 7
  • ???jsp.display-item.citation.isi??? 5
social impact