Task planning for robots in real-life settings presents significant challenges. These challenges stem from three primary issues: the difficulty in identifying grounded sequences of steps to achieve a goal; the lack of a standardized mapping between high-level actions and low-level commands; and the challenge of maintaining low computational overhead given the limited resources of robotic hardware. We introduce EMPOWER, a framework designed for open-vocabulary online grounding and planning for embodied agents aimed at addressing these issues. By leveraging efficient pre-trained foundation models and a multi-role mechanism, EMPOWER demonstrates notable improvements in grounded planning and execution. Quantitative results highlight the effectiveness of our approach, achieving an average success rate of 0.73 across six different real-life scenarios using a TIAGo robot.

EMPOWER: embodied multi-role open-vocabulary planning with online grounding and execution / Argenziano, Francesco; Brienza, Michele; Suriani, Vincenzo; Nardi, Daniele; Bloisi, Domenico D.. - (2024), pp. 12040-12047. (Intervento presentato al convegno IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) tenutosi a Abu Dhabi; United Arab Emirates) [10.1109/IROS58592.2024.10802251].

EMPOWER: embodied multi-role open-vocabulary planning with online grounding and execution

Francesco Argenziano
Primo
;
Michele Brienza
Secondo
;
Daniele Nardi
Penultimo
;
2024

Abstract

Task planning for robots in real-life settings presents significant challenges. These challenges stem from three primary issues: the difficulty in identifying grounded sequences of steps to achieve a goal; the lack of a standardized mapping between high-level actions and low-level commands; and the challenge of maintaining low computational overhead given the limited resources of robotic hardware. We introduce EMPOWER, a framework designed for open-vocabulary online grounding and planning for embodied agents aimed at addressing these issues. By leveraging efficient pre-trained foundation models and a multi-role mechanism, EMPOWER demonstrates notable improvements in grounded planning and execution. Quantitative results highlight the effectiveness of our approach, achieving an average success rate of 0.73 across six different real-life scenarios using a TIAGo robot.
2024
IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)
grounding; foundation models; semantics; real-time systems; planning; recycling; robots; testing; sorting; periodic structures
04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
EMPOWER: embodied multi-role open-vocabulary planning with online grounding and execution / Argenziano, Francesco; Brienza, Michele; Suriani, Vincenzo; Nardi, Daniele; Bloisi, Domenico D.. - (2024), pp. 12040-12047. (Intervento presentato al convegno IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) tenutosi a Abu Dhabi; United Arab Emirates) [10.1109/IROS58592.2024.10802251].
File allegati a questo prodotto
File Dimensione Formato  
Argenziano_EMPOWER_2024.pdf

solo gestori archivio

Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Creative commons
Dimensione 5.62 MB
Formato Adobe PDF
5.62 MB Adobe PDF   Contatta l'autore

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1730230
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact