We present an improvement in visual object tracking and navigation for mobile robot implementing the advantage actor-critic (A2C) reinforcement learning architecture on top of the Gym-Gazebo framework. This work provides an easier way to integrate reinforcement learning algorithms for navigation and object tracking tasks in robotics field. We train the convolutional-recurrent model employed for the policy estimation in an end-to-end manner. The robot is able to follow a simulated human walking in an indoor environment by using the sequence of images provided by the robot camera. The input of the algorithm is acquired and processed directly in ROS-Gazebo environment. The policy learned by the robot agent proved to generalize well also in an environment with different size and shape with respect to the training one. Moreover, the policy allows the robot to avoid obstacles while following the tracking target. Thanks to these improvements, we can straightforwardly apply the tracking system in a real world robot for a person following task in indoor environments.

Supporting impaired people with a following robotic assistant by means of end-to-end visual target navigation and reinforcement learning approaches / Ngoc Dat, Nguyen; Ponzi, Valerio; Russo, Samuele; Vincelli, Francesco. - 3118:(2021), pp. 51-63. (Intervento presentato al convegno 2021 International Conference of Yearly Reports on Informatics, Mathematics, and Engineering, ICYRIME 2021 tenutosi a Virtual; Online).

Supporting impaired people with a following robotic assistant by means of end-to-end visual target navigation and reinforcement learning approaches

Valerio Ponzi
Co-primo
Investigation
;
Samuele Russo
Co-primo
Conceptualization
;
Francesco Vincelli
Secondo
Validation
2021

Abstract

We present an improvement in visual object tracking and navigation for mobile robot implementing the advantage actor-critic (A2C) reinforcement learning architecture on top of the Gym-Gazebo framework. This work provides an easier way to integrate reinforcement learning algorithms for navigation and object tracking tasks in robotics field. We train the convolutional-recurrent model employed for the policy estimation in an end-to-end manner. The robot is able to follow a simulated human walking in an indoor environment by using the sequence of images provided by the robot camera. The input of the algorithm is acquired and processed directly in ROS-Gazebo environment. The policy learned by the robot agent proved to generalize well also in an environment with different size and shape with respect to the training one. Moreover, the policy allows the robot to avoid obstacles while following the tracking target. Thanks to these improvements, we can straightforwardly apply the tracking system in a real world robot for a person following task in indoor environments.
2021
2021 International Conference of Yearly Reports on Informatics, Mathematics, and Engineering, ICYRIME 2021
visual object tracking; human robot interaction; reinforcement learning; visual navigation
04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
Supporting impaired people with a following robotic assistant by means of end-to-end visual target navigation and reinforcement learning approaches / Ngoc Dat, Nguyen; Ponzi, Valerio; Russo, Samuele; Vincelli, Francesco. - 3118:(2021), pp. 51-63. (Intervento presentato al convegno 2021 International Conference of Yearly Reports on Informatics, Mathematics, and Engineering, ICYRIME 2021 tenutosi a Virtual; Online).
File allegati a questo prodotto
File Dimensione Formato  
Dat_Supporting_2021.pdf

accesso aperto

Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Creative commons
Dimensione 4.52 MB
Formato Adobe PDF
4.52 MB Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1684741
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 20
  • ???jsp.display-item.citation.isi??? ND
social impact