This paper focuses on the use of meta-reinforcement learning for the autonomous guidance of a spacecraft with low thrust during the terminal phase of an impact mission towards a binary asteroid system. The control policy is replaced by a convolutional-recurrent neural network, which is used to map optical observations collected by the on-board camera to the optimal control thrust and thrusting times. The network is trained by Proximal Policy Optimization, a state-of-the-art policy-gradient reinforcement learning algorithm. The final phase of the DART mission is used as test case. The objective is to maneuver the spacecraft to impact on the smaller object, Dimorphos, in the 65803 Didymos binary system. The spacecraft dynamics are described within the bi-elliptic restricted four-body problem with an additional solar radiation pressure term. The initial conditions are randomly scattered according to actual specifications of the DART mission. A random error on the orbital position of Dimorphos is also considered to reflect an uncertainty on the binary system’s characteristics and dynamics. The control system aims at minimizing the error on the final spacecraft position. Numerical results show that the guidance system is able to correctly drive the spacecraft towards the final impact point in almost all test scenarios.

Image-based Meta-Reinforcement Learning for Autonomous Terminal Guidance of an Impactor in a Binary Asteroid System / Federici, L.; Scorsoglio, A.; Ghilardi, L.; D'Ambrosio, A.; Benedikter, B.; Zavoli, A.; Furfaro, R.. - (2022). (Intervento presentato al convegno AIAA Science and Technology Forum and Exposition, AIAA SciTech Forum 2022 tenutosi a San Diego; CA (USA)) [10.2514/6.2022-2270].

Image-based Meta-Reinforcement Learning for Autonomous Terminal Guidance of an Impactor in a Binary Asteroid System

Federici L.;D'ambrosio A.;Benedikter B.;Zavoli A.;
2022

Abstract

This paper focuses on the use of meta-reinforcement learning for the autonomous guidance of a spacecraft with low thrust during the terminal phase of an impact mission towards a binary asteroid system. The control policy is replaced by a convolutional-recurrent neural network, which is used to map optical observations collected by the on-board camera to the optimal control thrust and thrusting times. The network is trained by Proximal Policy Optimization, a state-of-the-art policy-gradient reinforcement learning algorithm. The final phase of the DART mission is used as test case. The objective is to maneuver the spacecraft to impact on the smaller object, Dimorphos, in the 65803 Didymos binary system. The spacecraft dynamics are described within the bi-elliptic restricted four-body problem with an additional solar radiation pressure term. The initial conditions are randomly scattered according to actual specifications of the DART mission. A random error on the orbital position of Dimorphos is also considered to reflect an uncertainty on the binary system’s characteristics and dynamics. The control system aims at minimizing the error on the final spacecraft position. Numerical results show that the guidance system is able to correctly drive the spacecraft towards the final impact point in almost all test scenarios.
2022
AIAA Science and Technology Forum and Exposition, AIAA SciTech Forum 2022
meta-reinforcement learning; recurrent neural network; image-based guidance; kinetic impactor; binary asteroid system
04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
Image-based Meta-Reinforcement Learning for Autonomous Terminal Guidance of an Impactor in a Binary Asteroid System / Federici, L.; Scorsoglio, A.; Ghilardi, L.; D'Ambrosio, A.; Benedikter, B.; Zavoli, A.; Furfaro, R.. - (2022). (Intervento presentato al convegno AIAA Science and Technology Forum and Exposition, AIAA SciTech Forum 2022 tenutosi a San Diego; CA (USA)) [10.2514/6.2022-2270].
File allegati a questo prodotto
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1615402
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 7
  • ???jsp.display-item.citation.isi??? ND
social impact