This paper focuses on the use of meta-reinforcement learning for the autonomous guidance of a spacecraft during the terminal phase of an impact mission toward a binary asteroid system. The control policy is replaced by a convolutional-recurrent neural network, which is used to map optical observations collected by the onboard camera to the control thrust and thrusting times. The network is trained by a proximal policy optimization algorithm, a family of reinforcement learning methods. The final phase of NASA's Double Asteroid Redirection Test (DART) mission is used as a test case. The objective is to maneuver the spacecraft to impact the smaller object, Dimorphos, in the Didymos binary system. The spacecraft dynamics are described using the bi-elliptic restricted four-body problem with solar radiation pressure. The initial conditions are randomly scattered according to the actual specifications of the DART mission. A random error on the orbital position of Dimorphos is also considered to reflect uncertainty on the binary system's characteristics and dynamics. The control system aims at minimizing the error on the final spacecraft position. Numerical results show that the guidance system can correctly drive the spacecraft toward the final impact point in more than 98% of the 500 test scenarios.
Image-based meta-reinforcement learning for autonomous guidance of an asteroid impactor / Federici, Lorenzo; Scorsoglio, Andrea; Ghilardi, Luca; D'Ambrosio, Andrea; Benedikter, Boris; Zavoli, Alessandro; Furfaro, Roberto. - In: JOURNAL OF GUIDANCE CONTROL AND DYNAMICS. - ISSN 0731-5090. - 45:11(2022), pp. 2013-2028. [10.2514/1.g006832]
Image-based meta-reinforcement learning for autonomous guidance of an asteroid impactor
Federici, Lorenzo;D'Ambrosio, Andrea;Benedikter, Boris;Zavoli, Alessandro;
2022
Abstract
This paper focuses on the use of meta-reinforcement learning for the autonomous guidance of a spacecraft during the terminal phase of an impact mission toward a binary asteroid system. The control policy is replaced by a convolutional-recurrent neural network, which is used to map optical observations collected by the onboard camera to the control thrust and thrusting times. The network is trained by a proximal policy optimization algorithm, a family of reinforcement learning methods. The final phase of NASA's Double Asteroid Redirection Test (DART) mission is used as a test case. The objective is to maneuver the spacecraft to impact the smaller object, Dimorphos, in the Didymos binary system. The spacecraft dynamics are described using the bi-elliptic restricted four-body problem with solar radiation pressure. The initial conditions are randomly scattered according to the actual specifications of the DART mission. A random error on the orbital position of Dimorphos is also considered to reflect uncertainty on the binary system's characteristics and dynamics. The control system aims at minimizing the error on the final spacecraft position. Numerical results show that the guidance system can correctly drive the spacecraft toward the final impact point in more than 98% of the 500 test scenarios.File | Dimensione | Formato | |
---|---|---|---|
Federici_postprint_Image_2022.pdf
accesso aperto
Note: https://arc.aiaa.org/doi/epdf/10.2514/1.G006832?src=getftr
Tipologia:
Documento in Post-print (versione successiva alla peer review e accettata per la pubblicazione)
Licenza:
Tutti i diritti riservati (All rights reserved)
Dimensione
2.19 MB
Formato
Adobe PDF
|
2.19 MB | Adobe PDF | |
Federici_Image_2022.pdf
solo gestori archivio
Note: https://arc.aiaa.org/doi/10.2514/1.G006832
Tipologia:
Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza:
Tutti i diritti riservati (All rights reserved)
Dimensione
1.87 MB
Formato
Adobe PDF
|
1.87 MB | Adobe PDF | Contatta l'autore |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.