This paper investigates the use of reinforcement learning for the fuel-optimal guidance of a spacecraft during a time-free low-thrust transfer between two libration point orbits in the cislunar environment. To this aim, a deep neural network is trained via proximal policy optimization to map any spacecraft state to the optimal control action. A general-purpose reward is used to guide the network toward a fuel-optimal control law, regardless of the specific pair of libration orbits considered and without the use of any ad hoc reward shaping technique. Eventually, the learned control policies are compared with the optimal solutions provided by a direct method in two different mission scenarios, and Monte Carlo simulations are used to assess the policies' robustness to navigation uncertainties.

Autonomous guidance between quasiperiodic orbits in cislunar space via deep reinforcement learning / Federici, Lorenzo; Scorsoglio, Andrea; Zavoli, Alessandro; Furfaro, Roberto. - In: JOURNAL OF SPACECRAFT AND ROCKETS. - ISSN 0022-4650. - 60:6(2023), pp. 1954-1965. [10.2514/1.a35747]

Autonomous guidance between quasiperiodic orbits in cislunar space via deep reinforcement learning

Federici, Lorenzo;Zavoli, Alessandro;
2023

Abstract

This paper investigates the use of reinforcement learning for the fuel-optimal guidance of a spacecraft during a time-free low-thrust transfer between two libration point orbits in the cislunar environment. To this aim, a deep neural network is trained via proximal policy optimization to map any spacecraft state to the optimal control action. A general-purpose reward is used to guide the network toward a fuel-optimal control law, regardless of the specific pair of libration orbits considered and without the use of any ad hoc reward shaping technique. Eventually, the learned control policies are compared with the optimal solutions provided by a direct method in two different mission scenarios, and Monte Carlo simulations are used to assess the policies' robustness to navigation uncertainties.
2023
reinforcement learning; space exploration and technology; pontryagin's maximum principle; artificial neural network; interplanetary trajectories; guidance; navigation; and control systems; cislunar orbit
01 Pubblicazione su rivista::01a Articolo in rivista
Autonomous guidance between quasiperiodic orbits in cislunar space via deep reinforcement learning / Federici, Lorenzo; Scorsoglio, Andrea; Zavoli, Alessandro; Furfaro, Roberto. - In: JOURNAL OF SPACECRAFT AND ROCKETS. - ISSN 0022-4650. - 60:6(2023), pp. 1954-1965. [10.2514/1.a35747]
File allegati a questo prodotto
File Dimensione Formato  
Federici_postprint_Autonomous_2023.pdf

accesso aperto

Note: Publisher version: https://doi.org/10.2514/1.A35747
Tipologia: Documento in Post-print (versione successiva alla peer review e accettata per la pubblicazione)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 3.95 MB
Formato Adobe PDF
3.95 MB Adobe PDF
Federici_Autonomous_2023.pdf

solo gestori archivio

Note: https://arc.aiaa.org/doi/10.2514/1.A35747
Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 2.46 MB
Formato Adobe PDF
2.46 MB Adobe PDF   Contatta l'autore

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1713986
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? 1
social impact