Safe ARPOD for under-actuated CubeSat via reinforcement learning

Biblioteche e Archivi
POLITesi - Archivio digitale delle tesi di laurea e di dottorato

This Master's thesis investigates the use of the reinforcement learning algorithm Proximal Policy Optimization (PPO) for achieving a planar Autonomous Rendezvous, Proximity Operation, and Docking (ARPOD) manoeuvre with an under-actuated CubeSat. Together with the safety considerations, the different control objectives throughout the three phases reflect the complexity necessary for safe and efficient operations.

Questa tesi di Master analizza l'utilizzo dell'algoritmo di reinforcement learning Proximal Policy Optimization (PPO) per ottenere una manovra planare di Autonomous Rendezvous, Proximity Operation e Docking (ARPOD) di un under-actuate CubeSat. I diversi obiettivi di controllo durante le tre fasi, insieme alle considerazioni sulla sicurezza, riflettono la complessità necessaria per ottenere operazioni sicure ed efficienti.

Safe ARPOD for under-actuated CubeSat via reinforcement learning

PARIS, MATTHIEU VINH

2020/2021

Abstract

This Master's thesis investigates the use of the reinforcement learning algorithm Proximal Policy Optimization (PPO) for achieving a planar Autonomous Rendezvous, Proximity Operation, and Docking (ARPOD) manoeuvre with an under-actuated CubeSat. Together with the safety considerations, the different control objectives throughout the three phases reflect the complexity necessary for safe and efficient operations.

Scheda breve

Scheda completa

	Relatore
	
				DI LIZIA, PIERLUIGI
			
	Correlatore/i
	
				MAESTRINI, MICHELE
			
	Scuola / Dip.
	
				ING  - Scuola di Ingegneria Industriale e dell'Informazione
			
	Data
	
				7-ott-2021
			
	Anno accademico
	
				2020/2021
			
	Abstract in italiano
	
				Questa tesi di Master analizza l'utilizzo dell'algoritmo di reinforcement learning Proximal Policy Optimization (PPO) per ottenere una manovra planare di Autonomous Rendezvous, Proximity Operation e Docking (ARPOD) di un under-actuate CubeSat. I diversi obiettivi di controllo durante le tre fasi, insieme alle considerazioni sulla sicurezza, riflettono la complessità necessaria per ottenere operazioni sicure ed efficienti.
			
	Appare nelle tipologie:
	
				Tesi di laurea Magistrale

File allegati

File	Dimensione	Formato
2021_10_Paris.pdf accessibile in internet per tutti Dimensione 2.17 MB Formato Adobe PDF Visualizza/Apri	2.17 MB	Adobe PDF	Visualizza/Apri

I documenti in POLITesi sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10589/179340