Article (Scientific journals)
Fuel-aware autonomous docking using RL-augmented MPC rewards for on-orbit refueling
RAMEZANI, Mahya; ALANDIHALLAJ, Mohammadamin; YALCIN, Baris Can et al.
2026In Acta Astronautica, 238, p. 690 - 705
Peer Reviewed verified by ORBi
 

Files


Full Text
1-s2.0-S009457652500623X-main.pdf
Author postprint (5.98 MB)
Request a copy

All documents in ORBilu are protected by a user license.

Send to



Details



Keywords :
Collision avoidance; Fuel sloshing; On-orbit refueling; Proximal policy optimization; Satellite docking; Soft actor-critic; Actor critic; Collisions avoidance; Model-predictive control; Policy optimization; Reinforcement learnings; Sloshing effects; Aerospace Engineering
Abstract :
[en] The operational lifespan of satellites is constrained by finite fuel reserves, limiting their maneuverability and mission duration. On-orbit refueling offers a transformative solution, extending satellite functionality, reducing costs, and enhancing sustainability. However, the precise execution of docking maneuvers remains a critical challenge, exacerbated by fuel sloshing effects in microgravity, which introduce unpredictable disturbances. This study proposes an integrated control framework combining Model Predictive Control (MPC) and Reinforcement Learning (RL) to ensure safe and efficient docking under these dynamic conditions. Initially, a Proximal Policy Optimization (PPO)-based RL control strategy is introduced, leveraging MPC for trajectory optimization. To further enhance adaptability in highly dynamic environments, Soft Actor-Critic (SAC) is incorporated, offering superior sample efficiency and robustness against stochastic disturbances. The proposed SAC-MPC framework effectively mitigates fuel sloshing effects by balancing computational efficiency with predictive accuracy. Experimental validation is conducted in the Zero-G Lab, emulating control scenarios with 3-DoF floating platforms, while high-fidelity numerical simulations extend the study to 6-DoF dynamics with realistic sloshing behavior modeled using OpenFOAM. Comparative results demonstrate that SAC-MPC outperforms conventional RL and MPC-based methods in docking success rate, precision, and control effort. This research establishes a robust foundation for autonomous satellite docking, contributing to the viability of on-orbit refueling missions and the future of sustainable space operations.
Disciplines :
Aerospace & aeronautics engineering
Author, co-author :
RAMEZANI, Mahya  ;  University of Luxembourg
ALANDIHALLAJ, Mohammadamin  ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > SPASYS
YALCIN, Baris Can  ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust > Space Robotics > Team Miguel Angel OLIVARES MENDEZ
OLIVARES MENDEZ, Miguel Angel ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > Space Robotics
HEIN, Andreas  ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > SPASYS
External co-authors :
no
Language :
English
Title :
Fuel-aware autonomous docking using RL-augmented MPC rewards for on-orbit refueling
Publication date :
2026
Journal title :
Acta Astronautica
ISSN :
0094-5765
eISSN :
1879-2030
Publisher :
Elsevier Ltd
Volume :
238
Pages :
690 - 705
Peer reviewed :
Peer Reviewed verified by ORBi
Available on ORBilu :
since 09 December 2025

Statistics


Number of views
5 (1 by Unilu)
Number of downloads
0 (0 by Unilu)

Scopus citations®
 
1
Scopus citations®
without self-citations
0
OpenCitations
 
0
OpenAlex citations
 
0
WoS citations
 
0

Bibliography


Similar publications



Contact ORBilu