Reinforcement Learning for Attitude Control of a Spacecraft with Flexible Appendages

MAHFOUZ, Ahmed; Valiullin, Ayrat; Lukashevichus, Alexey; Pritykin, Dmitry

Download

Paper published in a book (Scientific congresses, symposiums and conference proceedings)

Reinforcement Learning for Attitude Control of a Spacecraft with Flexible Appendages

MAHFOUZ, Ahmed; Valiullin, Ayrat; Lukashevichus, Alexey et al.

2022 • In IAC 2022 congress proceedings, 73rd International Astronautical Congress (IAC)

Editorial reviewed

Permalink
https://hdl.handle.net/10993/53599

Files (1)Send to Details Statistics Bibliography Similar publications

Files

Full Text

IAC_2022_RL_for_Flex_AttitudeControl-4-1.pdf

Publisher postprint (1.65 MB)

Download

All documents in ORBilu are protected by a user license.

Send to

RIS BibTex APA Chicago Permalink X Linkedin

Details

Keywords :

satellite; flexible appendages; attitude control; reinforcement learning; proximal policy optimization

Abstract :

[en] This study explores the reinforcement learning (RL) approach to constructing attitude control strategies for a LEOsatellite with flexible appendages. Attitude control system actuated by a set of three reaction wheels is considered.The satellite is assumed to move in a circular low Earth orbit under the action of gravity-gradient torque, randomdisturbance torque, and oscillations excited in flexible appendages. The control policy for rest-to-rest slew maneuversis learned via the Proximal Policy Optimization (PPO) technique. The robustness of the obtained control policy isanalyzed and compared to that of conventional controllers. The first part of the study is focused on problem formulationin terms of Markov Decision Processes, analysis of different reward-shaping techniques, and finally training the RL-agent and comparing the obtained results with the state-of-the-art RL-controllers as well as with the performance ofa commonly used quaternion feedback regulator (Lyapunov-based PD controller). We then proceed to consider thesame spacecraft with flexible appendages added to its structure. Equations of excitable oscillations are appended tothe system and coupling terms are added describing the interactions between the main rigid body and the flexiblestructures. The dynamics of the rigid spacecraft thus becomes coupled with that of its flexible appendages and thecontrol strategy should change accordingly in order to prevent actions that entail excitation of oscillation modes.Again PPO is used to learn the control policy for rest-to-rest slew maneuvers in the extended system. All in all,the proposed reinforcement learning strategy is shown to converge to a policy that matches the performance of thequaternion feedback regulator for a rigid spacecraft. It is also shown that a policy can be trained to take into accountthe highly nonlinear dynamics caused by the presence of flexible elements that need to be brought to rest in the requiredattitude. We also discuss the advantages of the reinforcement learning approach such as robustness and ability of onlinelearning pertaining to the systems that require a high level of autonomy

Research center :

Interdisciplinary Centre for Security, Reliability and Trust (SnT) > ARG - Automation & Robotics

Disciplines :

Aerospace & aeronautics engineering

Author, co-author :

MAHFOUZ, Ahmed ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > Automation

Valiullin, Ayrat

Lukashevichus, Alexey

Pritykin, Dmitry

External co-authors :

yes

Language :

English

Title :

Reinforcement Learning for Attitude Control of a Spacecraft with Flexible Appendages

Publication date :

September 2022

Event name :

73rd International Astronautical Congress

Event organizer :

International Astronautical Federation

Event place :

Paris, France

Event date :

from 18-09-2022 to 22-09-2022

Main work title :

IAC 2022 congress proceedings, 73rd International Astronautical Congress (IAC)

Publisher :

International Astronautical Federation, Paris, France

Edition :

73rd

Peer reviewed :

Editorial reviewed

FnR Project :

FNR14302465 - Development Tool For Autonomous Constellation And Formation Control Of Microsatellites, 2019 (01/09/2020-31/08/2023) - Holger Voos

Available on ORBilu :

since 11 January 2023

Statistics

Number of views

192 (4 by Unilu)

Number of downloads

360 (5 by Unilu)

More statistics

Scopus citations^®

Scopus citations^®
without self-citations

Bibliography

Richard S. Sutton and Andrew G. Barto. Reinforcement Learning: An Introduction. The MIT Press, second edition, 2018.
D. Cellucci, Nick B. Cramer, and Jeremy D. Frank. Distributed Spacecraft Autonomy.
Seongin Na, Tomáš Krajník, Barry Lennox, and Far-shad Arvin. Federated reinforcement learning for collective navigation of robotic swarms, 2022.
F. Vedant, J.T. Allison, M. West, and A. Ghosh. Reinforcement learning for spacecraft attitude control. In Proceedings of the International Astronautical Congress, IAC, volume 2019-October, 2019.
Vanessa Tan, John Leur Labrador, and Marc Caesar Talampas. Mata-rl: Continuous reaction wheel attitude control using the mata simulation software and reinforcement learning. In Proccedings of 35th Annual Small Satellite Conference, 2021.
Jacob G. Elkins, Rohan Sood, and Clemens Rumpf. Bridging reinforcement learning and online learning for spacecraft attitude control. Journal of Aerospace Information Systems, 19(1):62-69, 2022.
Daniel Alazard, Christelle Cumer, and Khalid Tantawi. Linear dynamic modeling of spacecraft with various flexible appendages and on-board angular momentums. In 7th International ESA Conference on Guidance, Navigation & Control Systems (GNC 2008), pages 1-14, Tralee, IE, 2008.
John Schulman, Filip Wolski, Prafulla Dhariwal, Alec Radford, and Oleg Klimov. Proximal policy optimization algorithms. ArXiv, abs/1707.06347, 2017.
Bong Wie and Peter M. Barba. Quaternion feedback for spacecraft large angle maneuvers. Journal of Guidance, Control, and Dynamics, 8(3):360-365, 1985.
I A Courie, Francesco Sanfedino, and Daniel Alazard. Worst-case pointing performance analysis for large flexible spacecraft. ArXiv, abs/2106.01893, 2021.
Bong Wie, Haim Weiss, and Ari Arapostathis. Quarternion feedback regulator for spacecraft eigenaxis rotations. Journal of Guidance, Control, and Dynamics, 12(3):375-380, 1989.