National Aeronautics and Space Administration (NASA), "Artemis Plan: NASA's Lunar Exploration Program Overview," 2020.
Rognant, M., Cumer, C., Biannic, J.-M., Roa, M. A., Verhaeghe, A., and Bissonnette, V, "Autonomous Assembly of Large Structures in Space: A Technology Review," European Conference for Aeronautics and Aerospace Sciences, 2019.
Moses, R. W., and Bushneil, D. M., "Frontier In-Situ Resource Utilization for Enabling Sustained Human Presence on Mars," NASA, 2016.
Zhang, T., Xu, K., Yao, Z., Ding, X., Zhao, Z., Hou, X., Pang, Y., Lai, X., Zhang, W., Liu, S., and Deng, J., "The Progress of Extraterrestrial Regolith-Sampling Robots," NatureAstronomy, Vol. 3,2019, pp. 487-497.
Sutton, R. S., and Barto, A. G., "Reinforcement Learning: An Introduction," A Bradford Book, Cambridge, MA, USA, 2018.
Doyle, R., Kubota, T., Picard, M., Sommer, B., Ueno, H., Visentin, G., and Volpe, R., "Recent Research and Development Activities on Space Robotics and AI," Advanced Robotics, Vol. 35, Nos. 21-22, 2021, pp. 1244-1264.
Furfaro, R., Wibben, D. R., Gaudet, B., and Simo, J., "Terminal Multiple Surface Sliding Guidance for Planetary Landing: Development, Tuning and Optimization via Reinforcement Learning," The Journal of the AstronauticalSciences, Vol. 62, No. 1,2015, pp. 73-99.
Jin, X., Lan, W., Wang, T., and Yu, P., "Value Iteration Networks with Double Estimator for Planetary Rover Path Planning," Sensors, Vol.21, No. 24,2021.
Hughes, S. P., Qureshi, R. H, Cooley, S. D., and Parker, J. J., "Verification and Validation of the General Mission Analysis Tool (GMAT)," AIAA/AAS Astrodynamics Specialist Conference, 2014.
Kenneally, P. W., Piggott, S., and Schaub, H., "Basilisk: A Flexible, Scalable and Modular Astrodynamics Simulation Framework Journal ofaerospace information systems, Vol. 17, No. 9, 2020, pp. 496-507.
Koenig, N, and Howard, A., "Design and Use Paradigms for Gazebo, an Open-Source Multi-Robot Simulator," IEEE/RSJInternational Conference on IntelligentRobots and Systems, Vol. 3, 2004, pp. 2149-2154.
Todorov, E., Erez, T., and Tassa, Y., "MuJoCo: A Physics Engine for Model-Based Control," IEEE/RSJ International Conference on Intelligent Robots and Systems, 2012, pp. 5026-5033.
Richard, A., Kamohara, J., Uno, K., Santra, S., Meer, D. van der, Olivares-Mendez, M., and Yoshida, K., "OmniLRS: A Photorealistic Simulator for Lunar Robotics," IEEE International Conference on Robotics and Automation, 2024, pp. 16901-16907.
Mortensen, A. B., and Begh, S., "RLRoverLAB: An Advanced Reinforcement Learning Suite for Planetary Rover Simulation and Training "International Conference on Space Robotics, 2024, pp. 273-277.
El-Hariry, M., Richard, A., Muralidharan, V., Geist, M., and Olivares-Mendez, M., "DRIFT: Deep Reinforcement Learning for Intelligent Floating Platforms Trajectories," IEEE/RSJ International Conference on Intelligent Robots and Systems, 2024, pp. 14034-14041.
Wang, S., Cao, Y., Zheng, X., and Zhang, T., "Collision-Free Trajectory Planning for a 6-Dof Free-Floating Space Robot via Hierarchical Decoupling Optimization," IEEE Robotics and Automation Letters, Vol. 7, No. 2,2022, pp. 4953-4960.
Orsula, A., Begh, S., Olivares-Mendez, M., and Martinez, C., "Learning to Grasp on the Moon from 3D Octree Observations with Deep Reinforcement Learning," IEEE/RSJ International Conference on IntelligentRobots and Systems, 2022, pp. 4112-4119.
Towers, M., Kwiatkowski,A., Terry, J., Balis, J. U.,De Cola, G., Deleu, T., Gouläo, M., Kallinteris, A., Krimmel, M., KG, A., and others, "Gymnasium: A Standard Interface for Reinforcement Learning Environments," arXivpreprint arXiv:2407.17032, 2024.
Zhu, Y., Wong, J., Mandlekar, A., Martin-Martin, R., Joshi, A., Lin, K., Nasiriany, S., and Zhu, Y., "robosuite: A Modular Simulation Framework and Benchmark for Robot Learning," arXiv preprint arXiv:2009.12293, 2020.
James, S., Ma, Z., Arrojo, D. R., and Davison, A. J., "RLBench: The Robot Learning Benchmark & Learning Environment," IEEE Robotics and Automation Letters, Vol. 5, No. 2, 2020, pp. 3019-3026.
Kumar, V, Shah, R., Zhou, G., Moens, V., Caggiano, V., Gupta, A., and Rajeswaran, A., "RoboHive: A Unified Framework for Robot Learning "Advances in Neural Information Processing Systems, Vol. 36, 2023, pp. 44323-44340.
Tao, S., Xiang, F., Shukla, A., Qin, Y., Hinrichsen, X., Yuan, X., Bao, C., Lin, X., Liu, Y, Chan, T.-k., Gao, Y., Li, X., Mu, T., Xiao, N, Gurha, A., Huang, Z., Calandra, R, Chen, R., Luo, S., and Su, H., "ManiSkill3: GPU Parallelized Robotics Simulation and Rendering for Generalizable Embodied AI," arXiv preprint arXiv.2410.00425, 2024.
Heo, M., Lee, Y., Lee, D., and Lim, J. J., "FurnitureBench: Reproducible Real-World Benchmark for Long-Horizon Complex Manipulation," Robotics: Science and Systems, 2023.
Sferrazza, C., Huang, D.-M., Lin, X., Lee, Y., and Abbeel, P., "HumanoidBench: Simulated Humanoid Benchmark for Whole-Body Locomotion and Manipulation," arXiv Preprint arxiv:2403.10506, 2024.
Zakka, K, Wu, P., Smith, L., Gileadi, N, Howell, T., Peng, X. B., Singh, S., Tassa, Y., Florence, P., Zeng, A., and Abbeel, P., "RoboPianist: Dexterous Piano Playing with Deep Reinforcement Learning," Conference on RobotLearning, Vol. 229,2023, pp. 2975-2994.
Ishigami, G., Miwa, A., Nagatani, K, and Yoshida, K, "Terramechanics-Based Model for Steering Maneuver of Planetary Exploration Rovers on Loose Soil," Journal of Field Robotics, Vol. 24, No. 3, 2007, pp. 233-250.
Orsula, A., Richard, A., Geist, M., Olivares-Mendez, M., and Martinez, C., "Towards Benchmarking Robotic Manipulation in Space," Conference on Robot Learning Workshop on Mastering Robot Manipulation in a WorldofAbundantData, 2024.
Tobin, J., Fong, R., Ray, A., Schneider, J., Zaremba, W., and Abbeel, P., "Domain Randomization for Transferring Deep Neural Networks from Simulation to the Real World," IEEE/RSJ International Conference on IntelligentRobots and Systems, 2017, pp. 23-30.
Koutras, D. I., Kapoutsis, A. C., Amanatiadis, A. A., and Kosmatopoulos, E. B., "MarsExplorer: Exploration of Unknown Terrains via Deep Reinforcement Learning and Procedurally Generated Environments," Electronics, Vol. 10, No. 22,2021, p. 2751.
Cobbe, K, Hesse, C., Hilton, J., and Schulman, J., "Leveraging Procedural Generation to Benchmark Reinforcement Learning," International Conference on Machine Learning, 2020, pp. 2048-2056.
Orsula, A., Geist, M., Olivares-Mendez, M., and Martinez, C., "Leveraging Procedural Generation for Learning Autonomous Peg-in-Hole Assembly in Space," International Conference on Space Robotics, 2024, pp. 357-364.
Orsula, A., Geist, M., Olivares-Mendez, M., and Martinez, C., "Learning Tool-Aware Adaptive Compliant Control for Autonomous Regolith Excavation," Symposium on Advanced Space Technologies in Robotics and Automation, 2025.
Blender Online Community, "Blender - 3D Modelling and Rendering Package," 2025.. https://blender.org/
Narvekar, S., Peng, B., Leonetti, M., Sinapov, J., Taylor, M., and Stone, P., "Curriculum Learning for Reinforcement Learning Domains: A Framework and Survey," 2020.
Mittal, M., Yu, C., Yu, Q., Liu, J., Rudin, N, Hoeller, D., Yuan, J. L., Singh, R., Guo, Y., Mazhar, H., Mandlekar, A., Babich, B., State, G., Hutter, M., and Garg, A., "Orbit: A Unified Simulation Framework for Interactive Robot Learning Environments," IEEERobotics and Automation Letters, Vol. 8, No. 6,2023, pp. 3740-3747.
Macenski, S., Foote, T., Gerkey, B., Lalancette, C., and Woodall, W., "Robot Operating System 2: Design, Architecture, And Uses in the Wild," Science Robotics, Vol. 7, No. 66,2022.
Probe, A., Oyake, A., Chambers, S. W., Deans, M., Brat, G., Cramer, N. B., Kempa, B., Roberts, B., and Hambuchen, K., "Space ROS: An open-source framework for space robotics and flight software," AIAA SCITECH 2023 Forum, 2023, p. 2709.
Raffin, A., Hill, A., Gleave, A., Kanervisto, A., Ernestus, M., and Dormann, N, "Stable-Baselines3: Reliable Reinforcement Learning Implementations," Journal of Machine Learning Research, Vol. 22, No. 268,2021, pp. 1-8.
Serrano-Muñoz, A., Chrysostomou, D., Begh, S., and Arana-Arexolaleiba, N, "Skrl: Modular and Flexible Library for Reinforcement Learning," J.Mach. Learn. Res., Vol. 24, No. 1,2023.
Ludivig, P., Calzada-Diaz, A., Olivares-Mendez, M., Voos, H., and Lamamy, J., "Building a Piece of the Moon: Construction of Two Indoor Lunar Analogue Environments," International Astronautical Congress, 2020.
Hafner, D., Pasukonis, J., Ba, J., and Lillicrap, T., "Mastering diverse control tasks through world models," Nature, Vol. 640,2025, pp. 647-653.