3D reconstruction; Dynamic environments; Gaussians; Low fidelities; Prior-knowledge; Space robotics; Splatting; Structure and motions; Target object; Unknown objects; Artificial Intelligence; Aerospace Engineering; Automotive Engineering; Control and Optimization; Computer Science - Robotics
Abstract :
[en] Generalizable perception is one of the pillars of high-level autonomy in space robotics. Estimating the structure and motion of unknown objects in dynamic environments is fundamental for such autonomous systems. Traditionally, the solutions have relied on prior knowledge of target objects, multiple disparate representations, or low-fidelity outputs unsuitable for robotic operations. This work proposes a novel approach to incrementally reconstruct and track a dynamic unknown object using a unified representation-a set of 3D Gaussian blobs that describe its geometry and appearance. The differentiable 3DGS framework is adapted to a dynamic object-centric setting. The input to the pipeline is a sequential set of RGB-D images. 3D reconstruction and 6-DoF pose tracking tasks are tackled using first-order gradient-based optimization. The formulation is simple, requires no pre-training, assumes no prior knowledge of the object or its motion, and is suitable for online applications. The proposed approach is validated on a dataset of 10 unknown spacecraft of diverse geometry and texture under arbitrary relative motion. The experiments demonstrate successful 3D reconstruction and accurate 6-DoF tracking of the target object in proximity operations over a short to medium duration. The causes of tracking drift are discussed and potential solutions are outlined.
Research center :
Interdisciplinary Centre for Security, Reliability and Trust (SnT) > SpaceR – Space Robotics
BARAD, Kuldeep Rambhai ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > Space Robotics ; Redwire Space Europe, Luxembourg
RICHARD, Antoine ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > Space Robotics
DENTLER, Jan Eric ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust > Automation ; Redwire Space Europe, Luxembourg
OLIVARES MENDEZ, Miguel Angel ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > Space Robotics
MARTINEZ LUNA, Carol ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > Space Robotics
External co-authors :
no
Language :
English
Title :
Object-centric Reconstruction and Tracking of Dynamic Unknown Objects Using 3D Gaussian Splatting
Publication date :
2024
Event name :
2024 International Conference on Space Robotics (iSpaRo)
Event place :
Luxembourg, Luxembourg
Event date :
24-06-2024 - 27-06-2024
Audience :
International
Main work title :
2024 International Conference on Space Robotics, iSpaRo 2024
Publisher :
Institute of Electrical and Electronics Engineers Inc.
B. Mildenhall et al., "Nerf: Representing scenes as neural radiance fields for view synthesis," Communications of the ACM, vol. 65, no. 1, pp. 99-106, 2021.
Y. Labbe et al., "Megapose: 6d pose estimation of novel objects via render & compare," arXiv preprint arXiv:2212.06870, 2022.
L. Pauly et al., "A survey on deep learning-based monocular spacecraft pose estimation: Current state, limitations and prospects," Acta Astronautica, 2023.
L. P. Cassinis et al., "Evaluation of tightly-and loosely-coupled approaches in cnn-based pose estimation systems for uncooperative spacecraft," Acta Astronautica, vol. 182, pp. 189-202, 2021.
T. H. Park and S. D'Amico, "Rapid abstraction of spacecraft 3d structure from single 2d image," in AIAA SCITECH 2024 Forum, 2024, p. 2768.
B. Kerbl et al., "3d gaussian splatting for real-time radiance field rendering," ACM Transactions on Graphics, vol. 42, no. 4, 2023.
R. Opromolla et al., "A review of cooperative and uncooperative spacecraft pose determination techniques for close-proximity operations," Progress in Aerospace Sciences, vol. 93, pp. 53-72, 2017.
B. J. Naasz et al., "The hst sm4 relative navigation sensor system: overview and preliminary testing results from the flight robotics lab," The Journal of the Astronautical Sciences, vol. 57, no. 1-2, pp. 457- 483, 2009.
S. D'Amico, M. Benn, and J. L. Jørgensen, "Pose estimation of an uncooperative spacecraft from actual space imagery," International Journal of Space Science and Engineering 5, vol. 2, no. 2, pp. 171- 189, 2014.
J. Canny, "A computational approach to edge detection," IEEE Transactions on pattern analysis and machine intelligence, no. 6, pp. 679- 698, 1986.
C. Harris, M. Stephens et al., "A combined corner and edge detector," in Alvey vision conference, vol. 15, no. 50. Citeseer, 1988, pp. 10- 5244.
D. G. Lowe, "Object recognition from local scale-invariant features," in Proceedings of the seventh IEEE international conference on computer vision, vol. 2. Ieee, 1999, pp. 1150-1157.
S. Sharma and S. D'Amico, "Neural network-based pose estimation for noncooperative spacecraft rendezvous," IEEE Transactions on Aerospace and Electronic Systems, vol. 56, no. 6, pp. 4638-4658, 2020.
T. H. Park et al., "Speed+: Next-generation dataset for spacecraft pose estimation across domain gap," in 2022 IEEE Aerospace Conference (AERO). IEEE, 2022, pp. 1-15.
K. Black et al., "Real-time, flight-ready, non-cooperative spacecraft pose estimation using monocular imagery," arXiv preprint arXiv:2101.09553, 2021.
J. I. B. Perez-Villar, A. Garcia-Martin, and J. Bescos, "Spacecraft pose estimation based on unsupervised domain adaptation and on a 3d-guided loss combination," in European Conference on Computer Vision. Springer, 2022, pp. 37-52.
T. H. Park and S. D'Amico, "Online supervised training of spaceborne vision during proximity operations using adaptive kalman filtering," arXiv preprint arXiv:2309.11645, 2023.
T. H. Park and S. D'Amico, "Adaptive neural-network-based unscented kalman filter for robust pose tracking of noncooperative spacecraft," Journal of Guidance, Control, and Dynamics, vol. 46, no. 9, pp. 1671- 1688, 2023.
J. Engel, V. Koltun, and D. Cremers, "Direct sparse odometry," IEEE transactions on pattern analysis and machine intelligence, vol. 40, no. 3, pp. 611-625, 2017.
C. Forster et al., "Svo: Semidirect visual odometry for monocular and multicamera systems," IEEE Transactions on Robotics, vol. 33, no. 2, pp. 249-265, 2016.
Z. Teed and J. Deng, "Droid-slam: Deep visual slam for monocular, stereo, and rgb-d cameras," Advances in neural information processing systems, vol. 34, pp. 16 558-16 569, 2021.
S. Izadi et al., "Kinectfusion: real-time 3d reconstruction and interaction using a moving depth camera," in Proceedings of the 24th annual ACM symposium on User interface software and technology, 2011, pp. 559-568.
L. Ma et al., "Simultaneous localization, mapping, and manipulation for unsupervised object discovery," in 2015 IEEE International Conference on Robotics and Automation (ICRA). IEEE, 2015, pp. 1344- 1351.
P. J. Besl and N. D. McKay, "Method for registration of 3-d shapes," in Sensor fusion IV: control paradigms and data structures, vol. 1611. Spie, 1992, pp. 586-606.
E. Sucar et al., "imap: Implicit mapping and positioning in realtime," in Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021, pp. 6229-6238.
M. M. Johari, C. Carta, and F. Fleuret, "Eslam: Efficient dense slam system based on hybrid representation of signed distance fields," in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023, pp. 17 408-17 419.
B. Wen et al., "Bundlesdf: Neural 6-dof tracking and 3d reconstruction of unknown objects," in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023, pp. 606-617.
H. Huang et al., "Photo-slam: Real-time simultaneous localization and photorealistic mapping for monocular, stereo, and rgb-d cameras," arXiv preprint arXiv:2311.16728, 2023.
H. Matsuki et al., "Gaussian splatting slam," arXiv preprint arXiv:2312.06741, 2023.
N. Keetha et al., "Splatam: Splat, track & map 3d gaussians for dense rgb-d slam," arXiv preprint arXiv:2312.02126, 2023.
M. Zwicker et al., "Ewa splatting," IEEE Transactions on Visualization and Computer Graphics, vol. 8, no. 3, pp. 223-238, 2002.
D. P. Kingma and J. Ba, "Adam: A method for stochastic optimization," arXiv preprint arXiv:1412.6980, 2014.
Z. Wang et al., "Image quality assessment: from error visibility to structural similarity," IEEE transactions on image processing, vol. 13, no. 4, pp. 600-612, 2004.