[en] In 3D object mapping, category-level priors enable efficient object reconstruction and canonical pose estimation, requiring only a single prior per semantic category (e.g., chair, book, laptop, etc.). DeepSDF has been used predominantly as a category-level shape prior, but it struggles to reconstruct sharp geometry and is computationally expensive. In contrast, NeRFs capture fine details but have yet to be effectively integrated with category-level priors in a real-time multi-object mapping framework. To bridge this gap, we introduce PRENOM, a Prior-based Efficient Neural Object Mapper that integrates category-level priors with object-level NeRFs to enhance reconstruction efficiency and enable canonical object pose estimation. PRENOM gets to know objects on a first-name basis by meta-learning on synthetic reconstruction tasks generated from open-source shape datasets. To account for object category variations, it employs a multi-objective genetic algorithm to optimize the NeRF architecture for each category, balancing reconstruction quality and training time. Additionally, prior-based probabilistic ray sampling directs sampling toward expected object regions, accelerating convergence and improving reconstruction quality under constrained resources. Experimental results highlight the ability of PRENOM to achieve high-quality reconstructions while maintaining computational feasibility. Specifically, comparisons with prior-free NeRF-based approaches on a synthetic dataset show a 21% lower Chamfer distance. Furthermore, evaluations against other approaches using shape priors on a noisy real-world dataset indicate a 13% improvement averaged across all reconstruction metrics, and comparable pose and size estimation accuracy, while being trained for 5× less time. Code available at: https://github.com/snt-arg/PRENOM
Research center :
Interdisciplinary Centre for Security, Reliability and Trust (SnT) > ARG - Automation & Robotics
Disciplines :
Computer science
Author, co-author :
EJAZ, Saad ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > Automation
BAVLE, Hriday ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust > Automation > Team Holger VOOS
RIBEIRO, Laura ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > Automation
VOOS, Holger ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > Automation
SANCHEZ LOPEZ, Jose Luis ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > Automation
External co-authors :
no
Language :
English
Title :
Category-level Meta-learned NeRF Priors for Efficient Object Mapping
Publication date :
2025
Event name :
2025 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)
Event place :
Hangzhou, China
Event date :
19/10/2025 - 25/10/2025
Audience :
International
Journal title :
IEEE International Conference on Intelligent Robots and Systems
ISSN :
2153-0858
Publisher :
Institute of Electrical and Electronics Engineers, New York, United States - New York
FNR17387634 - DEUS - Deep Understanding Of The Situation For Robots, 2022 (01/09/2023-31/08/2026) - Jose-luis Sanchez-lopez
Funding text :
This research was funded, in whole or in part, by the Luxembourg National Research Fund (FNR) under the DEUS Project (Ref. C22/IS/17387634/DEUS) and the MR-Cobot Project (Ref. 18883697/MR-Cobot)
Y. Wu, Y. Zhang, D. Zhu, Z. Deng, W. Sun, X. Chen, and J. Zhang, "An object slam framework for association, mapping, and high-level tasks, " IEEE Transactions on Robotics, vol. 39, no. 4, pp. 2912-2932, 2023.
X. Han and L. Yang, "Sq-slam: Monocular semantic slam based on superquadric object representation, " Journal of Intelligent & Robotic Systems, vol. 109, no. 2, p. 29, 2023.
Q. Wu, X. Liu, Y. Chen, K. Li, C. Zheng, J. Cai, and J. Zheng, "Objectcompositional neural implicit surfaces, " in European Conference on Computer Vision. Springer, 2022, pp. 197-213.
Q. Wu, K. Wang, K. Li, J. Zheng, and J. Cai, "Objectsdf++: Improved object-compositional neural implicit surfaces, " in The IEEE/CVF International Conference on Computer Vision, 2023, pp. 21 764-21 774.
X. Kong, S. Liu, M. Taher, and A. J. Davison, "vmap: Vectorised object mapping for neural field slam, " in The IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023, pp. 952-961.
X. Han, H. Liu, Y. Ding, and L. Yang, "Ro-map: Real-time multiobject mapping with neural radiance fields, " IEEE Robotics and Automation Letters, 2023.
C. Yang, S. Li, J. Fang, R. Liang, L. Xie, X. Zhang, W. Shen, and Q. Tian, "Gaussianobject: High-quality 3d object reconstruction from four views with Gaussian splatting, " ACM Transactions on Graphics, 2024.
R. F. Salas-Moreno, R. A. Newcombe, H. Strasdat, P. H. Kelly, and A. J. Davison, "Slam++: Simultaneous localisation and mapping at the level of objects, " in The IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2013, pp. 1352-1359.
N. Merrill, Y. Guo, X. Zuo, X. Huang, S. Leutenegger, X. Peng, L. Ren, and G. Huang, "Symmetry and uncertainty-aware object slam for 6dof object pose estimation, " in The IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022, pp. 14 901-14 910.
Z. Liao, B. Xu, and S. L. Waslander, "Toward general object-level mapping from sparse views with 3d diffusion priors, " in 8th Annual Conference on Robot Learning, 2024.
E. Sucar, K. Wada, and A. Davison, "Nodeslam: Neural object descriptors for multi-view shape reconstruction, " in The International Conference on 3D Vision (3DV). IEEE, 2020, pp. 949-958.
M. Rünz, K. Li, M. Tang, L. Ma, C. Kong, T. Schmidt, I. Reid, L. Agapito, J. Straub, S. Lovegrove, and R. Newcombe, "Frodo: From detections to 3d objects, " in The IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020, pp. 14 708-14 717.
J. Wang, M. Rünz, and L. Agapito, "Dsp-slam: Object oriented slam with deep shape priors, " in The International Conference on 3D Vision (3DV). IEEE, 2021, pp. 1362-1371.
Z.-X. Zou, S.-S. Huang, T.-J. Mu, and Y.-P. Wang, "Objectfusion: Accurate object-level slam with neural object priors, " Graphical Models, vol. 123, p. 101165, 2022.
Z. Liao, J. Yang, J. Qian, A. P. Schoellig, and S. L. Waslander, "Uncertainty-aware 3d object-level mapping with deep shape priors, " in The IEEE International Conference on Robotics and Automation (ICRA). IEEE, 2024, pp. 4082-4089.
M. Tancik, B. Mildenhall, T. Wang, D. Schmidt, P. P. Srinivasan, J. T. Barron, and R. Ng, "Learned initializations for optimizing coordinatebased neural representations, " in The IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021, pp. 2846-2855.
J. Abou-Chakra, F. Dayoub, and N. Sünderhauf, "Implicit object mapping with noisy data, " arXiv preprint arXiv: 2204. 10516, 2022.
B. Yang, Y. Zhang, Y. Xu, Y. Li, H. Zhou, H. Bao, G. Zhang, and Z. Cui, "Learning object-compositional neural radiance field for editable scene rendering, " in The IEEE/CVF International Conference on Computer Vision, 2021, pp. 13 779-13 788.
J. J. Park, P. Florence, J. Straub, R. Newcombe, and S. Lovegrove, "Deepsdf: Learning continuous signed distance functions for shape representation, " in The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2019.
H. Wang, S. Sridhar, J. Huang, J. Valentin, S. Song, and L. J. Guibas, "Normalized object coordinate space for category-level 6d object pose and size estimation, " in The IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019, pp. 2642-2651.
A. X. Chang et al., "Shapenet: An information-rich 3d model repository, " arXiv preprint arXiv: 1512. 03012, 2015.
F. Xiang et al., "SAPIEN: A simulated part-based interactive environment, " in The IEEE/CVF Conference on Computer Vision and Pattern Recognition, June 2020.
A. Nichol and J. Schulman, "Reptile: A scalable metalearning algorithm, " arXiv preprint arXiv: 1803. 02999, vol. 2, no. 3, p. 4, 2018.
J. Blank and K. Deb, "pymoo: Multi-objective optimization in python, " IEEE Access, vol. 8, pp. 89 497-89 509, 2020.
R. Mur-Artal and J. D. Tardós, "Orb-slam2: An open-source slam system for monocular, stereo, and rgb-d cameras, " IEEE Transactions on Robotics, vol. 33, no. 5, pp. 1255-1262, 2017.
F. Wilcoxon, "Individual comparisons by ranking methods, " in Breakthroughs in statistics: Methodology and distribution. Springer, 1992, pp. 196-202.
T. Müller, A. Evans, C. Schied, and A. Keller, "Instant neural graphics primitives with a multiresolution hash encoding, " ACM Transactions on Graphics, vol. 41, no. 4, pp. 102: 1-102: 15, Jul. 2022. [Online]. Available: Https: //doi. org/10. 1145/3528223. 3530127
A. Dai, A. X. Chang, M. Savva, M. Halber, T. Funkhouser, and M. Niesner, "Scannet: Richly-annotated 3d reconstructions of indoor scenes, " in The IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2017, pp. 5828-5839.
A. Avetisyan, M. Dahnert, A. Dai, M. Savva, A. X. Chang, and M. Niessner, "Scan2cad: Learning cad model alignment in rgb-d scans, " in The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2019.