Paper published on a website (Scientific congresses, symposiums and conference proceedings)
Towards Localizing Structural Elements: Merging Geometrical Detection with Semantic Verification in RGB-D Data
TOURANI, Ali; EJAZ, Saad; BAVLE, Hriday et al.
20242024 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS'24)
Peer reviewed
 

Files


Full Text
6_Tourani_paper_iros24.pdf
Publisher postprint (5.07 MB) Creative Commons License - Attribution
Link: https://github.com/TUM-AVS/IROS24_Workshop/blob/main/paper/6_Tourani_paper_iros24.pdf
Download

All documents in ORBilu are protected by a user license.

Send to



Details



Keywords :
Robotics; Computer Vision; Visual SLAM; SLAM; Scene Graph
Abstract :
[en] RGB-D cameras supply rich and dense visual and spatial information for various robotics tasks such as scene understanding, map reconstruction, and localization. Integrating depth and visual information can aid robots in localization and element mapping, advancing applications like 3D scene graph generation and Visual Simultaneous Localization and Mapping (VSLAM). While point cloud data containing such information is primarily used for enhanced scene understanding, exploiting their potential to capture and represent rich semantic information has yet to be adequately targeted. This paper presents a real-time pipeline for localizing building components, including wall and ground surfaces, by integrating geometric calculations for pure 3D plane detection followed by validating their semantic category using point cloud data from RGB-D cameras. It has a parallel multi-thread architecture to precisely estimate poses and equations of all the planes detected in the environment, filters the ones forming the map structure using a panoptic segmentation validation, and keeps only the validated building components. Incorporating the proposed method into a VSLAM framework confirmed that constraining the map with the detected environment-driven semantic elements can improve scene understanding and map reconstruction accuracy. It can also ensure (re-)association of these detected components into a unified 3D scene graph, bridging the gap between geometric accuracy and semantic understanding. Additionally, the pipeline allows for the detection of potential higher-level structural entities, such as rooms, by identifying the relationships between building components based on their layout.
Research center :
Interdisciplinary Centre for Security, Reliability and Trust (SnT) > ARG - Automation & Robotics
Disciplines :
Computer science
Author, co-author :
TOURANI, Ali  ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > Automation
EJAZ, Saad  ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > Automation
BAVLE, Hriday  ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > Automation
SANCHEZ LOPEZ, Jose Luis  ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > Automation
VOOS, Holger  ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > Automation
External co-authors :
yes
Language :
English
Title :
Towards Localizing Structural Elements: Merging Geometrical Detection with Semantic Verification in RGB-D Data
Publication date :
15 October 2024
Event name :
2024 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS'24)
Event place :
Abu Dhabi, United Arab Emirates
Event date :
October 14– 18, 2024
Audience :
International
Peer reviewed :
Peer reviewed
Focus Area :
Computational Sciences
Development Goals :
9. Industry, innovation and infrastructure
FnR Project :
22/IS/17387634/DEUS
Funding text :
This research was funded, in whole or part, by the Luxembourg National Research Fund (FNR), DEUS Project, ref. C22/IS/17387634/DEUS. In addition, it was partially funded by the Institute of Advanced Studies (IAS) of the University of Luxembourg through an “Audacity” grant (project TRANSCEND - 2021).
Available on ORBilu :
since 18 October 2024

Statistics


Number of views
163 (28 by Unilu)
Number of downloads
51 (5 by Unilu)

Bibliography


Similar publications



Contact ORBilu