Paper published in a book (Scientific congresses, symposiums and conference proceedings)
Resampling methods for a reliable validation set in deep learning based point cloud classification
Nurunnabi, Abdul Awal Md; Teferle, Felix Norman
2022In Resampling methods for a reliable validation set in deep learning based point cloud classification
Peer reviewed
 

Files


Full Text
RESAMPLING_METHODS_FOR_A_RELIABLE_VALIDATION_SET_I.pdf
Publisher postprint (1.32 MB)
Download

All documents in ORBilu are protected by a user license.

Send to



Details



Keywords :
Bootstrap; Supervised Method; Cross-Validation; Machine Learning; Monte Carlo; Semantic Segmentation; PointNet
Abstract :
[en] A validation data set plays a pivotal role in tweaking a machine learning model trained in a supervised manner. Many existing algorithms select a part of available data by using random sampling to produce a validation set. However, this approach can be prone to overfitting. One should follow careful data splitting to have reliable training and validation sets that can produce a generalized model with a good performance for the unseen (test) data. Data splitting based on resampling techniques involves repeatedly drawing samples from the available data. Hence, resampling methods can give better generalization power to a model, because they can produce and use many training and/or validation sets. These techniques are computationally expensive, but with increasingly available high-performance computing facilities, one can exploit them. Though a multitude of resampling methods exist, investigation of their influence on the generality of deep learning (DL) algorithms is limited due to its non-linear black-box nature. This paper contributes by: (1) investigating the generalization capability of the four most popular resampling methods: k-fold cross-validation (k-CV), repeated k-CV (Rk-CV), Monte Carlo CV (MC-CV) and bootstrap for creating training and validation data sets used for developing, training and validating DL based point cloud classifiers (e.g., PointNet; Qi et al., 2017a), (2) justifying Mean Square Error (MSE) as a statistically consistent estimator, and (3) exploring the use of MSE as a reliable performance metric for supervised DL. Experiments in this paper are performed on both synthetic and real-world aerial laser scanning (ALS) point clouds.
Disciplines :
Engineering, computing & technology: Multidisciplinary, general & others
Author, co-author :
Nurunnabi, Abdul Awal Md ;  University of Luxembourg > Faculty of Science, Technology and Medicine (FSTM) > Department of Engineering (DoE)
Teferle, Felix Norman ;  University of Luxembourg > Faculty of Science, Technology and Medicine (FSTM) > Department of Engineering (DoE)
External co-authors :
no
Language :
English
Title :
Resampling methods for a reliable validation set in deep learning based point cloud classification
Publication date :
June 2022
Event name :
ISPRS Congress, 2022
Event organizer :
ISPRS
Event place :
Nice, France
Event date :
from 06-06-2022 to 11-06-2022
Audience :
International
Main work title :
Resampling methods for a reliable validation set in deep learning based point cloud classification
Peer reviewed :
Peer reviewed
Available on ORBilu :
since 29 September 2022

Statistics


Number of views
84 (5 by Unilu)
Number of downloads
55 (2 by Unilu)

Scopus citations®
 
1
Scopus citations®
without self-citations
0

Bibliography


Similar publications



Contact ORBilu