Reference : Performance Analysis and Benchmarking of a Temperature Downscaling Deep Learning Model
Scientific congresses, symposiums and conference proceedings : Paper published in a book
Engineering, computing & technology : Computer science
Security, Reliability and Trust
http://hdl.handle.net/10993/54334
Performance Analysis and Benchmarking of a Temperature Downscaling Deep Learning Model
English
Panner Selvam, Karthick mailto [University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > SEDAN >]
Brorsson, Mats Hakan mailto [University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > SEDAN >]
Mar-2023
31st Euromicro International Conference on Parallel, Distributed, and Network-Based Processing, Naples, Italy 1-3 March 2023
Yes
International
31st Euromicro International Conference on Parallel, Distributed, and Network-Based Processing
01-03-2023 to 03-03-2023
Naples
Italy
[en] Performance Analysis ; Roofline Model ; Weather Forecast ; Deep Learning Benchmark
[en] We are presenting here a detailed analysis and performance characterization of a statistical temperature downscaling application used in the MAELSTROM EuroHPC project. This application uses a deep learning methodology to convert low-resolution atmospheric temperature states into high-resolution. We have performed in-depth profiling and roofline analysis at different levels (Operators, Training, Distributed Training, Inference) of the downscaling model on different hardware architectures (Nvidia V100 & A100 GPUs). Finally, we compare the training and inference cost of the downscaling model with various cloud providers. Our results identify the model bottlenecks which can be used to enhance the model architecture and determine hardware configuration for efficiently utilizing the HPC. Furthermore, we provide a comprehensive methodology for in-depth profiling and benchmarking of the deep learning models.
Interdisciplinary Centre for Security, Reliability and Trust (SnT) > SEDAN - Service and Data Management in Distributed Systems
Fonds National de la Recherche - FnR
MAELSTROM
Researchers ; Professionals ; Students ; General public
http://hdl.handle.net/10993/54334
FnR ; FNR15092355 > Mats Brorsson > MAELSTROM > Machine Learning For Scalable Meteorology And Climate > 01/04/2021 > 31/03/2023 > 2020

File(s) associated to this reference

Fulltext file(s):

FileCommentaryVersionSizeAccess
Open access
PDP_2023_Performance_Analysis.pdfPublisher postprint1.04 MBView/Open

Bookmark and Share SFX Query

All documents in ORBilu are protected by a user license.