Performance Analysis and Benchmarking of a Temperature Downscaling Deep Learning Model
English
Panner Selvam, Karthick[University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > SEDAN >]
Brorsson, Mats Hakan[University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > SEDAN >]
Mar-2023
31st Euromicro International Conference on Parallel, Distributed, and Network-Based Processing, Naples, Italy 1-3 March 2023
Yes
International
31st Euromicro International Conference on Parallel, Distributed, and Network-Based Processing
01-03-2023 to 03-03-2023
Naples
Italy
[en] Performance Analysis ; Roofline Model ; Weather Forecast ; Deep Learning Benchmark
[en] We are presenting here a detailed analysis and performance characterization of a statistical temperature downscaling application used in the MAELSTROM EuroHPC project. This application uses a deep learning methodology to convert low-resolution atmospheric temperature states into high-resolution. We have performed in-depth profiling and roofline analysis at different levels (Operators, Training, Distributed Training, Inference) of the downscaling model on different hardware architectures (Nvidia V100 & A100 GPUs). Finally, we compare the training and inference cost of the downscaling model with various cloud providers. Our results identify the model bottlenecks which can be used to enhance the model architecture and determine hardware configuration for efficiently utilizing the HPC. Furthermore, we provide a comprehensive methodology for in-depth profiling and benchmarking of the deep learning models.
Interdisciplinary Centre for Security, Reliability and Trust (SnT) > SEDAN - Service and Data Management in Distributed Systems
Fonds National de la Recherche - FnR
MAELSTROM
Researchers ; Professionals ; Students ; General public