Reference : OpenMP optimisation of the eXtended Discrete Element Method (XDEM)
Reports : External report
Engineering, computing & technology : Computer science
Computational Sciences
http://hdl.handle.net/10993/46716
OpenMP optimisation of the eXtended Discrete Element Method (XDEM)
English
Ojeda-May, Pedro [Umeå Universitet > MIT Huset > High Performance Computing Center North (HPC2N)]
Eriksson, Jerry [Umeå Universitet > MIT Huset > High Performance Computing Center North (HPC2N)]
Rousset, Alban mailto [University of Luxembourg > Faculty of Science, Technology and Medicine (FSTM) > Department of Engineering (DoE) >]
Besseron, Xavier mailto [University of Luxembourg > Faculty of Science, Technology and Medicine (FSTM) > Department of Engineering (DoE) >]
Mainassara Chekaraou, Abdoul Wahid mailto [University of Luxembourg > Faculty of Science, Technology and Medicine (FSTM) > Department of Engineering (DoE) >]
Peters, Bernhard mailto [University of Luxembourg > Faculty of Science, Technology and Medicine (FSTM) > Department of Engineering (DoE) >]
2-Mar-2021
Partnership for Advanced Computing in Europe
[en] The eXtended Discrete Element Method (XDEM) is an extension of the regular Discrete Element Method (DEM) which is a software for simulating the dynamics of granular material. XDEM extends the regular DEM method by adding features where both micro and macroscopic observables can be computed simultaneously by coupling different time and length scales. In this sense XDEM belongs the category of multi-scale/multi-physics applications which can be used in realistic simulations. In this whitepaper, we detail the different optimisations done during the preparatory PRACE project to overcome known bottlenecks in the OpenMP implementation of XDEM. We analysed the Conversion, Dynamic, and the combined Dynamics-Conversion modules with Extrae/Paraver and Intel VTune profiling tools in order to find the most expensive functions. The proposed code modifications improved the performance of XDEM by ~17% for the computational expensive Dynamics-Conversion combined modules (with 48 cores, full node). Our analysis was performed in the Marenostrum 4 (MN4) PRACE infrastructure at Barcelona Supercomputing Center (BSC).
http://hdl.handle.net/10993/46716
https://prace-ri.eu/wp-content/uploads/WP298.pdf

File(s) associated to this reference

Fulltext file(s):

FileCommentaryVersionSizeAccess
Open access
WP298.pdfAuthor preprint912.68 kBView/Open

Bookmark and Share SFX Query

All documents in ORBilu are protected by a user license.