Paper published in a book (Scientific congresses, symposiums and conference proceedings)
Performance and Portability in Multi-GPU Branch-and-Bound: Chapel Versus CUDA and HIP for Tree-Based Optimization
TAGLIAFERRO DE OLIVEIRA TEZÔTO, Ivan; HELBECQUE, Guillaume; KRISHNASAMY, Ezhilmathi et al.
2025In Performance and Portability in Multi-GPU Branch-and-Bound: Chapel versus CUDA and HIP for Tree-Based Optimization
Peer reviewed Dataset
 

Files


Full Text
preprint_IPDPS2025.pdf
Author preprint (233.04 kB)
This is a preprint version of the article. IEEE does not allow sharing of full final article without any payment.
Download

All documents in ORBilu are protected by a user license.

Send to



Details



Keywords :
branch-and-bound; chapel; cuda; flow- shop; hip; Tree search algorithm; Hardware and Architecture; Distributed processing; Graphics processing units; Porgramming; Performance gain; Heterogeneous networks; Optimization
Abstract :
[en] Tree search algorithms, such as the Branch-and-Bound (B&B) method, are essential tools in exact combinatorial optimization. Parallel B&B presents significant challenges in achieving scalability arising due to irregular and fine-grained workload associated and the hardware heterogeneity of modern supercomputers. This work focuses on leveraging GPU heterogeneity in the HPC environment within exact tree search optimization algorithms, through a comparison between a proposed CUDA-based low-level implementation and a high-level PGAS-based one. We revisit the design and implementation of a pool-based GPU-accelerated parallel B&B algorithm tailored for heterogeneous CPU / GPU systems originally developed in Chapel. A low-level counterpart implementation, suited for Nvidia and AMD GPU architectures, is proposed for performance gain, appeasing some of the portability issues usually related to low-level implementations. This comparison provides insights into the trade-offs between different programming paradigms in large-scale, heterogeneous computing environments.
Research center :
Interdisciplinary Centre for Security, Reliability and Trust (SnT) > PCOG - Parallel Computing & Optimization Group
Disciplines :
Computer science
Author, co-author :
TAGLIAFERRO DE OLIVEIRA TEZÔTO, Ivan ;  University of Luxembourg ; ULille - Université de Lille > Informatique
HELBECQUE, Guillaume ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust > PCOG > Team Grégoire DANOY ; Université de Lille, CNRS/CRIStAL UMR 9189, Centre Inria de l'Université de Lille, Lille, France
KRISHNASAMY, Ezhilmathi ;  University of Luxembourg > Faculty of Science, Technology and Medicine > HPC Platform > High Level Support Team
Melab, Nouredine;  Université de Lille, CNRS/CRIStAL UMR 9189, Centre Inria de l'Université de Lille, Lille, France
Danoy, Grégoire;  University of Luxembourg, DCS-FSTM/SnT, Luxembourg
External co-authors :
yes
Language :
English
Title :
Performance and Portability in Multi-GPU Branch-and-Bound: Chapel Versus CUDA and HIP for Tree-Based Optimization
Publication date :
13 August 2025
Event name :
2025 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)
Event place :
Milan, Italy
Event date :
03-06-2025 => 07-06-2025
Audience :
International
Main work title :
Performance and Portability in Multi-GPU Branch-and-Bound: Chapel versus CUDA and HIP for Tree-Based Optimization
Publisher :
Institute of Electrical and Electronics Engineers Inc.
ISBN/EAN :
9798331526436
Collection name :
2025 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)
Collection ISSN :
2639-3867
Pages :
1293-1295
Peer reviewed :
Peer reviewed
Focus Area :
Computational Sciences
Development Goals :
9. Industry, innovation and infrastructure
FnR Project :
FNR17133848 - UltraBO - Ultra-scale Computing For Solving Big Optimization Problems, 2022 (01/03/2023-31/08/2026) - Gregoire Danoy
FNR17395419 - SERENITY - Space Data Brokering Optimization System, 2022 (01/01/2023-31/12/2025) - Pascal Bouvry
Name of the research project :
U-AGR-7207 - INTER/ANR/22/17133848/UltraBO - DANOY Grégoire
U-AGR-7213 - C22/IS/17395419/Serenity - DANOY Grégoire
Funding text :
This work was partially supported by the French government through the program "France 2030" (SFRI project GRAEL/ ANR-21-SFRI-005, PI: Université de Lille) managed by the National Research Agency; by the Agence Nationale de la Recherche (ref. ANR-22-CE46-0011) and the Luxembourg National Research Fund (FNR) (ref. INTER/ANR/22/17133848), under the UltraBO project; and by the FNR POLLUX program under the SERENITY project (ref. C22/IS/17395419).
Data Set :
Distributed GPU accelerated tree search

Repository containing the version of the code used for this publication.

Available on ORBilu :
since 14 October 2025

Statistics


Number of views
20 (4 by Unilu)
Number of downloads
6 (0 by Unilu)

Scopus citations®
 
0
Scopus citations®
without self-citations
0
OpenCitations
 
0
OpenAlex citations
 
0

Bibliography


Similar publications



Contact ORBilu