Paper published in a book (Scientific congresses, symposiums and conference proceedings)
Management of an Academic HPC Cluster: The UL Experience
VARRETTE, Sébastien; BOUVRY, Pascal; CARTIAUX, Hyacinthe et al.
2014In Proc. of the 2014 Intl. Conf. on High Performance Computing Simulation (HPCS 2014)
Peer reviewed
 

Files


Full Text
hpcs2014.pdf
Author preprint (3.42 MB)
Download

All documents in ORBilu are protected by a user license.

Send to



Details



Abstract :
[en] The intensive growth of processing power, data storage and transmission capabilities has revolutionized many aspects of science. These resources are essential to achieve high-quality results in many application areas. In this context, the University of Luxembourg (UL) operates since 2007 an High Performance Computing (HPC) facility and the related storage. The aspect of bridging computing and storage is a requirement of UL service – the reasons are both legal (certain data may not move) and performance related. Nowa- days, people from the three faculties and/or the two Interdisciplinary centers within the UL, are users of this facility. More specifically, key research priorities such as Systems Bio-medicine (by LCSB) and Security, Reliability & Trust (by SnT) require access to such HPC facilities in order to function in an adequate environment. The management of HPC solutions is a complex enterprise and a constant area for discussion and improvement. The UL HPC facility and the derived deployed services is a complex computing system to manage by its scale: at the moment of writing, it consists of 150 servers, 368 nodes (3880 computing cores) and 1996 TB of shared raw storage which are all configured, monitored and operated by three per- sons using advanced IT automation solutions based on Puppet [1], FAI [2] and Capistrano [3]. This paper covers all the aspects in relation to the management of such a complex infrastructure, whether technical or administrative. Most design choices or implemented approaches have been motivated by several years of experience in addressing research needs, mainly in the HPC area but also in complementary services (typically Web-based). In this context, we tried to answer in a flexible and convenient way many technological issues. This experience report may be of interest for other research centers belonging either to the public or the private sector looking for good if not best practices in cluster architecture and management.
Research center :
ULHPC - University of Luxembourg: High Performance Computing
Disciplines :
Computer science
Author, co-author :
VARRETTE, Sébastien ;  University of Luxembourg > Faculty of Science, Technology and Communication (FSTC) > Computer Science and Communications Research Unit (CSC)
BOUVRY, Pascal ;  University of Luxembourg > Faculty of Science, Technology and Communication (FSTC) > Computer Science and Communications Research Unit (CSC)
CARTIAUX, Hyacinthe ;  University of Luxembourg > Faculty of Science, Technology and Communication (FSTC) > Computer Science and Communications Research Unit (CSC)
GEORGATOS, Fotis ;  University of Luxembourg > Luxembourg Centre for Systems Biomedicine (LCSB)
Language :
English
Title :
Management of an Academic HPC Cluster: The UL Experience
Publication date :
July 2014
Event name :
2014 Intl. Conf. on High Performance Computing & Simulation (HPCS 2014)
Event date :
May 2014
Audience :
International
Main work title :
Proc. of the 2014 Intl. Conf. on High Performance Computing Simulation (HPCS 2014)
Publisher :
IEEE, Bologna, Italy
Peer reviewed :
Peer reviewed
Available on ORBilu :
since 11 May 2014

Statistics


Number of views
442 (92 by Unilu)
Number of downloads
1166 (64 by Unilu)

Scopus citations®
 
254
Scopus citations®
without self-citations
199

Bibliography


Similar publications



Contact ORBilu