Reference : Resilience within Ultrascale Computing System: Challenges and Opportunities from Nesu...
Scientific journals : Article
Engineering, computing & technology : Computer science
http://hdl.handle.net/10993/22779
Resilience within Ultrascale Computing System: Challenges and Opportunities from Nesus Project
English
Bouvry, Pascal mailto [University of Luxembourg > Faculty of Science, Technology and Communication (FSTC) > Computer Science and Communications Research Unit (CSC)]
Mayer, R. [> >]
Muszynski, Jakub mailto [University of Luxembourg > Faculty of Science, Technology and Communication (FSTC) > Computer Science and Communications Research Unit (CSC) >]
Petcu, D. [> >]
Rauber, A. [> >]
Tempesti, G. [> >]
Trinh [> >]
Varrette, Sébastien mailto [University of Luxembourg > Faculty of Science, Technology and Communication (FSTC) > Computer Science and Communications Research Unit (CSC)]
2015
Supercomputing Frontiers and Innovations
2
2
46--63
Yes
International
[en] Ultrascale computing is a new computing paradigm that comes naturally from the necessity of computing systems that should be able to handle massive data in possibly very large scale distributed systems, enabling new forms of applications that can serve a very large amount of users and in a timely manner that we have never experienced before. However, besides the benefits, ultrascale computing systems do not come without challenges. One of the challenges is the resilience of ultrascale computing systems. Although resilience is already an established field in system science and many methodologies and approaches are available to deal with it, the unprecedented scales of computing, of the massive data to be managed, new network technologies, and drastically new forms of massive scale applications bring new challenges that need to be addressed. This paper reviews the challenges and approaches of resilience in ultrascale computing systems from multiple perspectives involving and addressing the resilience aspects of hardware-software co-design for ultrascale systems, resilience against (security) attacks, new approaches and methodologies to resilience in ultrascale systems, applications and case studies.
University of Luxembourg: High Performance Computing - ULHPC
http://hdl.handle.net/10993/22779
10.14529/jsfi1502

File(s) associated to this reference

Fulltext file(s):

FileCommentaryVersionSizeAccess
Open access
46-303-3-PB.pdfPublisher postprint484.02 kBView/Open

Bookmark and Share SFX Query

All documents in ORBilu are protected by a user license.