Fault Tolerance Techniques For High Performance Computing

Download Fault Tolerance Techniques For High Performance Computing full books in PDF, epub, and Kindle. Read online free Fault Tolerance Techniques For High Performance Computing ebook anywhere anytime directly on your device. Fast Download speed and no annoying ads. We cannot guarantee that every ebooks is available!

Fault-Tolerance Techniques for High-Performance Computing

Fault-Tolerance Techniques for High-Performance Computing
Author :
Publisher : Springer
Total Pages : 325
Release :
ISBN-10 : 9783319209432
ISBN-13 : 3319209434
Rating : 4/5 (434 Downloads)

Book Synopsis Fault-Tolerance Techniques for High-Performance Computing by : Thomas Herault

Download or read book Fault-Tolerance Techniques for High-Performance Computing written by Thomas Herault and published by Springer. This book was released on 2015-07-01 with total page 325 pages. Available in PDF, EPUB and Kindle. Book excerpt: This timely text presents a comprehensive overview of fault tolerance techniques for high-performance computing (HPC). The text opens with a detailed introduction to the concepts of checkpoint protocols and scheduling algorithms, prediction, replication, silent error detection and correction, together with some application-specific techniques such as ABFT. Emphasis is placed on analytical performance models. This is then followed by a review of general-purpose techniques, including several checkpoint and rollback recovery protocols. Relevant execution scenarios are also evaluated and compared through quantitative models. Features: provides a survey of resilience methods and performance models; examines the various sources for errors and faults in large-scale systems; reviews the spectrum of techniques that can be applied to design a fault-tolerant MPI; investigates different approaches to replication; discusses the challenge of energy consumption of fault-tolerance methods in extreme-scale systems.


Fault-Tolerance Techniques for High-Performance Computing Related Books

Fault-Tolerance Techniques for High-Performance Computing
Language: en
Pages: 325
Authors: Thomas Herault
Categories: Computers
Type: BOOK - Published: 2015-07-01 - Publisher: Springer

DOWNLOAD EBOOK

This timely text presents a comprehensive overview of fault tolerance techniques for high-performance computing (HPC). The text opens with a detailed introducti
Scalable Techniques for Fault Tolerant High Performance Computing
Language: en
Pages: 174
Authors:
Categories:
Type: BOOK - Published: 2006 - Publisher:

DOWNLOAD EBOOK

As the number of processors in todayʹs parallel systems continues to grow, the mean-time-to-failure of these systems is becoming significantly shorter than the
New Software-based Fault Tolerance Methods for High Performance Computing
Language: en
Pages: 0
Authors: Robert D. Hunt
Categories:
Type: BOOK - Published: 2015 - Publisher:

DOWNLOAD EBOOK

Transparent Fault Tolerance for Job Healing in HPC Environments
Language: en
Pages:
Authors:
Categories:
Type: BOOK - Published: 2004 - Publisher:

DOWNLOAD EBOOK

As the number of nodes in high-performance computing environments keeps increasing, faults are becoming common place causing losses in intermediate results of H
A Proactive Fault Tolerance Framework for High Performance Computing (HPC) Systems in the Cloud
Language: en
Pages:
Authors: Ifeanyi Paulinus Egwutuoha
Categories: Cloud computing
Type: BOOK - Published: 2014 - Publisher:

DOWNLOAD EBOOK