Fault Tolerance Techniques For High Performance Computing

Download Fault Tolerance Techniques For High Performance Computing full books in PDF, epub, and Kindle. Read online free Fault Tolerance Techniques For High Performance Computing ebook anywhere anytime directly on your device. Fast Download speed and no annoying ads. We cannot guarantee that every ebooks is available!

Fault-Tolerance Techniques for High-Performance Computing

Fault-Tolerance Techniques for High-Performance Computing
Author :
Publisher : Springer
Total Pages : 325
Release :
ISBN-10 : 9783319209432
ISBN-13 : 3319209434
Rating : 4/5 (434 Downloads)

Book Synopsis Fault-Tolerance Techniques for High-Performance Computing by : Thomas Herault

Download or read book Fault-Tolerance Techniques for High-Performance Computing written by Thomas Herault and published by Springer. This book was released on 2015-07-01 with total page 325 pages. Available in PDF, EPUB and Kindle. Book excerpt: This timely text presents a comprehensive overview of fault tolerance techniques for high-performance computing (HPC). The text opens with a detailed introduction to the concepts of checkpoint protocols and scheduling algorithms, prediction, replication, silent error detection and correction, together with some application-specific techniques such as ABFT. Emphasis is placed on analytical performance models. This is then followed by a review of general-purpose techniques, including several checkpoint and rollback recovery protocols. Relevant execution scenarios are also evaluated and compared through quantitative models. Features: provides a survey of resilience methods and performance models; examines the various sources for errors and faults in large-scale systems; reviews the spectrum of techniques that can be applied to design a fault-tolerant MPI; investigates different approaches to replication; discusses the challenge of energy consumption of fault-tolerance methods in extreme-scale systems.


Fault-Tolerance Techniques for High-Performance Computing Related Books

Fault-Tolerance Techniques for High-Performance Computing
Language: en
Pages: 325
Authors: Thomas Herault
Categories: Computers
Type: BOOK - Published: 2015-07-01 - Publisher: Springer

DOWNLOAD EBOOK

This timely text presents a comprehensive overview of fault tolerance techniques for high-performance computing (HPC). The text opens with a detailed introducti
Fault-Tolerant Systems
Language: en
Pages: 399
Authors: Israel Koren
Categories: Computers
Type: BOOK - Published: 2010-07-19 - Publisher: Elsevier

DOWNLOAD EBOOK

Fault-Tolerant Systems is the first book on fault tolerance design with a systems approach to both hardware and software. No other text on the market takes this
Software-Implemented Hardware Fault Tolerance
Language: en
Pages: 238
Authors: Olga Goloubeva
Categories: Technology & Engineering
Type: BOOK - Published: 2006-09-19 - Publisher: Springer Science & Business Media

DOWNLOAD EBOOK

This book presents the theory behind software-implemented hardware fault tolerance, as well as the practical aspects needed to put it to work on real examples.
Design And Analysis Of Reliable And Fault-tolerant Computer Systems
Language: en
Pages: 463
Authors: Mostafa I Abd-el-barr
Categories: Computers
Type: BOOK - Published: 2006-12-15 - Publisher: World Scientific

DOWNLOAD EBOOK

Covering both the theoretical and practical aspects of fault-tolerant mobile systems, and fault tolerance and analysis, this book tackles the current issues of
High Performance Computing in Clouds
Language: en
Pages: 337
Authors: Edson Borin
Categories: Computers
Type: BOOK - Published: 2023-07-05 - Publisher: Springer Nature

DOWNLOAD EBOOK

This book brings a thorough explanation on the path needed to use cloud computing technologies to run High-Performance Computing (HPC) applications. Besides pre