Fault Tolerance in Multicore Clusters. Techniques to Balance Performance andDependability
In High Performance Computing (HPC) the demand for more performance is satisfied by increasing the number of components. With the growing scale of HPC applications has came an increase in the number of interruptions as a consequence of hardware failures. The remarkable decrease of Mean Times Between...
Main Author: | Hugo Meyer |
---|---|
Format: | Article |
Language: | English |
Published: |
Postgraduate Office, School of Computer Science, Universidad Nacional de La Plata
2016-04-01
|
Series: | Journal of Computer Science and Technology |
Online Access: | https://journal.info.unlp.edu.ar/JCST/article/view/511 |
Similar Items
-
Peak-Power-Aware Primary-Backup Technique for Efficient Fault-Tolerance in Multicore Embedded Systems
by: Mohsen Ansari, et al.
Published: (2020-01-01) -
SMCV: a Methodology for Detecting Transient Faults in Multicore Clusters
by: Diego Montezanti, et al.
Published: (2012-12-01) -
Performance analysis and optimization of parallel Best-First Search algorithms on multicore and cluster of multicore
by: Victoria María Sanz
Published: (2016-04-01) -
A tool for detecting transient faults in execution of parallel scientific applications on multicore clusters
by: Diego Miguel Montezanti, et al.
Published: (2014-04-01) -
On Designing a Fault-Tolerant and Load-Balancing Clustered Video System
by: Shyu, Ing-Jye, et al.
Published: (1998)