Abstract
We report on the design and simulation of novel algorithms which ensure that application software runs correctly on a MIMD system in which processing units (PUs) can fail. The effect of these algorithms is evaluated for random task graphs using simulation. The simulation results are also compared to approximate analytical results. A specific application is finally studied: the Fast Fourier Transform. We give the corresponding task graph and then simulate its execution under various failure rates.
| Original language | English |
|---|---|
| Pages (from-to) | 1-16 |
| Number of pages | 16 |
| Journal | Simulation Practice and Theory |
| Volume | 3 |
| Issue number | 1 |
| DOIs | |
| Publication status | Published - 17 Jul 1995 |
| Externally published | Yes |
Keywords
- Dependability
- Parallel computing
- Software-based failure detection