Fault-Tolerant DesignThis textbook serves as an introduction to fault-tolerance, intended for upper-division undergraduate students, graduate-level students and practicing engineers in need of an overview of the field. Readers will develop skills in modeling and evaluating fault-tolerant architectures in terms of reliability, availability and safety. They will gain a thorough understanding of fault tolerant computers, including both the theory of how to design and evaluate them and the practical knowledge of achieving fault-tolerance in electronic, communication and software systems. Coverage includes fault-tolerance techniques through hardware, software, information and time redundancy. The content is designed to be highly accessible, including numerous examples and exercises. Solutions and powerpoint slides are available for instructors. |
Contents
1 | |
5 | |
3 Dependability Evaluation Techniques | 21 |
4 Hardware Redundancy | 55 |
5 Information Redundancy | 87 |
Other editions - View all
Common terms and phrases
acceptance test applications arithmetic codes Assume Berger code binary bit slice check bits checkpoint code distance codeword coefficients columns components computation configuration correct cyclic code decoding defined dependability diagram Draw a Markov duplication with comparison encoding equations error has occurred example failed failure rate fault detection fault tolerance fault-free Fault-Tolerant Design faulty module FD unit first Hamming code Hamming distance implemented independent events information rate input k-bit LFSR linear code logic circuit majority voter Markov chain MTTF MTTR multiple number of 1’s output parity check matrix parity code permanent faults polynomial polynomial g(x probability processor Q x Q Q x3 recomputing with shifted recovery block reliability evaluation repair rate repairs are allowed result self-purging redundancy shifted operands shown in Fig software fault spare specification standby redundancy steady-state availability stuck-at fault sufficient Suppose swapped operands switch Table techniques TMR system transient faults transition vector versions