Fault-Tolerant SystemsFault-Tolerant Systems is the first book on fault tolerance design with a systems approach to both hardware and software. No other text on the market takes this approach, nor offers the comprehensive and up-to-date treatment that Koren and Krishna provide. This book incorporates case studies that highlight six different computer systems with fault-tolerance techniques implemented in their design. A complete ancillary package is available to lecturers, including online solutions manual for instructors and PowerPoint slides. Students, designers, and architects of high performance processors will value this comprehensive overview of the field. - The first book on fault tolerance design with a systems approach - Comprehensive coverage of both hardware and software fault tolerance, as well as information and time redundancy - Incorporated case studies highlight six different computer systems with fault-tolerance techniques implemented in their design - Available to lecturers is a complete ancillary package including online solutions manual for instructors and PowerPoint slides |
Contents
1 | |
11 | |
3 Information Redundancy | 55 |
4 FaultTolerant Networks | 109 |
5 Software Fault Tolerance | 147 |
6 Checkpointing | 193 |
Other editions - View all
Common terms and phrases
acceptance test algorithm application approach assume backup block bugs byte cache calculated check bits checkpoint checksum chip circuit cluster codeword component connected consists correct cyclic code data bits decryption Defect Tolerant denoted density function disk Distributed Systems duplex encoding encryption Equation erroneous error-correcting codes estimate event example execution exponentially distributed fail failure rate fault injection fault tolerance fault-free fault-tolerant systems floorplan Hamming code hardware hypercube IEEE IEEE Transactions implemented input interval key ciphers Koren Markov chain matrix modules multiple N-version programming node occur operation output overhead parameter parity bit path Poisson process polynomial primary probability random numbers random variable received redundancy reliability repair result roll back rows and columns scheme Section shown in Figure simulation spare rows Suppose switchbox techniques tion Transactions on Computers unit URNG versions VLSI watchdog processor write quorum yield
Popular passages
Page xix - From 1976 to 1985 he was a member of the faculty of the Department of Electrical and Computer Engineering at the University of Massachusetts, Amherst.
References to this book
New Methods of Concurrent Checking Michael Gössel,Vitaly Ocheretny,Egor Sogomonyan,Daniel Marienfeld Limited preview - 2008 |
Zuverlässigkeit Mechatronischer Systeme Bernd Bertsche,Peter Göhner,Uwe Jensen,Wolfgang Schinköthe,Hans-Joachim Wunderlich No preview available - 2009 |