Details

Reliability of Computer Systems and Networks


Reliability of Computer Systems and Networks

Fault Tolerance, Analysis, and Design
1. Aufl.

von: Martin L. Shooman

186,99 €

Verlag: Wiley
Format: PDF
Veröffentl.: 08.04.2003
ISBN/EAN: 9780471464068
Sprache: englisch
Anzahl Seiten: 552

DRM-geschütztes eBook, Sie benötigen z.B. Adobe Digital Editions und eine Adobe ID zum Lesen.

Beschreibungen

With computers becoming embedded as controllers in everything from network servers to the routing of subway schedules to NASA missions, there is a critical need to ensure that systems continue to function even when a component fails. In this book, bestselling author Martin Shooman draws on his expertise in reliability engineering and software engineering to provide a complete and authoritative look at fault tolerant computing. He clearly explains all fundamentals, including how to use redundant elements in system design to ensure the reliability of computer systems and networks.<br /> Market: Systems and Networking Engineers, Computer Programmers, IT Professionals.
<b>Preface.</b> <p><b>1 Introduction.</b></p> <p>1.1 What is Fault-Tolerant Computing?</p> <p>1.2 The Rise of Microelectronics and the Computer.</p> <p>1.3 Reliability and Availability.</p> <p>1.4 Organization of the Book.</p> <p><b>2 Coding Techniques.</b></p> <p>2.1 Introduction.</p> <p>2.2 Basic Principles.</p> <p>2.3 Parity-Bit Codes.</p> <p>2.4 Hamming Codes.</p> <p>2.5 Error-Detection and Retransmission Codes.</p> <p>2.6 Burst Error-Correction Codes.</p> <p>2.7 Reed–Solomon Codes.</p> <p>2.8 Other Codes.</p> <p><b>3 Redundancy, Spares, and Repairs.</b></p> <p>3.1 Introduction.</p> <p>3.2 Apportionment.</p> <p>3.3 System Versus Component Redundancy.</p> <p>3.4 Approximate Reliability Functions.</p> <p>3.5 Parallel Redundancy.</p> <p>3.6 An <i>r</i>-out-of-<i>n</i> Structure.</p> <p>3.7 Standby Systems.</p> <p>3.8 Repairable Systems.</p> <p>3.9 RAID Systems Reliability.</p> <p>3.10 Typical Commercial Fault-Tolerant Systems: Tandem and Stratus.</p> <p><b>4 <i>N</i>-Modular Redundancy.</b></p> <p>4.1 Introduction.</p> <p>4.2 The History of <i>N</i>-Modular Redundancy.</p> <p>4.3 Triple Modular Redundancy.</p> <p>4.4 <i>N</i>-Modular Redundancy.</p> <p>4.5 Imperfect Voters.</p> <p>4.6 Voter Logic.</p> <p>4.7 <i>N</i>-Modular Redundancy with Repair.</p> <p>4.8 <i>N</i>-Modular Redundancy with Repair and Imperfect Voters.</p> <p>4.9 Availability of <i>N</i>-Modular Redundancy with Repair and Imperfect Voters.</p> <p>4.10 Microcode-Level Redundancy.</p> <p>4.11 Advanced Voting Techniques.</p> <p><b>5 Software Reliability and Recovery Techniques.</b></p> <p>5.1 Introduction.</p> <p>5.2 The Magnitude of the Problem.</p> <p>5.3 Software Development Life Cycle.</p> <p>5.4 Reliability Theory.</p> <p>5.5 Software Error Models.</p> <p>5.6 Reliability Models.</p> <p>5.7 Estimating the Model Constants.</p> <p>5.8 Other Software Reliability Models.</p> <p>5.9 Software Redundancy.</p> <p>5.10 Rollback and Recovery.</p> <p><b>6 Networked Systems Reliability.</b></p> <p>6.1 Introduction.</p> <p>6.2 Graph Models.</p> <p>6.3 Definition of Network Reliability.</p> <p>6.4 Two-Terminal Reliability.</p> <p>6.5 Node Pair Resilience.</p> <p>6.6 All-Terminal Reliability.</p> <p>6.7 Design Approaches.</p> <p><b>7 Reliability Optimization.</b></p> <p>7.1 Introduction.</p> <p>7.2 Optimum Versus Good Solutions.</p> <p>7.3 A Mathematical Statement of the Optimization Problem.</p> <p>7.4 Parallel and Standby Redundancy.</p> <p>7.5 Hierarchical Decomposition.</p> <p>7.6 Apportionment.</p> <p>7.7 Optimization at the Subsystem Level via Enumeration.</p> <p>7.8 Bounded Enumeration Approach.</p> <p>7.9 Apportionment as an Approximate Optimization Technique.</p> <p>7.10 Standby System Optimization.</p> <p>7.11 Optimization Using a Greedy Algorithm.</p> <p>7.12 Dynamic Programming.</p> <p>7.13 Conclusion.</p> <p><b>Appendix A Summary of Probability Theory.</b></p> <p>A1 Introduction.</p> <p>A2 Probability Theory.</p> <p>A3 Set Theory.</p> <p>A4 Combinatorial Properties.</p> <p>A5 Discrete Random Variables.</p> <p>A6 Continuous Random Variables.</p> <p>A7 Moments.</p> <p>A8 Markov Variables.</p> <p><b>Appendix B Summary of Reliability Theory.</b></p> <p>B1 Introduction.</p> <p>B2 Combinatorial Reliability.</p> <p>B3 Failure-Rate Models.</p> <p>B4 System Reliability.</p> <p>B5 Illustrative Example of Simplified Auto Drum Brakes.</p> <p>B5.6 Summary.</p> <p>B6 Markov Reliability and Availability Models.</p> <p>B7 Repairable Systems.</p> <p>B8 Laplace Transform Solutions of Markov Models.</p> <p><b>Appendix C Review of Architecture Fundamentals.</b></p> <p>C1 Introduction to Computer Architecture.</p> <p>C2 Logic Gates, Symbols, and Integrated Circuits.</p> <p>C3 Boolean Algebra and Switching Functions.</p> <p>C4 Switching Function Simplification.</p> <p>C5 Combinatorial Circuits.</p> <p>C6 Common Circuits: Parity-Bit Generators and Decoders.</p> <p>C7 Flip-Flops.</p> <p>C8 Storage Registers.</p> <p><b>Appendix D Programs for Reliability Modeling and Analysis.</b></p> <p>D1 Introduction.</p> <p>D2 Various Types of Reliability and Availability Programs.</p> <p>D3 Testing Programs.</p> <p>D4 Partial List of Reliability and Availability Programs.</p> <p>D5 An Example of Computer Analysis.</p> <p><b>Name Index.</b></p> <p><b>Subject Index.</b></p>
"...develops the principles of reliability and availability analysis for computer networks, culminating in a n introduction network design principles." (<i>SciTech Book News</i>, Vol. 26, No. 2, June 2002) <p>"...a useful reference." (<i>IEEE Computer-Review</i>, August 2002)</p> <p>"The author has created a wonderful toolbox for systems engineers. So much is right here in one place, and organized effectively. I recommend this book to anyone working on networks or systems where reliability is a concern." (<i>IIE Transactions on Quality and Reliability Engineering</i>)</p> <p>"...very good practical hints...recommended for everyone who wants to learn either reliability fundamentals or know about the computer applications of reliability..." (Comsoc.org, April 2003)</p>
<p><strong>MARTIN L. SHOOMAN</strong>, PhD, served for many years as a Professor of Electrical Engineering and Computer Science at Polytechnic University in Brooklyn, New York. Dr. Shooman has been a Visiting Professor at MIT and Hunter College, and a consultant to Bell Laboratories, NASA, IBM, the US Army, and many other government and commercial organizations. A fellow of the IEEE, he has received five best paper awards from their Reliability and Computer Societies. Dr. Shooman has contributed to over 100 papers and reports to the research literature and has given special courses in Britain, Canada, France, Israel, and throughout the US. The author of <em>Probabilistic Reliability: An Engineering Approach</em> and <em>Software Engineering: Design, Reliability, and Management</em>, he is currently President of the consulting firm Martin L. Shooman & Associates.
<p><b>A comprehensive introduction to reliability and availability modeling, analysis, and design at the system, hardware, and software levels</b></p> <p><i>Reliability of Computer Systems and Networks</i> presents the fundamentals of reliability and availability analysis for various computer hardware, software, and networked systems. Reliability and availability as major objectives in system design are the focus. Various redundancy and fault-tolerant techniques, as well as error-correcting coding techniques are treated.</p> <p>The author proposes a high-level design approach based on apportioning the reliability and availability goals to subsystems and provides various techniques for achieving these subsystem goals. The next step is an efficient, exact optimization approach based on upper and lower bounds to minimize the number of feasible candidates. The most readily applied methods for analysis are utilized and design techniques are derived from basic principles. Analytical simplifications and approximations are developed to validate the results of computer models used for large-scale complex problems.</p> <p>Coverage includes:<br /> </p> <ul> <li>Coding and decoding schemes for error detection and correction including chip reliability</li> <li>Comparison of the reliability and availability of parallel, standby, and majority voting architectures</li> <li>Formulation, solution, and interpretation of Markov models for repairable systems</li> <li>Introduction and comparison of various RAID memory systems</li> <li>The architecture and fault-tolerant principles of TANDEM and STRATUS non-stop computer systems</li> <li>Practical and tutorial examples and numerous practice problems</li> <li>Appendices which cover the necessary background material on probability, reliability, and architecture</li> </ul> <p><i>Reliability of Computer Systems and Networks</i> offers in-depth and up-to-date coverage of reliability and availability for students with a focus on important applications areas, computer systems, and networks. Professionals in systems and reliability design, as well as computer architecture, will find it a highly useful reference.</p>

Diese Produkte könnten Sie auch interessieren:

Bandwidth Efficient Coding
Bandwidth Efficient Coding
von: John B. Anderson
EPUB ebook
114,99 €
Digital Communications with Emphasis on Data Modems
Digital Communications with Emphasis on Data Modems
von: Richard W. Middlestead
PDF ebook
171,99 €
Bandwidth Efficient Coding
Bandwidth Efficient Coding
von: John B. Anderson
PDF ebook
114,99 €