High-availability computer systems
Computer1991Vol. 24(9), pp. 39–48
Citations Over TimeTop 10% of 1991 papers
Abstract
The techniques used to build highly available computer systems are sketched. Historical background is provided, and terminology is defined. Empirical experience with computer failure is briefly discussed. Device improvements that have greatly increased the reliability of digital electronics are identified. Fault-tolerant design concepts and approaches to fault-tolerant hardware are outlined. The role of repair and maintenance and of design-fault tolerance is discussed. Software repair is considered. The use of pairs of computer systems at separate locations to guard against unscheduled outages due to outside sources (communication or power failures, earthquakes, etc.) is addressed.>
Related Papers
- → Fault-Tolerance in the Scope of Software-Defined Networking (SDN)(2019)69 cited
- → A Classification-Based Approach to Fault-Tolerance Support in Parallel Programs(2009)4 cited
- Concepts of Fault Tolerant Computing(2011)
- Towards an Integrated Approach to Fault Tolerance in Delta-4(1991)
- Service fault tolerance for highly reliable service-oriented systems: an overview(2015)