Highly available cluster: A case study

Alain Azagury*, Danny Dolev, Gera Goft, John Marberg, Julian Satran

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

8 Scopus citations

Abstract

The methodology and design of a system that provides highly available data in a cluster is presented. A Highly Available Cluster consists of multiple machines interconnected by a common bus. Data is replicated at a primary and one or more backup machines. Data is accessed at the primary, using a location independent mechanism that ensures data integrity. If the primary copy of the data fails, access is recovered by switching to a backup copy. Switchover is transparent to the application, hence called seamless switchover. The fault model is fail-stop. The entire cluster is resilient to at least single failures. Designating data as highly available is selective in scope, and the overhead of replication and recovery is incurred only by applications that access highly available data. An experimental prototype was implemented using IBM AS/400 machines and a high-speed bus with fiber-optic links.

Original languageEnglish
Title of host publicationDigest of Papers - International Symposium on Fault-Tolerant Computing
PublisherPubl by IEEE
Pages404-413
Number of pages10
ISBN (Print)0818655224
StatePublished - 1994
Externally publishedYes
EventProceedings of the 24th International Symposium on Fault-Tolerant Computing - Austin, TX, USA
Duration: 15 Jun 199417 Jun 1994

Publication series

NameDigest of Papers - International Symposium on Fault-Tolerant Computing
ISSN (Print)0731-3071

Conference

ConferenceProceedings of the 24th International Symposium on Fault-Tolerant Computing
CityAustin, TX, USA
Period15/06/9417/06/94

Fingerprint

Dive into the research topics of 'Highly available cluster: A case study'. Together they form a unique fingerprint.

Cite this