Results 1 - 10
of
161,283
Understanding Fault-Tolerant Distributed Systems
- COMMUNICATIONS OF THE ACM
, 1993
"... We propose a small number of basic concepts that can be used to explain the architecture of fault-tolerant distributed systems and we discuss a list of architectural issues that we find useful to consider when designing or examining such systems. For each issue we present known solutions and design ..."
Abstract
-
Cited by 377 (23 self)
- Add to MetaCart
We propose a small number of basic concepts that can be used to explain the architecture of fault-tolerant distributed systems and we discuss a list of architectural issues that we find useful to consider when designing or examining such systems. For each issue we present known solutions and design
Synthesis of Fault-Tolerant Distributed Systems ⋆
"... Abstract. A distributed system is fault-tolerant if it continues to perform correctly even when a subset of the processes becomes faulty. Faulttolerance is highly desirable but often difficult to implement. In this paper, we investigate fault-tolerant synthesis, i.e., the problem of determining whet ..."
Abstract
-
Cited by 3 (0 self)
- Add to MetaCart
Abstract. A distributed system is fault-tolerant if it continues to perform correctly even when a subset of the processes becomes faulty. Faulttolerance is highly desirable but often difficult to implement. In this paper, we investigate fault-tolerant synthesis, i.e., the problem of determining
Abstractions for Fault-Tolerant Distributed System Verification
- In Proceedings of Theorem-Proving in Higher-Order Logics (TPHOLs). Theorem Proving in Higher-Order Logics (TPHOLs
, 2004
"... Four kinds of abstraction for the design and analysis of fault-- tolerant distributed systems are discussed. These abstractions concern system messages, faults, fault--masking voting, and communication. The abstractions are formalized in higher--order logic, and are intended to facilitate specifying ..."
Abstract
-
Cited by 5 (1 self)
- Add to MetaCart
Four kinds of abstraction for the design and analysis of fault-- tolerant distributed systems are discussed. These abstractions concern system messages, faults, fault--masking voting, and communication. The abstractions are formalized in higher--order logic, and are intended to facilitate
Fault-tolerant Distributed Systems in Hardware
"... Very large-scale integrated (VLSI) hardware designs can be seen as distributed systems at several levels of abstraction: from the cores in a multicore architecture down to the Boolean gates in its circuit implementation, hardware designs com-prise of interacting computing nodes with non-negligible c ..."
Abstract
- Add to MetaCart
-negligible communication delays. The resulting similarities to classic large-scale distributed systems become even more accented in mission critical hardware designs that are required to operate correctly in the presence of component failures. We advocate to act on this observation and treat fault-tolerant hardware de
Space in Fault-Tolerant Distributed Systems
- IN PROCEEDINGS OF THE 6TH IEEE INTERNATIONAL WORKSHOP ON OBJECT-ORIENTED REAL-TIME DEPENDABLE SYSTEMS (WORDS’01
, 2001
"... Algorithms for solving agreement problems can be classified in two categories: (1) those relying on failure detectors that we call FD-based, and (2) those that rely on a Group Membership Service that we call GMS-based. The paper discusses the advantages and limitations of these two approaches, and p ..."
Abstract
- Add to MetaCart
Algorithms for solving agreement problems can be classified in two categories: (1) those relying on failure detectors that we call FD-based, and (2) those that rely on a Group Membership Service that we call GMS-based. The paper discusses the advantages and limitations of these two approaches, and proposes an extension to the GMS-approach that combines the advantages of both approaches, without their drawbacks. This extension leads us to distinguish between timetriggered suspicions of processes and space-triggered exclusions.
Reliability Optimization Models for Fault-Tolerant Distributed Systems
- Proc. Ann. Reliability & Maintainability Symp., J an 2001
"... This paper presents four models to demonstrate our techniques for optimizing software and hardware reliability for fault-tolerant distributed systems. The models help us find the optimal system structure while considering basic information on reliability and cost of the available software and hardwa ..."
Abstract
-
Cited by 4 (1 self)
- Add to MetaCart
This paper presents four models to demonstrate our techniques for optimizing software and hardware reliability for fault-tolerant distributed systems. The models help us find the optimal system structure while considering basic information on reliability and cost of the available software
Notes in Computer Science, 2004. Abstractions for Fault-Tolerant Distributed System Verification
"... Abstract. Four kinds of abstraction for the design and analysis of fault– tolerant distributed systems are discussed. These abstractions concern system messages, faults, fault–masking voting, and communication. The abstractions are formalized in higher–order logic, and are intended to facilitate spe ..."
Abstract
- Add to MetaCart
Abstract. Four kinds of abstraction for the design and analysis of fault– tolerant distributed systems are discussed. These abstractions concern system messages, faults, fault–masking voting, and communication. The abstractions are formalized in higher–order logic, and are intended to facilitate
An Approach to Manage Reconfiguration in Fault-Tolerant Distributed Systems
- PROC. OF THE ICSE 2003 WORKSHOP ON SOFTWARE ARCHITECTURES FOR DEPENDABLE SYSTEMS
, 2003
"... This paper deals with dynamic resource management for real--time dependability--critical distributed systems. Requirements for such kind of systems span many domains such as time, survivability, and scalability and point out formidable challenges in terms of their fulfillment. An architecture is pro ..."
Abstract
-
Cited by 5 (1 self)
- Add to MetaCart
is proposed, based on the agent distributed infrastructure Lira, and enriched with statistical models for decision-making capabilities. The aim of the proposed architecture is to provide adaptive system reconfiguration, resorting to a hierarchy of resource managers to cope with fault tolerance and scalability
Time vs. Space in Fault-Tolerant Distributed Systems
- In Proceedings of the 6th IEEE International Workshop on Object-oriented Real-time Dependable Systems (WORDS’01
, 2001
"... Algorithms for solving agreement problems can be classified in two categories: (1) those relying on failure detectors that we call FD-based, and (2) those that rely on a Group Membership Service that we call GMS-based. The paper discusses the advantages and limitations of these two approaches, and p ..."
Abstract
-
Cited by 9 (4 self)
- Add to MetaCart
Algorithms for solving agreement problems can be classified in two categories: (1) those relying on failure detectors that we call FD-based, and (2) those that rely on a Group Membership Service that we call GMS-based. The paper discusses the advantages and limitations of these two approaches, and proposes an extension to the GMS-approach that combines the advantages of both approaches, without their drawbacks. This extension leads us to distinguish between timetriggered suspicions of processes and space-triggered exclusions.
Framework for Replica Selection in Fault-Tolerant Distributed Systems
, 2007
"... Abstract. This paper describes my term project, which I developed in the course CS 589 Software Engineering for Embedded Systems. The term project should be a design and an implementation of a novel application or development tool that exploits one or more existing approaches to software engineering ..."
Abstract
-
Cited by 1 (1 self)
- Add to MetaCart
engineering in the context of embedded systems, demonstrates a novel idea in this domain, or overcomes a known significant challenge posed by embedded systems. In my project I examined how to select replica components in fault-tolerant systems to increase the overall reliability of the system considering
Results 1 - 10
of
161,283