Search CORE

3,551 research outputs found

A metaobject architecture for fault-tolerant distributed systems : the FRIENDS approach

Author: Fabre Jean-Charles
Pérennou Tanguy
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/1998
Field of study

The FRIENDS system developed at LAAS-CNRS is a metalevel architecture providing libraries of metaobjects for fault tolerance, secure communication, and group-based distributed applications. The use of metaobjects provides a nice separation of concerns between mechanisms and applications. Metaobjects can be used transparently by applications and can be composed according to the needs of a given application, a given architecture, and its underlying properties. In FRIENDS, metaobjects are used recursively to add new properties to applications. They are designed using an object oriented design method and implemented on top of basic system services. This paper describes the FRIENDS software-based architecture, the object-oriented development of metaobjects, the experiments that we have done, and summarizes the advantages and drawbacks of a metaobject approach for building fault-tolerant system

CiteSeerX

Crossref

Open Archive Toulouse Archive Ouverte

FRIENDS - A flexible architecture for implementing fault tolerant and secure distributed applications

Author: Fabre Jean-Charles
Pérennou Tanguy
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/1996
Field of study

FRIENDS is a software-based architecture for implementing fault-tolerant and, to some extent, secure applications. This architecture is composed of sub-systems and libraries of metaobjects. Transparency and separation of concerns is provided not only to the application programmer but also to the programmers implementing metaobjects for fault tolerance, secure communication and distribution. Common services required for implementing metaobjects are provided by the sub-systems. Metaobjects are implemented using object-oriented techniques and can be reused and customised according to the application needs, the operational environment and its related fault assumptions. Flexibility is increased by a recursive use of metaobjects. Examples and experiments are also described

CiteSeerX

Open Archive Toulouse Archive Ouverte

Implementing fault tolerant applications using reflective object-oriented programming

Author: Fabre Jean-Charles
Nicomette Vincent
Pérennou Tanguy
Stroud Robert
Wu Zhixue
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/06/1995
Field of study

Abstract: Shows how reflection and object-oriented programming can be used to ease the implementation of classical fault tolerance mechanisms in distributed applications. When the underlying runtime system does not provide fault tolerance transparently, classical approaches to implementing fault tolerance mechanisms often imply mixing functional programming with non-functional programming (e.g. error processing mechanisms). The use of reflection improves the transparency of fault tolerance mechanisms to the programmer and more generally provides a clearer separation between functional and non-functional programming. The implementations of some classical replication techniques using a reflective approach are presented in detail and illustrated by several examples, which have been prototyped on a network of Unix workstations. Lessons learnt from our experiments are drawn and future work is discussed

Open Archive Toulouse Archive Ouverte

Programming with process groups: Group and multicast semantics

Author: Birman Kenneth P.
Cooper Robert
Gleeson Barry
Publication venue
Publication date: 29/01/1991
Field of study

Process groups are a natural tool for distributed programming and are increasingly important in distributed computing environments. Discussed here is a new architecture that arose from an effort to simplify Isis process group semantics. The findings include a refined notion of how the clients of a group should be treated, what the properties of a multicast primitive should be when systems contain large numbers of overlapping groups, and a new construct called the causality domain. A system based on this architecture is now being implemented in collaboration with the Chorus and Mach projects

NASA Technical Reports Server

eCommons@Cornell

The ISIS Project: Real Experience with a Fault Tolerant Programming System

Author: Birman Kenneth
Cooper Robert
Publication venue
Publication date: 01/07/1990
Field of study

The ISIS project has developed a distributed programming toolkit and a collection of higher level applications based on these tools. ISIS is now in use at more than 300 locations world-wise. The lessons (and surprises) gained from this experience with the real world are discussed

NASA Technical Reports Server

eCommons@Cornell

Hyperswitch communication network

Author: Peterson J.
Pniel M.
Upchurch E.
Publication venue
Publication date
Field of study

The Hyperswitch Communication Network (HCN) is a large scale parallel computer prototype being developed at JPL. Commercial versions of the HCN computer are planned. The HCN computer being designed is a message passing multiple instruction multiple data (MIMD) computer, and offers many advantages in price-performance ratio, reliability and availability, and manufacturing over traditional uniprocessors and bus based multiprocessors. The design of the HCN operating system is a uniquely flexible environment that combines both parallel processing and distributed processing. This programming paradigm can achieve a balance among the following competing factors: performance in processing and communications, user friendliness, and fault tolerance. The prototype is being designed to accommodate a maximum of 64 state of the art microprocessors. The HCN is classified as a distributed supercomputer. The HCN system is described, and the performance/cost analysis and other competing factors within the system design are reviewed

NASA Technical Reports Server

Designing application software in wide area network settings

Author: Birman Ken
Makpangou Mesaac
Publication venue
Publication date: 01/01/1990
Field of study

Progress in methodologies for developing robust local area network software has not been matched by similar results for wide area settings. The design of application software spanning multiple local area environments is examined. For important classes of applications, simple design techniques are presented that yield fault tolerant wide area programs. An implementation of these techniques as a set of tools for use within the ISIS system is described

INRIA a CCSD electronic archive server

NASA Technical Reports Server

eCommons@Cornell

ISIS and META projects

Author: Birman Kenneth
Cooper Robert
Marzullo Keith
Publication venue
Publication date
Field of study

The ISIS project has developed a new methodology, virtual synchony, for writing robust distributed software. High performance multicast, large scale applications, and wide area networks are the focus of interest. Several interesting applications that exploit the strengths of ISIS, including an NFS-compatible replicated file system, are being developed. The META project is distributed control in a soft real-time environment incorporating feedback. This domain encompasses examples as diverse as monitoring inventory and consumption on a factory floor, and performing load-balancing on a distributed computing system. One of the first uses of META is for distributed application management: the tasks of configuring a distributed program, dynamically adapting to failures, and monitoring its performance. Recent progress and current plans are reported

NASA Technical Reports Server

Tools for distributed application management

Author: Birman Kenneth P.
Cooper Robert
Marzullo Keith
Wood Mark
Publication venue
Publication date
Field of study

Distributed application management consists of monitoring and controlling an application as it executes in a distributed environment. It encompasses such activities as configuration, initialization, performance monitoring, resource scheduling, and failure response. The Meta system is described: a collection of tools for constructing distributed application management software. Meta provides the mechanism, while the programmer specifies the policy for application management. The policy is manifested as a control program which is a soft real time reactive program. The underlying application is instrumented with a variety of built-in and user defined sensors and actuators. These define the interface between the control program and the application. The control program also has access to a database describing the structure of the application and the characteristics of its environment. Some of the more difficult problems for application management occur when pre-existing, nondistributed programs are integrated into a distributed application for which they may not have been intended. Meta allows management functions to be retrofitted to such programs with a minimum of effort

NASA Technical Reports Server

Integrating security in a group oriented distributed system

Author: Birman Kenneth
Gong LI
Reiter Michael
Publication venue
Publication date: 01/02/1992
Field of study

A distributed security architecture is proposed for incorporation into group oriented distributed systems, and in particular, into the Isis distributed programming toolkit. The primary goal of the architecture is to make common group oriented abstractions robust in hostile settings, in order to facilitate the construction of high performance distributed applications that can tolerate both component failures and malicious attacks. These abstractions include process groups and causal group multicast. Moreover, a delegation and access control scheme is proposed for use in group oriented systems. The focus is the security architecture; particular cryptosystems and key exchange protocols are not emphasized

NASA Technical Reports Server

eCommons@Cornell