Search CORE

1,508 research outputs found

Exploiting replication in distributed systems

Author: Birman Kenneth P.
Joseph T. A.
Publication venue
Publication date
Field of study

Techniques are examined for replicating data and execution in directly distributed systems: systems in which multiple processes interact directly with one another while continuously respecting constraints on their joint behavior. Directly distributed systems are often required to solve difficult problems, ranging from management of replicated data to dynamic reconfiguration in response to failures. It is shown that these problems reduce to more primitive, order-based consistency problems, which can be solved using primitives such as the reliable broadcast protocols. Moreover, given a system that implements reliable broadcast primitives, a flexible set of high-level tools can be provided for building a wide variety of directly distributed application programs

NASA Technical Reports Server

Rigorous Design of Fault-Tolerant Transactions for Replicated Database Systems using Event B

Author: Butler Michael
Yadav Divakar
Publication venue: Lecture Notes in Computer Science, Springer , 2006
Publication date: 01/01/2006
Field of study

System availability is improved by the replication of data objects in a distributed database system. However, during updates, the complexity of keeping replicas identical arises due to failures of sites and race conditions among conflicting transactions. Fault tolerance and reliability are key issues to be addressed in the design and architecture of these systems. Event B is a formal technique which provides a framework for developing mathematical models of distributed systems by rigorous description of the problem, gradually introducing solutions in refinement steps, and verification of solutions by discharge of proof obligations. In this paper, we present a formal development of a distributed system using Event B that ensures atomic commitment of distributed transactions consisting of communicating transaction components at participating sites. This formal approach carries the development of the system from an initial abstract specification of transactional updates on a one copy database to a detailed design containing replicated databases in refinement. Through refinement we verify that the design of the replicated database confirms to the one copy database abstraction

Southampton (e-Prints Soton)

Parallel Deferred Update Replication

Author: Pacheco Leandro
Pedone Fernando
Sciascia Daniele
Publication venue
Publication date: 03/12/2013
Field of study

Deferred update replication (DUR) is an established approach to implementing highly efficient and available storage. While the throughput of read-only transactions scales linearly with the number of deployed replicas in DUR, the throughput of update transactions experiences limited improvements as replicas are added. This paper presents Parallel Deferred Update Replication (P-DUR), a variation of classical DUR that scales both read-only and update transactions with the number of cores available in a replica. In addition to introducing the new approach, we describe its full implementation and compare its performance to classical DUR and to Berkeley DB, a well-known standalone database

arXiv.org e-Print Archive

Crossref

Programming your way out of the past: ISIS and the META Project

Author: Birman Kenneth P.
Marzullo Keith
Publication venue
Publication date
Field of study

The ISIS distributed programming system and the META Project are described. The ISIS programming toolkit is an aid to low-level programming that makes it easy to build fault-tolerant distributed applications that exploit replication and concurrent execution. The META Project is reexamining high-level mechanisms such as the filesystem, shell language, and administration tools in distributed systems

NASA Technical Reports Server

Optimistic Parallel State-Machine Replication

Author: Marandi Parisa Jalili
Pedone Fernando
Publication venue
Publication date: 27/04/2014
Field of study

State-machine replication, a fundamental approach to fault tolerance, requires replicas to execute commands deterministically, which usually results in sequential execution of commands. Sequential execution limits performance and underuses servers, which are increasingly parallel (i.e., multicore). To narrow the gap between state-machine replication requirements and the characteristics of modern servers, researchers have recently come up with alternative execution models. This paper surveys existing approaches to parallel state-machine replication and proposes a novel optimistic protocol that inherits the scalable features of previous techniques. Using a replicated B+-tree service, we demonstrate in the paper that our protocol outperforms the most efficient techniques by a factor of 2.4 times

arXiv.org e-Print Archive

Crossref

Supporting multiple isolation levels in replicated environments

Author: Adya
Agrawal
Berenson
Bernabé-Gisbert
Bernabé-Gisbert
Bernstein
Cecchet
Charron-Bost
El Abbadi
Elnikety
Fekete
Francesc D. Muñoz-Escoí
Gray
Hadzilacos
INCITS 135-1992 (R1998)
Jiménez-Peris
Josep M. Bernabé-Gisbert
Juárez-Rodríguez
Kemme
Kemme
Lin
Lin
MacCormick
Mak
Microsoft Corp.
Oracle Corp.
Pedone
Pedone
PostgreSQL Global Development Group
Ruiz-Fuertes
Ruiz-Fuertes
Salinas-Monteagudo
Transaction Processing Performance Council (TPC)
Verma
Wiesmann
Wiesmann
Publication venue: 'Elsevier BV'
Publication date: 01/10/2012
Field of study

Replication is used by databases to implement reliability and provide scalability. However, achieving transparent replication is not an easy task. A replicated database is transparent if it can seamlessly replace a standard stand-alone database without requiring any changes to the components of the system. Database replication transparency can be achieved if: (a) replication protocols remain hidden for all other components of the system; and (b) the functionality of a stand-alone database is provided. The ability to simultaneously execute transactions under different isolation levels is a functionality offered by all stand-alone databases but not by their replicated counterparts. Allowing different isolation levels may improve overall system performance. For example, the TPC-C benchmark specification tolerates execution of some transactions at weaker isolation levels in order to increase throughput of committed transactions. In this paper, we show how replication protocols can be extended to enable transactions to be executed under different isolation levels. © 2012 Elsevier B.V. All rights reserved.This work has been supported by the Spanish Ministerio de Ciencia e Innovation (MICINN) and the European Regional Development Fund (ERDF/FEDER) under research grants TIN2009-14460-C03-01 and TIN2010-17193. The translation of this paper was funded by the Universitat Politecnica de Valencia, Spain.Bernabe Gisbert, JM.; Muñoz Escoí, FD. (2012). Supporting multiple isolation levels in replicated environments. Data and Knowledge Engineering. 79-80:1-16. doi:10.1016/j.datak.2012.05.001S11679-8

Crossref

RiuNet

Evaluating certification protocols in the partial database state machine

Author: Correia Júnior Alfrânio Tavares
Moura Francisco Coelho Soares
Oliveira Rui Carlos Mendes de
Pereira José
Sousa António Luís Pinto Ferreira de
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2006
Field of study

Partial replication is an alluring technique to ensure the reliability of very large and geographically distributed databases while, at the same time, offering good performance. By correctly exploiting access locality most transactions become confined to a small subset of the database replicas thus reducing processing, storage access and communication overhead associated with replication. The advantages of partial replication have however to be weighted against the added complexity that is required to manage it. In fact, if the chosen replica configuration prevents the local execution of transactions or if the overhead of consistency protocols offsets the savings of locality, potential gains cannot be realized. These issues are heavily dependent on the application used for evaluation and render simplistic benchmarks useless. In this paper, we present a detailed analysis of Partial Database State Machine (PDBSM) replication by comparing alternative partial replication protocols with full replication. This is done using a realistic scenario based on a detailed network simulator and access patterns from an industry standard database benchmark. The results obtained allow us to identify the best configuration for typical on-line transaction processing applications.União Europeia - GORDA Project (FP6-IST/004758)

Universidade do Minho: RepositoriUM

Programming with process groups: Group and multicast semantics

Author: Birman Kenneth P.
Cooper Robert
Gleeson Barry
Publication venue
Publication date: 29/01/1991
Field of study

Process groups are a natural tool for distributed programming and are increasingly important in distributed computing environments. Discussed here is a new architecture that arose from an effort to simplify Isis process group semantics. The findings include a refined notion of how the clients of a group should be treated, what the properties of a multicast primitive should be when systems contain large numbers of overlapping groups, and a new construct called the causality domain. A system based on this architecture is now being implemented in collaboration with the Chorus and Mach projects

NASA Technical Reports Server

eCommons@Cornell

How robust are distributed systems

Author: Birman Kenneth P.
Publication venue
Publication date: 01/06/1989
Field of study

A distributed system is made up of large numbers of components operating asynchronously from one another and hence with imcomplete and inaccurate views of one another's state. Load fluctuations are common as new tasks arrive and active tasks terminate. Jointly, these aspects make it nearly impossible to arrive at detailed predictions for a system's behavior. It is important to the successful use of distributed systems in situations in which humans cannot provide the sorts of predictable realtime responsiveness of a computer, that the system be robust. The technology of today can too easily be affected by worn programs or by seemingly trivial mechanisms that, for example, can trigger stock market disasters. Inventors of a technology have an obligation to overcome flaws that can exact a human cost. A set of principles for guiding solutions to distributed computing problems is presented

NASA Technical Reports Server

eCommons@Cornell