Search CORE

65 research outputs found

The Architecture of Complexity Revisited: Design Primitives for Ultra-Large-Scale Systems

Author: Chen Hong-Mei
Kazman Rick
Publication venue
Publication date: 03/01/2023
Field of study

As software-intensive systems continue to grow in scale and complexity the techniques that we have used to design and analyze them in the past no longer suffice. In this paper we look at examples of existing ultra-large-scale systems—systems of enormous size and complexity. We examine instances of such systems that have arisen spontaneously in nature and those that have been human-constructed. We distill from these example systems the design primitives that underlie them. We capture these design primitives as a set of tactics— fundamental architectural building-blocks—and argue that to efficiently build and analyze such systems in the future we should strongly consider employing such building-blocks

ScholarSpace at University of Hawai'i at Manoa

Self-Healing Protocols for Connectivity Maintenance in Unstructured Overlays

Author: Ferretti Stefano
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 15/06/2015
Field of study

In this paper, we discuss on the use of self-organizing protocols to improve the reliability of dynamic Peer-to-Peer (P2P) overlay networks. Two similar approaches are studied, which are based on local knowledge of the nodes' 2nd neighborhood. The first scheme is a simple protocol requiring interactions among nodes and their direct neighbors. The second scheme adds a check on the Edge Clustering Coefficient (ECC), a local measure that allows determining edges connecting different clusters in the network. The performed simulation assessment evaluates these protocols over uniform networks, clustered networks and scale-free networks. Different failure modes are considered. Results demonstrate the effectiveness of the proposal.Comment: The paper has been accepted to the journal Peer-to-Peer Networking and Applications. The final publication is available at Springer via http://dx.doi.org/10.1007/s12083-015-0384-

arXiv.org e-Print Archive

Archivio istituzionale della ricerca - Università di Urbino

Archivio istituzionale della ricerca - Alma Mater Studiorum Università di Bologna

Resource discovery for distributed computing systems: A comprehensive survey

Author: Abdullah
Aberer
Abraham
Aguiar
Aguilera
Ahmed
Akay
Alam
Albrecht
Albrecht
Anderson
Antonopoulos
Aspnes
Atif
Awerbuch
Awerbuch
Baldoni
Ballani
Bandara
Banerjee
Bangyong
Baranwal
Barjini
Basu
Battre
Berman
Bharambe
Bharambe
Bimson
Birman
Bisnik
Bisnik
Bo
Brocco
Brocco
Brogi
Brown
Brunner
Buccafurri
Burstein
Butt
Buyya
Byrom
Byrom
Cai
Caminero
Campo
Candan
Cao
Carra
Carzaniga
Castro
Chang
Chang-Yen
Chatziantoniou
Chaudhuri
Chawathe
Chen
Chen
Chen
Chen
Cheng
Chien
Chung
Cidon
Costa
Crainiceanu
Crainiceanu
Crespo
Czajkowski
Datta
Datta
Davtyan
Deng
Deng
Dhurandher
Di
Di
Di
Diaz
Dimakopoulos
Dimakopoulos
Dissanayaka
Di Martino
Dorigo
Dorigo
Duarte
D’Angelo
Elijorde
Erdil
Erdil
Falchi
Fensel
Ferretti
Forestiero
Forestiero
Foster
Foster
Foster
Foster
Foster
Frey
Fugkeaw
Gaeta
Ganesan
Ganesan
Ganesh
Ganguly
Gao
Gentzsch
Georgiou
Germain
Ghafarian
Ghamri-Doudane
Ghamri-Doudane
Gill
Glover
Goel
González-Beltrán
Guo
Hameurlain
Hameurlain
Harchol-Balter
Harvey
Haykin
Henderson
Hidalgo
Horrocks
Horrocks
Hussin
Iamnitchi
Ionescu
Javad Zarrin
Jelasity
Jesi
Jin
Joung
Joung
Joung
João Paulo Barraca
Kalogeraki
Kannan
Ke
Keller
Kermarrec
Keung
Khanli
Khoobkar
Kim
Klusch
Kniesburges
Ko
Korf
Korf
Kostoulas
Krauter
Krynicki
Kumar
Kutten
Kutten
Kutten
Lazaro
Lee
Lee
Li
Li
Li
Li
Li
Li
Liben-Nowell
Lima
Lu
Ludwig
Lv
Makki
Manvi
March
Martino
Massie
Mastroianni
Mateescu
McGuinness
Medrano-Chávez
Melliar-Smith
Meng
Meshkova
Michlmayr
Milojicic
Montebello
Murugan
Nagarajan
Naseer
Navimipour
Newcomer
Nurmi
Oikonomou
Pan
Pande
Passarella
Pastore
Pathan
Pipan
Pittaras
Prajapati
Raack
Raicu
Raman
Ratnasamy
Reed
Reynolds
Rhea
Rhea
Rhee
Risson
Rochwerger
Rochwerger
Rowstron
Rui L. Aguiar
Russell
Sander
Sathish
Schopf
Schubert
Schubert
Seo
Shaikh
Shaikh
Shang
Shen
Shenvi
Siddiqui
Sotiriadis
Sotomayor
Staples
Steiner
Stevens
Stevens
Stoica
Stützle
Sun
Sun
Sun
Taheri
Talia
Talia
Talia
Talia
Tang
Tang
Tannenbaum
Tao
Tate
Tereshko
Tigelaar
Torkestani
Trunfio
Valdez
Vanthournout
Vanthournout
Van Renesse
Ververidis
Wang
Watkins
Welch
Wolinsky
Wright
Xiao
Xu
Xu
Xu
Yang
Yao
Yin
Ying
Yoo
Yousefipour
Yu
Yusta
Zaharia
Zarrin
Zarrin
Zarrin
Zarrin
Zhang
Zhang
Zhang
Zhang
Zhao
Zhao
Zhou
Zhou
Zhu
Publication venue: 'Elsevier BV'
Publication date: 01/03/2018
Field of study

Large-scale distributed computing environments provide a vast amount of heterogeneous computing resources from different sources for resource sharing and distributed computing. Discovering appropriate resources in such environments is a challenge which involves several different subjects. In this paper, we provide an investigation on the current state of resource discovery protocols, mechanisms, and platforms for large-scale distributed environments, focusing on the design aspects. We classify all related aspects, general steps, and requirements to construct a novel resource discovery solution in three categories consisting of structures, methods, and issues. Accordingly, we review the literature, analyzing various aspects for each category

Crossref

Repositório Institucional da Universidade de Aveiro

Anglia Ruskin Research

A survey of distributed data aggregation algorithms

Author: Almeida Paulo Sérgio
Baquero Carlos
Jesus Paulo Alexandre Marques
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2015
Field of study

Distributed data aggregation is an important task, allowing the decentralized determination of meaningful global properties, which can then be used to direct the execution of other applications. The resulting values are derived by the distributed computation of functions like COUNT, SUM, and AVERAGE. Some application examples deal with the determination of the network size, total storage capacity, average load, majorities and many others. In the last decade, many different approaches have been proposed, with different trade-offs in terms of accuracy, reliability, message and time complexity. Due to the considerable amount and variety of aggregation algorithms, it can be difficult and time consuming to determine which techniques will be more appropriate to use in specific settings, justifying the existence of a survey to aid in this task. This work reviews the state of the art on distributed data aggregation algorithms, providing three main contributions. First, it formally defines the concept of aggregation, characterizing the different types of aggregation functions. Second, it succinctly describes the main aggregation techniques, organizing them in a taxonomy. Finally, it provides some guidelines toward the selection and use of the most relevant techniques, summarizing their principal characteristics.info:eu-repo/semantics/publishedVersio

Universidade do Minho: RepositoriUM

Crossref

Self-organising agent communities for autonomic resource management

Author: Bullock Seth
Geard Nicholas
Jacyno Mariusz
Luck Michael
Payne Terry R.
Publication venue: 'SAGE Publications'
Publication date: 01/02/2013
Field of study

The autonomic computing paradigm addresses the operational challenges presented by increasingly complex software systems by proposing that they be composed of many autonomous components, each responsible for the run-time reconfiguration of its own dedicated hardware and software components. Consequently, regulation of the whole software system becomes an emergent property of local adaptation and learning carried out by these autonomous system elements. Designing appropriate local adaptation policies for the components of such systems remains a major challenge. This is particularly true where the system’s scale and dynamism compromise the efficiency of a central executive and/or prevent components from pooling information to achieve a shared, accurate evidence base for their negotiations and decisions.In this paper, we investigate how a self-regulatory system response may arise spontaneously from local interactions between autonomic system elements tasked with adaptively consuming/providing computational resources or services when the demand for such resources is continually changing. We demonstrate that system performance is not maximised when all system components are able to freely share information with one another. Rather, maximum efficiency is achieved when individual components have only limited knowledge of their peers. Under these conditions, the system self-organises into appropriate community structures. By maintaining information flow at the level of communities, the system is able to remain stable enough to efficiently satisfy service demand in resource-limited environments, and thus minimise any unnecessary reconfiguration whilst remaining sufficiently adaptive to be able to reconfigure when service demand changes

Southampton (e-Prints Soton)

King's Research Portal

Explore Bristol Research

SoS: self-organizing substrates

Author: Datta Anwitaman
Publication venue: Lausanne, EPFL
Publication date: 21/06/2006
Field of study

Large-scale networked systems often, both by design or chance exhibit self-organizing properties. Understanding self-organization using tools from cybernetics, particularly modeling them as Markov processes is a first step towards a formal framework which can be used in (decentralized) systems research and design.Interesting aspects to look for include the time evolution of a system and to investigate if and when a system converges to some absorbing states or stabilizes into a dynamic (and stable) equilibrium and how it performs under such an equilibrium state. Such a formal framework brings in objectivity in systems research, helping discern facts from artefacts as well as providing tools for quantitative evaluation of such systems. This thesis introduces such formalism in analyzing and evaluating peer-to-peer (P2P) systems in order to better understand the dynamics of such systems which in turn helps in better designs. In particular this thesis develops and studies the fundamental building blocks for a P2P storage system. In the process the design and evaluation methodology we pursue illustrate the typical methodological approaches in studying and designing self-organizing systems, and how the analysis methodology influences the design of the algorithms themselves to meet system design goals (preferably with quantifiable guarantees). These goals include efficiency, availability and durability, load-balance, high fault-tolerance and self-maintenance even in adversarial conditions like arbitrarily skewed and dynamic load and high membership dynamics (churn), apart of-course the specific functionalities that the system is supposed to provide. The functionalities we study here are some of the fundamental building blocks for various P2P applications and systems including P2P storage systems, and hence we call them substrates or base infrastructure. These elemental functionalities include: (i) Reliable and efficient discovery of resources distributed over the network in a decentralized manner; (ii) Communication among participants in an address independent manner, i.e., even when peers change their physical addresses; (iii) Availability and persistence of stored objects in the network, irrespective of availability or departure of individual participants from the system at any time; and (iv) Freshness of the objects/resources' (up-to-date replicas). Internet-scale distributed index structures (often termed as structured overlays) are used for discovery and access of resources in a decentralized setting. We propose a rapid construction from scratch and maintenance of the P-Grid overlay network in a self-organized manner so as to provide efficient search of both individual keys as well as a whole range of keys, doing so providing good load-balancing characteristics for diverse kind of arbitrarily skewed loads - storage and replication, query forwarding and query answering loads. For fast overlay construction we employ recursive partitioning of the key-space so that the resulting partitions are balanced with respect to storage load and replication. The proper algorithmic parameters for such partitioning is derived from a transient analysis of the partitioning process which has Markov property. Preservation of ordering information in P-Grid such that queries other than exact queries, like range queries can be efficiently and rather trivially handled makes P-Grid suitable for data-oriented applications. Fast overlay construction is analogous to building an index on a new set of keys making P-Grid suitable as the underlying indexing mechanism for peer-to-peer information retrieval applications among other potential applications which may require frequent indexing of new attributes apart regular updates to an existing index. In order to deal with membership dynamics, in particular changing physical address of peers across sessions, the overlay itself is used as a (self-referential) directory service for maintaining the participating peers' physical addresses across sessions. Exploiting this self-referential directory, a family of overlay maintenance scheme has been designed with lower communication overhead than other overlay maintenance strategies. The notion of dynamic equilibrium study for overlays under continuous churn and repairs, modeled as a Markov process, was introduced in order to evaluate and compare the overlay maintenance schemes. While the self-referential directory was originally invented to realize overlay maintenance schemes with lower overheads than existing overlay maintenance schemes, the self-referential directory is generic in nature and can be used for various other purposes, e.g., as a decentralized public key infrastructure. Persistence of peer identity across sessions, in spite of changes in physical address, provides a logical independence of the overlay network from the underlying physical network. This has many other potential usages, for example, efficient maintenance mechanisms for P2P storage systems and P2P trust and reputation management. We specifically look into the dynamics of maintaining redundancy for storage systems and design a novel lazy maintenance strategy. This strategy is algorithmically a simple variant of existing maintenance strategies which adapts to the system dynamics. This randomized lazy maintenance strategy thus explores the cost-performance trade-offs of the storage maintenance operations in a self-organizing manner. We model the storage system (redundancy), under churn and maintenance, as a Markov process. We perform an equilibrium study to show that the system operates in a more stable dynamic equilibrium with our strategy than for the existing maintenance scheme for comparable overheads. Particularly, we show that our maintenance scheme provides substantial performance gains in terms of maintenance overhead and system's resilience in presence of churn and correlated failures. Finally, we propose a gossip mechanism which works with lower communication overhead than existing approaches for communication among a relatively large set of unreliable peers without assuming any specific structure for their mutual connectivity. We use such a communication primitive for propagating replica updates in P2P systems, facilitating management of mutable content in P2P systems. The peer population affected by a gossip can be modeled as a Markov process. Studying the transient spread of gossips help in choosing proper algorithm parameters to reduce communication overhead while guaranteeing coverage of online peers. Each of these substrates in themselves were developed to find practical solutions for real problems. Put together, these can be used in other applications, including a P2P storage system with support for efficient lookup and inserts, membership dynamics, content mutation and updates, persistence and availability. Many of the ideas have already been implemented in real systems and several others are in the way to be integrated into the implementations. There are two principal contributions of this dissertation. It provides design of the P2P systems which are useful for end-users as well as other application developers who can build upon these existing systems. Secondly, it adapts and introduces the methodology of analysis of a system's time-evolution (tools typically used in diverse domains including physics and cybernetics) to study the long run behavior of P2P systems, and uses this methodology to (re-)design appropriate algorithms and evaluate them. We observed that studying P2P systems from the perspective of complex systems reveals their inner dynamics and hence ways to exploit such dynamics for suitable or better algorithms. In other words, the analysis methodology in itself strongly influences and inspires the way we design such systems. We believe that such an approach of orchestrating self-organization in internet-scale systems, where the algorithms and the analysis methodology have strong mutual influence will significantly change the way future such systems are developed and evaluated. We envision that such an approach will particularly serve as an important tool for the nascent but fast moving P2P systems research and development community

Infoscience - École polytechnique fédérale de Lausanne