8 research outputs found

    Virtual Multicast

    Get PDF

    A Content-Addressable Network for Similarity Search in Metric Spaces

    Get PDF
    Because of the ongoing digital data explosion, more advanced search paradigms than the traditional exact match are needed for contentbased retrieval in huge and ever growing collections of data produced in application areas such as multimedia, molecular biology, marketing, computer-aided design and purchasing assistance. As the variety of data types is fast going towards creating a database utilized by people, the computer systems must be able to model human fundamental reasoning paradigms, which are naturally based on similarity. The ability to perceive similarities is crucial for recognition, classification, and learning, and it plays an important role in scientific discovery and creativity. Recently, the mathematical notion of metric space has become a useful abstraction of similarity and many similarity search indexes have been developed. In this thesis, we accept the metric space similarity paradigm and concentrate on the scalability issues. By exploiting computer networks and applying the Peer-to-Peer communication paradigms, we build a structured network of computers able to process similarity queries in parallel. Since no centralized entities are used, such architectures are fully scalable. Specifically, we propose a Peer-to-Peer system for similarity search in metric spaces called Metric Content-Addressable Network (MCAN) which is an extension of the well known Content-Addressable Network (CAN) used for hash lookup. A prototype implementation of MCAN was tested on real-life datasets of image features, protein symbols, and text — observed results are reported. We also compared the performance of MCAN with three other, recently proposed, distributed data structures for similarity search in metric spaces

    Incentive-driven QoS in peer-to-peer overlays

    Get PDF
    A well known problem in peer-to-peer overlays is that no single entity has control over the software, hardware and configuration of peers. Thus, each peer can selfishly adapt its behaviour to maximise its benefit from the overlay. This thesis is concerned with the modelling and design of incentive mechanisms for QoS-overlays: resource allocation protocols that provide strategic peers with participation incentives, while at the same time optimising the performance of the peer-to-peer distribution overlay. The contributions of this thesis are as follows. First, we present PledgeRoute, a novel contribution accounting system that can be used, along with a set of reciprocity policies, as an incentive mechanism to encourage peers to contribute resources even when users are not actively consuming overlay services. This mechanism uses a decentralised credit network, is resilient to sybil attacks, and allows peers to achieve time and space deferred contribution reciprocity. Then, we present a novel, QoS-aware resource allocation model based on Vickrey auctions that uses PledgeRoute as a substrate. It acts as an incentive mechanism by providing efficient overlay construction, while at the same time allocating increasing service quality to those peers that contribute more to the network. The model is then applied to lagsensitive chunk swarming, and some of its properties are explored for different peer delay distributions. When considering QoS overlays deployed over the best-effort Internet, the quality received by a client cannot be adjudicated completely to either its serving peer or the intervening network between them. By drawing parallels between this situation and well-known hidden action situations in microeconomics, we propose a novel scheme to ensure adherence to advertised QoS levels. We then apply it to delay-sensitive chunk distribution overlays and present the optimal contract payments required, along with a method for QoS contract enforcement through reciprocative strategies. We also present a probabilistic model for application-layer delay as a function of the prevailing network conditions. Finally, we address the incentives of managed overlays, and the prediction of their behaviour. We propose two novel models of multihoming managed overlay incentives in which overlays can freely allocate their traffic flows between different ISPs. One is obtained by optimising an overlay utility function with desired properties, while the other is designed for data-driven least-squares fitting of the cross elasticity of demand. This last model is then used to solve for ISP profit maximisation

    Progettazione e implementazione di una DHT sicura integrata con servizi Cloud

    Get PDF
    La presente tesi fa parte di un progetto con l'ambizione di creare una piattaforma unificata e sicura per servizi cloud. L'obiettivo del presente lavoro è quello di progettare e implementare una rete peer-to-peer sicura da integrare con l'architettura di cui sopra. In particolare è stata svolto uno studio per analizzare le vulnerabilità delle attuali reti ed è stato progettato un protocollo che per essere sicuro, unisce i vantaggi dell'architettura client/server e quelli delle reti P2P. Infine il progetto è stato implementato come libreria di comunicazione, come caso d'uso, è stata implementata una chat sicura

    Algorithms for continuous queries: A geometric approach

    Get PDF
    <p>There has been an unprecedented growth in both the amount of data and the number of users interested in different types of data. Users often want to keep track of the data that match their interests over a period of time. A continuous query, once issued by a user, maintains the matching results for the user as new data (as well as updates to the existing data) continue to arrive in a stream. However, supporting potentially millions of continuous queries is a huge challenge. This dissertation addresses the problem of scalably processing a large number of continuous queries over a wide-area network. </p><p>Conceptually, the task of supporting distributed continuous queries can be divided into two components--event processing (computing the set of affected users for each data update) and notification dissemination (notifying the set of affected users). The first part of this dissertation focuses on event processing. Since interacting with large-scale data can easily frustrate and overwhelm the users, top-k queries have attracted considerable interest from the database community as they allow users to focus on the top-ranked results only. However, it is nearly impossible to find a set of common top-ranked data that everyone is interested in, therefore, users are allowed to specify their interest in different forms of preferences, such as personalized ranking function and range selection. This dissertation presents geometric frameworks, data structures, and algorithms for answering several types of preference queries efficiently. Experimental evaluations show that our approaches outperform the previous ones by orders of magnitude.</p><p>The second part of the dissertation presents comprehensive solutions to the problem of processing and notifying a large number of continuous range top-k queries across a wide-area network. Simple solutions include using a content-driven network to notify all continuous queries whose ranges contain the update (ignoring top-k), or using a server to compute only the affected continuous queries and notifying them individually. The former solution generates too much network traffic, while the latter overwhelms the server. This dissertation presents a geometric framework which allows the set of affected continuous queries to be described succinctly with messages that can be efficiently disseminated using content-driven networks. Fast algorithms are also developed to reformulate each update into a set of messages whose number is provably optimal, with or without knowing all continuous queries. </p><p>The final component of this dissertation is the design of a wide-area dissemination network for continuous range queries. In particular, this dissertation addresses the problem of assigning users to servers in a wide-area content-based publish/subscribe system. A good assignment should consider both users' interests and locations, and balance multiple performance criteria including bandwidth, delay, and load balance. This dissertation presents a Monte Carlo approximation algorithm as well as a simple greedy algorithm. The Monte Carlo algorithm jointly considers multiple performance criteria to find a broker-subscriber assignment and provides theoretical performance guarantees. Using this algorithm as a yardstick, the greedy algorithm is also concluded to work well across a wide range of workloads.</p>Dissertatio

    Gears4Net - Ein Programmiermodell fĂĽr asynchrone Kommunikation in P2P-Systemen

    Get PDF
    Die vorliegende Arbeit betrachtet die Herausforderungen bei der Entwicklung massiv paralleler, verteilter Systeme. Am Beispiel des ortsbezogenen Peer-to-Peer-Systems Symstry werden geeignete Modelle zur Komplexitätsreduktion des Softwareentwicklungsprozesses abgeleitet sowie Verfahren zur Simulation derartiger Systeme etabliert. Die Arbeit ist in drei Themenschwerpunkte gegliedert. Der erste Schwerpunkt betrachtet die aktuell bestehenden Softwareentwicklungsmethoden im Hinblick auf den zunehmenden Parallelisierungsgrad und die daraus entstehenden Implikationen auf den Softwareentwicklungsprozess. Abgeschlossen wird dieses Kapitel mit dem Entwurf und der Implementierung des Programmiermodells Gears4Net. Dieses erlaubt die Betrachtung von Parallelität als Designkriterium und forciert sowohl die synchronisationsfreie Anwendungsimplementierung als auch die Zustandsmaschinen-ähnliche Quellcodestrukturierung. Im Fokus des zweiten Schwerpunktes steht die Konzeption und Implementierung des Peer-to-Peer-Systems Symstry. Symstry, das auf Basis des Gears4Net-Modells entwickelt und simuliert wurde, verfolgt den Ansatz symbolischer Adressierung zur Spezifikation von Endgeräten, Orten und Regionen. Der besondere Schwerpunkt dieses Themenkomplexes liegt bei der Konzeption effizienter Routing- beziehungsweise Geo-Cast- oder Range-Query-Verfahren. Abgeschlossen wird die Arbeit mit dem dritten Schwerpunkt, der Simulationsumgebung Inspector. Diese ermöglicht die komponentenweise Simulation beziehungsweise Emulation der auf Basis des Gears4Net-Modells entwickelten Softwarekomponenten und bildet die Grundlage für die Evaluation des Peer-to-Peer-Systems Symstry
    corecore