Search CORE

197 research outputs found

Optimising Structured P2P Networks for Complex Queries

Author: Furness Jamie R
Publication venue: University of Stirling
Publication date: 01/01/2014
Field of study

With network enabled consumer devices becoming increasingly popular, the number of connected devices and available services is growing considerably - with the number of connected devices es- timated to surpass 15 billion devices by 2015. In this increasingly large and dynamic environment it is important that users have a comprehensive, yet efficient, mechanism to discover services. Many existing wide-area service discovery mechanisms are centralised and do not scale to large numbers of users. Additionally, centralised services suffer from issues such as a single point of failure, high maintenance costs, and difficulty of management. As such, this Thesis seeks a Peer to Peer (P2P) approach. Distributed Hash Tables (DHTs) are well known for their high scalability, financially low barrier of entry, and ability to self manage. They can be used to provide not just a platform on which peers can offer and consume services, but also as a means for users to discover such services. Traditionally DHTs provide a distributed key-value store, with no search functionality. In recent years many P2P systems have been proposed providing support for a sub-set of complex query types, such as keyword search, range queries, and semantic search. This Thesis presents a novel algorithm for performing any type of complex query, from keyword search, to complex regular expressions, to full-text search, over any structured P2P overlay. This is achieved by efficiently broadcasting the search query, allowing each peer to process the query locally, and then efficiently routing responses back to the originating peer. Through experimentation, this technique is shown to be successful when the network is stable, however performance degrades under high levels of network churn. To address the issue of network churn, this Thesis proposes a number of enhancements which can be made to existing P2P overlays in order to improve the performance of both the existing DHT and the proposed algorithm. Through two case studies these enhancements are shown to improve not only the performance of the proposed algorithm under churn, but also the performance of traditional lookup operations in these networks

Stirling Online Research Repository

PariMulo: Kad

Author
Publication venue
Publication date
Field of study

With the advent of broadband connections and computing power available in every kind of digital equipment there is a need to share resources, such as information, among people. To fulfill this need in these years we have seen an amazing growth of distributed systems: cloud computing services, web applications, peer-to-peer systems, etc. In this context is born PariPari, a project which aims to build a modern peer-to-peer network of computers that runs various services, among which there is an eMule-compatible client, called PariMulo. As it is well known even to less computer-savvy people, there have been some problems with the centralized server-based structure of the original eDonkey network, and it has helped the development of a new network, Kad, based upon the Kademlia protocol. This work focuses on the implementation of Kad in PariMulo, starting by first describing the protocol and how the network works, and then providing an in-depth vision of the implementation considering security and performance issues. Finally we make some observations about future possibilities of developmen

Padua Thesis and Dissertation Archive

A Brief History of Web Crawlers

Author: Bochmann Gregor V.
Dinçktürk Mustafa Emre
Hooshmand Salman
Jourdan Guy-Vincent
Mirtaheri Seyed M.
Onut Iosif Viorel
Publication venue
Publication date: 04/05/2014
Field of study

Web crawlers visit internet applications, collect data, and learn about new web pages from visited pages. Web crawlers have a long and interesting history. Early web crawlers collected statistics about the web. In addition to collecting statistics about the web and indexing the applications for search engines, modern crawlers can be used to perform accessibility and vulnerability checks on the application. Quick expansion of the web, and the complexity added to web applications have made the process of crawling a very challenging one. Throughout the history of web crawling many researchers and industrial groups addressed different issues and challenges that web crawlers face. Different solutions have been proposed to reduce the time and cost of crawling. Performing an exhaustive crawl is a challenging question. Additionally capturing the model of a modern web application and extracting data from it automatically is another open question. What follows is a brief history of different technique and algorithms used from the early days of crawling up to the recent days. We introduce criteria to evaluate the relative performance of web crawlers. Based on these criteria we plot the evolution of web crawlers and compare their performanc

arXiv.org e-Print Archive

CiteSeerX

Resource discovery for distributed computing systems: A comprehensive survey

Author: Abdullah
Aberer
Abraham
Aguiar
Aguilera
Ahmed
Akay
Alam
Albrecht
Albrecht
Anderson
Antonopoulos
Aspnes
Atif
Awerbuch
Awerbuch
Baldoni
Ballani
Bandara
Banerjee
Bangyong
Baranwal
Barjini
Basu
Battre
Berman
Bharambe
Bharambe
Bimson
Birman
Bisnik
Bisnik
Bo
Brocco
Brocco
Brogi
Brown
Brunner
Buccafurri
Burstein
Butt
Buyya
Byrom
Byrom
Cai
Caminero
Campo
Candan
Cao
Carra
Carzaniga
Castro
Chang
Chang-Yen
Chatziantoniou
Chaudhuri
Chawathe
Chen
Chen
Chen
Chen
Cheng
Chien
Chung
Cidon
Costa
Crainiceanu
Crainiceanu
Crespo
Czajkowski
Datta
Datta
Davtyan
Deng
Deng
Dhurandher
Di
Di
Di
Diaz
Dimakopoulos
Dimakopoulos
Dissanayaka
Di Martino
Dorigo
Dorigo
Duarte
D’Angelo
Elijorde
Erdil
Erdil
Falchi
Fensel
Ferretti
Forestiero
Forestiero
Foster
Foster
Foster
Foster
Foster
Frey
Fugkeaw
Gaeta
Ganesan
Ganesan
Ganesh
Ganguly
Gao
Gentzsch
Georgiou
Germain
Ghafarian
Ghamri-Doudane
Ghamri-Doudane
Gill
Glover
Goel
González-Beltrán
Guo
Hameurlain
Hameurlain
Harchol-Balter
Harvey
Haykin
Henderson
Hidalgo
Horrocks
Horrocks
Hussin
Iamnitchi
Ionescu
Javad Zarrin
Jelasity
Jesi
Jin
Joung
Joung
Joung
João Paulo Barraca
Kalogeraki
Kannan
Ke
Keller
Kermarrec
Keung
Khanli
Khoobkar
Kim
Klusch
Kniesburges
Ko
Korf
Korf
Kostoulas
Krauter
Krynicki
Kumar
Kutten
Kutten
Kutten
Lazaro
Lee
Lee
Li
Li
Li
Li
Li
Li
Liben-Nowell
Lima
Lu
Ludwig
Lv
Makki
Manvi
March
Martino
Massie
Mastroianni
Mateescu
McGuinness
Medrano-Chávez
Melliar-Smith
Meng
Meshkova
Michlmayr
Milojicic
Montebello
Murugan
Nagarajan
Naseer
Navimipour
Newcomer
Nurmi
Oikonomou
Pan
Pande
Passarella
Pastore
Pathan
Pipan
Pittaras
Prajapati
Raack
Raicu
Raman
Ratnasamy
Reed
Reynolds
Rhea
Rhea
Rhee
Risson
Rochwerger
Rochwerger
Rowstron
Rui L. Aguiar
Russell
Sander
Sathish
Schopf
Schubert
Schubert
Seo
Shaikh
Shaikh
Shang
Shen
Shenvi
Siddiqui
Sotiriadis
Sotomayor
Staples
Steiner
Stevens
Stevens
Stoica
Stützle
Sun
Sun
Sun
Taheri
Talia
Talia
Talia
Talia
Tang
Tang
Tannenbaum
Tao
Tate
Tereshko
Tigelaar
Torkestani
Trunfio
Valdez
Vanthournout
Vanthournout
Van Renesse
Ververidis
Wang
Watkins
Welch
Wolinsky
Wright
Xiao
Xu
Xu
Xu
Yang
Yao
Yin
Ying
Yoo
Yousefipour
Yu
Yusta
Zaharia
Zarrin
Zarrin
Zarrin
Zarrin
Zhang
Zhang
Zhang
Zhang
Zhao
Zhao
Zhou
Zhou
Zhu
Publication venue: 'Elsevier BV'
Publication date: 01/03/2018
Field of study

Large-scale distributed computing environments provide a vast amount of heterogeneous computing resources from different sources for resource sharing and distributed computing. Discovering appropriate resources in such environments is a challenge which involves several different subjects. In this paper, we provide an investigation on the current state of resource discovery protocols, mechanisms, and platforms for large-scale distributed environments, focusing on the design aspects. We classify all related aspects, general steps, and requirements to construct a novel resource discovery solution in three categories consisting of structures, methods, and issues. Accordingly, we review the literature, analyzing various aspects for each category

Crossref

Repositório Institucional da Universidade de Aveiro

Anglia Ruskin Research

CHORUS Deliverable 2.1: State of the Art on Multimedia Search Engines

Author: Boujemaa Nozha
Compañó Ramón
Dosch Christoph
Geurts Joost
Karlgren Jussi
King Paul
Kompatsiaris Yiannis
Köhler Joachim
Le Moine Jean-Yves
Ortgies Robert
Point Jean-Charles
Rotenberg Boris
Rudström Åsa
Sebe Nicu
Publication venue: Chorus Project Consortium
Publication date: 01/01/2007
Field of study

Based on the information provided by European projects and national initiatives related to multimedia search as well as domains experts that participated in the CHORUS Think-thanks and workshops, this document reports on the state of the art related to multimedia content search from, a technical, and socio-economic perspective. The technical perspective includes an up to date view on content based indexing and retrieval technologies, multimedia search in the context of mobile devices and peer-to-peer networks, and an overview of current evaluation and benchmark inititiatives to measure the performance of multimedia search engines. From a socio-economic perspective we inventorize the impact and legal consequences of these technical advances and point out future directions of research

RISE – Research Institutes of Sweden

Digitala Vetenskapliga Arkivet - Academic Archive On-line

Swedish Institute of Computer Science Publications Database

Software institutes' Online Digital Archive

Recommended from our members

HARD: Hybrid Adaptive Resource Discovery for Jungle Computing

Author: Aguiar R. L.
Barraca J. P.
Zarrin J.
Publication venue: 'Elsevier BV'
Publication date: 15/07/2017
Field of study

In recent years, Jungle Computing has emerged as a distributed computing paradigm based on simultaneous combination of various hierarchical and distributed computing environments which are composed by large number of heterogeneous resources. In such a computing environment, the resources and the underlying computation and communication infrastructures are highly-hierarchical and heterogeneous. This creates a lot of difficulty and complexity for finding the proper resources in a precise way in order to run a particular job on the system efficiently. This paper proposes Hybrid Adaptive Resource Discovery (HARD), a novel efficient and highly scalable resource-discovery approach which is built upon a virtual hierarchical overlay based on self-organization and self-adaptation of processing resources in the system, where the computing resources are organized into distributed hierarchies according to a proposed hierarchical multi-layered resource description model. The proposed approach supports distributed query processing within and across hierarchical layers by deploying various distributed resource discovery services and functionalities in the system which are implemented using different adapted algorithms and mechanisms in each level of hierarchy. The proposed approach addresses the requirements for resource discovery in Jungle Computing environments such as high-hierarchy, high-heterogeneity, high-scalability and dynamicity. Simulation results show significant scalability and efficiency of the proposed approach over highly heterogeneous, hierarchical and dynamic computing environments

City Research Online

Repositório Institucional da Universidade de Aveiro

Anglia Ruskin Research

Content Distribution in P2P Systems

Author: El Dick Manal
Pacitti Esther
Publication venue: HAL CCSD
Publication date: 01/01/2009
Field of study

The report provides a literature review of the state-of-the-art for content distribution. The report's contributions are of threefold. First, it gives more insight into traditional Content Distribution Networks (CDN), their requirements and open issues. Second, it discusses Peer-to-Peer (P2P) systems as a cheap and scalable alternative for CDN and extracts their design challenges. Finally, it evaluates the existing P2P systems dedicated for content distribution according to the identied requirements and challenges

INRIA a CCSD electronic archive server

HAL Descartes

Descoberta de recursos para sistemas de escala arbitrarias

Author: Zarrin Javad
Publication venue: Universidade de Aveiro
Publication date: 01/01/2017
Field of study

Doutoramento em InformáticaTecnologias de Computação Distribuída em larga escala tais como Cloud, Grid, Cluster e Supercomputadores HPC estão a evoluir juntamente com a emergência revolucionária de modelos de múltiplos núcleos (por exemplo: GPU, CPUs num único die, Supercomputadores em single die, Supercomputadores em chip, etc) e avanços significativos em redes e soluções de interligação. No futuro, nós de computação com milhares de núcleos podem ser ligados entre si para formar uma única unidade de computação transparente que esconde das aplicações a complexidade e a natureza distribuída desses sistemas com múltiplos núcleos. A fim de beneficiar de forma eficiente de todos os potenciais recursos nesses ambientes de computação em grande escala com múltiplos núcleos ativos, a descoberta de recursos é um elemento crucial para explorar ao máximo as capacidade de todos os recursos heterogéneos distribuídos, através do reconhecimento preciso e localização desses recursos no sistema. A descoberta eficiente e escalável de recursos ´e um desafio para tais sistemas futuros, onde os recursos e as infira-estruturas de computação e comunicação subjacentes são altamente dinâmicas, hierarquizadas e heterogéneas. Nesta tese, investigamos o problema da descoberta de recursos no que diz respeito aos requisitos gerais da escalabilidade arbitrária de ambientes de computação futuros com múltiplos núcleos ativos. A principal contribuição desta tese ´e a proposta de uma entidade de descoberta de recursos adaptativa híbrida (Hybrid Adaptive Resource Discovery - HARD), uma abordagem de descoberta de recursos eficiente e altamente escalável, construída sobre uma sobreposição hierárquica virtual baseada na auto-organizaçãoo e auto-adaptação de recursos de processamento no sistema, onde os recursos computacionais são organizados em hierarquias distribuídas de acordo com uma proposta de modelo de descriçãoo de recursos multi-camadas hierárquicas. Operacionalmente, em cada camada, que consiste numa arquitetura ponto-a-ponto de módulos que, interagindo uns com os outros, fornecem uma visão global da disponibilidade de recursos num ambiente distribuído grande, dinâmico e heterogéneo. O modelo de descoberta de recursos proposto fornece a adaptabilidade e flexibilidade para executar consultas complexas através do apoio a um conjunto de características significativas (tais como multi-dimensional, variedade e consulta agregada) apoiadas por uma correspondência exata e parcial, tanto para o conteúdo de objetos estéticos e dinâmicos. Simulações mostram que o HARD pode ser aplicado a escalas arbitrárias de dinamismo, tanto em termos de complexidade como de escala, posicionando esta proposta como uma arquitetura adequada para sistemas futuros de múltiplos núcleos. Também contribuímos com a proposta de um regime de gestão eficiente dos recursos para sistemas futuros que podem utilizar recursos distribuíos de forma eficiente e de uma forma totalmente descentralizada. Além disso, aproveitando componentes de descoberta (RR-RPs) permite que a nossa plataforma de gestão de recursos encontre e aloque dinamicamente recursos disponíeis que garantam os parâmetros de QoS pedidos.Large scale distributed computing technologies such as Cloud, Grid, Cluster and HPC supercomputers are progressing along with the revolutionary emergence of many-core designs (e.g. GPU, CPUs on single die, supercomputers on chip, etc.) and significant advances in networking and interconnect solutions. In future, computing nodes with thousands of cores may be connected together to form a single transparent computing unit which hides from applications the complexity and distributed nature of these many core systems. In order to efficiently benefit from all the potential resources in such large scale many-core-enabled computing environments, resource discovery is the vital building block to maximally exploit the capabilities of all distributed heterogeneous resources through precisely recognizing and locating those resources in the system. The efficient and scalable resource discovery is challenging for such future systems where the resources and the underlying computation and communication infrastructures are highly-dynamic, highly-hierarchical and highly-heterogeneous. In this thesis, we investigate the problem of resource discovery with respect to the general requirements of arbitrary scale future many-core-enabled computing environments. The main contribution of this thesis is to propose Hybrid Adaptive Resource Discovery (HARD), a novel efficient and highly scalable resource-discovery approach which is built upon a virtual hierarchical overlay based on self-organization and self-adaptation of processing resources in the system, where the computing resources are organized into distributed hierarchies according to a proposed hierarchical multi-layered resource description model. Operationally, at each layer, it consists of a peer-to-peer architecture of modules that, by interacting with each other, provide a global view of the resource availability in a large, dynamic and heterogeneous distributed environment. The proposed resource discovery model provides the adaptability and flexibility to perform complex querying by supporting a set of significant querying features (such as multi-dimensional, range and aggregate querying) while supporting exact and partial matching, both for static and dynamic object contents. The simulation shows that HARD can be applied to arbitrary scales of dynamicity, both in terms of complexity and of scale, positioning this proposal as a proper architecture for future many-core systems. We also contributed to propose a novel resource management scheme for future systems which efficiently can utilize distributed resources in a fully decentralized fashion. Moreover, leveraging discovery components (RR-RPs) enables our resource management platform to dynamically find and allocate available resources that guarantee the QoS parameters on demand

Repositório Institucional da Universidade de Aveiro