Search CORE

1,170 research outputs found

A Content-Addressable Network for Similarity Search in Metric Spaces

Author: Falchi Fabrizio
Publication venue: 'Pisa University Press'
Publication date: 17/07/2007
Field of study

Because of the ongoing digital data explosion, more advanced search paradigms than the traditional exact match are needed for contentbased retrieval in huge and ever growing collections of data produced in application areas such as multimedia, molecular biology, marketing, computer-aided design and purchasing assistance. As the variety of data types is fast going towards creating a database utilized by people, the computer systems must be able to model human fundamental reasoning paradigms, which are naturally based on similarity. The ability to perceive similarities is crucial for recognition, classification, and learning, and it plays an important role in scientific discovery and creativity. Recently, the mathematical notion of metric space has become a useful abstraction of similarity and many similarity search indexes have been developed. In this thesis, we accept the metric space similarity paradigm and concentrate on the scalability issues. By exploiting computer networks and applying the Peer-to-Peer communication paradigms, we build a structured network of computers able to process similarity queries in parallel. Since no centralized entities are used, such architectures are fully scalable. Specifically, we propose a Peer-to-Peer system for similarity search in metric spaces called Metric Content-Addressable Network (MCAN) which is an extension of the well known Content-Addressable Network (CAN) used for hash lookup. A prototype implementation of MCAN was tested on real-life datasets of image features, protein symbols, and text — observed results are reported. We also compared the performance of MCAN with three other, recently proposed, distributed data structures for similarity search in metric spaces

Electronic Thesis and Dissertation Archive - Università di Pisa

Approximate Matching for Peer-to-Peer Overlays with Cubit

Author: Sirer Emin Gun
Slivkins Aleksandrs
Wong Bernard
Publication venue
Publication date: 01/01/2008
Field of study

Keyword search is a critical component in most content retrieval systems. Despite the emergence of completely decentralized and efficient peer-to-peer techniques for content distribution, there have not been similarly efficient, accurate, and decentralized mechanisms for content discovery based on approximate search keys. In this paper, we present a scalable and efficient peer-to-peer system called Cubit with a new search primitive that can efficiently find the k data items with keys most similar to a given search key. The system works by creating a keyword metric space that encompasses both the nodes and the objects in the system, where the distance between two points is a measure of the similarity between the strings that the points represent. It provides a loosely-structured overlay that can efficiently navigate this space. We evaluate Cubit through both a real deployment as a search plugin for a popular BitTorrent client and a large-scale simulation and show that it provides an efficient, accurate and robust method to handle imprecise string search in filesharing applications.This work was supported in part by NSF-TRUST 0424422 and NSF-CAREER 0546568 grants

CiteSeerX

eCommons@Cornell

A scalable content-addressable network

Author: A. D.
Clarke I.
Mark Handley
Paul Francis
Postel J. B.
Rekhter Y.
Richard Karp
Scott Shenker
Sylvia Ratnasamy
Welsh M.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref

The state of peer-to-peer network simulators

Author: Agosti M.
Anirban Basu
Annapureddy S.
Barcellos M.
Baumgart I.
Boufkhad Y.
Cheng B.
Clarke I.
Dabek F.
de Vogeleer K.
Doulkeridis C.
Ghinita G.
Haridasan M.
Huebsch R. J.
Ian Wakeman
Iliofotou M.
James Stanier
Johansson B.
Leonini L.
Likert R.
Mavlankar A.
Maymounkov P.
Naicken S.
Naicken S.
Ren D.
Rosenberg J.
Rowstron A. I. T.
Simon Fleming
Stephen Naicken
Stingl D.
Urban P.
Vijay K. Gurbani
Wang S.
Webb S.
Zantout B.
Zhang D.
Zhao B.
Zhou Y.
Zhu W.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/08/2013
Field of study

Networking research often relies on simulation in order to test and evaluate new ideas. An important requirement of this process is that results must be reproducible so that other researchers can replicate, validate and extend existing work. We look at the landscape of simulators for research in peer-to-peer (P2P) networks by conducting a survey of a combined total of over 280 papers from before and after 2007 (the year of the last survey in this area), and comment on the large quantity of research using bespoke, closed-source simulators. We propose a set of criteria that P2P simulators should meet, and poll the P2P research community for their agreement. We aim to drive the community towards performing their experiments on simulators that allow for others to validate their results

Crossref

Sussex Research Online