Search CORE

38,370 research outputs found

Data mining and fusion

Author: Addis M. J.
Choi F.
Taylor S. J.
Upstill C.
Watkins E. R.
Publication venue: s.n.
Publication date: 01/04/2006
Field of study

Pando: Personal Volunteer Computing in Browsers

Author: Anderson David P.
Balouek Daniel
Berry Kevin
Cherniack Mitch
Chorazyk Pawel
Dias David
Duda Jerzy
Jangda Abhinav
Lavoie Erick
Martınez Gonzalo J
Nakamoto Satoshi
Reginald Cushing
Ryza Sandy
Smolka Gert
Werner M. J.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 06/09/2019
Field of study

The large penetration and continued growth in ownership of personal electronic devices represents a freely available and largely untapped source of computing power. To leverage those, we present Pando, a new volunteer computing tool based on a declarative concurrent programming model and implemented using JavaScript, WebRTC, and WebSockets. This tool enables a dynamically varying number of failure-prone personal devices contributed by volunteers to parallelize the application of a function on a stream of values, by using the devices' browsers. We show that Pando can provide throughput improvements compared to a single personal device, on a variety of compute-bound applications including animation rendering and image processing. We also show the flexibility of our approach by deploying Pando on personal devices connected over a local network, on Grid5000, a French-wide computing grid in a virtual private network, and seven PlanetLab nodes distributed in a wide area network over Europe.Comment: 14 pages, 12 figures, 2 table

arXiv.org e-Print Archive

Crossref

A customizable multi-agent system for distributed data mining

Author: Di Fatta Giuseppe
Fortino Giancarlo
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2007
Field of study

We present a general Multi-Agent System framework for distributed data mining based on a Peer-to-Peer model. Agent protocols are implemented through message-based asynchronous communication. The framework adopts a dynamic load balancing policy that is particularly suitable for irregular search algorithms. A modular design allows a separation of the general-purpose system protocols and software components from the specific data mining algorithm. The experimental evaluation has been carried out on a parallel frequent subgraph mining algorithm, which has shown good scalability performances

Central Archive at the University of Reading

CiteSeerX

Crossref

Distributed Management of Massive Data: an Efficient Fine-Grain Data Access Scheme

Author: A. Bassi
A. Thomasian
B. Allcock
B.S. White
G. Antoniu
G. Antoniu
G. Antoniu
K. Douglas
M. Nicola
M.A. Casey
O. Tatebe
P. Honeyman
P.Z. Kunszt
R. Bolze
R. Jin
R.C. Merkle
S. Rhea
Publication venue
Publication date: 01/01/2008
Field of study

This paper addresses the problem of efficiently storing and accessing massive data blocks in a large-scale distributed environment, while providing efficient fine-grain access to data subsets. This issue is crucial in the context of applications in the field of databases, data mining and multimedia. We propose a data sharing service based on distributed, RAM-based storage of data, while leveraging a DHT-based, natively parallel metadata management scheme. As opposed to the most commonly used grid storage infrastructures that provide mechanisms for explicit data localization and transfer, we provide a transparent access model, where data are accessed through global identifiers. Our proposal has been validated through a prototype implementation whose preliminary evaluation provides promising results

arXiv.org e-Print Archive

HAL-CentraleSupelec

CiteSeerX

Crossref

INRIA a CCSD electronic archive server

HAL-Rennes 1