Search CORE

4,258 research outputs found

A Discussion on Parallelization Schemes for Stochastic Vector Quantization Algorithms

Author: Durut Matthieu
Patra Benoît
Rossi Fabrice
Publication venue
Publication date: 01/01/2012
Field of study

This paper studies parallelization schemes for stochastic Vector Quantization algorithms in order to obtain time speed-ups using distributed resources. We show that the most intuitive parallelization scheme does not lead to better performances than the sequential algorithm. Another distributed scheme is therefore introduced which obtains the expected speed-ups. Then, it is improved to fit implementation on distributed architectures where communications are slow and inter-machines synchronization too costly. The schemes are tested with simulated distributed architectures and, for the last one, with Microsoft Windows Azure platform obtaining speed-ups up to 32 Virtual Machines

arXiv.org e-Print Archive

CiteSeerX

HAL-Paris1

Analysis and Optimization of Mixed-Criticality Applications on Partitioned Distributed Architectures

Author: Marinescu S. O.
Pop Paul
Tamas-Selicean Domitian
Publication venue: 'Institution of Engineering and Technology (IET)'
Publication date: 01/01/2012
Field of study

Crossref

Online Research Database In Technology

Using problem frames with distributed architectures: a case for cardinality on interfaces

Author: Haley Charles B.
Publication venue
Publication date: 01/01/2003
Field of study

Certain classes of problems amenable to description using Problem Frames, in particular ones intended to be implemented using a distributed architecture, can benefit by the addition of a cardinality specification on the domain interfaces. This paper presents an example of such a problem, demonstrates the need for relationship cardinality, and proposes a notation to represent cardinality on domain interfaces

CiteSeerX

Open Research Online (The Open University)

Distributed data cache designs for clustered VLIW processors

Author: Gibert Codina Enric
González Colás Antonio María
Sánchez Jesús
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2005
Field of study

Wire delays are a major concern for current and forthcoming processors. One approach to deal with this problem is to divide the processor into semi-independent units referred to as clusters. A cluster usually consists of a local register file and a subset of the functional units, while the L1 data cache typically remains centralized in What we call partially distributed architectures. However, as technology evolves, the relative latency of such a centralized cache will increase, leading to an important impact on performance. In this paper, we propose partitioning the L1 data cache among clusters for clustered VLIW processors. We refer to this kind of design as fully distributed processors. In particular; we propose and evaluate three different configurations: a snoop-based cache coherence scheme, a word-interleaved cache, and flexible LO-buffers managed by the compiler. For each alternative, instruction scheduling techniques targeted to cyclic code are developed. Results for the Mediabench suite'show that the performance of such fully distributed architectures is always better than the performance of a partially distributed one with the same amount of resources. In addition, the key aspects of each fully distributed configuration are explored.Peer ReviewedPostprint (published version

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

UPCommons. Portal del coneixement obert de la UPC

Swarm shape manipulation through connection control

Author: Bennet Derek J.
Macdonald M.
Punzo Giuliano
Publication venue
Publication date: 31/08/2010
Field of study

The control of a large swarm of distributed agents is a well known challenge within the study of unmanned autonomous systems. However, it also presents many new opportunities. The advantages of operating a swarm through distributed means has been assessed in the literature for efficiency from both operational and economical aspects; practically as the number of agents increases, distributed control is favoured over centralised control, as it can reduce agent computational costs and increase robustness on the swarm. Distributed architectures, however, can present the drawback of requiring knowledge of the whole swarm state, therefore limiting the scalability of the swarm. In this paper a strategy is presented to address the challenges of distributed architectures, changing the way in which the swarm shape is controlled and providing a step towards verifiable swarm behaviour, achieving new configurations, while saving communication and computation resources. Instead of applying change at agent level (e.g. modify its guidance law), the sensing of the agents is addressed to a portion of agents, differentially driving their behaviour. This strategy is applied for swarms controlled by artificial potential functions which would ordinarily require global knowledge and all-to-all interactions. Limiting the agents' knowledge is proposed for the first time in this work as a methodology rather than obstacle to obtain desired swarm behaviour

University of Strathclyde Institutional Repository

Distributed Bayesian Probabilistic Matrix Factorization

Author: Aa Tom Vander
Chakroun Imen
Haber Tom
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 11/05/2017
Field of study

Matrix factorization is a common machine learning technique for recommender systems. Despite its high prediction accuracy, the Bayesian Probabilistic Matrix Factorization algorithm (BPMF) has not been widely used on large scale data because of its high computational cost. In this paper we propose a distributed high-performance parallel implementation of BPMF on shared memory and distributed architectures. We show by using efficient load balancing using work stealing on a single node, and by using asynchronous communication in the distributed version we beat state of the art implementations

arXiv.org e-Print Archive

Crossref

Secure Sparse Gradient Aggregation in Distributed Architectures

Author: Bouma H.
Pimentel A.
van Rooij M.
van Rooij S.
Publication venue
Publication date: 01/01/2022
Field of study

International Migration, Integration and Social Cohesion online publications

UvA-DARE

Membrane Dissolution in Distributed Architectures of P-Systems

Author: Bravo García Ginés
Mingo López Luis Fernando de
Peña Camacho Miguel Ángel
Publication venue: Facultad de Informática (UPM)
Publication date: 01/01/2011
Field of study

The goal of this paper is twofold. Firstly, to survey in a systematic and uniform way the main results regarding the way membranes can be placed on processors in order to get a software/hardware simulation of P-Systems in a distributed environment. Secondly, we improve some results about the membrane dissolution problem, prove that it is connected, and discuss the possibility of simulating this property in the distributed model. All this yields an improvement in the system parallelism implementation since it gets an increment of the parallelism of the external communication among processors. Also, the number of processors grows in such a way that is notorious the increment of the parallelism in the application of the evolution rules and the internal communica-tionsstudy because it gets an increment of the parallelism in the application of the evolution rules and the internal communications. Proposed ideas improve previous architectures to tackle the communication bottleneck problem, such as reduction of the total time of an evolution step, increase of the number of membranes that could run on a processor and reduction of the number of processor

Archivo Digital UPM