13,433 research outputs found
A decentralized motion coordination strategy for dynamic target tracking
This paper presents a decentralized motion planning
algorithm for the distributed sensing of a noisy dynamical
process by multiple cooperating mobile sensor agents. This
problem is motivated by localization and tracking tasks of
dynamic targets. Our gradient-descent method is based on a
cost function that measures the overall quality of sensing. We
also investigate the role of imperfect communication between
sensor agents in this framework, and examine the trade-offs in
performance between sensing and communication. Simulations
illustrate the basic characteristics of the algorithms
Efficient Generation of Parallel Spin-images Using Dynamic Loop Scheduling
High performance computing (HPC) systems underwent a significant increase in
their processing capabilities. Modern HPC systems combine large numbers of
homogeneous and heterogeneous computing resources. Scalability is, therefore,
an essential aspect of scientific applications to efficiently exploit the
massive parallelism of modern HPC systems. This work introduces an efficient
version of the parallel spin-image algorithm (PSIA), called EPSIA. The PSIA is
a parallel version of the spin-image algorithm (SIA). The (P)SIA is used in
various domains, such as 3D object recognition, categorization, and 3D face
recognition. EPSIA refers to the extended version of the PSIA that integrates
various well-known dynamic loop scheduling (DLS) techniques. The present work:
(1) Proposes EPSIA, a novel flexible version of PSIA; (2) Showcases the
benefits of applying DLS techniques for optimizing the performance of the PSIA;
(3) Assesses the performance of the proposed EPSIA by conducting several
scalability experiments. The performance results are promising and show that
using well-known DLS techniques, the performance of the EPSIA outperforms the
performance of the PSIA by a factor of 1.2 and 2 for homogeneous and
heterogeneous computing resources, respectively
A scalable parallel finite element framework for growing geometries. Application to metal additive manufacturing
This work introduces an innovative parallel, fully-distributed finite element
framework for growing geometries and its application to metal additive
manufacturing. It is well-known that virtual part design and qualification in
additive manufacturing requires highly-accurate multiscale and multiphysics
analyses. Only high performance computing tools are able to handle such
complexity in time frames compatible with time-to-market. However, efficiency,
without loss of accuracy, has rarely held the centre stage in the numerical
community. Here, in contrast, the framework is designed to adequately exploit
the resources of high-end distributed-memory machines. It is grounded on three
building blocks: (1) Hierarchical adaptive mesh refinement with octree-based
meshes; (2) a parallel strategy to model the growth of the geometry; (3)
state-of-the-art parallel iterative linear solvers. Computational experiments
consider the heat transfer analysis at the part scale of the printing process
by powder-bed technologies. After verification against a 3D benchmark, a
strong-scaling analysis assesses performance and identifies major sources of
parallel overhead. A third numerical example examines the efficiency and
robustness of (2) in a curved 3D shape. Unprecedented parallelism and
scalability were achieved in this work. Hence, this framework contributes to
take on higher complexity and/or accuracy, not only of part-scale simulations
of metal or polymer additive manufacturing, but also in welding, sedimentation,
atherosclerosis, or any other physical problem where the physical domain of
interest grows in time
Feature detection using spikes: the greedy approach
A goal of low-level neural processes is to build an efficient code extracting
the relevant information from the sensory input. It is believed that this is
implemented in cortical areas by elementary inferential computations
dynamically extracting the most likely parameters corresponding to the sensory
signal. We explore here a neuro-mimetic feed-forward model of the primary
visual area (VI) solving this problem in the case where the signal may be
described by a robust linear generative model. This model uses an over-complete
dictionary of primitives which provides a distributed probabilistic
representation of input features. Relying on an efficiency criterion, we derive
an algorithm as an approximate solution which uses incremental greedy inference
processes. This algorithm is similar to 'Matching Pursuit' and mimics the
parallel architecture of neural computations. We propose here a simple
implementation using a network of spiking integrate-and-fire neurons which
communicate using lateral interactions. Numerical simulations show that this
Sparse Spike Coding strategy provides an efficient model for representing
visual data from a set of natural images. Even though it is simplistic, this
transformation of spatial data into a spatio-temporal pattern of binary events
provides an accurate description of some complex neural patterns observed in
the spiking activity of biological neural networks.Comment: This work links Matching Pursuit with bayesian inference by providing
the underlying hypotheses (linear model, uniform prior, gaussian noise
model). A parallel with the parallel and event-based nature of neural
computations is explored and we show application to modelling Primary Visual
Cortex / image processsing.
http://incm.cnrs-mrs.fr/perrinet/dynn/LaurentPerrinet/Publications/Perrinet04tau
Distributed and parallel Ada and the Ada 9X recommendations
Recently, the DoD has sponsored work towards a new version of Ada, intended to support the construction of distributed systems. The revised version, often called Ada 9X, will become the new standard sometimes in the 1990s. It is intended that Ada 9X should provide language features giving limited support for distributed system construction. The requirements for such features are given. Many of the most advanced computer applications involve embedded systems that are comprised of parallel processors or networks of distributed computers. If Ada is to become the widely adopted language envisioned by many, it is essential that suitable compilers and tools be available to facilitate the creation of distributed and parallel Ada programs for these applications. The major languages issues impacting distributed and parallel programming are reviewed, and some principles upon which distributed/parallel language systems should be built are suggested. Based upon these, alternative language concepts for distributed/parallel programming are analyzed
An efficient parallel tree-code for the simulation of self-gravitating systems
We describe a parallel version of our tree-code for the simulation of
self-gravitating systems in Astrophysics. It is based on a dynamic and adaptive
method for the domain decomposition, which exploits the hierarchical data
arrangement used by the tree-code. It shows low computational costs for the
parallelization overhead -- less than 4% of the total CPU-time in the tests
done -- because the domain decomposition is performed 'on the fly' during the
tree setting and the portion of the tree that is local to each processor
'enriches' itself of remote data only when they are actually needed.
The performances of an implementation of the parallel code on a Cray T3E are
presented and discussed. They exhibit a very good behaviour of the speedup (=15
with 16 processors and 10^5 particles) and a rather low load unbalancing (< 10%
using up to 16 processors), achieving a high computation speed in the forces
evaluation (>10^4 particles/sec with 8 processors).Comment: 10 pages, 8 figures, LaTeX2e, A&A class file needed (included),
submitted to A&A; corrected abstract word wrappin
Considering skills evolutions in multi-skilled workforce allocation with flexible working hours
The growing need of responsiveness for manufacturing companies facing market volatility raises a strong demand for flexibility in their organisation. Since the company personnel are increasingly considered as the core of the organisational structures, a strong and forward-looking management of human resources and skills is crucial to performance in many industries. These organisations must develop strategies for the short, medium and long terms, in order to preserve and develop skills. Responding to this importance, this work presents an original model, looking at the line-up of multi-period project, considering the problem of staff allocation with two degrees of flexibility. The first results from the annualising of working time, and relies on policies of changing schedules, individually as well as collectively. The second degree of flexibility is the versatility of the operators, which induces a dynamic view of their skills and the need to predict changes in individual performance as a result of successive assignments. We are firmly in a context where the expected durations of activities are no longer predefined, but result from the performance of the operators selected for their execution. We present a mathematical model of this problem, which is solved by a genetic algorithm. An illustrative example is presented and analysed, and, the robustness of the solving approach is investigated using a sample of 400 projects with different characteristics
- …