Search CORE

75,631 research outputs found

OpenCAL++: An object-oriented architecture for transparent parallel execution of cellular automata models

Author: D'Ambrosio Donato
Gil Marisa
Giordano Andrea
Macri Davide
Rongo Rocco
Spataro William
Utrera Iglesias Gladys Miriam
Publication venue: Institute of Electrical and Electronics Engineers (IEEE)
Publication date: 01/01/2023
Field of study

Cellular Automata (CA) models, initially studied by John von Neumann, have been developed by numerous researchers and applied in both academic and scientific fields. Thanks to their local and independent rules, simulations of complex systems can be easily implemented based on CA modelling on parallel machines. However, due to the heterogeneity of the components - from the hardware to the software perspective-the various possible scenarios running parallelism in today’s architectures can pose a challenge in such implementations, making it difficult to exploit. This paper presents OpenCAL++, a transparent and efficient object-oriented platform for the parallel execution of cellular automata models. The architecture of OpenCAL++ ensures the modeller a fully transparent parallel execution and a strong ”separation of concerns” between the execution parallelism issues and the model implementation. The code implementing the Cellular Automata model remains the same whether the execution performs in a shared-, distributed-memory or a GPGPU context, irrespective of the optimizations adopted. To this aim, the object-oriented paradigm has been intensely exploited. As well as the OpenCAL++ architecture, we present the description of a simple Cellular Automata model implementation for illustrative purposes.This research was funded by the Italian “ICSC National Center for HPC, Big Data and Quantum Computing” Project, CN00000013 (approved under the Call M42C –Investment 1.4 – Avvisto “Centri Nazionali” – D.D. n. 3138 of 16.12.2021, admitted to financing with MUR Decree n. 1031 of 06.17.2022)Peer ReviewedPostprint (author's final draft

UPCommons. Portal del coneixement obert de la UPC

Developing Efﬁcient Discrete Simulations on Multicore and GPU Architectures

Author: Cagigas Muñiz Daniel
Díaz del Río Fernando
Guisado Lízar José Luís
Jiménez-Morales Francisco de Paula
López-Torres Manuel Ramón
Publication venue: 'MDPI AG'
Publication date: 01/01/2020
Field of study

In this paper we show how to efﬁciently implement parallel discrete simulations on multicoreandGPUarchitecturesthrougharealexampleofanapplication: acellularautomatamodel of laser dynamics. We describe the techniques employed to build and optimize the implementations using OpenMP and CUDA frameworks. We have evaluated the performance on two different hardware platforms that represent different target market segments: high-end platforms for scientiﬁc computing, using an Intel Xeon Platinum 8259CL server with 48 cores, and also an NVIDIA Tesla V100GPU,bothrunningonAmazonWebServer(AWS)Cloud;and on a consumer-oriented platform, using an Intel Core i9 9900k CPU and an NVIDIA GeForce GTX 1050 TI GPU. Performance results were compared and analyzed in detail. We show that excellent performance and scalability can be obtained in both platforms, and we extract some important issues that imply a performance degradation for them. We also found that current multicore CPUs with large core numbers can bring a performance very near to that of GPUs, and even identical in some cases.Ministerio de Economía, Industria y Competitividad, Gobierno de España (MINECO), and the Agencia Estatal de Investigación (AEI) of Spain, coﬁnanced by FEDER funds (EU) TIN2017-89842

idUS. Depósito de Investigación Universidad de Sevilla

A guided tour of asynchronous cellular automata

Author: A. Dennunzio
A. Dennunzio
A. Muscholl
A. Sarkar
A. Sharifulina
A. Spicher
B. Schönfisch
B.A. Huberman
C. Grilo
C.L. Nehaniv
D. Cornforth
D. Kuske
D. Kuske
D. Newth
D. Regnault
D. Regnault
E.D.L. Lumer
F. Peper
F. Radicchi
F. Silva
G. Abramson
G. Pighizzini
G. Ruxton
H.J. Blok
J. Aracena
J. Lee
J. Lee
J. Lee
J. Lee
J. Lee
J. Rolf
J.B. Rouquier
J.B. Rouquier
J.M. Baetens
K. Nakamura
L. Manzoni
L. Priese
L. Vanneschi
M. Droste
M. Droste
M. Macauley
M. Macauley
M. Macauley
M. Mamei
M. Tomassini
M.A. Saif
M.S. Capcarrère
N. Fatès
N. Fatès
N. Fatès
N. Fatès
N. Fatès
N. Fatès
N. Fatès
N. Fatès
N. Rajewsky
O. Bandman
O. Bouré
O. Bouré
O. Schneider
P.T. Tošić
R. Cori
R. Gharavi
R.L. Buvel
S. Adachi
S. Adachi
S. Bandini
S. Bandini
S. Belgacem
S. Das
S.A.H. Minoofam
S.M. Messinger
T. Suzudo
T. Suzudo
T. Toffoli
T. Worsch
T. Worsch
W. Taouali
W.R. Stark
Y. Takada
Publication venue
Publication date: 01/01/2013
Field of study

Research on asynchronous cellular automata has received a great amount of attention these last years and has turned to a thriving field. We survey the recent research that has been carried out on this topic and present a wide state of the art where computing and modelling issues are both represented.Comment: To appear in the Journal of Cellular Automat

arXiv.org e-Print Archive

Crossref

INRIA a CCSD electronic archive server

Nature-Inspired Interconnects for Self-Assembled Large-Scale Network-on-Chip Designs

Author: Christof Teuscher
Das R.
de Micheli G.
Di Caro G.
Lawniczak A. T.
Lawson J.
Levitan S. L.
Moore G. E.
Tomassini M.
Publication venue: 'AIP Publishing'
Publication date: 21/04/2007
Field of study

Future nano-scale electronics built up from an Avogadro number of components needs efficient, highly scalable, and robust means of communication in order to be competitive with traditional silicon approaches. In recent years, the Networks-on-Chip (NoC) paradigm emerged as a promising solution to interconnect challenges in silicon-based electronics. Current NoC architectures are either highly regular or fully customized, both of which represent implausible assumptions for emerging bottom-up self-assembled molecular electronics that are generally assumed to have a high degree of irregularity and imperfection. Here, we pragmatically and experimentally investigate important design trade-offs and properties of an irregular, abstract, yet physically plausible 3D small-world interconnect fabric that is inspired by modern network-on-chip paradigms. We vary the framework's key parameters, such as the connectivity, the number of switch nodes, the distribution of long- versus short-range connections, and measure the network's relevant communication characteristics. We further explore the robustness against link failures and the ability and efficiency to solve a simple toy problem, the synchronization task. The results confirm that (1) computation in irregular assemblies is a promising and disruptive computing paradigm for self-assembled nano-scale electronics and (2) that 3D small-world interconnect fabrics with a power-law decaying distribution of shortcut lengths are physically plausible and have major advantages over local 2D and 3D regular topologies

arXiv.org e-Print Archive

Crossref

PDXScholar (Portland State University)