Search CORE

26,859 research outputs found

Future scaling of processor-memory interfaces

Author
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2009
Field of study

Crossref

Roughening of the (1+1) interfaces in two-component surface growth with an admixture of random deposition

Author: A. Kolakowska
A.-L. Barabasi
B. D. Lubachevsky
B. J. Overeiner
D. A. Jefferson
D. O’Connor
G. Korniss
G. Korniss
H. N. Yang
J. S. Steinman
K. M. Chandy
M. A. Novotny
M. A. Novotny
M. den Nijs
M. Kardar
M. Kotrla
M. Kotrla
P. M. Dickens
P. S. Verma
R. M. Fujimoto
Publication venue: 'American Physical Society (APS)'
Publication date: 09/07/2004
Field of study

We simulate competitive two-component growth on a one dimensional substrate of

L

sites. One component is a Poisson-type deposition that generates Kardar-Parisi-Zhang (KPZ) correlations. The other is random deposition (RD). We derive the universal scaling function of the interface width for this model and show that the RD admixture acts as a dilatation mechanism to the fundamental time and height scales, but leaves the KPZ correlations intact. This observation is generalized to other growth models. It is shown that the flat-substrate initial condition is responsible for the existence of an early non-scaling phase in the interface evolution. The length of this initial phase is a non-universal parameter, but its presence is universal. In application to parallel and distributed computations, the important consequence of the derived scaling is the existence of the upper bound for the desynchronization in a conservative update algorithm for parallel discrete-event simulations. It is shown that such algorithms are generally scalable in a ring communication topology.Comment: 16 pages, 16 figures, 77 reference

arXiv.org e-Print Archive

Crossref

Overview of Swallow --- A Scalable 480-core System for Investigating the Performance and Energy Efficiency of Many-core Applications and Operating Systems

Author: Hollis Simon J.
Kerrison Steve
Publication venue
Publication date: 23/04/2015
Field of study

We present Swallow, a scalable many-core architecture, with a current configuration of 480 x 32-bit processors. Swallow is an open-source architecture, designed from the ground up to deliver scalable increases in usable computational power to allow experimentation with many-core applications and the operating systems that support them. Scalability is enabled by the creation of a tile-able system with a low-latency interconnect, featuring an attractive communication-to-computation ratio and the use of a distributed memory configuration. We analyse the energy and computational and communication performances of Swallow. The system provides 240GIPS with each core consuming 71--193mW, dependent on workload. Power consumption per instruction is lower than almost all systems of comparable scale. We also show how the use of a distributed operating system (nOS) allows the easy creation of scalable software to exploit Swallow's potential. Finally, we show two use case studies: modelling neurons and the overlay of shared memory on a distributed memory system.Comment: An open source release of the Swallow system design and code will follow and references to these will be added at a later dat

arXiv.org e-Print Archive

Explore Bristol Research

QPACE 2 and Domain Decomposition on the Intel Xeon Phi

Author: Arts Paul
Bloch Jacques
Georg Peter
Glaessle Benjamin
Heybrock Simon
Komatsubara Yu
Lohmayer Robert
Mages Simon
Mendl Bernhard
Meyer Nils
Parcianello Alessio
Pleiter Dirk
Rappl Florian
Rossi Mauro
Solbrig Stefan
Tecchiolli Giampietro
Wettig Tilo
Zanier Gianpaolo
Publication venue
Publication date: 01/01/2015
Field of study

We give an overview of QPACE 2, which is a custom-designed supercomputer based on Intel Xeon Phi processors, developed in a collaboration of Regensburg University and Eurotech. We give some general recommendations for how to write high-performance code for the Xeon Phi and then discuss our implementation of a domain-decomposition-based solver and present a number of benchmarks.Comment: plenary talk at Lattice 2014, to appear in the conference proceedings PoS(LATTICE2014), 15 pages, 9 figure

arXiv.org e-Print Archive

Juelich Shared Electronic Resources