Search CORE

337,435 research outputs found

Critical Behavior of the Three-Dimensional Ising Spin Glass

We have simulated, using parallel tempering, the three dimensional Ising spin glass model with binary couplings in a helicoidal geometry. The largest lattice (L=20) has been studied using a dedicated computer (the SUE machine). We have obtained, measuring the correlation length in the critical region, a strong evidence for a second-order finite temperature phase transition ruling out other possible scenarios like a Kosterlitz-Thouless phase transition. Precise values for the

\nu

and

\eta

critical exponents are also presented.Comment: RevTex; 12 pages plus 5 ps figures. Final version to be published in PR

arXiv.org e-Print Archive

Docta Complutense

Crossref

apeNEXT: A multi-TFlops Computer for Simulations in Lattice Gauge Theory

Author: apeNEXT Collaboration
Bodin F.
Boucaud Philippe
Cabibbo N.
De Luca S.
De Pietri R.
Di Carlo F.
Di Renzo F.
Kaldass H.
Lonardo A.
Lukyanov M.
Micheli J.
Morenas V.
Paschedag N.
Pene O.
Pleiter D.
Rapuano F.
Rossetti D.
Sartori L.
Schifano F.
Simma H.
Tripiccione R.
Vicini P.
Publication venue
Publication date: 01/01/2003
Field of study

We present the APE (Array Processor Experiment) project for the development of dedicated parallel computers for numerical simulations in lattice gauge theories. While APEmille is a production machine in today's physics simulations at various sites in Europe, a new machine, apeNEXT, is currently being developed to provide multi-Tflops computing performance. Like previous APE machines, the new supercomputer is largely custom designed and specifically optimized for simulations of Lattice QCD.Comment: Poster at the XXIII Physics in Collisions Conference (PIC03), Zeuthen, Germany, June 2003, 3 pages, Latex. PSN FRAP15. Replaced for adding forgotten autho

arXiv.org e-Print Archive

CiteSeerX

DESY Publication Database

HAL-IN2P3

Archivio istituzionale della Ricerca - Università degli Studi di Parma

HAL Clermont Université

DESY

Archivio istituzionale della ricerca - Università di Ferrara

CERN Document Server

Measuring NUMA effects with the STREAM benchmark

Author: Bergstrom Lars
Publication venue
Publication date: 01/01/2011
Field of study

Modern high-end machines feature multiple processor packages, each of which contains multiple independent cores and integrated memory controllers connected directly to dedicated physical RAM. These packages are connected via a shared bus, creating a system with a heterogeneous memory hierarchy. Since this shared bus has less bandwidth than the sum of the links to memory, aggregate memory bandwidth is higher when parallel threads all access memory local to their processor package than when they access memory attached to a remote package. But, the impact of this heterogeneous memory architecture is not easily understood from vendor benchmarks. Even where these measurements are available, they provide only best-case memory throughput. This work presents a series of modifications to the well-known STREAM benchmark to measure the effects of NUMA on both a 48-core AMD Opteron machine and a 32-core Intel Xeon machine

arXiv.org e-Print Archive

CiteSeerX

C-NNAP - A parallel processing architecture for binary neural networks

Author: Austin J.
Cass B.
Kennedy J.V.
Pack R.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/1995
Field of study

This paper describes the CNNAP machine, a MIMD implementation of an array of ADAM binary neural networks, primarily designed for image processing. CNNAP comprises an array of VME cards each containing a DSP, SCSI controller, and a new design of the SAT peripheral processor. The SAT processor is a dedicated hardware implemention that performs binary neural network computations. The SAT processor yields a potential speed-up of between 108 times to 182 times that of the current DSP with its dedicated coprocessor. CNNAP in association with the SAT provides a fast, parallel environment for performing binary neural network operations

CiteSeerX

Crossref

White Rose Research Online

Ianus: an Adpative FPGA Computer

Author: Belletti F.
Campos I.
Cruz A.
Fernandez L. A.
Gaviro S. Perez
Ianus Collaboration
Jimenez S.
Maiorano A.
Mantovani F.
Marinari E.
Martin-Mayor V.
Munoz-Sudupe A.
Navarro D.
Poli G.
Ruiz-Lorenzo J. J.
Schifano F.
Sciretti D.
Tarancon A.
Tellez P.
Tripiccione R.
Velasco J. L.
Publication venue
Publication date: 12/07/2005
Field of study

Dedicated machines designed for specific computational algorithms can outperform conventional computers by several orders of magnitude. In this note we describe {\it Ianus}, a new generation FPGA based machine and its basic features: hardware integration and wide reprogrammability. Our goal is to build a machine that can fully exploit the performance potential of new generation FPGA devices. We also plan a software platform which simplifies its programming, in order to extend its intended range of application to a wide class of interesting and computationally demanding problems. The decision to develop a dedicated processor is a complex one, involving careful assessment of its performance lead, during its expected lifetime, over traditional computers, taking into account their performance increase, as predicted by Moore's law. We discuss this point in detail

arXiv.org e-Print Archive

Archivio della Ricerca - Università degli Studi di Siena

Archivio istituzionale della ricerca - Università di Ferrara

Archivio della ricerca- Università di Roma La Sapienza

A low-cost parallel implementation of direct numerical simulation of wall turbulence

Author: Bertolotti
del Álamo
Dmitruk
Günther
Iovieno
Jiménez
Kim
Kim
Kwok
Lele
Mahesh
Maurizio Quadrio
Moin
Moser
Na
Paolo Luchini
Pelz
Pozzi
Quadrio
Quadrio
Spotz
Thomas
Publication venue: 'Elsevier BV'
Publication date: 18/06/2005
Field of study

A numerical method for the direct numerical simulation of incompressible wall turbulence in rectangular and cylindrical geometries is presented. The distinctive feature resides in its design being targeted towards an efficient distributed-memory parallel computing on commodity hardware. The adopted discretization is spectral in the two homogeneous directions; fourth-order accurate, compact finite-difference schemes over a variable-spacing mesh in the wall-normal direction are key to our parallel implementation. The parallel algorithm is designed in such a way as to minimize data exchange among the computing machines, and in particular to avoid taking a global transpose of the data during the pseudo-spectral evaluation of the non-linear terms. The computing machines can then be connected to each other through low-cost network devices. The code is optimized for memory requirements, which can moreover be subdivided among the computing nodes. The layout of a simple, dedicated and optimized computing system based on commodity hardware is described. The performance of the numerical method on this computing system is evaluated and compared with that of other codes described in the literature, as well as with that of the same code implementing a commonly employed strategy for the pseudo-spectral calculation.Comment: To be published in J. Comp. Physic

arXiv.org e-Print Archive

Archivio istituzionale della ricerca - Politecnico di Milano

Crossref

Archivio della Ricerca - Università di Salerno

CERN Document Server

Janus II: a new generation application-driven computer for spin-system simulations

This paper describes the architecture, the development and the implementation of Janus II, a new generation application-driven number cruncher optimized for Monte Carlo simulations of spin systems (mainly spin glasses). This domain of computational physics is a recognized grand challenge of high-performance computing: the resources necessary to study in detail theoretical models that can make contact with experimental data are by far beyond those available using commodity computer systems. On the other hand, several specific features of the associated algorithms suggest that unconventional computer architectures, which can be implemented with available electronics technologies, may lead to order of magnitude increases in performance, reducing to acceptable values on human scales the time needed to carry out simulation campaigns that would take centuries on commercially available machines. Janus II is one such machine, recently developed and commissioned, that builds upon and improves on the successful JANUS machine, which has been used for physics since 2008 and is still in operation today. This paper describes in detail the motivations behind the project, the computational requirements, the architecture and the implementation of this new machine and compares its expected performances with those of currently available commercial systems.Comment: 28 pages, 6 figure

arXiv.org e-Print Archive

Docta Complutense

Crossref

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Directory of Open Access Journals

La Colmena

Turismo y patrimonio (E-Journal)

Archivio istituzionale della ricerca - Università di Ferrara

DIALNET

Archivio della ricerca- Università di Roma La Sapienza