Search CORE

801 research outputs found

CLEX: Yet Another Supercomputer Architecture?

Author: Lenzen Christoph
Wattenhofer Roger
Publication venue
Publication date: 01/01/2016
Field of study

We propose the CLEX supercomputer topology and routing scheme. We prove that CLEX can utilize a constant fraction of the total bandwidth for point-to-point communication, at delays proportional to the sum of the number of intermediate hops and the maximum physical distance between any two nodes. Moreover, % applying an asymmetric bandwidth assignment to the links, all-to-all communication can be realized

(1+o(1))

-optimally both with regard to bandwidth and delays. This is achieved at node degrees of

n^{\varepsilon}

, for an arbitrary small constant

\varepsilon\in (0,1]

. In contrast, these results are impossible in any network featuring constant or polylogarithmic node degrees. Through simulation, we assess the benefits of an implementation of the proposed communication strategy. Our results indicate that, for a million processors, CLEX can increase bandwidth utilization and reduce average routing path length by at least factors

10

respectively

5

in comparison to a torus network. Furthermore, the CLEX communication scheme features several other properties, such as deadlock-freedom, inherent fault-tolerance, and canonical partition into smaller subsystems

arXiv.org e-Print Archive

MPG.PuRe

Computers working at the speed of light

Author: Selviah D.R.
Publication venue
Publication date: 19/12/2008
Field of study

UCL Discovery

The Brain on Low Power Architectures - Efficient Simulation of Cortical Slow Waves and Asynchronous States

Author: Ammendola Roberto
Biagioni Andrea
Capuani Fabrizio
Cicero Francesca Lo
Cretaro Paolo
De Bonis Giulia
Lonardo Alessandro
Martinelli Michele
Paolucci Pier Stanislao
Pastorelli Elena
Pontisso Luca
Simula Francesco
Vicini Piero
Publication venue: 'IOS Press'
Publication date: 01/01/2018
Field of study

Efficient brain simulation is a scientific grand challenge, a parallel/distributed coding challenge and a source of requirements and suggestions for future computing architectures. Indeed, the human brain includes about 10^15 synapses and 10^11 neurons activated at a mean rate of several Hz. Full brain simulation poses Exascale challenges even if simulated at the highest abstraction level. The WaveScalES experiment in the Human Brain Project (HBP) has the goal of matching experimental measures and simulations of slow waves during deep-sleep and anesthesia and the transition to other brain states. The focus is the development of dedicated large-scale parallel/distributed simulation technologies. The ExaNeSt project designs an ARM-based, low-power HPC architecture scalable to million of cores, developing a dedicated scalable interconnect system, and SWA/AW simulations are included among the driving benchmarks. At the joint between both projects is the INFN proprietary Distributed and Plastic Spiking Neural Networks (DPSNN) simulation engine. DPSNN can be configured to stress either the networking or the computation features available on the execution platforms. The simulation stresses the networking component when the neural net - composed by a relatively low number of neurons, each one projecting thousands of synapses - is distributed over a large number of hardware cores. When growing the number of neurons per core, the computation starts to be the dominating component for short range connections. This paper reports about preliminary performance results obtained on an ARM-based HPC prototype developed in the framework of the ExaNeSt project. Furthermore, a comparison is given of instantaneous power, total energy consumption, execution time and energetic cost per synaptic event of SWA/AW DPSNN simulations when executed on either ARM- or Intel-based server platforms

arXiv.org e-Print Archive

Archivio della ricerca- Università di Roma La Sapienza

Genome dynamics of the human embryonic kidney 293 lineage in response to cell biology manipulations

Author: Boone Morgane
Callewaert Nico
Chen Jason
Drmanac Radoje
Lambrechts Diether
Lemmens Irma
Lin Yao-Cheng
Meuris Leander
Moisse Matthieu
Plaisance Stéphane
Reumers Joke
Soete Arne
Speleman Franki
Tavernier Jan
Van de Peer Yves
Van Roy Nadine
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

The HEK293 human cell lineage is widely used in cell biology and biotechnology. Here we use whole-genome resequencing of six 293 cell lines to study the dynamics of this aneuploid genome in response to the manipulations used to generate common 293 cell derivatives, such as transformation and stable clone generation (293T); suspension growth adaptation (293S); and cytotoxic lectin selection (293SG). Remarkably, we observe that copy number alteration detection could identify the genomic region that enabled cell survival under selective conditions (i.c. ricin selection). Furthermore, we present methods to detect human/vector genome breakpoints and a user-friendly visualization tool for the 293 genome data. We also establish that the genome structure composition is in steady state for most of these cell lines when standard cell culturing conditions are used. This resource enables novel and more informed studies with 293 cells, and we will distribute the sequenced cell lines to this effect

Crossref

Ghent University Academic Bibliography

PubMed Central

UPSpace at the University of Pretoria

Symmetric Interconnection Networks from Cubic Crystal Lattices

Author: Beivide Ramón
Camarero Cristóbal
Martínez Carmen
Publication venue
Publication date: 08/11/2013
Field of study

Torus networks of moderate degree have been widely used in the supercomputer industry. Tori are superb when used for executing applications that require near-neighbor communications. Nevertheless, they are not so good when dealing with global communications. Hence, typical 3D implementations have evolved to 5D networks, among other reasons, to reduce network distances. Most of these big systems are mixed-radix tori which are not the best option for minimizing distances and efficiently using network resources. This paper is focused on improving the topological properties of these networks. By using integral matrices to deal with Cayley graphs over Abelian groups, we have been able to propose and analyze a family of high-dimensional grid-based interconnection networks. As they are built over

n

-dimensional grids that induce a regular tiling of the space, these topologies have been denoted \textsl{lattice graphs}. We will focus on cubic crystal lattices for modeling symmetric 3D networks. Other higher dimensional networks can be composed over these graphs, as illustrated in this research. Easy network partitioning can also take advantage of this network composition operation. Minimal routing algorithms are also provided for these new topologies. Finally, some practical issues such as implementability and preliminary performance evaluations have been addressed

arXiv.org e-Print Archive

CiteSeerX

Recommended from our members

A network architecture for Petaflops supercomputers.

Author: DeBenedictis Erik P.
Publication venue: Sandia National Laboratories
Publication date: 01/09/2003
Field of study

If we are to build a supercomputer with a speed of 10{sup 15} floating operations per second (1 PetaFLOPS), interconnect technology will need to be improved considerably over what it is today. In this report, we explore one possible interconnect design for such a network. The guiding principle in this design is the optimization of all components for the finiteness of the speed of light. To achieve a linear speedup in time over well-tested supercomputers of todays' designs will require scaling up of processor power and bandwidth and scaling down of latency. Latency scaling is the most challenging: it requires a 100 ns user-to-user latency for messages traveling the full diameter of the machine. To meet this constraint requires simultaneously minimizing wire length through 3D packaging, new low-latency electrical signaling mechanisms, extremely fast routers, and new network interfaces. In this report, we outline approaches and implementations that will meet the requirements when implemented as a system. No technology breakthroughs are required

UNT Digital Library