Search CORE

2,516 research outputs found

Overview of Swallow --- A Scalable 480-core System for Investigating the Performance and Energy Efficiency of Many-core Applications and Operating Systems

Author: Hollis Simon J.
Kerrison Steve
Publication venue
Publication date: 23/04/2015
Field of study

We present Swallow, a scalable many-core architecture, with a current configuration of 480 x 32-bit processors. Swallow is an open-source architecture, designed from the ground up to deliver scalable increases in usable computational power to allow experimentation with many-core applications and the operating systems that support them. Scalability is enabled by the creation of a tile-able system with a low-latency interconnect, featuring an attractive communication-to-computation ratio and the use of a distributed memory configuration. We analyse the energy and computational and communication performances of Swallow. The system provides 240GIPS with each core consuming 71--193mW, dependent on workload. Power consumption per instruction is lower than almost all systems of comparable scale. We also show how the use of a distributed operating system (nOS) allows the easy creation of scalable software to exploit Swallow's potential. Finally, we show two use case studies: modelling neurons and the overlay of shared memory on a distributed memory system.Comment: An open source release of the Swallow system design and code will follow and references to these will be added at a later dat

arXiv.org e-Print Archive

Parallel Simulation of Neural Networks on SpiNNaker Universal Neuromorphic Hardware

Author: Jin Xin
Publication venue
Publication date: 31/12/2010
Field of study

Minimizing Peak Temperature for Pipelined Hard Real-time Systems

Author
Publication venue: 'Research Publishing Services'
Publication date: 01/01/2016
Field of study

Guarded atomic actions and refinement in a system-on-chip development flow: bridging the specification gap with Event-B

Author: Colley John
Publication venue
Publication date: 01/01/2010
Field of study

Modern System-on-chip (SoC) hardware design puts considerable pressure on existing design and verification flows, languages and tools. The Register Transfer Level (RTL)description, which forms the input for synchronous, logic synthesis-driven design is at too low a level of abstraction for efficient architectural exploration and re-use. The existing methods for taking a high-level paper specification and refining this specification to an implementation that meets its performance criteria is largely manual and error-prone and as RTL descriptions get larger, a systematic design method is necessary to address explicitly the timing issues that arise when applying logic synthesis to such large blocks.Guarded Atomic Actions have been shown to offer a convenient notation for describing microarchitectures that is amenable to formal reasoning and high-level synthesis. Event-B is a language and method that supports the development of specifications with automatic proof and refinement, based on guarded atomic actions. Latency-insensitive design ensures that a design composed of functionally correct components will be independent of communication latency. A method has been developed which uses Event-B for latency-insensitive SoC component and sub-system design which can be combined with high-level, component synthesis to enable architectural exploration and re-use at the specification level and to close the specification gap in the SoC hardware flow

Southampton (e-Prints Soton)

Nanofabric Power Analysis: Biosequence Alignment Case of Study

Author: Amaru’ L.G.
Frache Stefano
Graziano Mariagrazia
Zamboni Maurizio
Publication venue: IEEE/ACM
Publication date: 01/01/2011
Field of study

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)

PORTO Publications Open Repository TOrino