Search CORE

2,002 research outputs found

Runtime Scheduling, Allocation, and Execution of Real-Time Hardware Tasks onto Xilinx FPGAs Subject to Fault Occurrence

Author: Arslan Tughrul
Benkrid Khaled
Ebrahim Ali
Hong Chuan
Iturbe Xabier
Martinez Imanol
Publication venue: 'Hindawi Limited'
Publication date: 01/01/2013
Field of study

This paper describes a novel way to exploit the computation capabilities delivered by modern Field-Programmable Gate Arrays (FPGAs), not only towards a higher performance, but also towards an improved reliability. Computation-specific pieces of circuitry are dynamically scheduled and allocated to different resources on the chip based on a set of novel algorithms which are described in detail in this article. These algorithms consider most of the technological constraints existing in modern partially reconfigurable FPGAs as well as spontaneously occurring faults and emerging permanent damage in the silicon substrate of the chip. In addition, the algorithms target other important aspects such as communications and synchronization among the different computations that are carried out, either concurrently or at different times. The effectiveness of the proposed algorithms is tested by means of a wide range of synthetic simulations, and, notably, a proof-of-concept implementation of them using real FPGA hardware is outlined

Crossref

Directory of Open Access Journals

Edinburgh Research Explorer

A Massively Parallel 2D Rectangle Placement Method

Author: . Xinlian Liu
Moorman Michael
Publication venue: GSTF Journal on Computing (JoC)
Publication date: 16/09/2014
Field of study

Layout design is a frequently occurring process that oftencombines human and computer reasoning. Because of the combinatorialnature of the problem, solving even a small size input involves searchinga prohibitively large state space. An algorithm PEMS (Pseudo-exhaustiveEdge Minimizing Search) is proposed for approximating a 2D rectanglepacking variant of the problem. The proposed method is inspiredby MERA (Minimum Enclosing of Rectangle Area) [1] and MEGA(Minimum Enclosing Under Gravitational Attraction) [2], yet produceshigher quality solutions, in terms of final space utilization. To addressthe performance cost, a CUDA based acceleration algorithm is developedwith significant speedup

GSTF Digital Library (GSTF-DL): Open Journal Systems (Global Science and Technology Forum)

Task scheduling and placement for reconfigurable devices

Author: Elbidweihy Mostafa
Publication venue: LSU Digital Commons
Publication date: 01/01/2009
Field of study

Partially reconfigurable devices allow the execution of different tasks at the same time, removing tasks when they finish and inserting new tasks when they arrive. This dissertation investigates scheduling and placing real-time tasks (tasks with deadline) on reconfigurable devices. One basic scheduler is the First-Fit scheduler. By allowing the First-Fit scheduler to retry tasks while they can satisfy their deadlines, we found that its performance can be enhanced to be better than other schedulers. We also proposed a placement idea based on partitioning the reconfigurable area into regions of various widths, assigning a task to a region based on its width. This idea has a similar rejection rate to a First-Fit scheduler that retries placing tasks and performs better than the First-Fit that does not retry tasks. Also, this regions-based scheduling method has a better running time. Managing how the space will be shared among tasks is a problems of interest. The main function of the free-space manager is to maintain information about the free space (areas not used by active tasks) after any placement or deletion of a task. Speed and efficiency of the free-space data structure are important as well as its effect on scheduler performance. We introduce the use of maximal horizontal strips and maximal vertical strips to represent free space. This resulted in a faster free space manager compared to what has been used in the area. Most researchers in the area of scheduling on reconfigurable devices assumed a homogeneous FPGA with only CLBs in the reconfigurable area. Most reconfigurable devices offered in the market, however, are not homogeneous but heterogeneous with other components between CLBs. We studied the effect of heterogeneity on the performance of schedulers designed for a homogeneous structure. We found that current schedulers result in worse performance when applied to a heterogeneous structure, but by simple modifications, we can apply them to a heterogeneous structure and achieve good performance. Consequently, the approach of studying homogeneous FPGAs is a valid one, as the scheduling ideas discovered there do carry over to heterogeneous FPGAs

Louisiana State University

Evolvable hardware system for automatic optical inspection

Author: Evans Jonathan
Publication venue: The University of Edinburgh
Publication date: 01/01/2004
Field of study

Edinburgh Research Archive

Memory system design for application-specific hardware

Author: Stramondo G.
Publication venue
Publication date: 01/01/2021
Field of study

International Migration, Integration and Social Cohesion online publications

Optimal fault-tolerant placement of relay nodes in a mission critical wireless network

Author: Di Pompeo Silvia
Finzi Alberto
Guarneri Riccardo
Lanciotti Filiberto
Mancini Toni
Scialanca Agostino
Tronci Enrico
Publication venue
Publication date: 01/01/2018
Field of study

The operations of many critical infrastructures (e.g., airports) heavily depend on proper functioning of the radio communication network supporting operations. As a result, such a communication network is indeed a mission-critical communication network that needs adequate protection from external electromagnetic interferences. This is usually done through radiogoniometers. Basically, by using at least three suitably deployed radiogoniometers and a gateway gathering information from them, sources of electromagnetic emissions that are not supposed to be present in the monitored area can be localised. Typically, relay nodes are used to connect radiogoniometers to the gateway. As a result, some degree of fault-tolerance for the network of relay nodes is essential in order to offer a reliable monitoring. On the other hand, deployment of relay nodes is typically quite expensive. As a result, we have two conflicting requirements: minimise costs while guaranteeing a given fault-tolerance. In this paper address the problem of computing a deployment for relay nodes that minimises the relay node network cost while at the same time guaranteeing proper working of the network even when some of the relay nodes (up to a given maximum number) become faulty (fault-tolerance). We show that the above problem can be formulated as a Mixed Integer Linear Programming (MILP) as well as a Pseudo-Boolean Satisfiability (PB-SAT) optimisation problem and present experimental results com- paring the two approaches on realistic scenarios

Archivio della ricerca - Università degli studi di Napoli Federico II

Archivio della ricerca- Università di Roma La Sapienza

3D IC optimal layout design. A parallel and distributed topological approach

Author: AB Kahng
C Ababei
J Funke
K Grzesiak-Kopeć
K Grzesiak-Kopeć
K Grzesiak-Kopeć
S Boettcher
T Lengauer
Z Li
Publication venue
Publication date: 26/11/2019
Field of study

The task of 3D ICs layout design involves the assembly of millions of components taking into account many different requirements and constraints such as topological, wiring or manufacturability ones. It is a NP-hard problem that requires new non-deterministic and heuristic algorithms. Considering the time complexity, the commonly applied Fiduccia-Mattheyses partitioning algorithm is superior to any other local search method. Nevertheless, it can often miss to reach a quasi-optimal solution in 3D spaces. The presented approach uses an original 3D layout graph partitioning heuristics implemented with use of the extremal optimization method. The goal is to minimize the total wire-length in the chip. In order to improve the time complexity a parallel and distributed Java implementation is applied. Inside one Java Virtual Machine separate optimization algorithms are executed by independent threads. The work may also be shared among different machines by means of The Java Remote Method Invocation system.Comment: 26 pages, 9 figure

arXiv.org e-Print Archive

Crossref

Jagiellonian Univeristy Repository