796 research outputs found
An Implementation of a Dual-Processor System on FPGA
In recent years, Field-Programmable Gate Arrays (FPGA) have evolved rapidly
paving the way for a whole new range of computing paradigms. On the other hand,
computer applications are evolving. There is a rising demand for a system that
is general-purpose and yet has the processing abilities to accommodate current
trends in application processing. This work proposes a design and
implementation of a tightly-coupled FPGA-based dual-processor platform. We
architect a platform that optimizes the utilization of FPGA resources and
allows for the investigation of practical implementation issues such as cache
design. The performance of the proposed prototype is then evaluated, as
different configurations of a uniprocessor and a dual-processor system are
studied and compared against each other and against published results for
common industry-standard CPU platforms. The proposed implementation utilizes
the Nios II 32-bit embedded soft-core processor architecture designed for the
Altera Cyclone III family of FPGAs
The AXIOM software layers
AXIOM project aims at developing a heterogeneous computing board (SMP-FPGA).The Software Layers developed at the AXIOM project are explained.OmpSs provides an easy way to execute heterogeneous codes in multiple cores. People and objects will soon share the same digital network for information exchange in a world named as the age of the cyber-physical systems. The general expectation is that people and systems will interact in real-time. This poses pressure onto systems design to support increasing demands on computational power, while keeping a low power envelop. Additionally, modular scaling and easy programmability are also important to ensure these systems to become widespread. The whole set of expectations impose scientific and technological challenges that need to be properly addressed.The AXIOM project (Agile, eXtensible, fast I/O Module) will research new hardware/software architectures for cyber-physical systems to meet such expectations. The technical approach aims at solving fundamental problems to enable easy programmability of heterogeneous multi-core multi-board systems. AXIOM proposes the use of the task-based OmpSs programming model, leveraging low-level communication interfaces provided by the hardware. Modular scalability will be possible thanks to a fast interconnect embedded into each module. To this aim, an innovative ARM and FPGA-based board will be designed, with enhanced capabilities for interfacing with the physical world. Its effectiveness will be demonstrated with key scenarios such as Smart Video-Surveillance and Smart Living/Home (domotics).Peer ReviewedPostprint (author's final draft
Parallel Modelling Paradigm in Multimedia Applications: Mapping and Scheduling onto a Multi-Processor System-on-Chip Platform
Multi-processor systems have appeared as a promising alternative to face the difficulties of creating even faster uni-processor systems using latest technologies. Emerg-ing design paradigms such as Multiprocessor System-on-a-Chip (MpSoC) offer high levels of performance and flex-ibility and at the same time promise low-cost, reliable and power-efficient implementations. However, the design com-plexity of such systems have increased tremendously. One source of the complexity stems from highly parallel het-erogeneous nature of the underlying hardware architecture, which poses many challenges for mapping of an applica-tion to the architecture. This motivates the development of a unified programming paradigm that facilitates the map-ping by hiding the architectural complexity and exposing the parallel resources of the architecture. To enable de-sign reuse, such a programming paradigm has to support a smooth translation of sequentially-coded software algo-rithms into their parallel implementations. In this paper we address the parallelization of sequential multimedia appli-cations written in C/C++ for their mapping and schedul-ing onto a flexible MpSoC platform. We show that using our approach an architecture-independent multi-threaded model of a MPEGâ2 video decoder algorithm can be ob-tained with only few modifications to an existing sequential implementation of the algorithm. 1
Multiprocessor System-on-Chips based Wireless Sensor Network Energy Optimization
Wireless Sensor Network (WSN) is an integrated part of the Internet-of-Things (IoT) used to monitor the physical or environmental conditions without human intervention. In WSN one of the major challenges is energy consumption reduction both at the sensor nodes and network levels. High energy consumption not only causes an increased carbon footprint but also limits the lifetime (LT) of the network. Network-on-Chip (NoC) based Multiprocessor System-on-Chips (MPSoCs) are becoming the de-facto computing platform for computationally extensive real-time applications in IoT due to their high performance and exceptional quality-of-service. In this thesis a task scheduling problem is investigated using MPSoCs architecture for tasks with precedence and deadline constraints in order to minimize the processing energy consumption while guaranteeing the timing constraints. Moreover, energy-aware nodes clustering is also performed to reduce the transmission energy consumption of the sensor nodes. Three distinct problems for energy optimization are investigated given as follows:
First, a contention-aware energy-efficient static scheduling using NoC based heterogeneous MPSoC is performed for real-time tasks with an individual deadline and precedence constraints. An offline meta-heuristic based contention-aware energy-efficient task scheduling is developed that performs task ordering, mapping, and voltage assignment in an integrated manner. Compared to state-of-the-art scheduling our proposed algorithm significantly improves the energy-efficiency.
Second, an energy-aware scheduling is investigated for a set of tasks with precedence constraints deploying Voltage Frequency Island (VFI) based heterogeneous NoC-MPSoCs. A novel population based algorithm called ARSH-FATI is developed that can dynamically switch between explorative and exploitative search modes at run-time. ARSH-FATI performance is superior to the existing task schedulers developed for homogeneous VFI-NoC-MPSoCs.
Third, the transmission energy consumption of the sensor nodes in WSN is reduced by developing ARSH-FATI based Cluster Head Selection (ARSH-FATI-CHS) algorithm integrated with a heuristic called Novel Ranked Based Clustering (NRC). In cluster formation parameters such as residual energy, distance parameters, and workload on CHs are considered to improve LT of the network. The results prove that ARSH-FATI-CHS outperforms other state-of-the-art clustering algorithms in terms of LT.University of Derby, Derby, U
Optimisation des mémoires dans le flot de conception des systÚmes multiprocesseurs sur puces pour des applications de type multimédia
RĂSUMĂ
Les systĂšmes multiprocesseurs sur puce (MPSoC) constituent l'un des principaux moteurs de
la rĂ©volution industrielle des semi-conducteurs. Les MPSoCs jouissent dâune popularitĂ©
grandissante dans le domaine des systĂšmes embarquĂ©s. Leur grande capacitĂ© de parallĂ©lisation Ă
un trÚs haut niveau d'intégration, en font de bons candidats pour les systÚmes et les applications
telles que les applications multimĂ©dia. La consommation dâĂ©nergie, la capacitĂ© de calcul et
lâespace de conception sont les Ă©lĂ©ments dont dĂ©pendent les performances de ce type
dâapplications. La mĂ©moire est le facteur clĂ© permettant dâamĂ©liorer de façon substantielle leurs
performances. Avec lâarrivĂ©e des applications multimĂ©dias embarquĂ©es dans lâindustrie, le
problÚme des gains de performances est vital. La masse de données traitées par ces applications
requiert une grande capacité de calcul et de mémoire. DerniÚrement, de nouveaux modÚles de
programmation ont fait leur apparition. Ces modĂšles offrent une programmation de plus haut
niveau pour rĂ©pondre aux besoins croissants des MPSoCs, dâoĂč la nĂ©cessitĂ© de nouvelles
approches d'optimisation et de placement pour les systÚmes embarqués et leurs modÚles de
programmation.
La conception niveau systĂšme des architectures MPSoCs pour les applications de type
multimĂ©dia constitue un vĂ©ritable dĂ©fi technique. Lâobjectif gĂ©nĂ©ral de cette thĂšse est de relever
ce dĂ©fi en trouvant des solutions. Plus spĂ©cifiquement, cette thĂšse se propose dâintroduire le
concept dâoptimisation mĂ©moire dans le flot de conception niveau systĂšme et dâobserver leur
impact sur différents modÚles de programmation utilisés lors de la conception de MPSoCs. Il
sâagit, autrement dit, de rĂ©aliser lâunification du domaine de la compilation avec celui de la
conception niveau systĂšme pour une meilleure conception globale.
La contribution de cette thĂšse est de proposer de nouvelles approches pour les techniques
d'optimisation mémoire pour la conception MPSoCs avec différents modÚles de programmation.
Nos travaux de recherche concernent l'intĂ©gration des techniques dâoptimisation mĂ©moire dans le
flot de conception de MPSoCs pour différents types de modÚle de programmation. Ces travaux
ont été exécutés en collaboration avec STMicroelectronics.----------ABSTRACT
Multiprocessor systems-on-chip (MPSoC) are defined as one of the main drivers of the
industrial semiconductors revolution. MPSoCs are gaining popularity in the field of embedded
systems. Pursuant to their great ability to parallelize at a very high integration level, they are
good candidates for systems and applications such as multimedia. Memory is becoming a key
player for significant improvements in these applications (i.e. power, performance and area).
With the emergence of more embedded multimedia applications in the industry, this issue
becomes increasingly vital. The large amount of data manipulated by these applications requires
high-capacity calculation and memory. Lately, new programming models have been introduced.
These programming models offer a higher programming level to answer the increasing needs of
MPSoCs. This leads to the need of new optimization and mapping approaches suitable for
embedded systems and their programming models.
The overall objective of this research is to find solutions to the challenges of system level
design of applications such as multimedia. This entails the development of new approaches and
new optimization techniques. The specific objective of this research is to introduce the concept
of memory optimization in the system level conception flow and study its impact on different
programming models used for MPSoCsâ design. In other words, it is the unification of the
compilation and system level design domains.
The contribution of this research is to propose new approaches for memory optimization
techniques for MPSoCsâ design in different programming models. This thesis relates to the
integration of memory optimization to varying programming model types in the MPSoCs
conception flow. Our research was done in collaboration with STMicroelectronics
The AXIOM platform for next-generation cyber physical systems
Cyber-Physical Systems (CPSs) are widely used in many applications that require interactions between humans and their physical environment. These systems usually integrate a set of hardware-software components for optimal application execution in terms of performance and energy consumption. The AXIOM project (Agile, eXtensible, fast I/O Module), presented in this paper, proposes a hardware-software platform for CPS coupled with an easy parallel programming model and sufficient connectivity so that the performance can scale-up by adding multiple boards. AXIOM supports a task-based programming model based on OmpSs and leverages a high-speed, inexpensive communication interface called AXIOM-Link. The board also tightly couples the CPU with reconfigurable resources to accelerate portions of the applications. As case studies, AXIOM uses smart video surveillance, and smart home living applicationsThis work is partially supported by the European Union H2020 program through the AXIOM project (grant ICT-01-2014 GA
645496) and HiPEAC (GA 687698), by the Spanish Government through Programa Severo Ochoa (SEV-2015-0493), by the Spanish Ministry of Science and Technology through TIN2015-65316-P project, and by the Generalitat de Catalunya (contracts 2014-SGR-1051 and 2014-SGR-1272). We also thank the Xilinx University Program for its hardware and software donations.Peer ReviewedPostprint (author's final draft
- âŠ