22 research outputs found
Challenges for the Parallelization of Loosely Timed SystemC Programs
International audienceSystemC/TLM models are commonly used in the industry to provide an early SoC simulation environment. The open source implementation of the SystemC simulator is sequential. The standard doesn't impose sequential executions, but makes this choice the easiest by imposing coroutine semantics. With the increasing size and complexity of models, and the multiplication of computation cores on recent machines, the parallelization of SystemC simulations is a major research concern. There have been several proposals for SystemC parallelization, but most of them are limited to cycle-accurate models. In this paper we give an overview of the practices in one industrial context. We explain why loosely timed models are the only viable option in this context. We also show that unfortunately, most of the existing approaches for SystemC parallelization can fundamentally not apply to these models. We support this claim with a set of measurements performed on a platform used in production at STMicroelectronics. This paper both surveys existing techniques and identifies unsolved challenges in the parallelization of SystemC/TLM models
SysRT: A Modular Multiprocessor RTOS Simulator for Early Design Space Exploration
International audienc
Recommended from our members
Dynamic time management for improved accuracy and speed in host-compiled multi-core platform models
textWith increasing complexity and software content, modern embedded platforms employ a heterogeneous mix of multi-core processors along with hardware accelerators in order to provide high performance in limited power budgets. Due to complex interactions and highly dynamic behavior, static analysis of real-time performance and other constraints is challenging. As an alternative, full-system simulations have been widely accepted by designers. With traditional approaches being either slow or inaccurate, so-called host-compiled simulators have recently emerged as a solution for rapid evaluation of complete systems at early design stages. In such approaches, a faster simulation is achieved by natively executing application code at the source level, abstracting execution behavior of target platforms, and thus increasing simulation granularity. However, most existing host-compiled simulators often focus on application behavior only while neglecting effects of hardware/software interactions and associated speed and accuracy tradeoffs in platform modeling. In this dissertation, we focus on host-compiled operating system (OS) and processor modeling techniques, and we introduce novel dynamic timing model management approaches that efficiently improve both accuracy and speed of such models via automatically calibrating the simulation granularity. The contributions of this dissertation are twofold: We first establish an infrastructure for efficient host-compiled multi-core platform simulation by developing (a) abstract models of both real-time OSs and processors that replicate timing-accurate hardware/software interactions and enable full-system co-simulation, and (b) quantitative and analytical studies of host-compiled simulation principles to analyze error bounds and investigate possible improvements. Building on this infrastructure, we further propose specific techniques for improving accuracy and speed tradeoffs in host-compiled simulation by developing (c) an automatic timing granularity adjustment technique based on dynamically observing system state to control the simulation, (d) an out-of-order cache hierarchy modeling approach to efficiently reorder memory access behavior in the presence of temporal decoupling, and (e) a synchronized timing model to align platform threads to run efficiently in parallel simulation. Results as applied to industrial-strength platforms confirm that by providing careful abstractions and dynamic timing management, our models can achieve full-system simulations at equivalent speeds of more than a thousand MIPS with less than 3% timing error. Coupled with the capability to easily adjust simulation parameters and configurations, this demonstrates the benefits of our platform models for early application development and exploration.Electrical and Computer Engineerin
A Contribution to Resource-Aware Architectures for Humanoid Robots
The goal of this work is to provide building blocks for resource-aware robot architectures. The topic of these blocks are data-driven generation of context-sensitive resource models, prediction of future resource utilizations, and resource-aware computer vision and motion planning algorithms. The implementation of these algorithms is based on resource-aware concepts and methodologies originating from the Transregional Collaborative Research Center "Invasive Computing" (SFB/TR 89)
system-level modeling of programmable packet processing systems
Computer networks are experiencing explosive growth which is reinforced by the recent
exhaustion of the global IPv4 addresses space in 2011 and the tenfold increase in users from 1999
to 2013. The advent of cloud, mobile and IoT is only going to accelerate this growth. This accedes
the need for flexible and scalable networks that process packets faster. Programmable packet
processing systems have emerged as a solution which aim to find balance between flexibility of
supporting different processing functions while maintaining a high processing capability.
Designing architectures that support such paradigms is fairly complicated as decisions need to be
made for evaluating trade-offs between flexibility and efficiency. Questions like what
programmatic interfaces, services, applications and protocols are required need to be answered
before synthesis of actual hardware. To evaluate such requirements modelling techniques are
required to evaluate architecture decisions accurately early enough in the design phase.
In this thesis, we propose a flexible system level modelling methodology for early
validation, design and analysis of packet processing applications for programmable forwarding
plane architectures. The hardware and software architecture is described in a high level language
which can be used to describe forwarding planes from many core network processors to
reconfigurable processing pipelines. Device architects can use this for design space exploration,
prototyping and validation; where application developers can start pre-silicon application design,
development and debugging to evaluate different hardware and software decisions in an industry
with ever shrinking market windows
Probablistic approaches for intelligent AUV localisation
This thesis studies the problem of intelligent localisation for an autonomous underwater
vehicle (AUV). After an introduction about robot localisation and specific
issues in the underwater domain, the thesis will focus on passive techniques for AUV
localisation, highlighting experimental results and comparison among different techniques.
Then, it will develop active techniques, which require intelligent decisions
about the steps to undertake in order for the AUV to localise itself. The undertaken
methodology consisted in three stages: theoretical analysis of the problem, tests with
a simulation environment, integration in the robot architecture and field trials. The
conclusions highlight applications and scenarios where the developed techniques have
been successfully used or can be potentially used to enhance the results given by current
techniques. The main contribution of this thesis is in the proposal of an active
localisation module, which is able to determine the best set of action to be executed,
in order to maximise the localisation results, in terms of time and efficiency
On the modelization of optical devices: from dielectric cavities to radiating structures
Premièrement, nous allons explorer la modélisation des cavités diélectriques bidimensionnelles. Plus spécifiquement, nous allons développer différentes méthodes de modélisation valides pour des cavités diélectriques à géométrie et profil d’indice de réfraction arbitraires. Ce degré de liberté supplémentaire pourra être utilisé dans le design de microcavités pour des applications spécifiques. Un formalisme de diffusion permettra de définir les modes caractéristiques de ce type de structure et d’en calculer les résonances. Une analyse numérique des équations résultantes montrera que les méthodes intégrales sont possiblement meilleures que les méthodes différentielles. Deuxièmement, nous discuterons de la modélisation de structures radiatives. Nous utiliserons les méthodes développées dans la section précédente pour modéliser les propriétés lasers des microcavités bidimensionnelles prédites par la théorie SALT. Nous aborderons aussi la modélisation de fibres-antennes RF, plus particulièrement les câbles coaxiaux à perte radiative, dans le but d’intégrer des fonctionnalités radio dans un textile de manière transparente à l’utilisateur.In this essay, we will develop different modelization techniques valid for bidimensional dielectric cavities having arbitrary geometries and refractive index profiles and provide a way to accurately compute the resonances of such structures. The refractive index thus becomes an additional design variable for dielectric cavities. A numerical analysis of of the underlying equations of the theory will reveal that perhaps it is best to forego differential equations in favour of integral ones for the scattering problem. In the second part, we will discuss the modelization of radiating structures. Using the formalism developed in the previous section, we will study the lasing properties of bidimensional cavities using the newly developed self-consistent ab initio laser theory (SALT). We will also touch on the modelization of the class of antenna known as leaky coa
Systematische Transaction-Level-Kommunikations-Modellierung mit SystemC
An emerging approach to embedded system design is to assemble them from a library of hardware and software component models (IP, intellectual property) using a system description language, such as SystemC. SystemC allows describing the communication among IPs in terms of abstract operations (transactions). The promise is that with transaction-level modeling (TLM), future systems-on-chip with one billion transistors and more can be composed out of IPs as simply as playing with LEGO bricks. However, reality is far out. In fact, each IP vendor promotes another proprietary interface standard and the provided design tools lack compatibility, such that heterogeneous IPs cannot be integrated efficiently. A novel generic interconnect fabric for TLM is presented which aims at enabling inter-operation between models of different levels of abstraction (mixed-mode) and models with different interfaces (heterogeneous components), with as little overhead as possible. A generic, protocol independent representation of transactions is developed, among with an abstraction level formalism. This approach is shown to support systematic simulation of state-of-the-art buses and networks-on-chip such as IBM CoreConnect and PCI Express over several levels of TLM abstraction. A layered simulation framework for SystemC, GreenBus, is developed to examine the proposed concepts. The thesis discusses new implementation techniques for communication modeling with SystemC which outperform the existing approaches in terms of flexibility, simulation accuracy, and performance. Based on these techniques, advanced concepts for TLM-based hardware/software co-design and FPGA prototyping are examined. Several experiments and a video processor case study highlight the efficiency of the approach and show its applicability in a TLM design flow.Eingebettete Systeme werden zunehmend auf Basis vorgefertigter Hard- und Softwarebausteine entwickelt, die in Form von Modellen (IP, Intellectual Property) vorliegen. Hierzu werden Systembeschreibungssprachen wie SystemC eingesetzt. SystemC ermöglicht, die Kommunikation zwischen IPs durch abstrakte Operationen, sog. Transaktionen zu beschreiben. Mit dieser Transaction-Level-Modellierung (TLM) sollen auch zukünftige Systeme mit 1 Milliarde Transistoren und mehr effizient entwickelt werden können. Idealerweise sollte das Hantieren mit IPs dabei so einfach sein wie das Spielen mit LEGO-Steinen. In der Realität sind jedoch IPs unterschiedlicher Hersteller nicht ohne weiteres integrierbar, und auch die Entwurfswerkzeuge sind nicht kompatibel. In dieser Doktorarbeit wird ein neuer, generischer Ansatz für die Transaction-Level-Modellierung mit SystemC vorgestellt, der Kommunikation zwischen Modellen auf unterschiedlichen Abstraktionsebenen (Mixed-Mode) und mit unterschiedlichen Schnittstellen (heterogene Komponenten) möglich macht. Der zusätzlich benötigte Simulations- und Code-Aufwand ist minimal. Ein protokollunabhängiges Transaktionsmodell und ein formaler Ansatz zur Beschreibung von Abstraktionsebenen werden vorgestellt, mit denen verschiedenartige Busse und Networks-on-Chip wie IBM CoreConnect und PCI Express auf verschiedenen TLM-Abstraktionsebenen simuliert werden können. Ein modulares Simulationsframework für SystemC wird entwickelt (GreenBus), um die vorgeschlagenen Konzepte zu untersuchen. Anhand von GreenBus werden neue Implementierungstechniken diskutiert, die den existierenden Ansätzen in Flexibilität, Simulationsgenauigkeit und -geschwindigkeit überlegen sind. Die Vor- und Nachteile der entwickelten Techniken werden mit Experimenten belegt, und eine Videoprozessor-Fallstudie demonstriert die Effizienz des Ansatzes in einem TLM-basierten Entwurfsfluss
Multi-level simulation of nano-electronic digital circuits on GPUs
Simulation of circuits and faults is an essential part in design and test validation tasks of contemporary nano-electronic digital integrated CMOS circuits.
Shrinking technology processes with smaller feature sizes and strict performance and reliability requirements demand not only detailed validation of the functional properties of a design, but also accurate validation of non-functional aspects including the timing behavior. However, due to the rising complexity of the circuit behavior and the steady growth of the designs with respect to the transistor count, timing-accurate simulation of current designs requires a lot of computational effort which can only be handled by proper abstraction and a high degree of parallelization.
This work presents a simulation model for scalable and accurate timing simulation of digital circuits on data-parallel graphics processing unit (GPU) accelerators.
By providing compact modeling and data-structures as well as through exploiting multiple dimensions of parallelism, the simulation model enables not only fast and timing-accurate simulation at logic level, but also massively-parallel simulation with switch level accuracy.
The model facilitates extensions for fast and efficient fault simulation of small delay faults at logic level, as well as first-order parametric and parasitic faults at switch level.
With the parallelization on GPUs, detailed and scalable simulation is enabled that is applicable even to multi-million gate designs.
This way, comprehensive analyses of realistic timing-related faults in presence of process- and parameter variations are enabled for the first time.
Additional simulation efficiency is achieved by merging the presented methods in a unified simulation model, that allows to combine the unique advantages of the different levels of abstraction in a mixed-abstraction multi-level simulation flow to reach even higher speedups.
Experimental results show that the implemented parallel approach achieves unprecedented simulation throughput as well as high speedup compared to conventional timing simulators.
The underlying model scales for multi-million gate designs and gives detailed insights into the timing behavior of digital CMOS circuits, thereby enabling large-scale applications to aid even highly complex design and test validation tasks