86 research outputs found
Formal and Informal Methods for Multi-Core Design Space Exploration
We propose a tool-supported methodology for design-space exploration for
embedded systems. It provides means to define high-level models of applications
and multi-processor architectures and evaluate the performance of different
deployment (mapping, scheduling) strategies while taking uncertainty into
account. We argue that this extension of the scope of formal verification is
important for the viability of the domain.Comment: In Proceedings QAPL 2014, arXiv:1406.156
A Survey of Techniques For Improving Energy Efficiency in Embedded Computing Systems
Recent technological advances have greatly improved the performance and
features of embedded systems. With the number of just mobile devices now
reaching nearly equal to the population of earth, embedded systems have truly
become ubiquitous. These trends, however, have also made the task of managing
their power consumption extremely challenging. In recent years, several
techniques have been proposed to address this issue. In this paper, we survey
the techniques for managing power consumption of embedded systems. We discuss
the need of power management and provide a classification of the techniques on
several important parameters to highlight their similarities and differences.
This paper is intended to help the researchers and application-developers in
gaining insights into the working of power management techniques and designing
even more efficient high-performance embedded systems of tomorrow
An Early Real-Time Checker for Retargetable Compile-Time Analysis
ABSTRACT With the demand for energy-efficient embedded computing and the rise of heterogeneous architectures, automatically retargetable techniques are likely to grow in importance. On the one hand, retargetable compilers do not handle realtime constraints properly. On the other hand, conventional worst-case execution time (WCET) approaches are not automatically retargetable: measurement-based methods require time-consuming dynamic characterization of target processors, whereas static program analysis and abstract interpretation are performed in a post-compiling phase, being therefore restricted to the set of supported targets. This paper proposes a retargetable technique to grant early realtime checking (ERTC) capabilities for design space exploration. The technique provides a general (minimum, maximum and exact-delay) timing analysis at compile time. It allows the early detection of inconsistent time-constraint combinations prior to the generation of binary executables, thereby promising higher design productivity. ERTC is a complement to state-of-the-art design flows, which could benefit from early infeasiblity detection and exploration of alternative target processors, before the binary executables are submitted to tight-bound BCET and WCET analyses for the selected target processor
New Trends in Photonic Switching and Optical Network Architecture for Data Centre and Computing Systems
AI/ML for data centres and data centres for AI/ML are defining new trends in
cloud computing. Disaggregated heterogeneous reconfigurable computing systems
realized by photonic interconnects and photonic switching expect greatly
enhanced throughput and energy-efficiency for AI/ML workloads, especially when
aided by an AI/ML control plane
Automatic Demand Draft Withdrawal Machine Using Micro controller
At present, intelligent automation has stepped its presence in every field all over the world. This paper is an intelligent automation for issuing the demand drafts in automated way. In present scenario, demand draft was taken manually. Without the help of human source, it is highly impossible to take DD. To take DD the customer has to go for bank and wait for an hour. Initially the customer will be provided DD form, after filling the complete details it will be forwarded to the next level, this process is followed in existing system and DD will be taken only on bank working hours. In order to overcome such kind of difficulties an idea is introduced which allows the customer to take DD automatically. To make the work easier, faster and automated a domain called embedded system is used. In this automated system, the customer has to insert their currency in the rupee slot and have to wait for few seconds to accept it. Then within next few seconds they have to feed the required details in the PC instead of writing in a form. This is then generated and the sum of amount which has been inserted will be added in the softcopy. Then they have to verify once and have to give print .Thus the Demand Draft will be generated in few seconds instead of standing for hours. Thus, this system eliminates the drawbacks of the existing set-up. It is placed in the bank branches similar to that of ATM. In proposed system, image processing is used to count the currency. With this automation, issuing of demand draft is made easier by feeding the inputs in the input module. In our project we designed the hardware for taking the currency notes from the input slot and software is designed to generate the demand draft. The objective of this project is to design a simple, easy to install, microcontroller-based circuit to control and PC or laptop interface for generating the demand draft
A 0.80pJ/flop, 1.24Tflop/sW 8-to-64 bit Transprecision Floating-Point Unit for a 64 bit RISC-V Processor in 22nm FD-SOI
The crisis of Moore's law and new dominant Machine Learning workloads require a paradigm shift towards finely tunable-precision (a.k.a. transprecision) computing. More specifically, we need floating-point circuits that are capable to operate on many formats with high flexibility. We present the first silicon implementation of a 64-bit transprecision floating-point unit. It fully supports the standard double, single, and half precision, alongside custom bfloat and 8 bit formats. Operations occur on scalars or 2, 4, or 8-way SIMD vectors. We have integrated the 247 kGE unit into a 64 bit application-class RISC-V processor core, where the added transprecision support accounts for an energy and area overhead of merely 11 and 9, respectively; yet achieving speedups and per-datum energy gains of 7.3x and 7.94x. We implemented the design in a 22 nm FD-SOI technology. The unit achieves energy efficiencies between 75 Gflop/sW and 1.24 Tflop/sW, and a performance between 1.85 Gflop/s and 14.83 Gflop/s, across formats
Design and Evaluation of SmallFloat SIMD extensions to the RISC-V ISA
RISC-V is an open-source instruction set architecture (ISA) with a modular design consisting of a mandatory base part plus optional extensions. The RISC-V 32IMFC ISA configuration has been widely adopted for the design of new-generation, low-power processors. Motivated by the important energy savings that smaller-than-32-bit FP types have enabled in several application domains and related compute platforms, some recent studies have published encouraging early results for their adoption in RISC-V processors. In this paper we introduce a set of ISA extensions for RISC-V 32IMFC, supporting scalar and SIMD operations (fitting the 32-bit register size) for 8-bit and two 16-bit FP types. The proposed extensions are enabled by exposing the new FP types to the standard C/C++ type system and an implementation for the RISC-V GCC compiler is presented. As a further, novel contribution, we extensively characterize the performance and energy savings achievable with the proposed extensions. On average, experimental results show that their adoption provide benefits in terms of performance (1.64
7 speedup for 16-bit and 2.18
7 for 8-bit types) and energy consumption (30% saving for 16-bit and 50% for 8-bit types). We also illustrate an approach based on automatic precision tuning to make effective use of the new FP types
- …