2,898 research outputs found

    XNOR Neural Engine: a Hardware Accelerator IP for 21.6 fJ/op Binary Neural Network Inference

    Full text link
    Binary Neural Networks (BNNs) are promising to deliver accuracy comparable to conventional deep neural networks at a fraction of the cost in terms of memory and energy. In this paper, we introduce the XNOR Neural Engine (XNE), a fully digital configurable hardware accelerator IP for BNNs, integrated within a microcontroller unit (MCU) equipped with an autonomous I/O subsystem and hybrid SRAM / standard cell memory. The XNE is able to fully compute convolutional and dense layers in autonomy or in cooperation with the core in the MCU to realize more complex behaviors. We show post-synthesis results in 65nm and 22nm technology for the XNE IP and post-layout results in 22nm for the full MCU indicating that this system can drop the energy cost per binary operation to 21.6fJ per operation at 0.4V, and at the same time is flexible and performant enough to execute state-of-the-art BNN topologies such as ResNet-34 in less than 2.2mJ per frame at 8.9 fps.Comment: 11 pages, 8 figures, 2 tables, 3 listings. Accepted for presentation at CODES'18 and for publication in IEEE Transactions on Computer-Aided Design of Circuits and Systems (TCAD) as part of the ESWEEK-TCAD special issu

    Nanoparticle shape effects on squeezed MHD flow of water based Cu, Al2O3 and SWCNTs over a porous sensor surface

    Get PDF
    Impact of nanoparticle shape on the squeezed MHD flow of water based metallic nanoparticles over a porous sensor surface in the presence of heat source has been investigated. In distinctly most paramount studies, three distinctive forms of nanoparticle shapes are employed into account, i.e. sphere ðm ¼ 3:0Þ, cylinder ðm ¼ 6:3698Þ and laminar ðm ¼ 16:1576Þ. The controlling partial differential equations (PDEs) are regenerated into ordinary differential equations (ODEs) by manipulating consistent conformity conversion and it is determined numerically by handling Runge Kutta Fehlberg method with shooting technique. It is noticed that the solid volume fraction and nanoparticle shape have powerful outputs in squeezing flow phenomena, the sphere shape nanoparticle in Cu – water and cylindrical shape in SWCNTs-water in the presence of magnetic field along with thermal radiation energy has better improvement on heat transfer as compared with the other nanoparticle shapes in different flow regimes

    Motion estimation and CABAC VLSI co-processors for real-time high-quality H.264/AVC video coding

    Get PDF
    Real-time and high-quality video coding is gaining a wide interest in the research and industrial community for different applications. H.264/AVC, a recent standard for high performance video coding, can be successfully exploited in several scenarios including digital video broadcasting, high-definition TV and DVD-based systems, which require to sustain up to tens of Mbits/s. To that purpose this paper proposes optimized architectures for H.264/AVC most critical tasks, Motion estimation and context adaptive binary arithmetic coding. Post synthesis results on sub-micron CMOS standard-cells technologies show that the proposed architectures can actually process in real-time 720 Ă— 480 video sequences at 30 frames/s and grant more than 50 Mbits/s. The achieved circuit complexity and power consumption budgets are suitable for their integration in complex VLSI multimedia systems based either on AHB bus centric on-chip communication system or on novel Network-on-Chip (NoC) infrastructures for MPSoC (Multi-Processor System on Chip

    Application of Robot in CNC Manufacturing Process in Connection with Embeddedsystem

    Get PDF
    Embedded an Electronic system basically a computer application with dedicated function with a large mechanical and electrical integrated system. The use or application of embedded system in various areas worldwide. Hence in this study, we discussed about the wide area of application and finally we have discussed about the use of embedded system in connection with industrial robot along with CNC technology in flexible manufacturing (FMS) where accurate control of speed and position of DC motor in precession and repeatability in motion control have achieved in servomotor and machine actuators.In the paper Specific Area of Embedded System and its Applications have been used as applications of specific processor and devices such as Robot application of CNC in world Class manufacturing process

    On-line multiobjective automatic control system generation by evolutionary algorithms

    Get PDF
    Evolutionary algorithms are applied to the on- line generation of servo-motor control systems. In this paper, the evolving population of controllers is evaluated at run-time via hardware in the loop, rather than on a simulated model. Disturbances are also introduced at run-time in order to pro- duce robust performance. Multiobjective optimisation of both PI and Fuzzy Logic controllers is considered. Finally an on-line implementation of Genetic Programming is presented based around the Simulink standard blockset. The on-line designed controllers are shown to be robust to both system noise and ex- ternal disturbances while still demonstrating excellent steady- state and dvnamic characteristics

    Customer application protocol for data transfer between embedded processor and microcontroller systems

    Get PDF
    This paper develops a new customer application protocol (CAP) to improve the efficiency of transferring data between embedded processor and microcontroller systems. The established protocol is characterized by its fidelity and simplicity for using a small header to control and monitor the data flow between the two systems. This is achieved by constructing an embedded processor system with an Ethernet intellectual property (IP) core featured by lightweight IP (lwIP) to settle a connection with a microcontroller device. The embedded system is configured on spartan6E FPGAs slice. The system performance is tested by transferring audio samples and displaying them on chipscope media. The performance test of the designed embedded system with the developed customer application protocol showed fast, efficient and high precision data exchange between the processor and microcontroller systems
    • …
    corecore