21 research outputs found

    Many-core and heterogeneous architectures: programming models and compilation toolchains

    Get PDF
    1noL'abstract è presente nell'allegato / the abstract is in the attachmentopen677. INGEGNERIA INFORMATInopartially_openembargoed_20211002Barchi, Francesc

    Rapid SoC Design: On Architectures, Methodologies and Frameworks

    Full text link
    Modern applications like machine learning, autonomous vehicles, and 5G networking require an order of magnitude boost in processing capability. For several decades, chip designers have relied on Moore’s Law - the doubling of transistor count every two years to deliver improved performance, higher energy efficiency, and an increase in transistor density. With the end of Dennard’s scaling and a slowdown in Moore’s Law, system architects have developed several techniques to deliver on the traditional performance and power improvements we have come to expect. More recently, chip designers have turned towards heterogeneous systems comprised of more specialized processing units to buttress the traditional processing units. These specialized units improve the overall performance, power, and area (PPA) metrics across a wide variety of workloads and applications. While the GPU serves as a classical example, accelerators for machine learning, approximate computing, graph processing, and database applications have become commonplace. This has led to an exponential growth in the variety (and count) of these compute units found in modern embedded and high-performance computing platforms. The various techniques adopted to combat the slowing of Moore’s Law directly translates to an increase in complexity for modern system-on-chips (SoCs). This increase in complexity in turn leads to an increase in design effort and validation time for hardware and the accompanying software stacks. This is further aggravated by fabrication challenges (photo-lithography, tooling, and yield) faced at advanced technology nodes (below 28nm). The inherent complexity in modern SoCs translates into increased costs and time-to-market delays. This holds true across the spectrum, from mobile/handheld processors to high-performance data-center appliances. This dissertation presents several techniques to address the challenges of rapidly birthing complex SoCs. The first part of this dissertation focuses on foundations and architectures that aid in rapid SoC design. It presents a variety of architectural techniques that were developed and leveraged to rapidly construct complex SoCs at advanced process nodes. The next part of the dissertation focuses on the gap between a completed design model (in RTL form) and its physical manifestation (a GDS file that will be sent to the foundry for fabrication). It presents methodologies and a workflow for rapidly walking a design through to completion at arbitrary technology nodes. It also presents progress on creating tools and a flow that is entirely dependent on open-source tools. The last part presents a framework that not only speeds up the integration of a hardware accelerator into an SoC ecosystem, but emphasizes software adoption and usability.PHDElectrical and Computer EngineeringUniversity of Michigan, Horace H. Rackham School of Graduate Studieshttp://deepblue.lib.umich.edu/bitstream/2027.42/168119/1/ajayi_1.pd

    Characterization and optimization of network traffic in cortical simulation

    Get PDF
    Considering the great variety of obstacles the Exascale systems have to face in the next future, a deeper attention will be given in this thesis to the interconnect and the power consumption. The data movement challenge involves the whole hierarchical organization of components in HPC systems — i.e. registers, cache, memory, disks. Running scientific applications needs to provide the most effective methods of data transport among the levels of hierarchy. On current petaflop systems, memory access at all the levels is the limiting factor in almost all applications. This drives the requirement for an interconnect achieving adequate rates of data transfer, or throughput, and reducing time delays, or latency, between the levels. Power consumption is identified as the largest hardware research challenge. The annual power cost to operate the system would be above 2.5 B$ per year for an Exascale system using current technology. The research for alternative power-efficient computing device is mandatory for the procurement of the future HPC systems. In this thesis, a preliminary approach will be offered to the critical process of co-design. Co-desing is defined as the simultaneos design of both hardware and software, to implement a desired function. This process both integrates all components of the Exascale initiative and illuminates the trade-offs that must be made within this complex undertaking

    Applications and Techniques for Fast Machine Learning in Science

    Get PDF
    In this community review report, we discuss applications and techniques for fast machine learning (ML) in science - the concept of integrating powerful ML methods into the real-time experimental data processing loop to accelerate scientific discovery. The material for the report builds on two workshops held by the Fast ML for Science community and covers three main areas: applications for fast ML across a number of scientific domains; techniques for training and implementing performant and resource-efficient ML algorithms; and computing architectures, platforms, and technologies for deploying these algorithms. We also present overlapping challenges across the multiple scientific domains where common solutions can be found. This community report is intended to give plenty of examples and inspiration for scientific discovery through integrated and accelerated ML solutions. This is followed by a high-level overview and organization of technical advances, including an abundance of pointers to source material, which can enable these breakthroughs

    Convergence of Intelligent Data Acquisition and Advanced Computing Systems

    Get PDF
    This book is a collection of published articles from the Sensors Special Issue on "Convergence of Intelligent Data Acquisition and Advanced Computing Systems". It includes extended versions of the conference contributions from the 10th IEEE International Conference on Intelligent Data Acquisition and Advanced Computing Systems: Technology and Applications (IDAACS’2019), Metz, France, as well as external contributions

    The Fifth NASA Symposium on VLSI Design

    Get PDF
    The fifth annual NASA Symposium on VLSI Design had 13 sessions including Radiation Effects, Architectures, Mixed Signal, Design Techniques, Fault Testing, Synthesis, Signal Processing, and other Featured Presentations. The symposium provides insights into developments in VLSI and digital systems which can be used to increase data systems performance. The presentations share insights into next generation advances that will serve as a basis for future VLSI design

    Neurophysiological mechanisms of sensorimotor recovery from stroke

    Get PDF
    Ischemic stroke often results in the devastating loss of nervous tissue in the cerebral cortex, leading to profound motor deficits when motor territory is lost, and ultimately resulting in a substantial reduction in quality of life for the stroke survivor. The International Classification of Functioning, Disability and Health (ICF) was developed in 2002 by the World Health Organization (WHO) and provides a framework for clinically defining impairment after stroke. While the reduction of burdens due to neurological disease is stated as a mission objective of the National Institute of Neurological Disorders and Stroke (NINDS), recent clinical trials have been unsuccessful in translating preclinical research breakthroughs into actionable therapeutic treatment strategies with meaningful progress towards this goal. This means that research expanding another NINDS mission is now more important than ever: improving fundamental knowledge about the brain and nervous system in order to illuminate the way forward. Past work in the monkey model of ischemic stroke has suggested there may be a relationship between motor improvements after injury and the ability of the animal to reintegrate sensory and motor information during behavior. This relationship may be subserved by sprouting cortical axonal processes that originate in the spared premotor cortex after motor cortical injury in squirrel monkeys. The axons were observed to grow for relatively long distances (millimeters), significantly changing direction so that it appears that they specifically navigate around the injury site and reorient toward the spared sensory cortex. Critically, it remains unknown whether such processes ever form functional synapses, and if they do, whether such synapses perform meaningful calculations or other functions during behavior. The intent of this dissertation was to study this phenomenon in both intact rats and rats with a focal ischemia in primary motor cortex (M1) contralateral to the preferred forelimb during a pellet retrieval task. As this proved to be a challenging and resource-intensive endeavor, a primary objective of the dissertation became to provide the tools to facilitate such a project to begin with. This includes the creation of software, hardware, and novel training and behavioral paradigms for the rat model. At the same time, analysis of previous experimental data suggested that plasticity in the neural activity of the bilateral motor cortices of rats performing pellet retrievals after focal M1 ischemia may exhibit its most salient changes with respect to functional changes in behavior via mechanisms that were different than initially hypothesized. Specifically, a major finding of this dissertation is the finding that evidence of plasticity in the unit activity of bilateral motor cortical areas of the reaching rat is much stronger at the level of population features. These features exhibit changes in dynamics that suggest a shift in network fixed points, which may relate to the stability of filtering performed during behavior. It is therefore predicted that in order to define recovery by comparison to restitution, a specific type of fixed point dynamics must be present in the cortical population state. A final suggestion is that the stability or presence of these dynamics is related to the reintegration of sensory information to the cortex, which may relate to the positive impact of physical therapy during rehabilitation in the postacute window. Although many more rats will be needed to state any of these findings as a definitive fact, this line of inquiry appears to be productive for identifying targets related to sensorimotor integration which may enhance the efficacy of future therapeutic strategies

    Technology 2001: The Second National Technology Transfer Conference and Exposition, volume 1

    Get PDF
    Papers from the technical sessions of the Technology 2001 Conference and Exposition are presented. The technical sessions featured discussions of advanced manufacturing, artificial intelligence, biotechnology, computer graphics and simulation, communications, data and information management, electronics, electro-optics, environmental technology, life sciences, materials science, medical advances, robotics, software engineering, and test and measurement
    corecore