11,798 research outputs found

    AutoAccel: Automated Accelerator Generation and Optimization with Composable, Parallel and Pipeline Architecture

    Full text link
    CPU-FPGA heterogeneous architectures are attracting ever-increasing attention in an attempt to advance computational capabilities and energy efficiency in today's datacenters. These architectures provide programmers with the ability to reprogram the FPGAs for flexible acceleration of many workloads. Nonetheless, this advantage is often overshadowed by the poor programmability of FPGAs whose programming is conventionally a RTL design practice. Although recent advances in high-level synthesis (HLS) significantly improve the FPGA programmability, it still leaves programmers facing the challenge of identifying the optimal design configuration in a tremendous design space. This paper aims to address this challenge and pave the path from software programs towards high-quality FPGA accelerators. Specifically, we first propose the composable, parallel and pipeline (CPP) microarchitecture as a template of accelerator designs. Such a well-defined template is able to support efficient accelerator designs for a broad class of computation kernels, and more importantly, drastically reduce the design space. Also, we introduce an analytical model to capture the performance and resource trade-offs among different design configurations of the CPP microarchitecture, which lays the foundation for fast design space exploration. On top of the CPP microarchitecture and its analytical model, we develop the AutoAccel framework to make the entire accelerator generation automated. AutoAccel accepts a software program as an input and performs a series of code transformations based on the result of the analytical-model-based design space exploration to construct the desired CPP microarchitecture. Our experiments show that the AutoAccel-generated accelerators outperform their corresponding software implementations by an average of 72x for a broad class of computation kernels

    A Decoupled 3D Facial Shape Model by Adversarial Training

    Get PDF
    Data-driven generative 3D face models are used to compactly encode facial shape data into meaningful parametric representations. A desirable property of these models is their ability to effectively decouple natural sources of variation, in particular identity and expression. While factorized representations have been proposed for that purpose, they are still limited in the variability they can capture and may present modeling artifacts when applied to tasks such as expression transfer. In this work, we explore a new direction with Generative Adversarial Networks and show that they contribute to better face modeling performances, especially in decoupling natural factors, while also achieving more diverse samples. To train the model we introduce a novel architecture that combines a 3D generator with a 2D discriminator that leverages conventional CNNs, where the two components are bridged by a geometry mapping layer. We further present a training scheme, based on auxiliary classifiers, to explicitly disentangle identity and expression attributes. Through quantitative and qualitative results on standard face datasets, we illustrate the benefits of our model and demonstrate that it outperforms competing state of the art methods in terms of decoupling and diversity.Comment: camera-ready version for ICCV'1

    Protein folding on the ribosome studied using NMR spectroscopy

    Get PDF
    NMR spectroscopy is a powerful tool for the investigation of protein folding and misfolding, providing a characterization of molecular structure, dynamics and exchange processes, across a very wide range of timescales and with near atomic resolution. In recent years NMR methods have also been developed to study protein folding as it might occur within the cell, in a de novo manner, by observing the folding of nascent polypeptides in the process of emerging from the ribosome during synthesis. Despite the 2.3 MDa molecular weight of the bacterial 70S ribosome, many nascent polypeptides, and some ribosomal proteins, have sufficient local flexibility that sharp resonances may be observed in solution-state NMR spectra. In providing information on dynamic regions of the structure, NMR spectroscopy is therefore highly complementary to alternative methods such as X-ray crystallography and cryo-electron microscopy, which have successfully characterized the rigid core of the ribosome particle. However, the low working concentrations and limited sample stability associated with ribosome-nascent chain complexes means that such studies still present significant technical challenges to the NMR spectroscopist. This review will discuss the progress that has been made in this area, surveying all NMR studies that have been published to date, and with a particular focus on strategies for improving experimental sensitivity

    Integrated Thermal-structural-electromagnetic Design Optimization of Large Space Antenna Reflectors

    Get PDF
    The requirements for low mass and high electromagnetic (EM) performance in large, flexible space antenna structures is motivating the development of a systematic procedure for antenna design. In contrast to previous work which concentrated on reducing rms distortions of the reflector surface, thereby indirectly increasing antenna performance, the current work involves a direct approach to increasing electromagnetic performance using mathematical optimization. The thermal, structural, and EM analyses are fully integrated in the context of an optimization procedure, and consequently, the interaction of the various responses is accounted for directly and automatically. Preliminary results are presented for sizing cross-sectional areas of a tetrahedral truss reflector. The results indicate potential for this integrated procedure from the standpoint of mass reduction, performance increase, and efficiency of the design process

    Voltage stabilization in DC microgrids: an approach based on line-independent plug-and-play controllers

    Full text link
    We consider the problem of stabilizing voltages in DC microGrids (mGs) given by the interconnection of Distributed Generation Units (DGUs), power lines and loads. We propose a decentralized control architecture where the primary controller of each DGU can be designed in a Plug-and-Play (PnP) fashion, allowing the seamless addition of new DGUs. Differently from several other approaches to primary control, local design is independent of the parameters of power lines. Moreover, differently from the PnP control scheme in [1], the plug-in of a DGU does not require to update controllers of neighboring DGUs. Local control design is cast into a Linear Matrix Inequality (LMI) problem that, if unfeasible, allows one to deny plug-in requests that might be dangerous for mG stability. The proof of closed-loop stability of voltages exploits structured Lyapunov functions, the LaSalle invariance theorem and properties of graph Laplacians. Theoretical results are backed up by simulations in PSCAD
    • …
    corecore