19,973 research outputs found

    Wavelet-Based High-Order Adaptive Modeling of Lossy Interconnects

    Get PDF
    Abstract—This paper presents a numerical-modeling strategy for simulation of fast transients in lossy electrical interconnects. The proposed algorithm makes use of wavelet representations of voltages and currents along the structure, with the aim of reducing the computational complexity of standard time-domain solvers. A special weak procedure for the implementation of possibly dynamic and nonlinear boundary conditions allows to preserve stability as well as a high approximation order, thus leading to very accurate schemes. On the other hand, the wavelet expansion allows the computation of the solution by using few significant coefficients which are automatically determined at each time step. A dynamically refinable mesh is then used to perform a sparse time-stepping. Several numerical results illustrate the high efficiency of the proposed algorithm, which has been tuned and optimized for best performance in fast digital applications typically found on modern PCB structures. Index Terms—Finite difference methods, time-domain analysis, transmission lines, wavelet transforms. I

    Transient Non-linear Thermal FEM Simulation of Smart Power Switches and Verification by Measurements

    Get PDF
    Thermal FEM (Finite Element Method) simulations can be used to predict the thermal behavior of power semiconductors in application. Most power semiconductors are made of silicon. Silicon thermal material properties are significantly temperature dependent. In this paper, validity of a common non-linear silicon material model is verified by transient non-linear thermal FEM simulations of Smart Power Switches and measurements. For verification, over-temperature protection behavior of Smart Power Switches is employed. This protection turns off the switch at a pre-defined temperature which is used as a temperature reference in the investigation. Power dissipation generated during a thermal overload event of two Smart Power devices is measured and used as an input stimulus to transient thermal FEM simulations. The duration time of the event together with the temperature reference is confronted with simulation results and thus the validity of the silicon model is proved. In addition, the impact of non-linear thermal properties of silicon on the thermal impedance of power semiconductors is shown.Comment: Submitted on behalf of TIMA Editions (http://irevues.inist.fr/tima-editions

    Detailed characteristics of drop-laden mixing layers: Large eddy simulation predictions compared to direct numerical simulation

    Get PDF
    Results are compared from direct numerical simulation (DNS) and large eddy simulation (LES) of a temporal mixing layer laden with evaporating drops to assess the ability of LES to reproduce detailed characteristics of DNS. The LES used computational drops, each of which represented eight physical drops, and a reduced flow field resolution using a grid spacing four times larger than that of the DNS. The LES also used models for the filtered source terms, which express the coupling of the drops with the flow, and for the unresolved subgrid-scale (SGS) fluxes of species mass, momentum, and enthalpy. The LESs were conducted using one of three different SGS-flux models: dynamic-coefficient gradient (GRD), dynamic-coefficient Smagorinsky (SMD), and constant-coefficient scale similarity (SSC). The comparison of the LES with the filtered-and-coarsened (FC) DNS considered detailed aspects of the flow that are of interest in ignition or full combustion. All LESs captured the largest-scale vortex, the global amount of vapor emanating from the drops, and the overall size distribution of the drops. All LESs tended to underpredict the global amount of irreversible entropy production (dissipation). The SMD model was found unable to capture either the global or local vorticity variation and had minimal small-scale activity in dynamic and thermodynamic variables compared to the FC-DNS. The SMD model was also deficient in predicting the spatial distribution of drops and of the dissipation. In contrast, the GRD and SSC models did mimic the small-scale activity of the FC-DNS and the spatial distribution of drops and of the dissipation. Therefore, the GRD and SSC models are recommended, while the SMD model seems inappropriate for combustion or other problems where the local activity must be predicted

    A low-power, high-performance speech recognition accelerator

    Get PDF
    © 2019 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes,creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.Automatic Speech Recognition (ASR) is becoming increasingly ubiquitous, especially in the mobile segment. Fast and accurate ASR comes at high energy cost, not being affordable for the tiny power-budgeted mobile devices. Hardware acceleration reduces energy-consumption of ASR systems, while delivering high-performance. In this paper, we present an accelerator for largevocabulary, speaker-independent, continuous speech-recognition. It focuses on the Viterbi search algorithm representing the main bottleneck in an ASR system. The proposed design consists of innovative techniques to improve the memory subsystem, since memory is the main bottleneck for performance and power in these accelerators' design. It includes a prefetching scheme tailored to the needs of ASR systems that hides main memory latency for a large fraction of the memory accesses, negligibly impacting area. Additionally, we introduce a novel bandwidth-saving technique that removes off-chip memory accesses by 20 percent. Finally, we present a power saving technique that significantly reduces the leakage power of the accelerators scratchpad memories, providing between 8.5 and 29.2 percent reduction in entire power dissipation. Overall, the proposed design outperforms implementations running on the CPU by orders of magnitude, and achieves speedups between 1.7x and 5.9x for different speech decoders over a highly optimized CUDA implementation running on Geforce-GTX-980 GPU, while reducing the energy by 123-454x.Peer ReviewedPostprint (author's final draft

    A non-hybrid method for the PDF equations of turbulent flows on unstructured grids

    Full text link
    In probability density function (PDF) methods of turbulent flows, the joint PDF of several flow variables is computed by numerically integrating a system of stochastic differential equations for Lagrangian particles. A set of parallel algorithms is proposed to provide an efficient solution of the PDF transport equation, modeling the joint PDF of turbulent velocity, frequency and concentration of a passive scalar in geometrically complex configurations. An unstructured Eulerian grid is employed to extract Eulerian statistics, to solve for quantities represented at fixed locations of the domain (e.g. the mean pressure) and to track particles. All three aspects regarding the grid make use of the finite element method (FEM) employing the simplest linear FEM shape functions. To model the small-scale mixing of the transported scalar, the interaction by exchange with the conditional mean model is adopted. An adaptive algorithm that computes the velocity-conditioned scalar mean is proposed that homogenizes the statistical error over the sample space with no assumption on the shape of the underlying velocity PDF. Compared to other hybrid particle-in-cell approaches for the PDF equations, the current methodology is consistent without the need for consistency conditions. The algorithm is tested by computing the dispersion of passive scalars released from concentrated sources in two different turbulent flows: the fully developed turbulent channel flow and a street canyon (or cavity) flow. Algorithmic details on estimating conditional and unconditional statistics, particle tracking and particle-number control are presented in detail. Relevant aspects of performance and parallelism on cache-based shared memory machines are discussed.Comment: Accepted in Journal of Computational Physics, Feb. 20, 200
    • …
    corecore