19,973 research outputs found
Wavelet-Based High-Order Adaptive Modeling of Lossy Interconnects
Abstract—This paper presents a numerical-modeling strategy for simulation of fast transients in lossy electrical interconnects. The proposed algorithm makes use of wavelet representations of voltages and currents along the structure, with the aim of reducing the computational complexity of standard time-domain solvers. A special weak procedure for the implementation of possibly dynamic and nonlinear boundary conditions allows to preserve stability as well as a high approximation order, thus leading to very accurate schemes. On the other hand, the wavelet expansion allows the computation of the solution by using few significant coefficients which are automatically determined at each time step. A dynamically refinable mesh is then used to perform a sparse time-stepping. Several numerical results illustrate the high efficiency of the proposed algorithm, which has been tuned and optimized for best performance in fast digital applications typically found on modern PCB structures. Index Terms—Finite difference methods, time-domain analysis, transmission lines, wavelet transforms. I
Transient Non-linear Thermal FEM Simulation of Smart Power Switches and Verification by Measurements
Thermal FEM (Finite Element Method) simulations can be used to predict the
thermal behavior of power semiconductors in application. Most power
semiconductors are made of silicon. Silicon thermal material properties are
significantly temperature dependent. In this paper, validity of a common
non-linear silicon material model is verified by transient non-linear thermal
FEM simulations of Smart Power Switches and measurements. For verification,
over-temperature protection behavior of Smart Power Switches is employed. This
protection turns off the switch at a pre-defined temperature which is used as a
temperature reference in the investigation. Power dissipation generated during
a thermal overload event of two Smart Power devices is measured and used as an
input stimulus to transient thermal FEM simulations. The duration time of the
event together with the temperature reference is confronted with simulation
results and thus the validity of the silicon model is proved. In addition, the
impact of non-linear thermal properties of silicon on the thermal impedance of
power semiconductors is shown.Comment: Submitted on behalf of TIMA Editions
(http://irevues.inist.fr/tima-editions
Detailed characteristics of drop-laden mixing layers: Large eddy simulation predictions compared to direct numerical simulation
Results are compared from direct numerical simulation (DNS) and large eddy simulation (LES) of a temporal mixing layer laden with evaporating drops to assess the ability of LES to reproduce detailed characteristics of DNS. The LES used computational drops, each of which represented eight physical drops, and a reduced flow field resolution using a grid spacing four times larger than that of the DNS. The LES also used models for the filtered source terms, which express the coupling of the drops with the flow, and for the unresolved subgrid-scale (SGS) fluxes of species mass, momentum, and enthalpy. The LESs were conducted using one of three different SGS-flux models: dynamic-coefficient gradient (GRD), dynamic-coefficient Smagorinsky (SMD), and constant-coefficient scale similarity (SSC). The comparison of the LES with the filtered-and-coarsened (FC) DNS considered detailed aspects of the flow that are of interest in ignition or full combustion. All LESs captured the largest-scale vortex, the global amount of vapor emanating from the drops, and the overall size distribution of the drops. All LESs tended to underpredict the global amount of irreversible entropy production (dissipation). The SMD model was found unable to capture either the global or local vorticity variation and had minimal small-scale activity in dynamic and thermodynamic variables compared to the FC-DNS. The SMD model was also deficient in predicting the spatial distribution of drops and of the dissipation. In contrast, the GRD and SSC models did mimic the small-scale activity of the FC-DNS and the spatial distribution of drops and of the dissipation. Therefore, the GRD and SSC models are recommended, while the SMD model seems inappropriate for combustion or other problems where the local activity must be predicted
A low-power, high-performance speech recognition accelerator
© 2019 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes,creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.Automatic Speech Recognition (ASR) is becoming increasingly ubiquitous, especially in the mobile segment. Fast and accurate ASR comes at high energy cost, not being affordable for the tiny power-budgeted mobile devices. Hardware acceleration reduces energy-consumption of ASR systems, while delivering high-performance. In this paper, we present an accelerator for largevocabulary, speaker-independent, continuous speech-recognition. It focuses on the Viterbi search algorithm representing the main bottleneck in an ASR system. The proposed design consists of innovative techniques to improve the memory subsystem, since memory is the main bottleneck for performance and power in these accelerators' design. It includes a prefetching scheme tailored to the needs of ASR systems that hides main memory latency for a large fraction of the memory accesses, negligibly impacting area. Additionally, we introduce a novel bandwidth-saving technique that removes off-chip memory accesses by 20 percent. Finally, we present a power saving technique that significantly reduces the leakage power of the accelerators scratchpad memories, providing between 8.5 and 29.2 percent reduction in entire power dissipation. Overall, the proposed design outperforms implementations running on the CPU by orders of magnitude, and achieves speedups between 1.7x and 5.9x for different speech decoders over a highly optimized CUDA implementation running on Geforce-GTX-980 GPU, while reducing the energy by 123-454x.Peer ReviewedPostprint (author's final draft
A non-hybrid method for the PDF equations of turbulent flows on unstructured grids
In probability density function (PDF) methods of turbulent flows, the joint
PDF of several flow variables is computed by numerically integrating a system
of stochastic differential equations for Lagrangian particles. A set of
parallel algorithms is proposed to provide an efficient solution of the PDF
transport equation, modeling the joint PDF of turbulent velocity, frequency and
concentration of a passive scalar in geometrically complex configurations. An
unstructured Eulerian grid is employed to extract Eulerian statistics, to solve
for quantities represented at fixed locations of the domain (e.g. the mean
pressure) and to track particles. All three aspects regarding the grid make use
of the finite element method (FEM) employing the simplest linear FEM shape
functions. To model the small-scale mixing of the transported scalar, the
interaction by exchange with the conditional mean model is adopted. An adaptive
algorithm that computes the velocity-conditioned scalar mean is proposed that
homogenizes the statistical error over the sample space with no assumption on
the shape of the underlying velocity PDF. Compared to other hybrid
particle-in-cell approaches for the PDF equations, the current methodology is
consistent without the need for consistency conditions. The algorithm is tested
by computing the dispersion of passive scalars released from concentrated
sources in two different turbulent flows: the fully developed turbulent channel
flow and a street canyon (or cavity) flow. Algorithmic details on estimating
conditional and unconditional statistics, particle tracking and particle-number
control are presented in detail. Relevant aspects of performance and
parallelism on cache-based shared memory machines are discussed.Comment: Accepted in Journal of Computational Physics, Feb. 20, 200
- …