Search CORE

3,052 research outputs found

Stable Unitary Integrators for the Numerical Implementation of Continuous Unitary Transformations

Author: Refael Gil
Savitz Samuel
Publication venue: 'American Physical Society (APS)'
Publication date: 15/09/2017
Field of study

The technique of continuous unitary transformations has recently been used to provide physical insight into a diverse array of quantum mechanical systems. However, the question of how to best numerically implement the flow equations has received little attention. The most immediately apparent approach, using standard Runge-Kutta numerical integration algorithms, suffers from both severe inefficiency due to stiffness and the loss of unitarity. After reviewing the formalism of continuous unitary transformations and Wegner's original choice for the infinitesimal generator of the flow, we present a number of approaches to resolving these issues including a choice of generator which induces what we call the "uniform tangent decay flow" and three numerical integrators specifically designed to perform continuous unitary transformations efficiently while preserving the unitarity of flow. We conclude by applying one of the flow algorithms to a simple calculation that visually demonstrates the many-body localization transition.Comment: 13 pages, 4 figures, Comments welcom

arXiv.org e-Print Archive

Caltech Authors

Customisable arithmetic hardware designs

Author: Cheung Chak-Chung Ray
Cheung Chak-Chung Ray
Publication venue
Publication date: 01/01/2007
Field of study

Imperial Users onl

Spiral - Imperial College Digital Repository

An Efficient Implementation of an Exponential Random Number Generator in a Field Programmable Gate Array (FPGA)

Author: Gautham Smitha
Publication venue: VCU Scholars Compass
Publication date: 29/04/2010
Field of study

Many physical, biological, ecological and behavioral events occur at times and rates that are exponentially distributed. Modeling these systems requires simulators that can accurately generate a large quantity of exponentially distributed random numbers, which is a computationally intensive task. To improve the performance of these simulators, one approach is to move portions of the computationally inefficient simulation tasks from software to custom hardware implemented in Field Programmable Gate Arrays (FPGAs). In this work, we study efficient FPGA implementations of exponentially distributed random number generators to improve simulator performance. Our approach is to generate uniformly distributed random numbers using standard techniques and scale them using the inverse cumulative distribution function (CDF). Scaling is implemented by curve fitting piecewise linear, quadratic, cubic, and higher order functions to solve for the inverse CDF. As the complexity of the scaling function increases (in terms of order and the number of pieces), number accuracy increases and additional FPGA resources (logic cells and block RAMs) are consumed. We analyze these tradeoffs and show how a designer with particular accuracy requirements and FPGA resource constraints can implement an accurate and efficient exponentially distributed random number generator

VCU Scholars Compass

Acceleration techniques for dependability simulation

Author: Barnette James David
Publication venue
Publication date
Field of study

As computer systems increase in complexity, the need to project system performance from the earliest design and development stages increases. We have to employ simulation for detailed dependability studies of large systems. However, as the complexity of the simulation model increases, the time required to obtain statistically significant results also increases. This paper discusses an approach that is application independent and can be readily applied to any process-based simulation model. Topics include background on classical discrete event simulation and techniques for random variate generation and statistics gathering to support simulation

NASA Technical Reports Server

Application-Specific Number Representation

Author: Fu Haohuan
Fu Haohuan
Publication venue: Computing, Imperial College London
Publication date: 01/02/2009
Field of study

Reconfigurable devices, such as Field Programmable Gate Arrays (FPGAs), enable application- specific number representations. Well-known number formats include fixed-point, floating- point, logarithmic number system (LNS), and residue number system (RNS). Such different number representations lead to different arithmetic designs and error behaviours, thus produc- ing implementations with different performance, accuracy, and cost. To investigate the design options in number representations, the first part of this thesis presents a platform that enables automated exploration of the number representation design space. The second part of the thesis shows case studies that optimise the designs for area, latency or throughput from the perspective of number representations. Automated design space exploration in the first part addresses the following two major issues: ² Automation requires arithmetic unit generation. This thesis provides optimised arithmetic library generators for logarithmic and residue arithmetic units, which support a wide range of bit widths and achieve significant improvement over previous designs. ² Generation of arithmetic units requires specifying the bit widths for each variable. This thesis describes an automatic bit-width optimisation tool called R-Tool, which combines dynamic and static analysis methods, and supports different number systems (fixed-point, floating-point, and LNS numbers). Putting it all together, the second part explores the effects of application-specific number representation on practical benchmarks, such as radiative Monte Carlo simulation, and seismic imaging computations. Experimental results show that customising the number representations brings benefits to hardware implementations: by selecting a more appropriate number format, we can reduce the area cost by up to 73.5% and improve the throughput by 14.2% to 34.1%; by performing the bit-width optimisation, we can further reduce the area cost by 9.7% to 17.3%. On the performance side, hardware implementations with customised number formats achieve 5 to potentially over 40 times speedup over software implementations

Spiral - Imperial College Digital Repository