Search CORE

690 research outputs found

A bibliography on parallel and vector numerical algorithms

Author: Ortega J. M.
Voigt R. G.
Publication venue
Publication date
Field of study

This is a bibliography of numerical methods. It also includes a number of other references on machine architecture, programming language, and other topics of interest to scientific computing. Certain conference proceedings and anthologies which have been published in book form are listed also

NASA Technical Reports Server

High Performance Reconfigurable Computing for Linear Algebra: Design and Performance Analysis

Author: Sun Junqing
Publication venue: TRACE: Tennessee Research and Creative Exchange
Publication date: 01/05/2008
Field of study

Field Programmable Gate Arrays (FPGAs) enable powerful performance acceleration for scientific computations because of their intrinsic parallelism, pipeline ability, and flexible architecture. This dissertation explores the computational power of FPGAs for an important scientific application: linear algebra. First of all, optimized linear algebra subroutines are presented based on enhancements to both algorithms and hardware architectures. Compared to microprocessors, these routines achieve significant speedup. Second, computing with mixed-precision data on FPGAs is proposed for higher performance. Experimental analysis shows that mixed-precision algorithms on FPGAs can achieve the high performance of using lower-precision data while keeping higher-precision accuracy for finding solutions of linear equations. Third, an execution time model is built for reconfigurable computers (RC), which plays an important role in performance analysis and optimal resource utilization of FPGAs. The accuracy and efficiency of parallel computing performance models often depend on mean maximum computations. Despite significant prior work, there have been no sufficient mathematical tools for this important calculation. This work presents an Effective Mean Maximum Approximation method, which is more general, accurate, and efficient than previous methods. Together, these research results help address how to make linear algebra applications perform better on high performance reconfigurable computing architectures

University of Tennessee, Knoxville: Trace

Solution of partial differential equations on vector and parallel computers

Author: Ortega J. M.
Voigt R. G.
Publication venue
Publication date
Field of study

The present status of numerical methods for partial differential equations on vector and parallel computers was reviewed. The relevant aspects of these computers are discussed and a brief review of their development is included, with particular attention paid to those characteristics that influence algorithm selection. Both direct and iterative methods are given for elliptic equations as well as explicit and implicit methods for initial boundary value problems. The intent is to point out attractive methods as well as areas where this class of computer architecture cannot be fully utilized because of either hardware restrictions or the lack of adequate algorithms. Application areas utilizing these computers are briefly discussed

NASA Technical Reports Server

Electronically reconfigurable metal-on-silicon metamaterial

Author: David R. Smith
Jae Seung Lee
L. Solymar
N. Engheta
Nan M. Jokerst
P. Bartley
Paul Schmalenberg
R. Marques
S. M. Sze
Sulochana Dhar
Talmage Tyler
Vinh Nguyen
Yaroslav Urzhumov
Publication venue: 'American Physical Society (APS)'
Publication date: 06/08/2012
Field of study

Reconfigurable metamaterial-based apertures can play a unique role in both imaging and in beam-forming applications, where current technology relies mostly on the fabrication and integration of large detector or antenna arrays. Here, we report the experimental demonstration of a voltage-controlled, silicon-based electromagnetic metamaterial operating in the W-band (75-110 GHz). In this composite semiconductor metamaterial, patterned gold metamaterial elements serve both to manage electromagnetic wave propagation while simultaneously acting as electrical Schottky contacts that control the local conductivity of the semiconductor substrate. The active device layers consist of a patterned metal on a 2-{\mu}m-thick n-doped silicon layer, adhesively bonded to a transparent Pyrex wafer. The transmittance of the composite metamaterial can be modulated over a given frequency band as a function of bias voltage. We demonstrate a quantitative understanding of the composite device through the application of numerical approaches that simultaneously treat the semiconductor junction physics as well as wave propagation.Comment: 28 double-spaced pages, 8 figure

arXiv.org e-Print Archive

Crossref

DukeSpace

An Improved BKW Algorithm for LWE with Applications to Cryptography and Lattices

Author: A Blum
A Lenstra
C Gentry
D Micciancio
D Wagner
DJ Bernstein
ED Mulder
J Hoffstein
JC Lagarias
M Liu
MJ Coster
MR Albrecht
MR Albrecht
MR Albrecht
N Döttling
O Regev
P Kirchner
Q Guo
R Lindner
RM Gray
S Arora
T Pöppelmann
V Lyubashevsky
V Lyubashevsky
W Banaszczyk
Y Chen
Z Brakerski
É Levieil
Publication venue
Publication date: 29/06/2015
Field of study

In this paper, we study the Learning With Errors problem and its binary variant, where secrets and errors are binary or taken in a small interval. We introduce a new variant of the Blum, Kalai and Wasserman algorithm, relying on a quantization step that generalizes and fine-tunes modulus switching. In general this new technique yields a significant gain in the constant in front of the exponent in the overall complexity. We illustrate this by solving p within half a day a LWE instance with dimension n = 128, modulus

q = n^2

, Gaussian noise

\alpha = 1/(\sqrt{n/\pi} \log^2 n)

and binary secret, using

2^{28}

samples, while the previous best result based on BKW claims a time complexity of

2^{74}

with

2^{60}

samples for the same parameters. We then introduce variants of BDD, GapSVP and UniqueSVP, where the target point is required to lie in the fundamental parallelepiped, and show how the previous algorithm is able to solve these variants in subexponential time. Moreover, we also show how the previous algorithm can be used to solve the BinaryLWE problem with n samples in subexponential time

2^{(\ln 2/2+o(1))n/\log \log n}

. This analysis does not require any heuristic assumption, contrary to other algebraic approaches; instead, it uses a variant of an idea by Lyubashevsky to generate many samples from a small number of samples. This makes it possible to asymptotically and heuristically break the NTRU cryptosystem in subexponential time (without contradicting its security assumption). We are also able to solve subset sum problems in subexponential time for density

o(1)

, which is of independent interest: for such density, the previous best algorithm requires exponential time. As a direct application, we can solve in subexponential time the parameters of a cryptosystem based on this problem proposed at TCC 2010.Comment: CRYPTO 201

arXiv.org e-Print Archive

CiteSeerX

Crossref

Cryptology ePrint Archive

Recommended from our members

Hybrid Analog-Digital Co-Processing for Scientific Computation

Author: Huang Yipeng
Publication venue: 'Columbia University Libraries/Information Services'
Publication date: 01/01/2018
Field of study

In the past 10 years computer architecture research has moved to more heterogeneity and less adherence to conventional abstractions. Scientists and engineers hold an unshakable belief that computing holds keys to unlocking humanity's Grand Challenges. Acting on that belief they have looked deeper into computer architecture to find specialized support for their applications. Likewise, computer architects have looked deeper into circuits and devices in search of untapped performance and efficiency. The lines between computer architecture layers---applications, algorithms, architectures, microarchitectures, circuits and devices---have blurred. Against this backdrop, a menagerie of computer architectures are on the horizon, ones that forgo basic assumptions about computer hardware, and require new thinking of how such hardware supports problems and algorithms. This thesis is about revisiting hybrid analog-digital computing in support of diverse modern workloads. Hybrid computing had extensive applications in early computing history, and has been revisited for small-scale applications in embedded systems. But architectural support for using hybrid computing in modern workloads, at scale and with high accuracy solutions, has been lacking. I demonstrate solving a variety of scientific computing problems, including stochastic ODEs, partial differential equations, linear algebra, and nonlinear systems of equations, as case studies in hybrid computing. I solve these problems on a system of multiple prototype analog accelerator chips built by a team at Columbia University. On that team I made contributions toward programming the chips, building the digital interface, and validating the chips' functionality. The analog accelerator chip is intended for use in conjunction with a conventional digital host computer. The appeal and motivation for using an analog accelerator is efficiency and performance, but it comes with limitations in accuracy and problem sizes that we have to work around. The first problem is how to do problems in this unconventional computation model. Scientific computing phrases problems as differential equations and algebraic equations. Differential equations are a continuous view of the world, while algebraic equations are a discrete one. Prior work in analog computing mostly focused on differential equations; algebraic equations played a minor role in prior work in analog computing. The secret to using the analog accelerator to support modern workloads on conventional computers is that these two viewpoints are interchangeable. The algebraic equations that underlie most workloads can be solved as differential equations, and differential equations are naturally solvable in the analog accelerator chip. A hybrid analog-digital computer architecture can focus on solving linear and nonlinear algebra problems to support many workloads. The second problem is how to get accurate solutions using hybrid analog-digital computing. The reason that the analog computation model gives less accurate solutions is it gives up representing numbers as digital binary numbers, and instead uses the full range of analog voltage and current to represent real numbers. Prior work has established that encoding data in analog signals gives an energy efficiency advantage as long as the analog data precision is limited. While the analog accelerator alone may be useful for energy-constrained applications where inputs and outputs are imprecise, we are more interested in using analog in conjunction with digital for precise solutions. This thesis gives novel insight that the trick to do so is to solve nonlinear problems where low-precision guesses are useful for conventional digital algorithms. The third problem is how to solve large problems using hybrid analog-digital computing. The reason the analog computation model can't handle large problems is it gives up step-by-step discrete-time operation, instead allowing variables to evolve smoothly in continuous time. To make that happen the analog accelerator works by chaining hardware for mathematical operations end-to-end. During computation analog data flows through the hardware with no overheads in control logic and memory accesses. The downside is then the needed hardware size grows alongside problem sizes. While scientific computing researchers have for a long time split large problems into smaller subproblems to fit in digital computer constraints, this thesis is a first attempt to consider these divide-and-conquer algorithms as an essential tool in using the analog model of computation. As we enter the post-Moore’s law era of computing, unconventional architectures will offer specialized models of computation that uniquely support specific problem types. Two prominent examples are deep neural networks and quantum computers. Recent trends in computer science research show these unconventional architectures will soon have broad adoption. In this thesis I show another specialized, unconventional architecture is to use analog accelerators to solve problems in scientific computing. Computer architecture researchers will discover other important models of computation in the future. This thesis is an example of the discovery process, implementation, and evaluation of how an unconventional architecture supports specialized workloads

Columbia University Academic Commons

High-performance and hardware-aware computing: proceedings of the second International Workshop on New Frontiers in High-performance and Hardware-aware Computing (HipHaC\u2711), San Antonio, Texas, USA, February 2011 ; (in conjunction with HPCA-17)

Author: Buchty Rainer
Weiß Jan-Philipp
Publication venue: KIT Scientific Publishing, Karlsruhe
Publication date: 01/01/2011
Field of study

High-performance system architectures are increasingly exploiting heterogeneity. The HipHaC workshop aims at combining new aspects of parallel, heterogeneous, and reconfigurable microprocessor technologies with concepts of high-performance computing and, particularly, numerical solution methods. Compute- and memory-intensive applications can only benefit from the full hardware potential if all features on all levels are taken into account in a holistic approach

KITopen

Self-Evaluation Applied Mathematics 2003-2008 University of Twente

Author: Mouthaan A.J.
Vegt J.J.W. van der
Publication venue: Faculty of Electrical Engineering, Mathematics and Computer Science, University of Twente
Publication date: 01/01/2009
Field of study

This report contains the self-study for the research assessment of the Department of Applied Mathematics (AM) of the Faculty of Electrical Engineering, Mathematics and Computer Science (EEMCS) at the University of Twente (UT). The report provides the information for the Research Assessment Committee for Applied Mathematics, dealing with mathematical sciences at the three universities of technology in the Netherlands. It describes the state of affairs pertaining to the period 1 January 2003 to 31 December 2008

University of Twente Research Information

Harnessing optical micro-combs for microwave photonics

Author: Chu Sai Tak
Little Brent E.
Mitchell Arnan
Morandotti Roberto
Moss David J.
Moss DJ (reprint author), Swinburne Univ Technol, Ctr Microphoton, Hawthorn, Vic 3122, Australia.
Nguyen Thach G.
Wu Jiayang
Xu Xingyuan
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 23/04/2018
Field of study

In the past decade, optical frequency combs generated by high-Q micro-resonators, or micro-combs, which feature compact device footprints, high energy efficiency, and high-repetition-rates in broad optical bandwidths, have led to a revolution in a wide range of fields including metrology, mode-locked lasers, telecommunications, RF photonics, spectroscopy, sensing, and quantum optics. Among these, an application that has attracted great interest is the use of micro-combs for RF photonics, where they offer enhanced functionalities as well as reduced size and power consumption over other approaches. This article reviews the recent advances in this emerging field. We provide an overview of the main achievements that have been obtained to date, and highlight the strong potential of micro-combs for RF photonics applications. We also discuss some of the open challenges and limitations that need to be met for practical applications.Comment: 32 Pages, 13 Figures, 172 Reference

arXiv.org e-Print Archive

Institutional Repository of Xi'an Institute of Optics and Precision Mechanics, CAS

Additively Manufactured Shape-changing RF Devices Enabled by Origami-inspired Structures

Author: Cui Yepu
Publication venue: Georgia Institute of Technology
Publication date: 10/01/2023
Field of study

The work to be presented in this dissertation explores the possibility of implementing origami-inspired shape-changing structures into RF designs to enable continuous performance tunability as well as deployability. The research not only experimented novel structures that have unique mechanical behaviour, but also developed automated additive manufacturing (AM) fabrication process that pushes the boundary of realizable frequency from Sub-6 GHz to mm-wave. High-performance origami-inspired reconfigurable frequency selective surfaces (FSSs) and reflectarray antennas are realized for the first time at mm-wave frequencies via AM techniques. The research also investigated the idea of combining mechanical tuning and active tuning methods in a hybrid manner to realize the first truly conformal beam-forming phased array antenna that can be applied onto any arbitrary surface and can be re-calibrated with a 3D depth camera.Ph.D

Scholarly Materials And Research @ Georgia Tech