Search CORE

85,533 research outputs found

Characterization of Alaskan HMA Mixtures with the Simple Performance Tester

Author: Li Peng
Liu Juanyu
Publication venue: Alaska University Transportation Center, Alaska Department of Transportation and Public Facilities
Publication date: 01/01/2014
Field of study

INE/AUTC 12.2

ScholarWorks@UA

Parallel accelerated cyclic reduction preconditioner for three-dimensional elliptic PDEs with variable coefficients

Author: Chávez Gustavo
Keyes David
Turkiyyah George
Zampini Stefano
Publication venue: 'Elsevier BV'
Publication date: 23/12/2017
Field of study

We present a robust and scalable preconditioner for the solution of large-scale linear systems that arise from the discretization of elliptic PDEs amenable to rank compression. The preconditioner is based on hierarchical low-rank approximations and the cyclic reduction method. The setup and application phases of the preconditioner achieve log-linear complexity in memory footprint and number of operations, and numerical experiments exhibit good weak and strong scalability at large processor counts in a distributed memory environment. Numerical experiments with linear systems that feature symmetry and nonsymmetry, definiteness and indefiniteness, constant and variable coefficients demonstrate the preconditioner applicability and robustness. Furthermore, it is possible to control the number of iterations via the accuracy threshold of the hierarchical matrix approximations and their arithmetic operations, and the tuning of the admissibility condition parameter. Together, these parameters allow for optimization of the memory requirements and performance of the preconditioner.Comment: 24 pages, Elsevier Journal of Computational and Applied Mathematics, Dec 201

arXiv.org e-Print Archive

eScholarship - University of California

Statistical lossless compression of space imagery and general data in a reconfigurable architecture

Author: Canagarajah CN
Chen X
Nunez-Yanez JL
Vitulli R
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/06/2008
Field of study

Explore Bristol Research

Motion estimation and CABAC VLSI co-processors for real-time high-quality H.264/AVC video coding

Author: Casula M.
Fanucci L.
Martina Maurizio
Masera Guido
Saponara S.
Publication venue: Elsevier
Publication date: 01/01/2010
Field of study

Real-time and high-quality video coding is gaining a wide interest in the research and industrial community for different applications. H.264/AVC, a recent standard for high performance video coding, can be successfully exploited in several scenarios including digital video broadcasting, high-definition TV and DVD-based systems, which require to sustain up to tens of Mbits/s. To that purpose this paper proposes optimized architectures for H.264/AVC most critical tasks, Motion estimation and context adaptive binary arithmetic coding. Post synthesis results on sub-micron CMOS standard-cells technologies show that the proposed architectures can actually process in real-time 720 × 480 video sequences at 30 frames/s and grant more than 50 Mbits/s. The achieved circuit complexity and power consumption budgets are suitable for their integration in complex VLSI multimedia systems based either on AHB bus centric on-chip communication system or on novel Network-on-Chip (NoC) infrastructures for MPSoC (Multi-Processor System on Chip

Archivio della Ricerca - Università di Pisa

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)

PORTO Publications Open Repository TOrino

Anatomy of quantum chaotic eigenstates

Author: A Bouzouina
A Bäcker
A Bäcker
A Bäcker
A Hassell
A Katok
A Schnirelman
A Voros
AH Barnett
AJ Lichtenberg
B Crespi
B Ekhardt
B Gutkin
B Helffer
B Shiffman
C-H Chang
D Kelmer
E Bogomolny
E Bogomolny
E Bogomolny
E Lindenstrauss
E Vergini
EB Bogomolny
EJ Heller
EJ Heller
F Faure
F Nazarov
G Berkolaiko
G Blum
H Donnelly
H Iwaniec
I Melbourne
IC Percival
J Brüning
J Marklof
J-M Tualle
JH Hannay
JH Hannay
JP Keating
JP Keating
L Hörmander
L Kaplan
L Kaplan
LA Bunimovich
M Degli Esposti
M Degli Esposti
M Feingold
M. Saraceno
MV Berry
MV Berry
MV Berry
N Anantharaman
N Anantharaman
N Anantharaman
N Anantharaman
N Chernov
NL Balasz
NL Balasz
P Bleher
P Bàlint
P Gérard
P Kurlberg
P Kurlberg
P Kurlberg
P Leboeuf
P Leboeuf
R Aurich
R Aurich
R Aurich
R Courant
R Schubert
R Schubert
S Brooks
S Nonnenmacher
S Zelditch
S Zelditch
S Zelditch
S Zelditch
S Zelditch
SW McDonald
W Luo
Y Colin de Verdière
Z Rudnick
Z Rudnick
Å Pleijel
Publication venue
Publication date: 06/01/2012
Field of study

The eigenfunctions of quantized chaotic systems cannot be described by explicit formulas, even approximate ones. This survey summarizes (selected) analytical approaches used to describe these eigenstates, in the semiclassical limit. The levels of description are macroscopic (one wants to understand the quantum averages of smooth observables), and microscopic (one wants informations on maxima of eigenfunctions, "scars" of periodic orbits, structure of the nodal sets and domains, local correlations), and often focusses on statistical results. Various models of "random wavefunctions" have been introduced to understand these statistical properties, with usually good agreement with the numerical data. We also discuss some specific systems (like arithmetic ones) which depart from these random models.Comment: Corrected typos, added a few references and updated some result

arXiv.org e-Print Archive

CiteSeerX

Crossref

HAL-CEA

An Application-Specific VLIW Processor with Vector Instruction Set for CNN Acceleration

Author: Ascheid Gerd
Bytyn Andreas
Leupers Rainer
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2019
Field of study

In recent years, neural networks have surpassed classical algorithms in areas such as object recognition, e.g. in the well-known ImageNet challenge. As a result, great effort is being put into developing fast and efficient accelerators, especially for Convolutional Neural Networks (CNNs). In this work we present ConvAix, a fully C-programmable processor, which -- contrary to many existing architectures -- does not rely on a hard-wired array of multiply-and-accumulate (MAC) units. Instead it maps computations onto independent vector lanes making use of a carefully designed vector instruction set. The presented processor is targeted towards latency-sensitive applications and is capable of executing up to 192 MAC operations per cycle. ConvAix operates at a target clock frequency of 400 MHz in 28nm CMOS, thereby offering state-of-the-art performance with proper flexibility within its target domain. Simulation results for several 2D convolutional layers from well known CNNs (AlexNet, VGG-16) show an average ALU utilization of 72.5% using vector instructions with 16 bit fixed-point arithmetic. Compared to other well-known designs which are less flexible, ConvAix offers competitive energy efficiency of up to 497 GOP/s/W while even surpassing them in terms of area efficiency and processing speed.Comment: Accepted for publication in the proceedings of the 2019 IEEE International Symposium on Circuits and Systems (ISCAS

arXiv.org e-Print Archive

Crossref

Publikationsserver der RWTH Aachen University

Solving the global atmospheric equations through heterogeneous reconfigurable platforms

Author: Fu H
Gan L
Huang X
Luk W
Xue W
Yang C
Yang G
Zhang Y
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/03/2014
Field of study

Spiral - Imperial College Digital Repository