Search CORE

68,311 research outputs found

Thermal Infrared Imaging Experiments of C-Type Asteroid 162173 Ryugu on Hayabusa2

Author: A. Fujiwara
Axel Hagermann
D.A. Paige
F. Vilas
G.J. Consolmagno
H. Hiesinger
H. Hihara
H. Yano
H.H. Kieffer
H.H. Kieffer
Hirohide Demura
Hiroki Senshu
J. Biele
J. Helbert
J. Veverka
Jun Takita
Jörn Helbert
Ken Endo
Kohei Kitazato
M. Delbo
M. Grott
M. Ishiguro
M.P. Golombek
Makoto Taguchi
Naoya Sakatani
O. Groussin
P. Michel
P.R. Christensen
P.R. Christensen
Ryosuke Nakamura
S. Tachibana
S.C. Chase
Satoshi Tanaka
Sunao Hasegawa
T. Fukuhara
T. Okada
T. Okada
T. Spohn
T.-M. Ho
T.G. Müller
Takehiko Arai
Takehiko Wada
Takeshi Imamura
Tatsuaki Okada
Tetsuya Fukuhara
Thomas G. Müller
Tomohiko Sekiguchi
Toru Kouyama
Tsuneo Matsunaga
W.F. Bottke Jr.
Y. Tsuda
Yamato Horikawa
Yoshiko Ogawa
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

The thermal infrared imager TIR onboard Hayabusa2 has been developed to investigate thermo-physical properties of C-type, near-Earth asteroid 162173 Ryugu. TIR is one of the remote science instruments on Hayabusa2 designed to understand the nature of a volatile-rich solar system small body, but it also has significant mission objectives to provide information on surface physical properties and conditions for sampling site selection as well as the assessment of safe landing operations. TIR is based on a two-dimensional uncooled micro-bolometer array inherited from the Longwave Infrared Camera LIR on Akatsuki (Fukuhara et al., 2011). TIR takes images of thermal infrared emission in 8 to 12 μm with a field of view of 16×12∘ and a spatial resolution of 0.05∘ per pixel. TIR covers the temperature range from 150 to 460 K, including the well calibrated range from 230 to 420 K. Temperature accuracy is within 2 K or better for summed images, and the relative accuracy or noise equivalent temperature difference (NETD) at each of pixels is 0.4 K or lower for the well-calibrated temperature range. TIR takes a couple of images with shutter open and closed, the corresponding dark frame, and provides a true thermal image by dark frame subtraction. Data processing involves summation of multiple images, image processing including the StarPixel compression (Hihara et al., 2014), and transfer to the data recorder in the spacecraft digital electronics (DE). We report the scientific and mission objectives of TIR, the requirements and constraints for the instrument specifications, the designed instrumentation and the pre-flight and in-flight performances of TIR, as well as its observation plan during the Hayabusa2 mission

Institute of Transport Research:Publications

Crossref

Stirling Online Research Repository (RIOXX)

Springer - Publisher Connector

Open Research Online (The Open University)

Scipedia

Stirling Online Research Repository

MPG.PuRe

GAMER: a GPU-Accelerated Adaptive Mesh Refinement Code for Astrophysics

Author: Aubert
Bagla
Bryan
Campbell
Collins
Frigo
Fryxell
Gingold
Godunov
Hallman
Hockney
Hsi-Yu Schive
Klypin
Kravtsov
Landau
Martin
NVIDIA
O'Shea
Pen
Press
Ricker
Tzihong Chiueh
Woo
Yu-Chih Tsai
Publication venue: 'IOP Publishing'
Publication date: 24/12/2009
Field of study

We present the newly developed code, GAMER (GPU-accelerated Adaptive MEsh Refinement code), which has adopted a novel approach to improve the performance of adaptive mesh refinement (AMR) astrophysical simulations by a large factor with the use of the graphic processing unit (GPU). The AMR implementation is based on a hierarchy of grid patches with an oct-tree data structure. We adopt a three-dimensional relaxing TVD scheme for the hydrodynamic solver, and a multi-level relaxation scheme for the Poisson solver. Both solvers have been implemented in GPU, by which hundreds of patches can be advanced in parallel. The computational overhead associated with the data transfer between CPU and GPU is carefully reduced by utilizing the capability of asynchronous memory copies in GPU, and the computing time of the ghost-zone values for each patch is made to diminish by overlapping it with the GPU computations. We demonstrate the accuracy of the code by performing several standard test problems in astrophysics. GAMER is a parallel code that can be run in a multi-GPU cluster system. We measure the performance of the code by performing purely-baryonic cosmological simulations in different hardware implementations, in which detailed timing analyses provide comparison between the computations with and without GPU(s) acceleration. Maximum speed-up factors of 12.19 and 10.47 are demonstrated using 1 GPU with 4096^3 effective resolution and 16 GPUs with 8192^3 effective resolution, respectively.Comment: 60 pages, 22 figures, 3 tables. More accuracy tests are included. Accepted for publication in ApJ

arXiv.org e-Print Archive

CiteSeerX

Crossref

National Taiwan University Repository

Hyperdrive: A Multi-Chip Systolically Scalable Binary-Weight CNN Inference Engine

Author: Andri Renzo
Benini Luca
Cavigelli Lukas
Rossi Davide
Publication venue
Publication date: 01/01/2019
Field of study

Deep neural networks have achieved impressive results in computer vision and machine learning. Unfortunately, state-of-the-art networks are extremely compute and memory intensive which makes them unsuitable for mW-devices such as IoT end-nodes. Aggressive quantization of these networks dramatically reduces the computation and memory footprint. Binary-weight neural networks (BWNs) follow this trend, pushing weight quantization to the limit. Hardware accelerators for BWNs presented up to now have focused on core efficiency, disregarding I/O bandwidth and system-level efficiency that are crucial for deployment of accelerators in ultra-low power devices. We present Hyperdrive: a BWN accelerator dramatically reducing the I/O bandwidth exploiting a novel binary-weight streaming approach, which can be used for arbitrarily sized convolutional neural network architecture and input resolution by exploiting the natural scalability of the compute units both at chip-level and system-level by arranging Hyperdrive chips systolically in a 2D mesh while processing the entire feature map together in parallel. Hyperdrive achieves 4.3 TOp/s/W system-level efficiency (i.e., including I/Os)---3.1x higher than state-of-the-art BWN accelerators, even if its core uses resource-intensive FP16 arithmetic for increased robustness

arXiv.org e-Print Archive

Archivio istituzionale della ricerca - Alma Mater Studiorum Università di Bologna

Bit Fusion: Bit-Level Dynamically Composable Architecture for Accelerating Deep Neural Networks

Author: Chandra Vikas
Chau Benson
Esmaeilzadeh Hadi
Kim Joon Kyung
Lai Liangzhen
Park Jongse
Sharma Hardik
Suda Naveen
Publication venue
Publication date: 30/05/2018
Field of study

Fully realizing the potential of acceleration for Deep Neural Networks (DNNs) requires understanding and leveraging algorithmic properties. This paper builds upon the algorithmic insight that bitwidth of operations in DNNs can be reduced without compromising their classification accuracy. However, to prevent accuracy loss, the bitwidth varies significantly across DNNs and it may even be adjusted for each layer. Thus, a fixed-bitwidth accelerator would either offer limited benefits to accommodate the worst-case bitwidth requirements, or lead to a degradation in final accuracy. To alleviate these deficiencies, this work introduces dynamic bit-level fusion/decomposition as a new dimension in the design of DNN accelerators. We explore this dimension by designing Bit Fusion, a bit-flexible accelerator, that constitutes an array of bit-level processing elements that dynamically fuse to match the bitwidth of individual DNN layers. This flexibility in the architecture enables minimizing the computation and the communication at the finest granularity possible with no loss in accuracy. We evaluate the benefits of BitFusion using eight real-world feed-forward and recurrent DNNs. The proposed microarchitecture is implemented in Verilog and synthesized in 45 nm technology. Using the synthesis results and cycle accurate simulation, we compare the benefits of Bit Fusion to two state-of-the-art DNN accelerators, Eyeriss and Stripes. In the same area, frequency, and process technology, BitFusion offers 3.9x speedup and 5.1x energy savings over Eyeriss. Compared to Stripes, BitFusion provides 2.6x speedup and 3.9x energy reduction at 45 nm node when BitFusion area and frequency are set to those of Stripes. Scaling to GPU technology node of 16 nm, BitFusion almost matches the performance of a 250-Watt Titan Xp, which uses 8-bit vector instructions, while BitFusion merely consumes 895 milliwatts of power

arXiv.org e-Print Archive

Crossref

STV-based Video Feature Processing for Action Recognition

Author: Wang Jing
Xu Zhijie
Publication venue: 'Elsevier BV'
Publication date: 01/08/2012
Field of study

In comparison to still image-based processes, video features can provide rich and intuitive information about dynamic events occurred over a period of time, such as human actions, crowd behaviours, and other subject pattern changes. Although substantial progresses have been made in the last decade on image processing and seen its successful applications in face matching and object recognition, video-based event detection still remains one of the most difficult challenges in computer vision research due to its complex continuous or discrete input signals, arbitrary dynamic feature definitions, and the often ambiguous analytical methods. In this paper, a Spatio-Temporal Volume (STV) and region intersection (RI) based 3D shape-matching method has been proposed to facilitate the definition and recognition of human actions recorded in videos. The distinctive characteristics and the performance gain of the devised approach stemmed from a coefficient factor-boosted 3D region intersection and matching mechanism developed in this research. This paper also reported the investigation into techniques for efficient STV data filtering to reduce the amount of voxels (volumetric-pixels) that need to be processed in each operational cycle in the implemented system. The encouraging features and improvements on the operational performance registered in the experiments have been discussed at the end

University of Huddersfield Repository

Huddersfield Research Portal