64 research outputs found
Primary arm array during directional solidification of a single-crystal binary alloy: Large-scale phase-field study
AbstractPrimary arm arrays formed during the directional solidification of a single-crystal binary alloy were investigated by performing large-scale phase-field simulations using the GPU supercomputer TSUBAME2.5Â at Tokyo Institute of Technology. The primary arm array and spacing were investigated by Voronoi decomposition and Delaunay triangulation, respectively. It was concluded that a hexagonal array was dominant for both the dendrite and cell structures and that pentaâhepta defects, which are typical defects in hexagonal patterns, were formed. The primary arms continuously moved such that the number of hexagons increased, and the distribution of primary arm spacing became uniform over time even after the number of primary arms was constant. The order of array was highest in the growth condition of the dendrite close to the cell-to-dendrite transition region. In addition, we proposed a realistic and accurate evaluation method of primary arm array by removing small sides from the Voronoi polygons
Generating observation guided ensembles for data assimilation with denoising diffusion probabilistic model
This paper presents an ensemble data assimilation method using the pseudo
ensembles generated by denoising diffusion probabilistic model. Since the model
is trained against noisy and sparse observation data, this model can produce
divergent ensembles close to observations. Thanks to the variance in generated
ensembles, our proposed method displays better performance than the
well-established ensemble data assimilation method when the simulation model is
imperfect
Spectral evolution of GRB 060904A observed with Swift and Suzaku -- Possibility of Inefficient Electron Acceleration
We observed an X-ray afterglow of GRB 060904A with the Swift and Suzaku
satellites. We found rapid spectral softening during both the prompt tail phase
and the decline phase of an X-ray flare in the BAT and XRT data. The observed
spectra were fit by power-law photon indices which rapidly changed from to within a few hundred
seconds in the prompt tail. This is one of the steepest X-ray spectra ever
observed, making it quite difficult to explain by simple electron acceleration
and synchrotron radiation. Then, we applied an alternative spectral fitting
using a broken power-law with exponential cutoff (BPEC) model. It is valid to
consider the situation that the cutoff energy is equivalent to the synchrotron
frequency of the maximum energy electrons in their energy distribution. Since
the spectral cutoff appears in the soft X-ray band, we conclude the electron
acceleration has been inefficient in the internal shocks of GRB 060904A. These
cutoff spectra suddenly disappeared at the transition time from the prompt tail
phase to the shallow decay one. After that, typical afterglow spectra with the
photon indices of 2.0 are continuously and preciously monitored by both XRT and
Suzaku/XIS up to 1 day since the burst trigger time. We could successfully
trace the temporal history of two characteristic break energies (peak energy
and cutoff energy) and they show the time dependence of while the following afterglow spectra are quite stable. This fact
indicates that the emitting material of prompt tail is due to completely
different dynamics from the shallow decay component. Therefore we conclude the
emission sites of two distinct phenomena obviously differ from each other.Comment: 19 pages, 9 figures, accepted for publication in PASJ (Suzaku 2nd
Special Issue
Spectral Lag Relations in GRB Pulses Detected with HETE-2
Using a pulse-fit method, we investigate the spectral lags between the
traditional gamma-ray band (50-400 keV) and the X-ray band (6-25 keV) for 8
GRBs with known redshifts (GRB 010921, GRB 020124, GRB 020127, GRB 021211, GRB
030528, GRB 040924, GRB 041006, GRB 050408) detected with the WXM and FREGATE
instruments aboard the HETE-2 satellite. We find several relations for the
individual GRB pulses between the spectral lag and other observables, such as
the luminosity, pulse duration, and peak energy (Epeak). The obtained results
are consistent with those for BATSE, indicating that the BATSE correlations are
still valid at lower energies (6-25 keV). Furthermore, we find that the photon
energy dependence for the spectral lags can reconcile the simple curvature
effect model. We discuss the implication of these results from various points
of view.Comment: 13 pages, 9 figures, accepted for the publication in PASJ (minor
corrections
AN5D: Automated Stencil Framework for High-Degree Temporal Blocking on GPUs
Stencil computation is one of the most widely-used compute patterns in high
performance computing applications. Spatial and temporal blocking have been
proposed to overcome the memory-bound nature of this type of computation by
moving memory pressure from external memory to on-chip memory on GPUs. However,
correctly implementing those optimizations while considering the complexity of
the architecture and memory hierarchy of GPUs to achieve high performance is
difficult. We propose AN5D, an automated stencil framework which is capable of
automatically transforming and optimizing stencil patterns in a given C source
code, and generating corresponding CUDA code. Parameter tuning in our framework
is guided by our performance model. Our novel optimization strategy reduces
shared memory and register pressure in comparison to existing implementations,
allowing performance scaling up to a temporal blocking degree of 10. We achieve
the highest performance reported so far for all evaluated stencil benchmarks on
the state-of-the-art Tesla V100 GPU
HETE-2 Observations of the X-Ray Flash XRF 040916
A long X-ray flash was detected and localized by the instruments aboard the
High Energy Transient Explorer II (HETE-2) at 00:03:30 UT on 2004 September 16.
The position was reported to the GRB Coordinates Network (GCN) approximately 2
hours after the burst. This burst consists of two peaks separated by 200 s,
with durations of 110 s and 60 s. We have analyzed the energy spectra of the
1st and 2nd peaks observed with the Wide Field X-Ray Monitor (WXM) and the
French Gamma Telescope (FREGATE). We discuss the origin of the 2nd peak in
terms of flux variabilities and timescales. We find that it is most likely part
of the prompt emission, and is explained by the long-acting engine model. This
feature is similar to some bright X-ray flares detected in the early afterglow
phase of bursts observed by the Swift satellite.Comment: 9 pages, 4 figures, Accepted for publication in PAS
High-productivity Framework for Large-scale GPU/CPU Stencil Applications
AbstractA high-productivity framework for multi-GPU and multi-CPU computation of stencil applications is proposed. Our framework is implemented in C++ and CUDA languages. It automatically translates user-written stencil functions that update a grid point and generates both GPU and CPU codes. The programmers write user code just in the C++ language, and can execute the translated user code on either multiple multicore CPUs or multiple GPUs with optimization. The user code can be executed on multiple GPUs with the auto-tuning mechanism and the overlapping method to hide communication cost by computation. It can be also executed on multiple CPUs with OpenMP. The compressible flow code on GPU exploiting the optimizations provided by the framework has achieved 2.7 times faster than the non-optimized version
Large-scale Phase-field Studies of Three-dimensional Dendrite Competitive Growth at the Converging Grain Boundary during Directional Solidification of a Bicrystal Binary Alloy
Large-scale phase-field studies of three-dimensional (3D) dendrite competitive growth at the converging grain boundary (GB) of a bicrystal binary alloy were carried out using the GPU-rich supercomputer TSUBAME 2.5 at Tokyo Institute of Technology. First, a series of thin-sample simulations were performed to investigate the effects of thin-sample thickness, unfavorably oriented (UO) grain inclination angle, and dendrite arrangement on an unusual overgrowth phenomenon whereby the favorably oriented (FO) grain is overgrown by the UO grain. It was concluded that the unusual overgrowth easily occurs as the thickness of the thin sample and the UO grain inclination angle decrease. It was also concluded that the interaction between FO and UO dendrites at the converging GB depends on the dendrite arrangement for relatively large dendrite spacing. Next, realistic large-scale simulations whereby multiple dendrites interact at the converging GB were performed. Unusual overgrowth was also observed in such large-scale simulations, and this phenomenon easily occurred at smaller UO dendrite inclination angles. Furthermore, it was also concluded that the FO and UO dendrites rearrange toward a space-to-face interaction. Because the interaction between FO and UO dendrites differs according to the location on the GB, a zigzag GB was formed, especially at small UO grain inclination angles
- âŠ