Search CORE

19,249 research outputs found

Inner product computation for sparse iterative solvers on\ud distributed supercomputer

Author: Gu T. -X.
Liu X. -P.
Zhu S. -X.
Publication venue
Publication date: 01/01/2012
Field of study

Recent years have witnessed that iterative Krylov methods without re-designing are not suitable for distribute supercomputers because of intensive global communications. It is well accepted that re-engineering Krylov methods for prescribed computer architecture is necessary and important to achieve higher performance and scalability. The paper focuses on simple and practical ways to re-organize Krylov methods and improve their performance for current heterogeneous distributed supercomputers. In construct with most of current software development of Krylov methods which usually focuses on efficient matrix vector multiplications, the paper focuses on the way to compute inner products on supercomputers and explains why inner product computation on current heterogeneous distributed supercomputers is crucial for scalable Krylov methods. Communication complexity analysis shows that how the inner product computation can be the bottleneck of performance of (inner) product-type iterative solvers on distributed supercomputers due to global communications. Principles of reducing such global communications are discussed. The importance of minimizing communications is demonstrated by experiments using up to 900 processors. The experiments were carried on a Dawning 5000A, one of the fastest and earliest heterogeneous supercomputers in the world. Both the analysis and experiments indicates that inner product computation is very likely to be the most challenging kernel for inner product-based iterative solvers to achieve exascale

Oxford University Research Archive

Minimizing synchronizations in sparse iterative solvers for distributed supercomputers

Author: Gu T.-X.
Liu X.-P.
Zhu S.-X.
Publication venue
Publication date: 01/01/2013
Field of study

Eliminating synchronizations is one of the important techniques related to minimizing communications for modern high performance computing. This paper discusses principles of reducing communications due to global synchronizations in sparse iterative solvers on distributed supercomputers. We demonstrates how to minimizing global synchronizations by rescheduling a typical Krylov subspace method. The benefit of minimizing synchronizations is shown in theoretical analysis and is verified by numerical experiments using up to 900 processors. The experiments also show the communication complexity for some structured sparse matrix vector multiplications and global communications in the underlying supercomputers are in the order P1/2.5 and P4/5 respectively, where P is the number of processors and the experiments were carried on a Dawning 5000A

Oxford University Research Archive

Risk, cohabitation and marriage

Author: Gu X.
Rao Sahib P.
Publication venue
Publication date
Field of study

This paper introduces imperfect information,learning,and risk aversion in a two sided matching model.The modelprovides a theoreticalframework for the com- monly occurring phenomenon of cohabitation followed by marriage,and is con- sistent with empirical findings on these institutions.The paper has three major results.First,individuals set higher standards for marriage than for cohabitation. When the true worth of a cohabiting partner is revealed,some cohabiting unions are converted into marriage while others are not.Second,individuals cohabit within classes.Third,the premium that compensates individuals for the higher risk involved in marriage over a cohabiting partnership is derived.This premium can be decomposed into two parts.The first part is a function of the individual ’s level of risk aversion,while the second part is a function of the di difference in risk between marriage and cohabitation.

Research Papers in Economics

Fast Monte Carlo Simulation for Patient-specific CT/CBCT Imaging Dose Calculation

Author: Briesmeister J F
DeMarco J J
DeMarco J J
Gu X
Gu X
Gu X J
Gu X J
Hao Yan
Hirayama H
Jacques R
Jarry G
Jia X
Jia X
L’Ecuyer P
Marsaglia G
Men C
Men C H
NVIDIA
NVIDIA
Salvat F
Steve B Jiang
Woodcock E
Xuejun Gu
Xun Jia
Publication venue: 'IOP Publishing'
Publication date: 15/11/2011
Field of study

Recently, X-ray imaging dose from computed tomography (CT) or cone beam CT (CBCT) scans has become a serious concern. Patient-specific imaging dose calculation has been proposed for the purpose of dose management. While Monte Carlo (MC) dose calculation can be quite accurate for this purpose, it suffers from low computational efficiency. In response to this problem, we have successfully developed a MC dose calculation package, gCTD, on GPU architecture under the NVIDIA CUDA platform for fast and accurate estimation of the x-ray imaging dose received by a patient during a CT or CBCT scan. Techniques have been developed particularly for the GPU architecture to achieve high computational efficiency. Dose calculations using CBCT scanning geometry in a homogeneous water phantom and a heterogeneous Zubal head phantom have shown good agreement between gCTD and EGSnrc, indicating the accuracy of our code. In terms of improved efficiency, it is found that gCTD attains a speed-up of ~400 times in the homogeneous water phantom and ~76.6 times in the Zubal phantom compared to EGSnrc. As for absolute computation time, imaging dose calculation for the Zubal phantom can be accomplished in ~17 sec with the average relative standard deviation of 0.4%. Though our gCTD code has been developed and tested in the context of CBCT scans, with simple modification of geometry it can be used for assessing imaging dose in CT scans as well.Comment: 18 pages, 7 figures, and 1 tabl

arXiv.org e-Print Archive

Crossref

Risk, cohabitation and marriage

Author: Gu X.
Rao Sahib P.
Publication venue: s.n.
Publication date: 01/01/2000
Field of study

ARTS repository - University of Groningen

Recommended from our members

Value encoding in the globus pallidus: fMRI reveals an interaction effect between reward and dopamine drive

Author: Dolan R.J.
Fiore V.G.
Gu X.
Nolte T.
Rigoli F.
Smittenaar P.
Publication venue: Elsevier BV
Publication date: 01/01/2018
Field of study

The external part of the globus pallidus (GPe) is a core nucleus of the basal ganglia (BG) whose activity is disrupted under conditions of low dopamine release, as in Parkinson's disease. Current models assume decreased dopamine release in the dorsal striatum results in deactivation of dorsal GPe, which in turn affects motor expression via a regulatory effect on other nuclei of the BG. However, recent studies in healthy and pathological animal models have reported neural dynamics that do not match with this view of the GPe as a relay in the BG circuit. Thus, the computational role of the GPe in the BG is still to be determined. We previously proposed a neural model that revisits the functions of the nuclei of the BG, and this model predicts that GPe encodes values which are amplified under a condition of low striatal dopaminergic drive. To test this prediction, we used an fMRI paradigm involving a within-subject placebo-controlled design, using the dopamine antagonist risperidone, wherein healthy volunteers performed a motor selection and maintenance task under low and high reward conditions. ROI-based fMRI analysis revealed an interaction between reward and dopamine drive manipulations, with increased BOLD activity in GPe in a high compared to low reward condition, and under risperidone compared to placebo. These results confirm the core prediction of our computational model, and provide a new perspective on neural dynamics in the BG and their effects on motor selection and cognitive disorders

Risk, cohabitation and marriage

Author: Gu X.
Rao Sahib P.
Publication venue: s.n.
Publication date: 01/01/2000
Field of study

ARTS repository - University of Groningen

Ground-State Entanglement in Interacting Bosonic Graphs

Author: Duan L.-M.
Giorda P.
Godsil Chris D.
Gu S.
Gu S.
Gu S.-J.
Hamma A.
Horodecki M.
Latorre J. I.
P Giorda
P Zanardi
Paskauskas R.
Schuch N.
Wang X.
Zanardi P.
Zanardi P.
Zanardi P.
Publication venue: 'IOP Publishing'
Publication date: 10/11/2003
Field of study

We consider a collection of bosonic modes corresponding to the vertices of a graph

\Gamma.

Quantum tunneling can occur only along the edges of

\Gamma

and a local self-interaction term is present. Quantum entanglement of one vertex with respect the rest of the graph is analyzed in the ground-state of the system as a function of the tunneling amplitude

\tau.

The topology of

\Gamma

plays a major role in determining the tunneling amplitude

\tau^*

which leads to the maximum ground-state entanglement. Whereas in most of the cases one finds the intuitively expected result

\tau^*=\infty

we show that it there exists a family of graphs for which the optimal value of

\tau

is pushed down to a finite value. We also show that, for complete graphs, our bi-partite entanglement provides useful insights in the analysis of the cross-over between insulating and superfluid ground statesComment: 5 pages (LaTeX) 5 eps figures include

arXiv.org e-Print Archive

Crossref

EDP Sciences OAI-PMH repository (1.2.0)

CERN Document Server

Six-dimensional weak-strong simulations of head-on beam-beam compensation in RHIC

Author: Abreu N. P.
Fischer W.
Gu X.
Luo Y.
Pikin A.
Robert-Demolaize G.
Publication venue
Publication date: 21/10/2014
Field of study

To compensate the large beam-beam tune spread and beam-beam resonance driving terms in the polarized proton operation in the Relativistic Heavy Ion Collider (RHIC), we will introduce a low-energy DC electron beam into each ring to collide head-on with the opposing proton beam. The device to provide the electron beam is called an electron lens. In this article, using a 6-D weak-strong-beam-beam interaction simulation model, we investigate the effects of head-on beam-beam compensation with electron lenses on the proton beam dynamics in the RHIC 250 GeV polarized proton operation. This article is abridged from the published article [1].Comment: 5 pages, contribution to the ICFA Mini-Workshop on Beam-Beam Effects in Hadron Colliders, CERN, Geneva, Switzerland, 18-22 Mar 201

arXiv.org e-Print Archive

CERN Document Server