49 research outputs found
Recommended from our members
LUsim: A Framework for Simulation-Based Performance Modelingand Prediction of Parallel Sparse LU Factorization
Sparse parallel factorization is among the most complicated and irregular algorithms to analyze and optimize. Performance depends both on system characteristics such as the floating point rate, the memory hierarchy, and the interconnect performance, as well as input matrix characteristics such as such as the number and location of nonzeros. We present LUsim, a simulation framework for modeling the performance of sparse LU factorization. Our framework uses micro-benchmarks to calibrate the parameters of machine characteristics and additional tools to facilitate real-time performance modeling. We are using LUsim to analyze an existing parallel sparse LU factorization code, and to explore a latency tolerant variant. We developed and validated a model of the factorization in SuperLU_DIST, then we modeled and implemented a new variant of slud, replacing a blocking collective communication phase with a non-blocking asynchronous point-to-point one. Our strategy realized a mean improvement of 11percent over a suite of test matrices
Recommended from our members
LUsim: A Framework for Simulation-Based Performance Modeling and Prediction of Parallel Sparse LU Factorization
Sparse parallel factorization is among the most complicated and irregular algorithms to analyze and optimize. Performance depends both on system characteristics such as the floating point rate, the memory hierarchy, and the interconnect performance, as well as input matrix characteristics such as such as the number and location of nonzeros. We present LUsim, a simulation framework for modeling the performance of sparse LU factorization. Our framework uses micro-benchmarks to calibrate the parameters of machine characteristics and additional tools to facilitate real-time performance modeling. We are using LUsim to analyze an existing parallel sparse LU factorization code, and to explore a latency tolerant variant. We developed and validated a model of the factorization in SuperLU_DIST, then we modeled and implemented a new variant of slud, replacing a blocking collective communication phase with a non-blocking asynchronous point-to-point one. Our strategy realized a mean improvement of 11percent over a suite of test matrices
On an ODE model for narrow-necked rotating liquid drops
SIGLEAvailable from TIB Hannover: RO 7722(299) / FIZ - Fachinformationszzentrum Karlsruhe / TIB - Technische InformationsbibliothekDEGerman
Error correction models for fractionally cointegrated time series
This note provides a proof of Granger's (1986) error correction model for fractionally cointegrated variables and points out a necessary assumption that has not been noted before. Moreover, a simpler, alternative error correction model is proposed which can be employed to estimate fractionally cointegrated systems in three steps. (orig.)Available from TIB Hannover: RR 8460(2000,2) / FIZ - Fachinformationszzentrum Karlsruhe / TIB - Technische InformationsbibliothekSIGLEDEGerman
On the existence of Hermitian-harmonic maps from complete Hermitian to complete Riemannian manifolds
SIGLEAvailable from TIB Hannover: RR 4487(2003,9) / FIZ - Fachinformationszzentrum Karlsruhe / TIB - Technische InformationsbibliothekDeutsche Forschungsgemeinschaft (DFG), Bonn (Germany)DEGerman
Properties of nonlinear transformations of fractionally integrated processes
SIGLEAvailable from TIB Hannover: RR 8460(2000,25) / FIZ - Fachinformationszzentrum Karlsruhe / TIB - Technische InformationsbibliothekDEGerman
Superconducting tunneling von VNx films
SIGLECopy held by FIZ Karlsruhe; available from UB/TIB Hannover / FIZ - Fachinformationszzentrum Karlsruhe / TIB - Technische InformationsbibliothekDEGerman
Very close pairs of quasi-stellar objects
It is pointed out that there are now known four very close pairs of QSOs with separations > 5 arcsec and very different redshifts. Several estimates of the probability that they are accidental configurations range between 10"-"7 and 3.5 x 10"-"3. We conclude either than this is further evidence that QSOs have significant non-cosmological redshift components, or that the pairs must be explained by gravitational lensing. (orig.)Available from TIB Hannover: RR 4697(916) / FIZ - Fachinformationszzentrum Karlsruhe / TIB - Technische InformationsbibliothekSIGLEDEGerman
Recommended from our members
Flow shear suppression of turbulence using externally driven ion Bernstein and Alfven waves
The utilization of externally-launched radio-frequency waves as a means of active confinement control through the generation of sheared poloidal flows is explored. For low-frequency waves, kinetic Alfven waves are proposed, and are shown to drive sheared E {times} B flows as a result of the radial variation in the electromagnetic Reynolds stress. In the high frequency regime, ion Bernstein waves are considered, and shown to generate sheared poloidal rotation through the ponderomotive force. In either case, it is shown that modest amounts of absorbed power ({approximately} few 100 kW) are required to suppress turbulence in a region of several cm radial width. 9 refs
A more robust definition of subjective probability
SIGLEAvailable from TIB Hannover: RO 2708(365) / FIZ - Fachinformationszzentrum Karlsruhe / TIB - Technische InformationsbibliothekDEGerman