344 research outputs found

    Block Locally Optimal Preconditioned Eigenvalue Xolvers (BLOPEX) in hypre and PETSc

    Full text link
    We describe our software package Block Locally Optimal Preconditioned Eigenvalue Xolvers (BLOPEX) publicly released recently. BLOPEX is available as a stand-alone serial library, as an external package to PETSc (``Portable, Extensible Toolkit for Scientific Computation'', a general purpose suite of tools for the scalable solution of partial differential equations and related problems developed by Argonne National Laboratory), and is also built into {\it hypre} (``High Performance Preconditioners'', scalable linear solvers package developed by Lawrence Livermore National Laboratory). The present BLOPEX release includes only one solver--the Locally Optimal Block Preconditioned Conjugate Gradient (LOBPCG) method for symmetric eigenvalue problems. {\it hypre} provides users with advanced high-quality parallel preconditioners for linear systems, in particular, with domain decomposition and multigrid preconditioners. With BLOPEX, the same preconditioners can now be efficiently used for symmetric eigenvalue problems. PETSc facilitates the integration of independently developed application modules with strict attention to component interoperability, and makes BLOPEX extremely easy to compile and use with preconditioners that are available via PETSc. We present the LOBPCG algorithm in BLOPEX for {\it hypre} and PETSc. We demonstrate numerically the scalability of BLOPEX by testing it on a number of distributed and shared memory parallel systems, including a Beowulf system, SUN Fire 880, an AMD dual-core Opteron workstation, and IBM BlueGene/L supercomputer, using PETSc domain decomposition and {\it hypre} multigrid preconditioning. We test BLOPEX on a model problem, the standard 7-point finite-difference approximation of the 3-D Laplacian, with the problem size in the range 10510810^5-10^8.Comment: Submitted to SIAM Journal on Scientific Computin

    Fast and Robust Parametric Estimation of Jointly Sparse Channels

    Get PDF
    We consider the joint estimation of multipath channels obtained with a set of receiving antennas and uniformly probed in the frequency domain. This scenario fits most of the modern outdoor communication protocols for mobile access or digital broadcasting among others. Such channels verify a Sparse Common Support property (SCS) which was used in a previous paper to propose a Finite Rate of Innovation (FRI) based sampling and estimation algorithm. In this contribution we improve the robustness and computational complexity aspects of this algorithm. The method is based on projection in Krylov subspaces to improve complexity and a new criterion called the Partial Effective Rank (PER) to estimate the level of sparsity to gain robustness. If P antennas measure a K-multipath channel with N uniformly sampled measurements per channel, the algorithm possesses an O(KPNlogN) complexity and an O(KPN) memory footprint instead of O(PN^3) and O(PN^2) for the direct implementation, making it suitable for K << N. The sparsity is estimated online based on the PER, and the algorithm therefore has a sense of introspection being able to relinquish sparsity if it is lacking. The estimation performances are tested on field measurements with synthetic AWGN, and the proposed algorithm outperforms non-sparse reconstruction in the medium to low SNR range (< 0dB), increasing the rate of successful symbol decodings by 1/10th in average, and 1/3rd in the best case. The experiments also show that the algorithm does not perform worse than a non-sparse estimation algorithm in non-sparse operating conditions, since it may fall-back to it if the PER criterion does not detect a sufficient level of sparsity. The algorithm is also tested against a method assuming a "discrete" sparsity model as in Compressed Sensing (CS). The conducted test indicates a trade-off between speed and accuracy.Comment: 11 pages, 9 figures, submitted to IEEE JETCAS special issue on Compressed Sensing, Sep. 201

    Systematic Data Extraction in High-Frequency Electromagnetic Fields

    Get PDF
    The focus of this work is on the investigation of billiards with its statistical eigenvalue properties. Specifically, superconducting microwave resonators with chaotic characteristics are simulated and the eigenfrequencies that are needed for the statistical analysis are computed. The eigenfrequency analysis requires many (in order of thousands) eigenfrequencies to be calculated and the accurate determination of the eigenfrequencies has a crucial significance. Consequently, the research interests cover all aspects from accurate numerical calculation of many eigenvalues and eigenvectors up to application development in order to get good performance out of the programs for distributed-memory and shared-memory multiprocessors. Furthermore, this thesis provides an overview and detailed evaluation of the used numerical approaches for large-scale eigenvalue calculations with respect to the accuracy, the computational time, and the memory consumption. The first approach for an accurate eigenfrequency extraction takes into consideration the evaluated electric field computations in Time Domain (TD) of a superconducting resonant structure. Upon excitation of the cavity, the electric field intensity is recorded at different detection probes inside the cavity. Thereafter, Fourier analysis of the recorded signals is performed and by means of signal-processing and fitting techniques, the requested eigenfrequencies are extracted by finding the optimal model parameters in the least squares sense. The second numerical approach is based on a numerical computation of electromagnetic fields in Frequency Domain (FD) and further employs the Lanczos method for the eigenvalue determination. Namely, when utilizing the Finite Integration Technique (FIT) to solve an electromagnetic problem for a superconducting cavity, which enclosures excited electromagnetic fields, the numerical solution of a standard large-scale eigenvalue problem is considered. Accordingly, if the numerical solution of the same problem is treated by the Finite Element Method (FEM) based on curvilinear tetrahedrons, it yields to the generalized large-scale eigenvalue problem. Afterward, the desired eigenvalues are calculated with the direct solution of the large (generalized) eigenvalue formulations. For this purpose, the implemented Lanczos solvers combine two major ingredients: the Lanczos algorithm with polynomial filtering on the one hand and its parallelization on the other

    USRA/RIACS

    Get PDF
    The Research Institute for Advanced Computer Science (RIACS) was established by the Universities Space Research Association (USRA) at the NASA Ames Research Center (ARC) on 6 June 1983. RIACS is privately operated by USRA, a consortium of universities with research programs in the aerospace sciences, under a cooperative agreement with NASA. The primary mission of RIACS is to provide research and expertise in computer science and scientific computing to support the scientific missions of NASA ARC. The research carried out at RIACS must change its emphasis from year to year in response to NASA ARC's changing needs and technological opportunities. A flexible scientific staff is provided through a university faculty visitor program, a post doctoral program, and a student visitor program. Not only does this provide appropriate expertise but it also introduces scientists outside of NASA to NASA problems. A small group of core RIACS staff provides continuity and interacts with an ARC technical monitor and scientific advisory group to determine the RIACS mission. RIACS activities are reviewed and monitored by a USRA advisory council and ARC technical monitor. Research at RIACS is currently being done in the following areas: Parallel Computing; Advanced Methods for Scientific Computing; Learning Systems; High Performance Networks and Technology; Graphics, Visualization, and Virtual Environments

    Parallel three-dimensional magnetotelluric inversion using adaptive finite-element method. Part I: theory and synthetic study

    Get PDF
    This paper presents a distributed magnetotelluric inversion scheme based on adaptive finite-element method (FEM). The key novel aspect of the introduced algorithm is the use of automatic mesh refinement techniques for both forward and inverse modelling. These techniques alleviate tedious and subjective procedure of choosing a suitable model parametrization. To avoid overparametrization, meshes for forward and inverse problems were decoupled. For calculation of accurate electromagnetic (EM) responses, automatic mesh refinement algorithm based on a goal-oriented error estimator has been adopted. For further efficiency gain, EM fields for each frequency were calculated using independent meshes in order to account for substantially different spatial behaviour of the fields over a wide range of frequencies. An automatic approach for efficient initial mesh design in inverse problems based on linearized model resolution matrix was developed. To make this algorithm suitable for large-scale problems, it was proposed to use a low-rank approximation of the linearized model resolution matrix. In order to fill a gap between initial and true model complexities and resolve emerging 3-D structures better, an algorithm for adaptive inverse mesh refinement was derived. Within this algorithm, spatial variations of the imaged parameter are calculated and mesh is refined in the neighborhoods of points with the largest variations. A series of numerical tests were performed to demonstrate the utility of the presented algorithms. Adaptive mesh refinement based on the model resolution estimates provides an efficient tool to derive initial meshes which account for arbitrary survey layouts, data types, frequency content and measurement uncertainties. Furthermore, the algorithm is capable to deliver meshes suitable to resolve features on multiple scales while keeping number of unknowns low. However, such meshes exhibit dependency on an initial model guess. Additionally, it is demonstrated that the adaptive mesh refinement can be particularly efficient in resolving complex shapes. The implemented inversion scheme was able to resolve a hemisphere object with sufficient resolution starting from a coarse discretization and refining mesh adaptively in a fully automatic process. The code is able to harness the computational power of modern distributed platforms and is shown to work with models consisting of millions of degrees of freedom. Significant computational savings were achieved by using locally refined decoupled meshe

    Electromagnetic model subdivision and iterative solvers for surface and volume double higher order numerical methods and applications

    Get PDF
    2019 Fall.Includes bibliographical references.Higher order methods have been established in the numerical analysis of electromagnetic structures decreasing the number of unknowns compared to the low order discretization. In order to decrease memory requirements even further, model subdivision in the computational analysis of electrically large structures has been used. The technique is based on clustering elements and solving/approximating subsystems separately, and it is often implemented in conjunction with iterative solvers. This thesis addresses unique theoretical and implementation details specific to model subdivision of the structures discretized by the Double Higher Order (DHO) elements analyzed by i) Finite Element Method - Mode Matching (FEM-MM) technique for closed-region (waveguide) structures and ii) Surface Integral Equation Method of Moments (SIE-MoM) in combination with (Multi-Level) Fast Multipole Method for open-region bodies. Besides standard application in decreasing the model size, DHO FEM-MM is applied to modeling communication system in tunnels by means of Standard Impedance Boundary Condition (SIBC), and excellent agreement is achieved with measurements performed in Massif Central tunnel. To increase accuracy of the SIE-MoM computation, novel method for numerical evaluation of the 2-D surface integrals in MoM matrix entries has been improved to achieve better accuracy than traditional method. To demonstrate its efficiency and practicality, SIE-MoM technique is applied to analysis of the rain event containing significant percentage of the oscillating drops recorded by 2D video disdrometer. An excellent agreement with previously-obtained radar measurements has been established providing the benefits of accurately modeling precipitation particles
    corecore