111 research outputs found

    Solutions of large-scale electromagnetics problems involving dielectric objects with the parallel multilevel fast multipole algorithm

    Get PDF
    Fast and accurate solutions of large-scale electromagnetics problems involving homogeneous dielectric objects are considered. Problems are formulated with the electric and magnetic current combined-field integral equation and discretized with the Rao-Wilton-Glisson functions. Solutions are performed iteratively by using the multi-level fast multipole algorithm (MLFMA). For the solution of large-scale problems discretized with millions of unknowns, MLFMA is parallelized on distributed-memory architectures using a rigorous technique, namely, the hierarchical partitioning strategy. Efficiency and accuracy of the developed implementation are demonstrated on very large problems involving as many as 100 million unknowns

    Analysis of a scalable, parallel, 2D MLFMA solver

    Get PDF

    Weak scalability analysis of the distributed-memory parallel MLFMA

    Get PDF
    Distributed-memory parallelization of the multilevel fast multipole algorithm (MLFMA) relies on the partitioning of the internal data structures of the MLFMA among the local memories of networked machines. For three existing data partitioning schemes (spatial, hybrid and hierarchical partitioning), the weak scalability, i.e., the asymptotic behavior for proportionally increasing problem size and number of parallel processes, is analyzed. It is demonstrated that none of these schemes are weakly scalable. A nontrivial change to the hierarchical scheme is proposed, yielding a parallel MLFMA that does exhibit weak scalability. It is shown that, even for modest problem sizes and a modest number of parallel processes, the memory requirements of the proposed scheme are already significantly lower, compared to existing schemes. Additionally, the proposed scheme is used to perform full-wave simulations of a canonical example, where the number of unknowns and CPU cores are proportionally increased up to more than 200 millions of unknowns and 1024 CPU cores. The time per matrix-vector multiplication for an increasing number of unknowns and CPU cores corresponds very well to the theoretical time complexity

    Hierarchical Parallelization of the Multilevel Fast Multipole Algorithm (MLFMA)

    Get PDF
    Cataloged from PDF version of article.Due to its O(NlogN) complexity, the multilevel fast multipole algorithm (MLFMA) is one of the most prized algorithms of computational electromagnetics and certain other disciplines. Various implementations of this algorithm have been used for rigorous solutions of large-scale scattering, radiation, and miscellaneous other electromagnetics problems involving 3-D objects with arbitrary geometries. Parallelization of MLFMA is crucial for solving real-life problems discretized with hundreds of millions of unknowns. This paper presents the hierarchical partitioning strategy, which provides a very efficient parallelization of MLFMA on distributed-memory architectures. We discuss the advantages of the hierarchical strategy over previous approaches and demonstrate the improved efficiency on scattering problems discretized with millions of unknowns

    Scalable parallel computation of the translation operator in three dimensions

    Get PDF
    We propose a novel algorithm for the parallel, distributed-memory computation of the translation operator in the three-dimensional multilevel fast multipole algorithm (MLFMA). Sequential algorithms can compute the translation operator with L multipoles and O(L-2) sampling points in O(L-2) time. State-of-the-art hierarchical parallelization schemes of the MLFMA rely on the distribution of radiation patterns and associated translation operators among P = O(L-2) parallel processes, necessitating the development of distributed-memory algorithms for the computation of the translation operator. Whereas a baseline parallel algorithm computes this translation operator in O(L) time, we propose an algorithm that achieves this in only O(log L) time. For large translation operators and a high number of parallel processes, our algorithm proves to be roughly ten times faster than the baseline algorithm

    Accurate and efficient algorithms for boundary element methods in electromagnetic scattering: a tribute to the work of F. Olyslager

    Get PDF
    Boundary element methods (BEMs) are an increasingly popular approach to model electromagnetic scattering both by perfect conductors and dielectric objects. Several mathematical, numerical, and computational techniques pullulated from the research into BEMs, enhancing its efficiency and applicability. In designing a viable implementation of the BEM, both theoretical and practical aspects need to be taken into account. Theoretical aspects include the choice of an integral equation for the sought after current densities on the geometry's boundaries and the choice of a discretization strategy (i.e. a finite element space) for this equation. Practical aspects include efficient algorithms to execute the multiplication of the system matrix by a test vector (such as a fast multipole method) and the parallelization of this multiplication algorithm that allows the distribution of the computation and communication requirements between multiple computational nodes. In honor of our former colleague and mentor, F. Olyslager, an overview of the BEMs for large and complex EM problems developed within the Electromagnetics Group at Ghent University is presented. Recent results that ramified from F. Olyslager's scientific endeavors are included in the survey

    A Hierarchical Partitioning Strategy for an Efficient Parallelization of the Multilevel Fast Multipole Algorithm

    Get PDF
    Cataloged from PDF version of article.We present a novel hierarchical partitioning strategy for the efficient parallelization of the multilevel fast multipole algorithm (MLFMA) on distributed-memory architectures to solve large-scale problems in electromagnetics. Unlike previous parallelization techniques, the tree structure of MLFMA is distributed among processors by partitioning both clusters and samples of fields at each level. Due to the improved load-balancing, the hierarchical strategy offers a higher parallelization efficiency than previous approaches, especially when the number of processors is large. We demonstrate the improved efficiency on scattering problems discretized with millions of unknowns. In addition, we present the effectiveness of our algorithm by solving very large scattering problems involving a conducting sphere of radius 210 wavelengths and a complicated real-life target with a maximum dimension of 880 wavelengths. Both of the objects are discretized with more than 200 million unknowns
    corecore