7 research outputs found
A fast algorithm to solve systems of nonlinear equations
[EN] A new HSS-based algorithm for solving systems of nonlinear equations is presented and its semilocal convergence is proved. Spectral properties of the new method are investigated. Performance profile for the new scheme is computed and compared with HSS algorithm. Besides, by a numerical example in which a two-dimensional nonlinear convection diffusion equation is solved, we compare the new method and the Newton-HSS method. Numerical results show that the new scheme solves the problem faster than the NewtonHSS scheme in terms of CPU -time and number of iterations. Moreover, the application of the new method is found to be fast, reliable, flexible, accurate, and has small CPU time.This research was partially supported by Ministerio de Economia y Competitividad, Spain under grants MTM2014-52016-C2-2-P and Generalitat Valenciana, Spain PROMETEO/2016/089.Amiri, A.; Cordero Barbero, A.; Darvishi, M.; Torregrosa Sánchez, JR. (2019). A fast algorithm to solve systems of nonlinear equations. Journal of Computational and Applied Mathematics. 354:242-258. https://doi.org/10.1016/j.cam.2018.03.048S24225835
Recommended from our members
Fermion Low Modes in Lattice QCD: Topology, the η' Mass and Algorithm Development
Lattice gauge theory is an important approach to understanding quantum chromodynamics (QCD) due to the large coupling constant in the theory at low energy. In this thesis, we report our study of the topological properties of the gauge fields and we calculate _η and _η' which are related to the topology of the gauge fields. We also develop two algorithms to speed up the inversion of the Dirac equation which is computationally demanding in lattice QCD calculations.
The topology of lattice gauge fields is important but difficult to study because of the large local fluctuations of the gauge fields. In chapter 2, we probe the topological properties of the gauge fields through the measurement of closed quark loops, field strength and low-lying eigenvectors of the Shamir domain wall operator. The closed quark loops suggest the slow evolution of topological modes during the generation of QCD configurations. The chirality of the low-lying eigenvectors is studied and the lattice eigenvectors are compared to the eigenvectors in the continuous theory. The topological charges are calculated from the eigenvectors and the results agree with the topological charges calculated from the smoothed gauge fields. The fermion correlators are also obtained from the eigenvectors.
The non-trivial topological properties of QCD gauge fields are important to the mass of the η and η', _η and _η'. Lattice QCD is an area where _{\eta} can be calculated by using gauge fields that are sampled over different topological sectors. We calculate _η and _η' in chapter 3 by including the fermion correlators and the topological charge density correlators. The errors of _η and _η' are reduced to the percent level and the mixing angle between the octet, singlet states in the SU(3) limit and the physical eigenstates is calculated.
An algorithm that reduces communication and increases the usage of the local computational power is developed in chapter 4. The algorithm uses the multisplitting algorithm as a preconditioner in the preconditioned conjugate gradient method. It speeds up the inversion of the Dirac equation during the evolution phase.
In chapter 5, we utilize two lattices, called the coarse lattice and the fine lattice, that lie on the renormalization group trajectory and have different lattice spacings. We find that the low-mode space of the coarse lattice corresponds to the low-mode space of the fine lattice. Because of the correspondence, the coarse lattice can be used to solve the low modes of the fine lattice. The coarse lattice is used in the restart algorithm and the preconditioned conjugate gradient algorithm where the latter is called the renormalization group based preconditioned conjugate gradient algorithm (RGPCG). By using the near-null vectors as the filter, RGPCG could reduce the operations of the matrix multiplications on the fine lattice by 33% to 44% for the inversion of Dirac equation. The algorithm works better than the conjugate gradient algorithm when multiple equations are solved
Improving performance of simplified computational fluid dynamics models via symmetric successive overrelaxation
The ability to model fluid flow and heat transfer in process equipment (e.g., shell-and-tube heat exchangers) is often critical. What is more, many different geometric variants may need to be evaluated during the design process. Although this can be done using detailed computational fluid dynamics (CFD) models, the time needed to evaluate a single variant can easily reach tens of hours on powerful computing hardware. Simplified CFD models providing solutions in much shorter time frames may, therefore, be employed instead. Still, even these models can prove to be too slow or not robust enough when used in optimization algorithms. Effort is thus devoted to further improving their performance by applying the symmetric successive overrelaxation (SSOR) preconditioning technique in which, in contrast to, e.g., incomplete lower–upper factorization (ILU), the respective preconditioning matrix can always be constructed. Because the efficacy of SSOR is influenced by the selection of forward and backward relaxation factors, whose direct calculation is prohibitively expensive, their combinations are experimentally investigated using several representative meshes. Performance is then compared in terms of the single-core computational time needed to reach a converged steady-state solution, and recommendations are made regarding relaxation factor combinations generally suitable for the discussed purpose. It is shown that SSOR can be used as a suitable fallback preconditioner for the fast-performing, but numerically sensitive, incomplete lower–upper factorization
A bibliography on parallel and vector numerical algorithms
This is a bibliography of numerical methods. It also includes a number of other references on machine architecture, programming language, and other topics of interest to scientific computing. Certain conference proceedings and anthologies which have been published in book form are listed also
Asynchronous and Multiprecision Linear Solvers - Scalable and Fault-Tolerant Numerics for Energy Efficient High Performance Computing
Asynchronous methods minimize idle times by removing synchronization barriers, and therefore allow the efficient usage of computer systems. The implied high tolerance with respect to communication latencies improves the fault tolerance. As asynchronous methods also enable the usage of the power and energy saving mechanisms provided by the hardware, they are suitable candidates for the highly parallel and heterogeneous hardware platforms that are expected for the near future