1,480 research outputs found
Matrix Structure Exploitation in Generalized Eigenproblems Arising in Density Functional Theory
In this short paper, the authors report a new computational approach in the
context of Density Functional Theory (DFT). It is shown how it is possible to
speed up the self-consistent cycle (iteration) characterizing one of the most
well-known DFT implementations: FLAPW. Generating the Hamiltonian and overlap
matrices and solving the associated generalized eigenproblems
constitute the two most time-consuming fractions of each iteration. Two
promising directions, implementing the new methodology, are presented that will
ultimately improve the performance of the generalized eigensolver and save
computational time.Comment: To appear in the proceedings of 8th International Conference on
Numerical Analysis and Applied Mathematics (ICNAAM 2010
What Are the Success Factors of Multilingual Families? Relationships Between Linguistic Attitudes and Community Dynamics
The research focuses on the influence of emotional, cognitive, and social climate on the language choices of multilingual families, and the impact they can have on their general well-being, intergenerational relationships, and the community context. The methodological framework of reference is Grounded Theory. Collected data concern language practices, attitudes, emotions, and generational, trigenerational, and social interactive dynamics of multilingual families. The results include key insights into the variables underlying the linguistic attitudes of multicultural families. Two Network Views suggest that linguistic attitudes, such as the conscious management of specific and complex dynamics activated in a multilingual family, can stimulate well-being
Quantum Deconstruction of 5D SQCD
We deconstruct the fifth dimension of 5D SCQD with general numbers of colors
and flavors and general 5D Chern-Simons level; the latter is adjusted by adding
extra quarks to the 4D quiver. We use deconstruction as a non-stringy UV
completion of the quantum 5D theory; to prove its usefulness, we compute
quantum corrections to the SQCD_5 prepotential. We also explore the
moduli/parameter space of the deconstructed SQCD_5 and show that for |K_CS| <
N_F/2 it continues to negative values of 1/(g_5)^2. In many cases there are
flop transitions connecting SQCD_5 to exotic 5D theories such as E0, and we
present several examples of such transitions. We compare deconstruction to
brane-web engineering of the same SQCD_5 and show that the phase diagram is the
same in both cases; indeed, the two UV completions are in the same universality
class, although they are not dual to each other. Hence, the phase structure of
an SQCD_5 (and presumably any other 5D gauge theory) is inherently
five-dimensional and does not depends on a UV completion.Comment: LaTeX+PStricks, 108 pages, 41 colored figures. Please print in colo
Chiral Rings of Deconstructive [SU(n_c)]^N Quivers
Dimensional deconstruction of 5D SQCD with general n_c, n_f and k_CS gives
rise to 4D N=1 gauge theories with large quivers of SU(n_c) gauge factors. We
construct the chiral rings of such [SU(n_c)]^N theories, off-shell and
on-shell. Our results are broadly similar to the chiral rings of single U(n_c)
theories with both adjoint and fundamental matter, but there are also some
noteworthy differences such as nonlocal meson-like operators where the quark
and antiquark fields belong to different nodes of the quiver. And because our
gauge groups are SU(n_c) rather than U(n_c), our chiral rings also contain a
whole zoo of baryonic and antibaryonic operators.Comment: 93 pages, LaTeX, PSTricks macros; 1 reference added in v
High-performance functional renormalization group calculations for interacting fermions
We derive a novel computational scheme for functional Renormalization Group
(fRG) calculations for interacting fermions on 2D lattices. The scheme is based
on the exchange parametrization fRG for the two-fermion interaction, with
additional insertions of truncated partitions of unity. These insertions
decouple the fermionic propagators from the exchange propagators and lead to a
separation of the underlying equations. We demonstrate that this separation is
numerically advantageous and may pave the way for refined, large-scale
computational investigations even in the case of complex multiband systems.
Furthermore, on the basis of speedup data gained from our implementation, it is
shown that this new variant facilitates efficient calculations on a large
number of multi-core CPUs. We apply the scheme to the , Hubbard model on
a square lattice to analyze the convergence of the results with the bond length
of the truncation of the partition of unity. In most parameter areas, a fast
convergence can be observed. Finally, we compare to previous results in order
to relate our approach to other fRG studies.Comment: 26 pages, 9 figure
Lipschitz regularity for degenerate elliptic integrals with p, q-growth
We establish the local Lipschitz continuity and the higher differentiability of vector-valued local minimizers of a class of energy integrals of the Calculus of Variations. The main novelty is that we deal with possibly degenerate energy densities with respect to the x -variable
Hybrid CPU-GPU generation of the Hamiltonian and overlap matrices in FLAPW methods
In this paper we focus on the integration of high-performance numerical libraries in ab initio codes and the portability of performance and scalability. The target of our work is FLEUR, a software for electronic structure calculations developed in the Forschungszentrum J\"ulich over the course of two decades. The presented work follows up on a previous effort to modernize legacy code by re-engineering and rewriting it in terms of highly optimized libraries. We illustrate how this initial effort to get efficient and portable shared-memory code enables fast porting of the code to emerging heterogeneous architectures. More specifically, we port the code to nodes equipped with multiple GPUs. We divide our study in two parts. First, we show considerable speedups attained by minor and relatively straightforward code changes to off-load parts of the computation to the GPUs. Then, we identify further possible improvements to achieve even higher performance and scalability. On a system consisting of 16-cores and 2 GPUs, we observe speedups of up to 5x with respect to our optimized shared-memory code, which in turn means between 7.5x and 12.5x speedup with respect to the original FLEUR code
- …