7 research outputs found

    Random geometry at an infinite-randomness fixed point

    Full text link
    We study the low-energy physics of the critical (2+1)-dimensional random transverse-field Ising model. The one-dimensional version of the model is a paradigmatic example of a system governed by an infinite-randomness fixed point, for which many results on the distributions of observables are known via an asymptotically exact renormalization group (RG) approach. In two dimensions, the same RG rules have been implemented numerically, and demonstrate a flow to infinite randomness. However, analytical understanding of the structure of this RG has remained elusive due to the development of geometrical structure in the graph of interacting spins. To understand the character of the fixed point, we consider the RG flow acting on a joint ensemble of graphs and couplings. We propose that the RG effectively occurs in two stages: (1) randomization of the interaction graph until it belongs to a certain ensemble of random triangulations of the plane, and (2) a flow of the distributions of couplings to infinite randomness while the graph ensemble remains invariant. This picture is substantiated by a numerical RG in which one obtains a steady-state graph degree distribution and subsequently infinite-randomness scaling distributions of the couplings. Both of these aspects of the RG flow can be approximately reproduced in simplified analytical models.Comment: 28 pages, 13 figure

    Flatter, faster: scaling momentum for optimal speedup of SGD

    Full text link
    Commonly used optimization algorithms often show a trade-off between good generalization and fast training times. For instance, stochastic gradient descent (SGD) tends to have good generalization; however, adaptive gradient methods have superior training times. Momentum can help accelerate training with SGD, but so far there has been no principled way to select the momentum hyperparameter. Here we study training dynamics arising from the interplay between SGD with label noise and momentum in the training of overparametrized neural networks. We find that scaling the momentum hyperparameter 1β1-\beta with the learning rate to the power of 2/32/3 maximally accelerates training, without sacrificing generalization. To analytically derive this result we develop an architecture-independent framework, where the main assumption is the existence of a degenerate manifold of global minimizers, as is natural in overparametrized models. Training dynamics display the emergence of two characteristic timescales that are well-separated for generic values of the hyperparameters. The maximum acceleration of training is reached when these two timescales meet, which in turn determines the scaling limit we propose. We confirm our scaling rule for synthetic regression problems (matrix sensing and teacher-student paradigm) and classification for realistic datasets (ResNet-18 on CIFAR10, 6-layer MLP on FashionMNIST), suggesting the robustness of our scaling rule to variations in architectures and datasets.Comment: v2: expanded introduction section, corrected minor typos. v1: 12+13 pages, 3 figure

    TBG V: Exact Analytic Many-Body Excitations In Twisted Bilayer Graphene Coulomb Hamiltonians: Charge Gap, Goldstone Modes and Absence of Cooper Pairing

    Full text link
    We find exact analytic expressions for the energies and wavefunctions of the charged and neutral excitations above the exact ground states (at rational filling per unit cell) of projected Coulomb Hamiltonians in twisted bilayer graphene. Our exact expressions are valid for any form of the Coulomb interaction and any form of AAAA and AB/BAAB/BA tunneling. The single charge excitation energy is a convolution of the Coulomb potential with a quantum geometric tensor of the TBG bands. The neutral excitations are (high-symmetry group) magnons, and their dispersion is analytically calculated in terms of the form factors of the active bands in TBG. The two-charge excitation energy and wavefunctions are also obtained, and a sufficient condition on the graphene eigenstates for obtaining a Cooper-pair from Coulomb interactions is obtained. For the actual TBG bands at the first magic angle, we can analytically show that the Cooper pair binding energy is zero in all such projected Coulomb models, implying that either phonons and/or non-zero kinetic energy are needed for superconductivity. Since the [Phys. Rev. Lett. 122, 246401] showed that the kinetic energy bounds on the superexchange energy are less 10310^{-3} in Coulomb units, the phonon mechanism becomes then very likely. If nonetheless the superconductivity is due to kinetic terms which render the bands non-flat, one prediction of our theory is that the highest TcT_c would not occur at the highest DOS.Comment: references adde

    TBG VI: An Exact Diagonalization Study of Twisted Bilayer Graphene at Non-Zero Integer Fillings

    Full text link
    Using exact diagonalization, we study the projected Hamiltonian with Coulomb interaction in the 8 flat bands of first magic angle twisted bilayer graphene. Employing the U(4) (U(4)×\timesU(4)) symmetries in the nonchiral (chiral) flat band limit, we reduced the Hilbert space to an extent which allows for study around ν=±3,±2,±1\nu=\pm 3,\pm2,\pm1 fillings. In the first chiral limit w0/w1=0w_0/w_1=0 where w0w_0 (w1w_1) is the AAAA (ABAB) stacking hopping, we find that the ground-states at these fillings are extremely well-described by Slater determinants in a so-called Chern basis, and the exactly solvable charge ±1\pm1 excitations found in [arXiv:2009.14200] are the lowest charge excitations up to system sizes 8×88\times8 (for restricted Hilbert space) in the chiral-flat limit. We also find that the Flat Metric Condition (FMC) used in [arXiv:2009.11301,2009.11872,2009.12376,2009.13530,2009.14200] for obtaining a series of exact ground-states and excitations holds in a large parameter space. For ν=3\nu=-3, the ground state is the spin and valley polarized Chern insulator with νC=±1\nu_C=\pm1 at w0/w10.9w_0/w_1\lesssim0.9 (0.3) with (without) FMC. At ν=2\nu=-2, we can only numerically access the valley polarized sector, and we find a spin ferromagnetic phase when w0/w10.5tw_0/w_1\gtrsim0.5t where t[0,1]t\in[0,1] is the factor of rescaling of the actual TBG bandwidth, and a spin singlet phase otherwise, confirming the perturbative calculation [arXiv:2009.13530]. The analytic FMC ground state is, however, predicted in the intervalley coherent sector which we cannot access [arXiv:2009.13530]. For ν=3\nu=-3 with/without FMC, when w0/w1w_0/w_1 is large, the finite-size gap Δ\Delta to the neutral excitations vanishes, leading to phase transitions. Further analysis of the ground state momentum sectors at ν=3\nu=-3 suggests a competition among (nematic) metal, momentum MMM_M (π\pi) stripe and KMK_M-CDW orders at large w0/w1w_0/w_1.Comment: 21+23 pages, 13+15 figure

    Electron-Electron Interactions in Twisted Bilayer Graphene

    No full text
    In this paper we use exact diagonalization studies to explore the phase diagram of twisted bilayer graphene at the first magic angle. We show the physics in the flat band limit is closely related to the physics at a realistic bandwidth, inducing spin and valley polarization. We also show that the physics at realistic parameters can be described as a perturbation on the chiral limit with a large U(4)×U(4)U(4) \times U(4) symmetry which simplifies calculation greatly. This result may be used in the future to more easily calculate the many-body response as a function of single-body properties even when the interaction between electrons is strong

    Electron-Electron Interactions in Twisted Bilayer Graphene

    No full text
    In this paper we use exact diagonalization studies to explore the phase diagram of twisted bilayer graphene at the first magic angle. We show the physics in the flat band limit is closely related to the physics at a realistic bandwidth, inducing spin and valley polarization. We also show that the physics at realistic parameters can be described as a perturbation on the chiral limit with a large U(4)×U(4)U(4) \times U(4) symmetry which simplifies calculation greatly. This result may be used in the future to more easily calculate the many-body response as a function of single-body properties even when the interaction between electrons is strong
    corecore