7 research outputs found
Random geometry at an infinite-randomness fixed point
We study the low-energy physics of the critical (2+1)-dimensional random
transverse-field Ising model. The one-dimensional version of the model is a
paradigmatic example of a system governed by an infinite-randomness fixed
point, for which many results on the distributions of observables are known via
an asymptotically exact renormalization group (RG) approach. In two dimensions,
the same RG rules have been implemented numerically, and demonstrate a flow to
infinite randomness. However, analytical understanding of the structure of this
RG has remained elusive due to the development of geometrical structure in the
graph of interacting spins. To understand the character of the fixed point, we
consider the RG flow acting on a joint ensemble of graphs and couplings. We
propose that the RG effectively occurs in two stages: (1) randomization of the
interaction graph until it belongs to a certain ensemble of random
triangulations of the plane, and (2) a flow of the distributions of couplings
to infinite randomness while the graph ensemble remains invariant. This picture
is substantiated by a numerical RG in which one obtains a steady-state graph
degree distribution and subsequently infinite-randomness scaling distributions
of the couplings. Both of these aspects of the RG flow can be approximately
reproduced in simplified analytical models.Comment: 28 pages, 13 figure
Flatter, faster: scaling momentum for optimal speedup of SGD
Commonly used optimization algorithms often show a trade-off between good
generalization and fast training times. For instance, stochastic gradient
descent (SGD) tends to have good generalization; however, adaptive gradient
methods have superior training times. Momentum can help accelerate training
with SGD, but so far there has been no principled way to select the momentum
hyperparameter. Here we study training dynamics arising from the interplay
between SGD with label noise and momentum in the training of overparametrized
neural networks. We find that scaling the momentum hyperparameter
with the learning rate to the power of maximally accelerates training,
without sacrificing generalization. To analytically derive this result we
develop an architecture-independent framework, where the main assumption is the
existence of a degenerate manifold of global minimizers, as is natural in
overparametrized models. Training dynamics display the emergence of two
characteristic timescales that are well-separated for generic values of the
hyperparameters. The maximum acceleration of training is reached when these two
timescales meet, which in turn determines the scaling limit we propose. We
confirm our scaling rule for synthetic regression problems (matrix sensing and
teacher-student paradigm) and classification for realistic datasets (ResNet-18
on CIFAR10, 6-layer MLP on FashionMNIST), suggesting the robustness of our
scaling rule to variations in architectures and datasets.Comment: v2: expanded introduction section, corrected minor typos. v1: 12+13
pages, 3 figure
TBG V: Exact Analytic Many-Body Excitations In Twisted Bilayer Graphene Coulomb Hamiltonians: Charge Gap, Goldstone Modes and Absence of Cooper Pairing
We find exact analytic expressions for the energies and wavefunctions of the
charged and neutral excitations above the exact ground states (at rational
filling per unit cell) of projected Coulomb Hamiltonians in twisted bilayer
graphene. Our exact expressions are valid for any form of the Coulomb
interaction and any form of and tunneling. The single charge
excitation energy is a convolution of the Coulomb potential with a quantum
geometric tensor of the TBG bands. The neutral excitations are (high-symmetry
group) magnons, and their dispersion is analytically calculated in terms of the
form factors of the active bands in TBG. The two-charge excitation energy and
wavefunctions are also obtained, and a sufficient condition on the graphene
eigenstates for obtaining a Cooper-pair from Coulomb interactions is obtained.
For the actual TBG bands at the first magic angle, we can analytically show
that the Cooper pair binding energy is zero in all such projected Coulomb
models, implying that either phonons and/or non-zero kinetic energy are needed
for superconductivity. Since the [Phys. Rev. Lett. 122, 246401] showed that the
kinetic energy bounds on the superexchange energy are less in Coulomb
units, the phonon mechanism becomes then very likely. If nonetheless the
superconductivity is due to kinetic terms which render the bands non-flat, one
prediction of our theory is that the highest would not occur at the
highest DOS.Comment: references adde
TBG VI: An Exact Diagonalization Study of Twisted Bilayer Graphene at Non-Zero Integer Fillings
Using exact diagonalization, we study the projected Hamiltonian with Coulomb
interaction in the 8 flat bands of first magic angle twisted bilayer graphene.
Employing the U(4) (U(4)U(4)) symmetries in the nonchiral (chiral) flat
band limit, we reduced the Hilbert space to an extent which allows for study
around fillings. In the first chiral limit
where () is the () stacking hopping, we find that the
ground-states at these fillings are extremely well-described by Slater
determinants in a so-called Chern basis, and the exactly solvable charge
excitations found in [arXiv:2009.14200] are the lowest charge excitations up to
system sizes (for restricted Hilbert space) in the chiral-flat
limit. We also find that the Flat Metric Condition (FMC) used in
[arXiv:2009.11301,2009.11872,2009.12376,2009.13530,2009.14200] for obtaining a
series of exact ground-states and excitations holds in a large parameter space.
For , the ground state is the spin and valley polarized Chern insulator
with at (0.3) with (without) FMC. At
, we can only numerically access the valley polarized sector, and we
find a spin ferromagnetic phase when where is
the factor of rescaling of the actual TBG bandwidth, and a spin singlet phase
otherwise, confirming the perturbative calculation [arXiv:2009.13530]. The
analytic FMC ground state is, however, predicted in the intervalley coherent
sector which we cannot access [arXiv:2009.13530]. For with/without
FMC, when is large, the finite-size gap to the neutral
excitations vanishes, leading to phase transitions. Further analysis of the
ground state momentum sectors at suggests a competition among
(nematic) metal, momentum () stripe and -CDW orders at large
.Comment: 21+23 pages, 13+15 figure
Electron-Electron Interactions in Twisted Bilayer Graphene
In this paper we use exact diagonalization studies to explore the phase diagram of twisted bilayer graphene at the first magic angle. We show the physics in the flat band limit is closely related to the physics at a realistic bandwidth, inducing spin and valley polarization. We also show that the physics at realistic parameters can be described as a perturbation on the chiral limit with a large symmetry which simplifies calculation greatly. This result may be used in the future to more easily calculate the many-body response as a function of single-body properties even when the interaction between electrons is strong
Electron-Electron Interactions in Twisted Bilayer Graphene
In this paper we use exact diagonalization studies to explore the phase diagram of twisted bilayer graphene at the first magic angle. We show the physics in the flat band limit is closely related to the physics at a realistic bandwidth, inducing spin and valley polarization. We also show that the physics at realistic parameters can be described as a perturbation on the chiral limit with a large symmetry which simplifies calculation greatly. This result may be used in the future to more easily calculate the many-body response as a function of single-body properties even when the interaction between electrons is strong