Search CORE

7 research outputs found

Random geometry at an infinite-randomness fixed point

Author: Cowsik Aditya
Mahadevan Aditya
Pandey Akshat
Publication venue
Publication date: 20/04/2023
Field of study

We study the low-energy physics of the critical (2+1)-dimensional random transverse-field Ising model. The one-dimensional version of the model is a paradigmatic example of a system governed by an infinite-randomness fixed point, for which many results on the distributions of observables are known via an asymptotically exact renormalization group (RG) approach. In two dimensions, the same RG rules have been implemented numerically, and demonstrate a flow to infinite randomness. However, analytical understanding of the structure of this RG has remained elusive due to the development of geometrical structure in the graph of interacting spins. To understand the character of the fixed point, we consider the RG flow acting on a joint ensemble of graphs and couplings. We propose that the RG effectively occurs in two stages: (1) randomization of the interaction graph until it belongs to a certain ensemble of random triangulations of the plane, and (2) a flow of the distributions of couplings to infinite randomness while the graph ensemble remains invariant. This picture is substantiated by a numerical RG in which one obtains a steady-state graph degree distribution and subsequently infinite-randomness scaling distributions of the couplings. Both of these aspects of the RG flow can be approximately reproduced in simplified analytical models.Comment: 28 pages, 13 figure

arXiv.org e-Print Archive

Flatter, faster: scaling momentum for optimal speedup of SGD

Author: Can Tankut
Cowsik Aditya
Glorioso Paolo
Publication venue
Publication date: 13/06/2023
Field of study

Commonly used optimization algorithms often show a trade-off between good generalization and fast training times. For instance, stochastic gradient descent (SGD) tends to have good generalization; however, adaptive gradient methods have superior training times. Momentum can help accelerate training with SGD, but so far there has been no principled way to select the momentum hyperparameter. Here we study training dynamics arising from the interplay between SGD with label noise and momentum in the training of overparametrized neural networks. We find that scaling the momentum hyperparameter

1-\beta

with the learning rate to the power of

2/3

maximally accelerates training, without sacrificing generalization. To analytically derive this result we develop an architecture-independent framework, where the main assumption is the existence of a degenerate manifold of global minimizers, as is natural in overparametrized models. Training dynamics display the emergence of two characteristic timescales that are well-separated for generic values of the hyperparameters. The maximum acceleration of training is reached when these two timescales meet, which in turn determines the scaling limit we propose. We confirm our scaling rule for synthetic regression problems (matrix sensing and teacher-student paradigm) and classification for realistic datasets (ResNet-18 on CIFAR10, 6-layer MLP on FashionMNIST), suggesting the robustness of our scaling rule to variations in architectures and datasets.Comment: v2: expanded introduction section, corrected minor typos. v1: 12+13 pages, 3 figure

arXiv.org e-Print Archive

TBG V: Exact Analytic Many-Body Excitations In Twisted Bilayer Graphene Coulomb Hamiltonians: Charge Gap, Goldstone Modes and Absence of Cooper Pairing

Author: Bernevig B. Andrei
Cowsik Aditya
Lian Biao
Regnault Nicolas
Song Zhi-Da
Xie Fang
Publication venue
Publication date: 05/10/2020
Field of study

We find exact analytic expressions for the energies and wavefunctions of the charged and neutral excitations above the exact ground states (at rational filling per unit cell) of projected Coulomb Hamiltonians in twisted bilayer graphene. Our exact expressions are valid for any form of the Coulomb interaction and any form of

AA

and

AB/BA

tunneling. The single charge excitation energy is a convolution of the Coulomb potential with a quantum geometric tensor of the TBG bands. The neutral excitations are (high-symmetry group) magnons, and their dispersion is analytically calculated in terms of the form factors of the active bands in TBG. The two-charge excitation energy and wavefunctions are also obtained, and a sufficient condition on the graphene eigenstates for obtaining a Cooper-pair from Coulomb interactions is obtained. For the actual TBG bands at the first magic angle, we can analytically show that the Cooper pair binding energy is zero in all such projected Coulomb models, implying that either phonons and/or non-zero kinetic energy are needed for superconductivity. Since the [Phys. Rev. Lett. 122, 246401] showed that the kinetic energy bounds on the superexchange energy are less

10^{-3}

in Coulomb units, the phonon mechanism becomes then very likely. If nonetheless the superconductivity is due to kinetic terms which render the bands non-flat, one prediction of our theory is that the highest

T_c

would not occur at the highest DOS.Comment: references adde

arXiv.org e-Print Archive

Princeton University Open Access Repository

Hal-Diderot

TBG VI: An Exact Diagonalization Study of Twisted Bilayer Graphene at Non-Zero Integer Fillings

Author: Bernevig B. Andrei
Cowsik Aditya
Lian Biao
Regnault Nicolas
Song Zhi-Da
Xie Fang
Publication venue
Publication date: 01/10/2020
Field of study

Using exact diagonalization, we study the projected Hamiltonian with Coulomb interaction in the 8 flat bands of first magic angle twisted bilayer graphene. Employing the U(4) (U(4)

\times

U(4)) symmetries in the nonchiral (chiral) flat band limit, we reduced the Hilbert space to an extent which allows for study around

\nu=\pm 3,\pm2,\pm1

fillings. In the first chiral limit

w_0/w_1=0

where

w_0

(

w_1

) is the

AA

(

AB

) stacking hopping, we find that the ground-states at these fillings are extremely well-described by Slater determinants in a so-called Chern basis, and the exactly solvable charge

\pm1

excitations found in [arXiv:2009.14200] are the lowest charge excitations up to system sizes

8\times8

(for restricted Hilbert space) in the chiral-flat limit. We also find that the Flat Metric Condition (FMC) used in [arXiv:2009.11301,2009.11872,2009.12376,2009.13530,2009.14200] for obtaining a series of exact ground-states and excitations holds in a large parameter space. For

\nu=-3

, the ground state is the spin and valley polarized Chern insulator with

\nu_C=\pm1

w_0/w_1\lesssim0.9

(0.3) with (without) FMC. At

\nu=-2

, we can only numerically access the valley polarized sector, and we find a spin ferromagnetic phase when

w_0/w_1\gtrsim0.5t

where

t\in[0,1]

is the factor of rescaling of the actual TBG bandwidth, and a spin singlet phase otherwise, confirming the perturbative calculation [arXiv:2009.13530]. The analytic FMC ground state is, however, predicted in the intervalley coherent sector which we cannot access [arXiv:2009.13530]. For

\nu=-3

with/without FMC, when

w_0/w_1

is large, the finite-size gap

\Delta

to the neutral excitations vanishes, leading to phase transitions. Further analysis of the ground state momentum sectors at

\nu=-3

suggests a competition among (nematic) metal, momentum

M_M

(

\pi

) stripe and

K_M

-CDW orders at large

w_0/w_1

.Comment: 21+23 pages, 13+15 figure

arXiv.org e-Print Archive

Hal-Diderot

Electron-Electron Interactions in Twisted Bilayer Graphene

Author: Cowsik Aditya
Publication venue
Publication date: 24/07/2020
Field of study

In this paper we use exact diagonalization studies to explore the phase diagram of twisted bilayer graphene at the first magic angle. We show the physics in the flat band limit is closely related to the physics at a realistic bandwidth, inducing spin and valley polarization. We also show that the physics at realistic parameters can be described as a perturbation on the chiral limit with a large

U(4) \times U(4)

symmetry which simplifies calculation greatly. This result may be used in the future to more easily calculate the many-body response as a function of single-body properties even when the interaction between electrons is strong

Dataspace

Electron-Electron Interactions in Twisted Bilayer Graphene

Author: Cowsik Aditya
Publication venue
Publication date: 02/10/2020
Field of study

U(4) \times U(4)

Dataspace