Search CORE

22 research outputs found

Recommended from our members

When Can Nonconvex Optimization Problems be Solved with Gradient Descent? A Few Case Studies

Author: Gilboa Dar
Publication venue: 'Columbia University Libraries/Information Services'
Publication date: 01/01/2020
Field of study

Gradient descent and related algorithms are ubiquitously used to solve optimization problems arising in machine learning and signal processing. In many cases, these problems are nonconvex yet such simple algorithms are still effective. In an attempt to better understand this phenomenon, we study a number of nonconvex problems, proving that they can be solved efficiently with gradient descent. We will consider complete, orthogonal dictionary learning, and present a geometric analysis allowing us to obtain efficient convergence rate for gradient descent that hold with high probability. We also show that similar geometric structure is present in other nonconvex problems such as generalized phase retrieval. Turning next to neural networks, we will also calculate conditions on certain classes of networks under which signals and gradients propagate through the network in a stable manner during the initial stages of training. Initialization schemes derived using these calculations allow training recurrent networks on long sequence tasks, and in the case of networks with low precision activation functions they make explicit a tradeoff between the reduction in precision and the maximal depth of a model that can be trained with gradient descent. We finally consider manifold classification with a deep feed-forward neural network, for a particularly simple configuration of the manifolds. We provide an end-to-end analysis of the training process, proving that under certain conditions on the architectural hyperparameters of the network, it can successfully classify any point on the manifolds with high probability given a sufficient number of independent samples from the manifold, in a timely manner. Our analysis relates the depth and width of the network to its fitting capacity and statistical regularity respectively in early stages of training

Columbia University Academic Commons

On quantum backpropagation, information reuse, and cheating measurement collapse

Author: Abbas Amira
Gilboa Dar
Huang Hsin-Yuan
Huggins William J.
King Robbie
McClean Jarrod R.
Movassagh Ramis
Publication venue
Publication date: 22/05/2023
Field of study

The success of modern deep learning hinges on the ability to train neural networks at scale. Through clever reuse of intermediate information, backpropagation facilitates training through gradient computation at a total cost roughly proportional to running the function, rather than incurring an additional factor proportional to the number of parameters - which can now be in the trillions. Naively, one expects that quantum measurement collapse entirely rules out the reuse of quantum information as in backpropagation. But recent developments in shadow tomography, which assumes access to multiple copies of a quantum state, have challenged that notion. Here, we investigate whether parameterized quantum models can train as efficiently as classical neural networks. We show that achieving backpropagation scaling is impossible without access to multiple copies of a state. With this added ability, we introduce an algorithm with foundations in shadow tomography that matches backpropagation scaling in quantum resources while reducing classical auxiliary computational costs to open problems in shadow tomography. These results highlight the nuance of reusing quantum information for practical purposes and clarify the unique difficulties in training large quantum models, which could alter the course of quantum machine learning.Comment: 29 pages, 2 figure

arXiv.org e-Print Archive

Dynamics of magnetization at infinite temperature in a Heisenberg spin chain

Author: Abanin Dmitry
Acharya Rajeev
Allen Richard
Andersen Trond
Anderson Kyle
Ansmann Markus
Arute Frank
Arya Kunal
Asfaw Abraham
Atalaya Juan
Babbush Ryan
Bacon Dave
Barba Alexander Del Toro
Bardin Joseph
Bengtsson Andreas
Bilmes A.
Boixo Sergio
Bortoli Gina
Bourassa Alexandre
Bovaird Jenna
Brill Leon
Broughton Michael
Buckley Bob B.
Buell David
Burger Tim
Burkett Brian
Bushnell Nicholas
Campero Juan
Chang Hung-Shen
Chen Yu
Chen Zijun
Chiaro Benjamin
Chik Desmond
Cogan Josh
Collins Roberto
Conner Paul
Courtney William
Crook Alexander
Curtin Ben
Dau Alejandro Grajales
Debroy Dripto
Demura Sean
Di Paolo Agustin
Drozdov Ilya
Dunsworth Andrew
Earle Clint
Erickson Catherine
Farhi E.
Fatemi Reza
Ferreira Vinicius
Flores Leslie
Forati Ebrahim
Fowler Austin
Foxen Brooks
Garcia Gonzalo
Genois Élie
Giang William
Gidney Craig
Gilboa Dar
Giustina Marissa
Gopalakrishnan Sarang
Gosula Raja
Gross Jonathan
Habegger Steve
Hamilton Michael
Hansen Monica
Harrigan Matthew
Harrington Sean
Heidweiller Catherine Vollgraff
Heu Paula
Hill Gordon
Hilton Jeremy
Hoffmann Markus
Hoke Jesse
Hong Sabrina
Huang Trent
Huff Ashley
Huggins William
Ioffe Lev
Isakov Sergei
Iveland Justin
Jeffrey Evan
Jiang Zhang
Jones Cody
Juhas Pavol
Kafri D.
Kelly Julian
Khattar Tanuj
Khemani Vedika
Khezri Mostafa
Kieferová Mária
Kim Seon
Kitaev Alexei
Klimov Paul
Klots Andrey
Korotkov Alexander
Kostritsa Fedor
Kreikebaum John Mark
Landhuis David
Laptev Pavel
Lau Kim Ming
Laws Lily
Lee Joonho
Lee Kenneth
Lensky Yuri
Lester Brian
Lill Alexander
Liu Wayne
Livingston William P.
Locharla A.
Lucero Erik
Mandrà Salvatore
Martin Orion
Martin Steven
McClean Jarrod
McEwen Matthew
Meeks Seneca
Megrant Anthony
Mi Xiao
Miao Kevin
Mieszala Amanda
Montazeri Shirin
Morvan Alexis
Movassagh Ramis
Mruczkiewicz Wojciech
Neeley Matthew
Neill Charles
Nersisyan Ani
Neven Hartmut
Newman Michael
Ng Jiun How
Nguyen Anthony
Nguyen Murray
Niu M.
O'Brien Thomas
Omonije Seun
Opremcak Alex
Petukhov Andre
Potter Rebecca
Prosen Tomaž
Pryadko Leonid
Quintana Chris
Rhodes David
Rocque Charles
Rosenberg Eliott
Roushan Pedram
Rubin N.
Saei Negar
Samajdar Rhine
Sank Daniel
Sankaragomathi Kannan
Satzinger Kevin
Schurkus Henry
Schuster Christopher
Shearn Michael
Shorter Aaron
Shutty Noah
Shvarts Vladimir
Sivak Volodymyr
Skruzny Jindra
Smelyanskiy Vadim
Smith Clarke
Somma Rolando
Sterling George
Strain Doug
Szalay Marco
Thor Douglas
Torres Alfredo
Vidal Guifre
Villalonga Benjamin
White Theodore
Woo Bryan
Xing Cheng
Yao Jamie
Yeh Ping
Yoo Juhwan
Young Grayson
Zalcman Adam
Zhang Yaxing
Zhu Ningfeng
Zobrist Nicholas
Publication venue
Publication date: 04/04/2024
Field of study

Understanding universal aspects of quantum dynamics is an unresolved problem in statistical mechanics. In particular, the spin dynamics of the 1D Heisenberg model were conjectured to belong to the Kardar-Parisi-Zhang (KPZ) universality class based on the scaling of the infinite-temperature spin-spin correlation function. In a chain of 46 superconducting qubits, we study the probability distribution,

P(\mathcal{M})

, of the magnetization transferred across the chain's center. The first two moments of

P(\mathcal{M})

show superdiffusive behavior, a hallmark of KPZ universality. However, the third and fourth moments rule out the KPZ conjecture and allow for evaluating other theories. Our results highlight the importance of studying higher moments in determining dynamic universality classes and provide key insights into universal behavior in quantum systems

arXiv.org e-Print Archive