Global Convergence of Over-parameterized Deep Equilibrium Models

Lin, Zhouchen; Ling, Zenan; Wang, Qiuhao; Xie, Xingyu; Zhang, Zongpeng

Global Convergence of Over-parameterized Deep Equilibrium Models

Authors: Zhouchen Lin
Zenan Ling
Qiuhao Wang
Xingyu Xie
Zongpeng Zhang
Publication date: 27 May 2022
Publisher

Abstract

A deep equilibrium model (DEQ) is implicitly defined through an equilibrium point of an infinite-depth weight-tied model with an input-injection. Instead of infinite computations, it solves an equilibrium point directly with root-finding and computes gradients with implicit differentiation. The training dynamics of over-parameterized DEQs are investigated in this study. By supposing a condition on the initial equilibrium point, we show that the unique equilibrium point always exists during the training process, and the gradient descent is proved to converge to a globally optimal solution at a linear convergence rate for the quadratic loss function. In order to show that the required initial condition is satisfied via mild over-parameterization, we perform a fine-grained analysis on random DEQs. We propose a novel probabilistic framework to overcome the technical difficulty in the non-asymptotic analysis of infinite-depth weight-tied models

Similar works

Full text

Available Versions

arXiv.org e-Print Archive

oai:arXiv.org:2205.13814

Last time updated on 14/08/2022