Mode Combinability: Exploring Convex Combinations of Permutation Aligned
  Models

Csiszárik, Adrián; Kiss, Melinda F.; Kőrösi-Szabó, Péter; Muntag, Márton; Papp, Gergely; Varga, Dániel

Mode Combinability: Exploring Convex Combinations of Permutation Aligned Models

Authors: Adrián Csiszárik
Melinda F. Kiss
Péter Kőrösi-Szabó
Márton Muntag
Gergely Papp
Dániel Varga
Publication date: 22 August 2023
Publisher

Abstract

We explore element-wise convex combinations of two permutation-aligned neural network parameter vectors

\Theta_A

and

\Theta_B

of size

d

. We conduct extensive experiments by examining various distributions of such model combinations parametrized by elements of the hypercube

[0,1]^{d}

and its vicinity. Our findings reveal that broad regions of the hypercube form surfaces of low loss values, indicating that the notion of linear mode connectivity extends to a more general phenomenon which we call mode combinability. We also make several novel observations regarding linear mode connectivity and model re-basin. We demonstrate a transitivity property: two models re-based to a common third model are also linear mode connected, and a robustness property: even with significant perturbations of the neuron matchings the resulting combinations continue to form a working model. Moreover, we analyze the functional and weight similarity of model combinations and show that such combinations are non-vacuous in the sense that there are significant functional differences between the resulting models

Similar works

Full text

Available Versions

arXiv.org e-Print Archive

oai:arXiv.org:2308.11511

Last time updated on 24/08/2023