EqCo: Equivalent Rules for Self-supervised Contrastive Learning

Huang, Junqiang; Li, Zeming; Sun, Jian; Zhang, Xiangyu; Zhu, Benjin

EqCo: Equivalent Rules for Self-supervised Contrastive Learning

Authors: Junqiang Huang
Zeming Li
Jian Sun
Xiangyu Zhang
Benjin Zhu
Publication date: 15 March 2021
Publisher

Abstract

In this paper, we propose a method, named EqCo (Equivalent Rules for Contrastive Learning), to make self-supervised learning irrelevant to the number of negative samples in InfoNCE-based contrastive learning frameworks. Inspired by the InfoMax principle, we point that the margin term in contrastive loss needs to be adaptively scaled according to the number of negative pairs in order to keep steady mutual information bound and gradient magnitude. EqCo bridges the performance gap among a wide range of negative sample sizes, so that we can use only a few negative pairs (e.g. 16 per query) to perform self-supervised contrastive training on large-scale vision datasets like ImageNet, while with almost no accuracy drop. This is quite a contrast to the widely used large batch training or memory bank mechanism in current practices. Equipped with EqCo, our simplified MoCo (SiMo) achieves comparable accuracy with MoCo v2 on ImageNet (linear evaluation protocol) while only involves 4 negative pairs per query instead of 65536, suggesting that large quantities of negative samples might not be a critical factor in InfoNCE loss

Similar works

Full text

Available Versions

arXiv.org e-Print Archive

oai:arXiv.org:2010.01929

Last time updated on 12/10/2020