Softmax Dissection: Towards Understanding Intra- and Inter-class
  Objective for Embedding Learning

He, Lanqing; Li, Yali; Wang, Shengjin; Wang, Zhongdao

Softmax Dissection: Towards Understanding Intra- and Inter-class Objective for Embedding Learning

Authors: Lanqing He
Yali Li
Shengjin Wang
Zhongdao Wang
Publication date: 12 February 2020
Publisher
Doi

Abstract

The softmax loss and its variants are widely used as objectives for embedding learning, especially in applications like face recognition. However, the intra- and inter-class objectives in the softmax loss are entangled, therefore a well-optimized inter-class objective leads to relaxation on the intra-class objective, and vice versa. In this paper, we propose to dissect the softmax loss into independent intra- and inter-class objective (D-Softmax). With D-Softmax as objective, we can have a clear understanding of both the intra- and inter-class objective, therefore it is straightforward to tune each part to the best state. Furthermore, we find the computation of the inter-class objective is redundant and propose two sampling-based variants of D-Softmax to reduce the computation cost. Training with regular-scale data, experiments in face verification show D-Softmax is favorably comparable to existing losses such as SphereFace and ArcFace. Training with massive-scale data, experiments show the fast variants of D-Softmax significantly accelerates the training process (such as 64x) with only a minor sacrifice in performance, outperforming existing acceleration methods of softmax in terms of both performance and efficiency.Comment: Accepted to AAAI-2020, Oral presentatio

Similar works

Full text

Available Versions

Association for the Advancement of Artificial Intelligence: AAAI Publications

oai:ojs.aaai.org:article/6729

Last time updated on 30/11/2020