HMOE: Hypernetwork-based Mixture of Experts for Domain Generalization

de Hemptinne, Jean-Charles; Faney, Thibault; Gallinari, Patrick; Qu, Jingang; Wang, Ze; Yousef, Soleiman

HMOE: Hypernetwork-based Mixture of Experts for Domain Generalization

Authors: Jean-Charles de Hemptinne
Thibault Faney
Patrick Gallinari
Jingang Qu
Ze Wang
Soleiman Yousef
Publication date: 12 March 2023
Publisher

Abstract

Due to domain shift, machine learning systems typically fail to generalize well to domains different from those of training data, which is what domain generalization (DG) aims to address. Although various DG methods have been developed, most of them lack interpretability and require domain labels that are not available in many real-world scenarios. This paper presents a novel DG method, called HMOE: Hypernetwork-based Mixture of Experts (MoE), which does not rely on domain labels and is more interpretable. MoE proves effective in identifying heterogeneous patterns in data. For the DG problem, heterogeneity arises exactly from domain shift. HMOE uses hypernetworks taking vectors as input to generate experts' weights, which allows experts to share useful meta-knowledge and enables exploring experts' similarities in a low-dimensional vector space. We compare HMOE with other DG algorithms under a fair and unified benchmark-DomainBed. Our extensive experiments show that HMOE can divide mixed-domain data into distinct clusters that are surprisingly more consistent with human intuition than original domain labels. Compared to other DG methods, HMOE shows competitive performance and achieves SOTA results in some cases

Similar works

Full text

Available Versions

arXiv.org e-Print Archive

oai:arXiv.org:2211.08253

Last time updated on 18/12/2022