Leveraging Multimodal Shapley Values to Address Multimodal Collapse and Improve Fine-Grained E-Commerce Product Classification

Obayemi, Ajibola; Nguyen, Khuong An

journal article

oai:pure.royalholloway.ac.uk:openaire/d8544deb-8e72-4f83-90ce-e13fd0541f1a

Leveraging Multimodal Shapley Values to Address Multimodal Collapse and Improve Fine-Grained E-Commerce Product Classification

Authors: Ajibola Obayemi
Khuong An Nguyen
Publication date: 4 March 2025
Publisher: IEEE
Doi

Abstract

Multimodal models can experience multimodal collapse, leading to sub-optimal performance on tasks like fine-grained e-commerce product classification. To address this, we introduce an approach that leverages multimodal Shapley values (MM-SHAP) to quantify the individual contributions of each modality to the model's predictions. By employing weighted stacked ensembles of unimodal and multimodal models, with weights derived from these Shapley values (MM-SHAP), we enhance the overall performance and mitigate the effects of multimodal collapse. Using this approach we improve previous results (F1-score) from 0.67 to 0.79

contributionToPeriodical

Similar works

Full text

Open in the Core reader

Download PDF

Royal Holloway - Pure

oai:pure.royalholloway.ac.uk:o...

Last time updated on 18/10/2025

This paper was published in Royal Holloway - Pure.

Having an issue?

Is data on this page outdated, violates copyrights or anything else? Report the problem now and we will take corresponding actions after reviewing your request.

Licence: info:eu-repo/semantics/openAccess