Multimodal sequential fashion attribute prediction

Anbarjafari, Gholamreza; Arslan, Hasan Sait; Fishel, Mark; Sirts, Kairit

Multimodal sequential fashion attribute prediction

Authors: Gholamreza Anbarjafari
Hasan Sait Arslan
Mark Fishel
Kairit Sirts
Publication date: 1 October 2019
Publisher: 'MDPI AG'
Doi

Abstract

We address multimodal product attribute prediction of fashion items based on product images and titles. The product attributes, such as type, sub-type, cut or fit, are in a chain format, with previous attribute values constraining the values of the next attributes. We propose to address this task with a sequential prediction model that can learn to capture the dependencies between the different attribute values in the chain. Our experiments on three product datasets show that the sequential model outperforms two non-sequential baselines on all experimental datasets. Compared to other models, the sequential model is also better able to generate sequences of attribute chains not seen during training. We also measure the contributions of both image and textual input and show that while text-only models always outperform image-only models, only the multimodal sequential model combining both image and text improves over the text-only model on all experimental dataset

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

DSpace@HKU

oai:openaccess.hku.edu.tr:20.5...

Last time updated on 16/12/2019

Multidisciplinary Digital Publishing Institute

oai:mdpi.com:/2078-2489/10/10/...

Last time updated on 20/10/2022