Search CORE

3,676 research outputs found

Learning Fashion Compatibility with Bidirectional LSTMs

Author: Davis Larry S.
Han Xintong
Jiang Yu-Gang
Wu Zuxuan
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 18/07/2017
Field of study

The ubiquity of online fashion shopping demands effective recommendation services for customers. In this paper, we study two types of fashion recommendation: (i) suggesting an item that matches existing components in a set to form a stylish outfit (a collection of fashion items), and (ii) generating an outfit with multimodal (images/text) specifications from a user. To this end, we propose to jointly learn a visual-semantic embedding and the compatibility relationships among fashion items in an end-to-end fashion. More specifically, we consider a fashion outfit to be a sequence (usually from top to bottom and then accessories) and each item in the outfit as a time step. Given the fashion items in an outfit, we train a bidirectional LSTM (Bi-LSTM) model to sequentially predict the next item conditioned on previous ones to learn their compatibility relationships. Further, we learn a visual-semantic space by regressing image features to their semantic representations aiming to inject attribute and category information as a regularization for training the LSTM. The trained network can not only perform the aforementioned recommendations effectively but also predict the compatibility of a given outfit. We conduct extensive experiments on our newly collected Polyvore dataset, and the results provide strong qualitative and quantitative evidence that our framework outperforms alternative methods.Comment: ACM MM 1

arXiv.org e-Print Archive

Crossref

Towards a multimedia knowledge-based agent with social competence and human interaction capabilities

Author: André E
Blat J
Dasiopoulou S
Domínguez M
Kamateri E
Kompatsiaris I
Lamel L
Lingenfelser F
Llorach G
Mehlmann G
Mille S
Minker W
Pragst L
Stam A
Stellingwerff L
Sukno F
Ultes S
Vieru B
Vrochidis S
Wanner L
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2016
Field of study

We present work in progress on an intelligent embodied conversation agent in the basic care and healthcare domain. In contrast to most of the existing agents, the presented agent is aimed to have linguistic cultural, social and emotional competence needed to interact with elderly and migrants. It is composed of an ontology-based and reasoning-driven dialogue manager, multimodal communication analysis and generation modules and a search engine for the retrieval of multimedia background content from the web needed for conducting a conversation on a given topic.The presented work is funded by the European Commission under the contract number H2020-645012-RIA

OPUS Augsburg

Crossref

UPF Digital Repository

CUED - Cambridge University Engineering Department

Proceedings of the international conference on cooperative multimodal communication CMC/95, Eindhoven, May 24-26, 1995:proceedings

Author
Publication venue: DENK: Samenwerkingsorgaan Brabantse Universiteiten
Publication date: 01/01/1995
Field of study

Pure OAI Repository