Search CORE

46,893 research outputs found

Appearance-invariant place recognition by adversarially learning disentangled representation  

Author: Arjovsky
Arroyo
Bay
Bengio
Bingham
Cao Qin
Carlevaris-Bianco
Chen
Chen
Dermot Kerr
Donahue
Glover
Guanghao Lv
Gulrajani
Gálvez-López
Huang
Kenshimov
Lample
Latif
Liu
Liu
Lowry
Maddern
Makhzani
McManus
Michael J. Swain Ballard
Milford
Mur-Artal
Naseer
Naseer
Neubert
Odena
Oliva
Sonya Coleman
Sünderhauf
Sünderhauf
Taigman
Valgren
Wulfmeier
Wulfmeier
Yan Liu
Yunzhou Zhang
Publication venue: 'Elsevier BV'
Publication date: 30/09/2020
Field of study

Crossref

Ulster University's Research Portal

Clue: Cross-modal Coherence Modeling for Caption Generation

Author: Alikhani Malihe
Li Shengjie
Sharma Piyush
Soricut Radu
Stone Matthew
Publication venue
Publication date: 02/05/2020
Field of study

We use coherence relations inspired by computational models of discourse to study the information needs and goals of image captioning. Using an annotation protocol specifically devised for capturing image--caption coherence relations, we annotate 10,000 instances from publicly-available image--caption pairs. We introduce a new task for learning inferences in imagery and text, coherence relation prediction, and show that these coherence annotations can be exploited to learn relation classifiers as an intermediary step, and also train coherence-aware, controllable image captioning models. The results show a dramatic improvement in the consistency and quality of the generated captions with respect to information needs specified via coherence relations.Comment: Accepted as a long paper to ACL 202

arXiv.org e-Print Archive