Search CORE

5,668 research outputs found

Does Multimodality Help Human and Machine for Translation and Image Captioning?

Author: Aransa Walid
Barrault Loïc
Bougares Fethi
Caglayan Ozan
García-Martínez Mercedes
Masana Marc
van de Weijer Joost
Wang Yaxing
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2016
Field of study

This paper presents the systems developed by LIUM and CVC for the WMT16 Multimodal Machine Translation challenge. We explored various comparative methods, namely phrase-based systems and attentional recurrent neural networks models trained using monomodal or multimodal data. We also performed a human evaluation in order to estimate the usefulness of multimodal data for human machine translation and image description generation. Our systems obtained the best results for both tasks according to the automatic evaluation metrics BLEU and METEOR.Comment: 7 pages, 2 figures, v4: Small clarification in section 4 title and conten

arXiv.org e-Print Archive

Crossref

Linking Image and Text with 2-Way Nets

Author: Eisenschtat Aviv
Wolf Lior
Publication venue
Publication date: 10/02/2017
Field of study

Linking two data sources is a basic building block in numerous computer vision problems. Canonical Correlation Analysis (CCA) achieves this by utilizing a linear optimizer in order to maximize the correlation between the two views. Recent work makes use of non-linear models, including deep learning techniques, that optimize the CCA loss in some feature space. In this paper, we introduce a novel, bi-directional neural network architecture for the task of matching vectors from two data sources. Our approach employs two tied neural network channels that project the two views into a common, maximally correlated space using the Euclidean loss. We show a direct link between the correlation-based loss and Euclidean loss, enabling the use of Euclidean loss for correlation maximization. To overcome common Euclidean regression optimization problems, we modify well-known techniques to our problem, including batch normalization and dropout. We show state of the art results on a number of computer vision matching tasks including MNIST image matching and sentence-image matching on the Flickr8k, Flickr30k and COCO datasets.Comment: 14 pages, 2 figures, 6 table

arXiv.org e-Print Archive

Crossref

Particle swarm optimization with sequential niche technique for dynamic finite element model updating

Author: Abdel-Ghaffar
Adeli
Ahmadian
Allemang
AS/NZS
Bakir
Beasley
Begambre
Bodeux
Brownjohn
Caicedo
Chang
Cho
Clerc
Deb
Ewins
Friswell
Fuggini
García-Palencia
Gent
Glisic
Goldberg
Hampshire
Hu
Hua
Jaishi
Jaishi
Jiang
Jiang
Jiang
Kennedy
Knowles
Konstantinos
Lee
Lozano-Galant
Marwala
Mottershead
Mottershead
Möller
Nazmy
Osornio-Rios
Park
Pedersen
Perera
Perera
Perera
Perera
Pintér
Plevris
Price
Proakis
Putha
Qiao
Raich
Raich
Ren
Saada
SAP2000
Shafahi
Tao
Teughels
Tikhonov
Titurus
Trelea
Tu
Van Overschee
Van Overschee
Walsh
Wilson
Xiang
Zhang
Zheng
Zhou
Zivanovic
Zárate
Publication venue: 'Wiley'
Publication date: 09/09/2014
Field of study

Peer reviewedPostprin

Aberdeen University Research

Crossref