Search CORE

27,718 research outputs found

A framework for evaluating automatic image annotation algorithms

Author: A.W.M. Smeulders
B.S. Manjunath
D.G. Lowe
D.M. Blei
D.M. Blei
G. Carneiro
H. Kwasnicka
J. Jeon
J. Li
L. Fei-Fei
N. Vasconcelos
P. Duygulu
V. Lavrenko
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2010
Field of study

Several Automatic Image Annotation (AIA) algorithms have been introduced recently, which have been found to outperform previous models. However, each one of them has been evaluated using either different descriptors, collections or parts of collections, or "easy" settings. This fact renders their results non-comparable, while we show that collection-specific properties are responsible for the high reported performance measures, and not the actual models. In this paper we introduce a framework for the evaluation of image annotation models, which we use to evaluate two state-of-the-art AIA algorithms. Our findings reveal that a simple Support Vector Machine (SVM) approach using Global MPEG-7 Features outperforms state-of-the-art AIA models across several collection settings. It seems that these models heavily depend on the set of features and the data used, while it is easy to exploit collection-specific properties, such as tag popularity especially in the commonly used Corel 5K dataset and still achieve good performance

CiteSeerX

Crossref

Enlighten

CoaCor: Code Annotation for Code Retrieval with Reinforcement Learning

Author: Peddamail Jayavardhan Reddy
Sun Huan
Yao Ziyu
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2019
Field of study

To accelerate software development, much research has been performed to help people understand and reuse the huge amount of available code resources. Two important tasks have been widely studied: code retrieval, which aims to retrieve code snippets relevant to a given natural language query from a code base, and code annotation, where the goal is to annotate a code snippet with a natural language description. Despite their advancement in recent years, the two tasks are mostly explored separately. In this work, we investigate a novel perspective of Code annotation for Code retrieval (hence called `CoaCor'), where a code annotation model is trained to generate a natural language annotation that can represent the semantic meaning of a given code snippet and can be leveraged by a code retrieval model to better distinguish relevant code snippets from others. To this end, we propose an effective framework based on reinforcement learning, which explicitly encourages the code annotation model to generate annotations that can be used for the retrieval task. Through extensive experiments, we show that code annotations generated by our framework are much more detailed and more useful for code retrieval, and they can further improve the performance of existing code retrieval models significantly.Comment: 10 pages, 2 figures. Accepted by The Web Conference (WWW) 201

arXiv.org e-Print Archive

Crossref

STAIR Captions: Constructing a Large-Scale Japanese Image Caption Dataset

Author: Shigeto Yutaro
Takeuchi Akikazu
Yoshikawa Yuya
Publication venue
Publication date: 01/01/2017
Field of study

In recent years, automatic generation of image descriptions (captions), that is, image captioning, has attracted a great deal of attention. In this paper, we particularly consider generating Japanese captions for images. Since most available caption datasets have been constructed for English language, there are few datasets for Japanese. To tackle this problem, we construct a large-scale Japanese image caption dataset based on images from MS-COCO, which is called STAIR Captions. STAIR Captions consists of 820,310 Japanese captions for 164,062 images. In the experiment, we show that a neural network trained using STAIR Captions can generate more natural and better Japanese captions, compared to those generated using English-Japanese machine translation after generating English captions.Comment: Accepted as ACL2017 short paper. 5 page

arXiv.org e-Print Archive

Crossref