Search CORE

21,267 research outputs found

Hierarchically-Attentive RNN for Album Summarization and Storytelling

Author: Bansal Mohit
Berg Tamara L.
Yu Licheng
Publication venue
Publication date: 01/01/2017
Field of study

We address the problem of end-to-end visual storytelling. Given a photo album, our model first selects the most representative (summary) photos, and then composes a natural language story for the album. For this task, we make use of the Visual Storytelling dataset and a model composed of three hierarchically-attentive Recurrent Neural Nets (RNNs) to: encode the album photos, select representative (summary) photos, and compose the story. Automatic and human evaluations show our model achieves better performance on selection, generation, and retrieval than baselines.Comment: To appear at EMNLP-2017 (7 pages

arXiv.org e-Print Archive

Crossref

Covering your face on Facebook.Managing identity through untagging and deletion

Author: Strano Michele
Wattei Jill
Publication venue: School of Information Technology Murdoch University
Publication date: 01/01/2010
Field of study

This paper describes the ways in which Facebook users manage their\ud online identities through untagging and deleting photos to make sure images are\ud interpreted in a desirable way. Using data collected from an online survey and\ud thirty in-depth interviews with American adult Facebook users, the authors argue\ud that identity management can best be understood as the combination of\ud constructive and destructive practices through which users control not only their\ud self-presentation (projection), but also the statements others make about them\ud (suppression)

Elektronisch archivierte Theorie - Sammelpunkt

Person Recognition in Personal Photo Collections

Author: Benenson Rodrigo
Fritz Mario
Oh Seong Joon
Schiele Bernt
Publication venue
Publication date: 01/01/2015
Field of study

Recognising persons in everyday photos presents major challenges (occluded faces, different clothing, locations, etc.) for machine vision. We propose a convnet based person recognition system on which we provide an in-depth analysis of informativeness of different body cues, impact of training data, and the common failure modes of the system. In addition, we discuss the limitations of existing benchmarks and propose more challenging ones. Our method is simple and is built on open source and open data, yet it improves the state of the art results on a large dataset of social media photos (PIPA).Comment: Accepted to ICCV 2015, revise

arXiv.org e-Print Archive

CISPA – Helmholtz-Zentrum für Informationssicherheit

Crossref

MPG.PuRe

A Novel Hybrid CNN-AIS Visual Pattern Recognition Engine

Author: A Borji
A Krizhevsky
E Hart
F Lauer
G Dudek
LN Castro De
N Pinto
S Knerr
T Serre
Y LeCun
Y Shao
Publication venue
Publication date: 11/03/2015
Field of study

Machine learning methods are used today for most recognition problems. Convolutional Neural Networks (CNN) have time and again proved successful for many image processing tasks primarily for their architecture. In this paper we propose to apply CNN to small data sets like for example, personal albums or other similar environs where the size of training dataset is a limitation, within the framework of a proposed hybrid CNN-AIS model. We use Artificial Immune System Principles to enhance small size of training data set. A layer of Clonal Selection is added to the local filtering and max pooling of CNN Architecture. The proposed Architecture is evaluated using the standard MNIST dataset by limiting the data size and also with a small personal data sample belonging to two different classes. Experimental results show that the proposed hybrid CNN-AIS based recognition engine works well when the size of training data is limited in siz

arXiv.org e-Print Archive

Crossref

Smartphone picture organization: a hierarchical approach

Author: Dimiccoli Mariella
Lonn Stefan
Radeva Petia
Publication venue: 'Elsevier BV'
Publication date: 01/01/2019
Field of study

We live in a society where the large majority of the population has a camera-equipped smartphone. In addition, hard drives and cloud storage are getting cheaper and cheaper, leading to a tremendous growth in stored personal photos. Unlike photo collections captured by a digital camera, which typically are pre-processed by the user who organizes them into event-related folders, smartphone pictures are automatically stored in the cloud. As a consequence, photo collections captured by a smartphone are highly unstructured and because smartphones are ubiquitous, they present a larger variability compared to pictures captured by a digital camera. To solve the need of organizing large smartphone photo collections automatically, we propose here a new methodology for hierarchical photo organization into topics and topic-related categories. Our approach successfully estimates latent topics in the pictures by applying probabilistic Latent Semantic Analysis, and automatically assigns a name to each topic by relying on a lexical database. Topic-related categories are then estimated by using a set of topic-specific Convolutional Neuronal Networks. To validate our approach, we ensemble and make public a large dataset of more than 8,000 smartphone pictures from 40 persons. Experimental results demonstrate major user satisfaction with respect to state of the art solutions in terms of organization.Peer ReviewedPreprin

arXiv.org e-Print Archive

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

UPCommons. Portal del coneixement obert de la UPC

Digital.CSIC

Memory texts and memory work: Performances of memory in and with visual media

Author: Annette Kuhn
Chalfen R.
Kuhn A.
Langford M.
Linkman A.
Metz C.
Misztal B.A.
Russell C.
Stewart S.
Publication venue: 'SAGE Publications'
Publication date: 01/10/2010
Field of study

The online version of this article can be found at: http://mss.sagepub.com/content/early/2010/05/24/175069801037003

Crossref

Queen Mary Research Online