Search CORE

713 research outputs found

Language-Based Image Editing with Recurrent Attentive Models

Author: Chen Jianbo
Gao Jianfeng
Liu Jingjing
Liu Xiaodong
Shen Yelong
Publication venue
Publication date: 10/06/2018
Field of study

We investigate the problem of Language-Based Image Editing (LBIE). Given a source image and a natural language description, we want to generate a target image by editing the source image based on the description. We propose a generic modeling framework for two sub-tasks of LBIE: language-based image segmentation and image colorization. The framework uses recurrent attentive models to fuse image and language features. Instead of using a fixed step size, we introduce for each region of the image a termination gate to dynamically determine after each inference step whether to continue extrapolating additional information from the textual description. The effectiveness of the framework is validated on three datasets. First, we introduce a synthetic dataset, called CoSaL, to evaluate the end-to-end performance of our LBIE system. Second, we show that the framework leads to state-of-the-art performance on image segmentation on the ReferIt dataset. Third, we present the first language-based colorization result on the Oxford-102 Flowers dataset.Comment: Accepted to CVPR 2018 as a Spotligh

arXiv.org e-Print Archive

Crossref

Colorization and Automated Segmentation of Human T2 MR Brain Images for Characterization of Soft Tissues

Author: A Anjos
A Panning
A Rehman
AG Ryan
Alex Vitkin
B Veksler
C Squvaget
C Valdés Hernández Mdel
C Valdés Hernández Mdel
CAZ Barcelos
CM Bauer
D Imperati
D Krstinic
D Ongur
DE Ilea
DL Pham
E Parzen
EI Zacharaki
Ghulam Gilanie
GN Holland
GNH Rai
H Ullah
H Ullah
H Ullah
Hafeez-Ullah
HM Ajmal
IM Vlasova
Javed A. Kamran
KL Weiss
KL Weiss
KN Plataniotis
M Lei
M Murgasova
M Rosenblatt
Malik S. Mehmood
Masroor Ikram
ME Celebi
MM Ahmed
MN Wu
Muhammad Attique
Muhammad S. Naweed
N Otsu
N Sharma
P Kanungo
PS Heckbert
SB Chaabane
T Horiuchi
T Horiuchi
TW Ridler
V Bochko
VG Jacob
Y Rathore
Y Zhang
Y Zhang
Yu-Feng Zang
Publication venue: Public Library of Science
Publication date: 27/03/2012
Field of study

Characterization of tissues like brain by using magnetic resonance (MR) images and colorization of the gray scale image has been reported in the literature, along with the advantages and drawbacks. Here, we present two independent methods; (i) a novel colorization method to underscore the variability in brain MR images, indicative of the underlying physical density of bio tissue, (ii) a segmentation method (both hard and soft segmentation) to characterize gray brain MR images. The segmented images are then transformed into color using the above-mentioned colorization method, yielding promising results for manual tracing. Our color transformation incorporates the voxel classification by matching the luminance of voxels of the source MR image and provided color image by measuring the distance between them. The segmentation method is based on single-phase clustering for 2D and 3D image segmentation with a new auto centroid selection method, which divides the image into three distinct regions (gray matter (GM), white matter (WM), and cerebrospinal fluid (CSF) using prior anatomical knowledge). Results have been successfully validated on human T2-weighted (T2) brain MR images. The proposed method can be potentially applied to gray-scale images from other imaging modalities, in bringing out additional diagnostic tissue information contained in the colorized image processing approach as described

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

A survey of comics research in computer science

Author: Augereau Olivier
Iwata Motoi
Kise Koichi
Publication venue
Publication date: 15/04/2018
Field of study

Graphical novels such as comics and mangas are well known all over the world. The digital transition started to change the way people are reading comics, more and more on smartphones and tablets and less and less on paper. In the recent years, a wide variety of research about comics has been proposed and might change the way comics are created, distributed and read in future years. Early work focuses on low level document image analysis: indeed comic books are complex, they contains text, drawings, balloon, panels, onomatopoeia, etc. Different fields of computer science covered research about user interaction and content generation such as multimedia, artificial intelligence, human-computer interaction, etc. with different sets of values. We propose in this paper to review the previous research about comics in computer science, to state what have been done and to give some insights about the main outlooks

arXiv.org e-Print Archive

Directory of Open Access Journals

The Missing Data Encoder: Cross-Channel Image Completion\\with Hide-And-Seek Adversarial Network

Author: Cord Matthieu
Dapogny Arnaud
Perez Patrick
Publication venue
Publication date: 06/05/2019
Field of study

Image completion is the problem of generating whole images from fragments only. It encompasses inpainting (generating a patch given its surrounding), reverse inpainting/extrapolation (generating the periphery given the central patch) as well as colorization (generating one or several channels given other ones). In this paper, we employ a deep network to perform image completion, with adversarial training as well as perceptual and completion losses, and call it the ``missing data encoder'' (MDE). We consider several configurations based on how the seed fragments are chosen. We show that training MDE for ``random extrapolation and colorization'' (MDE-REC), i.e. using random channel-independent fragments, allows a better capture of the image semantics and geometry. MDE training makes use of a novel ``hide-and-seek'' adversarial loss, where the discriminator seeks the original non-masked regions, while the generator tries to hide them. We validate our models both qualitatively and quantitatively on several datasets, showing their interest for image completion, unsupervised representation learning as well as face occlusion handling

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications