Search CORE

56 research outputs found

Supervised Deep Learning for Content-Aware Image Retargeting with Fourier Convolutions

Author: Givkashi MohammadHossein
Karimi Nader
Naderi MohammadReza
Samavi Shadrokh
Shirani Shahram
Publication venue
Publication date: 12/06/2023
Field of study

Image retargeting aims to alter the size of the image with attention to the contents. One of the main obstacles to training deep learning models for image retargeting is the need for a vast labeled dataset. Labeled datasets are unavailable for training deep learning models in the image retargeting tasks. As a result, we present a new supervised approach for training deep learning models. We use the original images as ground truth and create inputs for the model by resizing and cropping the original images. A second challenge is generating different image sizes in inference time. However, regular convolutional neural networks cannot generate images of different sizes than the input image. To address this issue, we introduced a new method for supervised learning. In our approach, a mask is generated to show the desired size and location of the object. Then the mask and the input image are fed to the network. Comparing image retargeting methods and our proposed method demonstrates the model's ability to produce high-quality retargeted images. Afterward, we compute the image quality assessment score for each output image based on different techniques and illustrate the effectiveness of our approach.Comment: 18 pages, 5 figure

arXiv.org e-Print Archive

Weakly- and Self-Supervised Learning for Content-Aware Deep Image Retargeting

Author: Cho Donghyeon
Kweon In So
Oh Tae-Hyun
Park Jinsun
Tai Yu-Wing
Publication venue
Publication date: 09/08/2017
Field of study

This paper proposes a weakly- and self-supervised deep convolutional neural network (WSSDCNN) for content-aware image retargeting. Our network takes a source image and a target aspect ratio, and then directly outputs a retargeted image. Retargeting is performed through a shift map, which is a pixel-wise mapping from the source to the target grid. Our method implicitly learns an attention map, which leads to a content-aware shift map for image retargeting. As a result, discriminative parts in an image are preserved, while background regions are adjusted seamlessly. In the training phase, pairs of an image and its image-level annotation are used to compute content and structure losses. We demonstrate the effectiveness of our proposed method for a retargeting application with insightful analyses.Comment: 10 pages, 11 figures. To appear in ICCV 2017, Spotlight Presentatio

arXiv.org e-Print Archive

포항공과대학교

Algorithms for video retargeting

Author: A Fox
A Shamir
A Vetro
A Vetro
A Vetro
B Bai
B Tseng
Benjamin Guthier
D Farin
DG Lowe
F Mokhtarian
H Bay
H Schneiderman
HA Rowley
I Nurnett
JF Canny
Johannes Kiess
JS Kim
K Curran
L Itti
M Fischler
M Hossain
M Rubinstein
M Zwicker
N Björk
O Steiger
P Beek
P Krähenbühl
P Schaber
R Han
R Mohan
RO Duda
S Kopf
S Kopf
S Kopf
S Kopf
S Nepal
Stephan Kopf
T Ren
T Shanableh
Thomas Haenselmann
V Cardellini
W Dong
W Lum
WH Cheng
WH Cheng
Wolfgang Effelsberg
Y Boykov
Y Guo
Y Li
Y Li
Y Linde
YF Ma
YS Wang
Z Lei
Z Lei
Z Obrenovic
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Gradient-based global features for seam carving

Author: B Yan
D Martin
DD Conge
DG Lowe
E Salma
F Shafieyan
Izumi Ito
J Chen
J Shen
M Frankovich
M Rubinstein
M Rubinstein
N Dalal
Q Yan
R Achanta
S Avidan
S Goferman
S-S Lin
T Basha
TK Wattanachote
Y Guo
Y Tanaka
Y-S Wang
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Improved content aware scene retargeting for retinitis pigmentosa patients

Author: Al-Atabany Walid I
Degenaar Patrick A
Tong Tzyy
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background In this paper we present a novel scene retargeting technique to reduce the visual scene while maintaining the size of the key features. The algorithm is scalable to implementation onto portable devices, and thus, has potential for augmented reality systems to provide visual support for those with tunnel vision. We therefore test the efficacy of our algorithm on shrinking the visual scene into the remaining field of view for those patients. Methods Simple spatial compression of visual scenes makes objects appear further away. We have therefore developed an algorithm which removes low importance information, maintaining the size of the significant features. Previous approaches in this field have included <it>seam carving</it>, which removes low importance seams from the scene, and <it>shrinkability </it>which dynamically shrinks the scene according to a generated importance map. The former method causes significant artifacts and the latter is inefficient. In this work we have developed a new algorithm, combining the best aspects of both these two previous methods. In particular, our approach is to generate a <it>shrinkability </it>importance map using as seam based approach. We then use it to dynamically shrink the scene in similar fashion to the <it>shrinkability </it>method. Importantly, we have implemented it so that it can be used in real time without prior knowledge of future frames. Results We have evaluated and compared our algorithm to the <it>seam carving </it>and image <it>shrinkability </it>approaches from a content preservation perspective and a compression quality perspective. Also our technique has been evaluated and tested on a trial included 20 participants with simulated tunnel vision. Results show the robustness of our method at reducing scenes up to 50% with minimal distortion. We also demonstrate efficacy in its use for those with simulated tunnel vision of 22 degrees of field of view or less. Conclusions Our approach allows us to perform content aware video resizing in real time using only information from previous frames to avoid jitter. Also our method has a great benefit over the ordinary resizing method and even over other image retargeting methods. We show that the benefit derived from this algorithm is significant to patients with fields of view 20° or less.</p

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Human Attention Modelization and Data Reduction

Author: Dominique De Beul
Matei Mancas
Nicolas Riche
Xavier Siebert
Publication venue: 'IntechOpen'
Publication date: 23/03/2012
Field of study

IntechOpen

CiteSeerX

Crossref

Adaptation of Images and Videos for Different Screen Sizes

Author: Kiess Johannes
Publication venue
Publication date: 01/01/2014
Field of study

With the increasing popularity of smartphones and similar mobile devices, the demand for media to consume on the go rises. As most images and videos today are captured with HD or even higher resolutions, there is a need to adapt them in a content-aware fashion before they can be watched comfortably on screens with small sizes and varying aspect ratios. This process is called retargeting. Most distortions during this process are caused by a change of the aspect ratio. Thus, retargeting mainly focuses on adapting the aspect ratio of a video while the rest can be scaled uniformly. The main objective of this dissertation is to contribute to the modern image and video retargeting, especially regarding the potential of the seam carving operator. There are still unsolved problems in this research field that should be addressed in order to improve the quality of the results or speed up the performance of the retargeting process. This dissertation presents novel algorithms that are able to retarget images, videos and stereoscopic videos while dealing with problems like the preservation of straight lines or the reduction of the required memory space and computation time. Additionally, a GPU implementation is used to achieve the retargeting of videos in real-time. Furthermore, an enhancement of face detection is presented which is able to distinguish between faces that are important for the retargeting and faces that are not. Results show that the developed techniques are suitable for the desired scenarios

MAnnheim DOCument Server

Methods for reducing visual discomfort in stereoscopic 3D: A review

Author: Akeley
Bando
Banks
Basha
Blohm
Carnegie
Chang
Chen
Chen
Chen
Choi
Fry
Harris
Heinzle
Hoffman
Hoffman
Holliman
Hong
Howarth
Hwang
Iatsun
Ideses
Jiang
Jiang
Jung
Jung
Jung
Jung
Jung
Kang
Kasim Terzić
Kim
Kim
Kim
Kim
Kim
Kitrosser
Konrad
Kooi
Koppal
Lambooij
Lambooij
Lang
Le Callet
Lee
Lee
Lee
Lee
Leroy
Li
Li
Li
Li
Lipton
Liu
Love
López
Ma
MacKenzie
MacKenzie
Masia
McIntire
Meesters
Mendiburu
Miles Hansard
Moorthy
Mu
Nojiri
Oh
Oh
Pajak
Park
Park
Park
Park
Percival
Pritch
Qi
Read
Rolland
Sakamoto
Sanftmann
Scher
Schor
Schor
Schor
Schor
Seuntiëns
Shao
Shao
Sheard
Sheedy
Shibata
Shibata
Shiwa
Sohn
Sohn
Sohn
Solimini
Tasli
Templin
Torii
Urvoy
Wang
Wang
Wang
Wang
Ware
Winkler
Wopking
Xia
Xue
Yan
Yano
Yoo
Yun
Zellinger
Zeng
Zeri
Zhang
Zhou
Zitnick
Publication venue: 'Elsevier BV'
Publication date: 11/08/2016
Field of study

This work was supported by the EPSRC Grant EP/M01469X/1, “Geometric Evaluation of Stereoscopic Video”

Crossref

Elsevier - Publisher Connector

Queen Mary Research Online

University of St. Andrews - Pure