Search CORE

310 research outputs found

Saliency-aware Stereoscopic Video Retargeting

Author: Imani Hassan
Islam Md Baharul
Wong Lai-Kuan
Publication venue
Publication date: 18/04/2023
Field of study

Stereo video retargeting aims to resize an image to a desired aspect ratio. The quality of retargeted videos can be significantly impacted by the stereo videos spatial, temporal, and disparity coherence, all of which can be impacted by the retargeting process. Due to the lack of a publicly accessible annotated dataset, there is little research on deep learning-based methods for stereo video retargeting. This paper proposes an unsupervised deep learning-based stereo video retargeting network. Our model first detects the salient objects and shifts and warps all objects such that it minimizes the distortion of the salient parts of the stereo frames. We use 1D convolution for shifting the salient objects and design a stereo video Transformer to assist the retargeting process. To train the network, we use the parallax attention mechanism to fuse the left and right views and feed the retargeted frames to a reconstruction module that reverses the retargeted frames to the input frames. Therefore, the network is trained in an unsupervised manner. Extensive qualitative and quantitative experiments and ablation studies on KITTI stereo 2012 and 2015 datasets demonstrate the efficiency of the proposed method over the existing state-of-the-art methods. The code is available at https://github.com/z65451/SVR/.Comment: 8 pages excluding references. CVPRW conferenc

arXiv.org e-Print Archive

Stereoscopic Seam Carving With Temporal Consistency

Author: Effelsberg Wolfgang
Guthier Benjamin
Kiess Johannes
Kopf Stephan
Publication venue
Publication date: 01/01/2013
Field of study

In this paper, we present a novel technique for seam carving of stereoscopic video. It removes seams of pixels in areas that are most likely not noticed by the viewer. When applying seam carving to stereoscopic video rather than monoscopic still images, new challenges arise. The detected seams must be consistent between the left and the right view, so that no depth information is destroyed. When removing seams in two consecutive frames, temporal consistency between the removed seams must be established to avoid flicker in the resulting video. By making certain assumptions, the available depth information can be harnessed to improve the quality achieved by seam carving. Assuming that closer pixels are more important, the algorithm can focus on removing distant pixels first. Furthermore, we assume that coherent pixels belonging to the same object have similar depth. By avoiding to cut through edges in the depth map, we can thus avoid cutting through object boundaries

MAnnheim DOCument Server

Supervised Deep Learning for Content-Aware Image Retargeting with Fourier Convolutions

Author: Givkashi MohammadHossein
Karimi Nader
Naderi MohammadReza
Samavi Shadrokh
Shirani Shahram
Publication venue
Publication date: 12/06/2023
Field of study

Image retargeting aims to alter the size of the image with attention to the contents. One of the main obstacles to training deep learning models for image retargeting is the need for a vast labeled dataset. Labeled datasets are unavailable for training deep learning models in the image retargeting tasks. As a result, we present a new supervised approach for training deep learning models. We use the original images as ground truth and create inputs for the model by resizing and cropping the original images. A second challenge is generating different image sizes in inference time. However, regular convolutional neural networks cannot generate images of different sizes than the input image. To address this issue, we introduced a new method for supervised learning. In our approach, a mask is generated to show the desired size and location of the object. Then the mask and the input image are fed to the network. Comparing image retargeting methods and our proposed method demonstrates the model's ability to produce high-quality retargeted images. Afterward, we compute the image quality assessment score for each output image based on different techniques and illustrate the effectiveness of our approach.Comment: 18 pages, 5 figure

arXiv.org e-Print Archive

Gradient-based global features for seam carving

Author: B Yan
D Martin
DD Conge
DG Lowe
E Salma
F Shafieyan
Izumi Ito
J Chen
J Shen
M Frankovich
M Rubinstein
M Rubinstein
N Dalal
Q Yan
R Achanta
S Avidan
S Goferman
S-S Lin
T Basha
TK Wattanachote
Y Guo
Y Tanaka
Y-S Wang
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Stereo Day-for-Night: Retargeting Disparity for Scotopic Vision

Author: Durand F.
Hans-Peter Seidel
Hess R.
Karol Myszkowski
Peter Vangorp
Petr Kellnhofer
Tobias Ritschel
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2014
Field of study

Crossref

Edge Hill University Research Information Repository

MPG.PuRe