Search CORE

127 research outputs found

Saliency-aware Stereoscopic Video Retargeting

Author: Imani Hassan
Islam Md Baharul
Wong Lai-Kuan
Publication venue
Publication date: 18/04/2023
Field of study

Stereo video retargeting aims to resize an image to a desired aspect ratio. The quality of retargeted videos can be significantly impacted by the stereo videos spatial, temporal, and disparity coherence, all of which can be impacted by the retargeting process. Due to the lack of a publicly accessible annotated dataset, there is little research on deep learning-based methods for stereo video retargeting. This paper proposes an unsupervised deep learning-based stereo video retargeting network. Our model first detects the salient objects and shifts and warps all objects such that it minimizes the distortion of the salient parts of the stereo frames. We use 1D convolution for shifting the salient objects and design a stereo video Transformer to assist the retargeting process. To train the network, we use the parallax attention mechanism to fuse the left and right views and feed the retargeted frames to a reconstruction module that reverses the retargeted frames to the input frames. Therefore, the network is trained in an unsupervised manner. Extensive qualitative and quantitative experiments and ablation studies on KITTI stereo 2012 and 2015 datasets demonstrate the efficiency of the proposed method over the existing state-of-the-art methods. The code is available at https://github.com/z65451/SVR/.Comment: 8 pages excluding references. CVPRW conferenc

arXiv.org e-Print Archive

Human Attention Modelization and Data Reduction

Author: Dominique De Beul
Matei Mancas
Nicolas Riche
Xavier Siebert
Publication venue: 'IntechOpen'
Publication date: 23/03/2012
Field of study

IntechOpen

CiteSeerX

Crossref

Intelligent visual media processing: when graphics meets vision

Author: Cheng Ming-Ming
Hou Qi-Bin
Rosin Paul L
Zhang Song-Hai
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

The computer graphics and computer vision communities have been working closely together in recent years, and a variety of algorithms and applications have been developed to analyze and manipulate the visual media around us. There are three major driving forces behind this phenomenon: i) the availability of big data from the Internet has created a demand for dealing with the ever increasing, vast amount of resources; ii) powerful processing tools, such as deep neural networks, provide e�ective ways for learning how to deal with heterogeneous visual data; iii) new data capture devices, such as the Kinect, bridge between algorithms for 2D image understanding and 3D model analysis. These driving forces have emerged only recently, and we believe that the computer graphics and computer vision communities are still in the beginning of their honeymoon phase. In this work we survey recent research on how computer vision techniques bene�t computer graphics techniques and vice versa, and cover research on analysis, manipulation, synthesis, and interaction. We also discuss existing problems and suggest possible further research directions

Online Research @ Cardiff

Perceptually Guided Photo Retargeting

Author: Hong Richang
Nie Liqiang
Shao Ling
Xia Yingjie
Yan Yan
Zhang Luming
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 22/04/2016
Field of study

We propose perceptually guided photo retargeting, which shrinks a photo by simulating a human's process of sequentially perceiving visually/semantically important regions in a photo. In particular, we first project the local features (graphlets in this paper) onto a semantic space, wherein visual cues such as global spatial layout and rough geometric context are exploited. Thereafter, a sparsity-constrained learning algorithm is derived to select semantically representative graphlets of a photo, and the selecting process can be interpreted by a path which simulates how a human actively perceives semantics in a photo. Furthermore, we learn the prior distribution of such active graphlet paths (AGPs) from training photos that are marked as esthetically pleasing by multiple users. The learned priors enforce the corresponding AGP of a retargeted photo to be maximally similar to those from the training photos. On top of the retargeting model, we further design an online learning scheme to incrementally update the model with new photos that are esthetically pleasing. The online update module makes the algorithm less dependent on the number and contents of the initial training data. Experimental results show that: 1) the proposed AGP is over 90% consistent with human gaze shifting path, as verified by the eye-tracking data, and 2) the retargeting algorithm outperforms its competitors significantly, as AGP is more indicative of photo esthetics than conventional saliency maps

Crossref

University of East Anglia digital repository

Methods for reducing visual discomfort in stereoscopic 3D: A review

Author: Akeley
Bando
Banks
Basha
Blohm
Carnegie
Chang
Chen
Chen
Chen
Choi
Fry
Harris
Heinzle
Hoffman
Hoffman
Holliman
Hong
Howarth
Hwang
Iatsun
Ideses
Jiang
Jiang
Jung
Jung
Jung
Jung
Jung
Kang
Kasim Terzić
Kim
Kim
Kim
Kim
Kim
Kitrosser
Konrad
Kooi
Koppal
Lambooij
Lambooij
Lang
Le Callet
Lee
Lee
Lee
Lee
Leroy
Li
Li
Li
Li
Lipton
Liu
Love
López
Ma
MacKenzie
MacKenzie
Masia
McIntire
Meesters
Mendiburu
Miles Hansard
Moorthy
Mu
Nojiri
Oh
Oh
Pajak
Park
Park
Park
Park
Percival
Pritch
Qi
Read
Rolland
Sakamoto
Sanftmann
Scher
Schor
Schor
Schor
Schor
Seuntiëns
Shao
Shao
Sheard
Sheedy
Shibata
Shibata
Shiwa
Sohn
Sohn
Sohn
Solimini
Tasli
Templin
Torii
Urvoy
Wang
Wang
Wang
Wang
Ware
Winkler
Wopking
Xia
Xue
Yan
Yano
Yoo
Yun
Zellinger
Zeng
Zeri
Zhang
Zhou
Zitnick
Publication venue: 'Elsevier BV'
Publication date: 11/08/2016
Field of study

This work was supported by the EPSRC Grant EP/M01469X/1, “Geometric Evaluation of Stereoscopic Video”

Crossref

Elsevier - Publisher Connector

Queen Mary Research Online

University of St. Andrews - Pure

Adaptation of Images and Videos for Different Screen Sizes

Author: Kiess Johannes
Publication venue
Publication date: 01/01/2014
Field of study

With the increasing popularity of smartphones and similar mobile devices, the demand for media to consume on the go rises. As most images and videos today are captured with HD or even higher resolutions, there is a need to adapt them in a content-aware fashion before they can be watched comfortably on screens with small sizes and varying aspect ratios. This process is called retargeting. Most distortions during this process are caused by a change of the aspect ratio. Thus, retargeting mainly focuses on adapting the aspect ratio of a video while the rest can be scaled uniformly. The main objective of this dissertation is to contribute to the modern image and video retargeting, especially regarding the potential of the seam carving operator. There are still unsolved problems in this research field that should be addressed in order to improve the quality of the results or speed up the performance of the retargeting process. This dissertation presents novel algorithms that are able to retarget images, videos and stereoscopic videos while dealing with problems like the preservation of straight lines or the reduction of the required memory space and computation time. Additionally, a GPU implementation is used to achieve the retargeting of videos in real-time. Furthermore, an enhancement of face detection is presented which is able to distinguish between faces that are important for the retargeting and faces that are not. Results show that the developed techniques are suitable for the desired scenarios

MAnnheim DOCument Server

Finding Objects of Interest in Images using Saliency and Superpixels

Author: Achanta Radhakrishna
Publication venue: Lausanne, EPFL
Publication date: 21/10/2010
Field of study

The ability to automatically find objects of interest in images is useful in the areas of compression, indexing and retrieval, re-targeting, and so on. There are two classes of such algorithms – those that find any object of interest with no prior knowledge, independent of the task, and those that find specific objects of interest known a priori. The former class of algorithms tries to detect objects in images that stand-out, i.e. are salient, by virtue of being different from the rest of the image and consequently capture our attention. The detection is generic in this case as there is no specific object we are trying to locate. The latter class of algorithms detects specific known objects of interest and often requires training using features extracted from known examples. In this thesis we address various aspects of finding objects of interest under the topics of saliency detection and object detection. We present two saliency detection algorithms that rely on the principle of center-surround contrast. These two algorithms are shown to be superior to several state-of-the-art techniques in terms of precision and recall measures with respect to a ground truth. They output full-resolution saliency maps, are simpler to implement, and are computationally more efficient than most existing algorithms. We further establish the relevance of our saliency detection algorithms by using them for the known applications of object segmentation and image re-targeting. We first present three different techniques for salient object segmentation using our saliency maps that are based on clustering, graph-cuts, and geodesic distance based labeling. We then demonstrate the use of our saliency maps for a popular technique of content-aware image resizing and compare the result with that of existing methods. Our saliency maps prove to be a much more effective replacement for conventional gradient maps for providing automatic content-awareness. Just as it is important to find regions of interest in images, it is also important to find interesting images within a large collection of images. We therefore extend the notion of saliency detection in images to image databases. We propose an algorithm for finding salient images in a database. Apart from finding such images we also present two novel techniques for creating visually appealing summaries in the form of collages and mosaics. Finally, we address the problem of finding specific known objects of interest in images. Specifically, we deal with the feature extraction step that is a pre-requisite for any technique in this domain. In this context, we first present a superpixel segmentation algorithm that outperforms previous algorithms in terms quantitative measures of under-segmentation error and boundary recall. Our superpixel segmentation algorithm also offers several other advantages over existing algorithms like compactness, uniform size, control on the number of superpixels, and computational efficiency. We prove the effectiveness of our superpixels by deploying them in existing algorithms, specifically, an object class detection technique and a graph based algorithm, and improving their performance. We also present the result of using our superpixels in a technique for detecting mitochondria in noisy medical images

Infoscience - École polytechnique fédérale de Lausanne

MODELLING APPEARANCE AND GEOMETRY FROM IMAGES

Author: Melendez Francisco
Publication venue
Publication date: 01/08/2011
Field of study

The University of Manchester - Institutional Repository

Spatiotemporal Saliency Detection: State of Art

Author: Sultana Kadri, Pooja, Manju Bala
Publication venue: 'Auricle Technologies, Pvt., Ltd.'
Publication date: 30/04/2016
Field of study

Saliency detection has become a very prominent subject for research in recent time. Many techniques has been defined for the saliency detection.In this paper number of techniques has been explained that include the saliency detection from the year 2000 to 2015, almost every technique has been included.all the methods are explained briefly including their advantages and disadvantages. Comparison between various techniques has been done. With the help of table which includes authors name,paper name,year,techniques,algorithms and challenges. A comparison between levels of acceptance rates and accuracy levels are made

International Journal on Recent and Innovation Trends in Computing and Communication