Search CORE

21 research outputs found

Low Compute and Fully Parallel Computer Vision with HashMatch

Author: Ciliberto C
Davidson P
Fanello SR
Izadi S
Kowdle A
Rhemann C
Tankovich V
Valentin J
Publication venue: 16th IEEE International Conference on Computer Vision (ICCV)
Publication date: 01/01/2017
Field of study

Numerous computer vision problems such as stereo depth estimation, object-class segmentation and fore-ground/background segmentation can be formulated as per-pixel image labeling tasks. Given one or many images as input, the desired output of these methods is usually a spatially smooth assignment of labels. The large amount of such computer vision problems has lead to significant research efforts, with the state of art moving from CRF-based approaches to deep CNNs and more recently, hybrids of the two. Although these approaches have significantly advanced the state of the art, the vast majority has solely focused on improving quantitative results and are not designed for low-compute scenarios. In this paper, we present a new general framework for a variety of computer vision labeling tasks, called HashMatch. Our approach is designed to be both fully parallel, i.e. each pixel is independently processed, and low-compute, with a model complexity an order of magnitude less than existing CNN and CRF-based approaches. We evaluate HashMatch extensively on several problems such as disparity estimation, image retrieval, feature approximation and background subtraction, for which HashMatch achieves high computational efficiency while producing high quality results

UCL Discovery

User-Centric Learning and Evaluation of Interactive Segmentation Systems

Author: A. Blake
A. Blake
A. Sorokin
B. C. Russell
B. Taskar
C. Rother
C. Rother
Carsten Rother
Christoph Rhemann
D. Batra
D. Singaraju
E. N. Mortensen
H. Nickisch
Hannes Nickisch
I. Tsochantaridis
J. Liu
K. McGuinness
K. McGuinness
L. Ahn von
L. Grady
L. Wasserman
M. Szummer
O. Duchenne
P. Kohli
P. Kohli
Pushmeet Kohli
R. Szeliski
S. Nowozin
S. Vicente
S. Vijayanarasimhan
S. Vijayanarasimhan
S. Vijayanarasimhan
T. Finley
V. Gulshan
X. Bai
Y. Boykov
Y. Li
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Learning an interactive segmentation system

Author: Kohli P.
Nickisch H.
Rhemann C.
Rother C.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/12/2010
Field of study

Many successful applications of computer vision to image or video manipulation are interactive by nature. However, parameters of such systems are often trained neglecting the user. Traditionally, interactive systems have been treated in the same manner as their fully automatic counterparts. Their performance is evaluated by computing the accuracy of their solutions under some fixed set of user interactions. This paper proposes a new evaluation and learning method which brings the user in the loop. It is based on the use of an active robot user -- a simulated model of a human user. We show how this approach can be used to evaluate and learn parameters of state-of-the-art interactive segmentation systems. We also show how simulated user models can be integrated into the popular max-margin method for parameter learning and propose an algorithm to solve the resulting optimisation problem

MPG.PuRe

A sparse control model for image and video editing

Author: Jiaya Jia
Li Xu
Qiong Yan
Rhemann C.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref

A perceptually motivated online benchmark for image matting

Author: C. Rhemann
C. Rother
M. Gelautz
null Jue Wang
P. Kohli
P. Rott
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2009
Field of study

Crossref

A novel solution of using mixed reality in bowel and oral and maxillofacial surgical telepresence: 3D mean value cloning algorithm

Author: Aksoy Y
Jiang H
Kim W
Rhemann C
Shen Y
Wang J
Xu N
Publication venue: 'Wiley'
Publication date: 01/01/2021
Field of study

Background and aim: Most of the Mixed Reality models used in the surgical telepresence are suffering from discrepancies in the boundary area and spatial-temporal inconsistency due to the illumination variation in the video frames. The aim behind this work is to propose a new solution that helps produce the composite video by merging the augmented video of the surgery site and the virtual hand of the remote expertise surgeon. The purpose of the proposed solution is to decrease the processing time and enhance the accuracy of merged video by decreasing the overlay and visualization error and removing occlusion and artefacts. Methodology: The proposed system enhanced the mean value cloning algorithm that helps to maintain the spatial-temporal consistency of the final composite video. The enhanced algorithm includes the 3D mean value coordinates and improvised mean value interpolant in the image cloning process, which helps to reduce the sawtooth, smudging and discolouration artefacts around the blending region. Results: As compared to the state of the art solution, the accuracy in terms of overlay error of the proposed solution is improved from 1.01mm to 0.80mm whereas the accuracy in terms of visualization error is improved from 98.8% to 99.4%. The processing time is reduced to 0.173 seconds from 0.211 seconds. Conclusion: Our solution helps make the object of interest consistent with the light intensity of the target image by adding the space distance that helps maintain the spatial consistency in the final merged video.Comment: 27 page

arXiv.org e-Print Archive

Crossref

Western Sydney ResearchDirect