Search CORE

23,473 research outputs found

Markov mezők a képmodellezésben, alkalmazásuk az automatikus képszegmentálás területén = Markovian Image Models: Applications in Unsupervised Image Segmentation

Author: Kató Zoltán
Publication venue: OTKA
Publication date: 01/01/2007
Field of study

1) Kifejlesztettünk egy olyan szín és textúra alapú szegmentáló MRF algoritmust, amely alkalmas egy kép automatikus szegmentálását elvégezni. Az eredményeinket az Image and Vision Computing folyóiratban publikáltuk. 2) Kifejlesztettünk egy Reversible Jump Markov Chain Monte Carlo technikán alapuló automatikus képszegmentáló eljárást, melyet sikeresen alkalmaztunk színes képek teljesen automatikus szegmentálására. Az eredményeinket a BMVC 2004 konferencián és az Image and Vision Computing folyóiratban publikáltuk. 3) A modell többrétegű továbbfejlesztését alkalmaztuk video objektumok szín és mozgás alapú szegmentálására, melynek eredményeit a HACIPPR 2005 illetve az ACCV 2006 nemzetközi konferenciákon publikáltuk. Szintén ehhez az alapproblémához kapcsolódik Horváth Péter hallgatómmal az optic flow szamításával illetve szín, textúra és mozgás alapú GVF aktív kontúrral kapcsoltos munkáink. TDK dolgozata első helyezést ért el a 2004-es helyi versenyen, az eredményeinket pedig a KEPAF 2004 konferencián publikáltuk. 4) Horváth Péter PhD hallgatómmal illetve az franciaországi INRIA Ariana csoportjával, kidolgoztunk egy olyan képszegmentáló eljárást, amely a szegmentálandó objektum alakját is figyelembe veszi. Az eredményeinket az ICPR 2006 illetve az ICCVGIP 2006 konferencián foglaltuk össze. A modell előzményeként kidolgoztunk továbbá egy alakzat-momemntumokon alapuló aktív kontúr modellt, amelyet a HACIPPR 2005 konferencián publikáltunk. | 1) We have proposed a monogrid MRF model which is able to combine color and texture features in order to improve the quality of segmentation results. We have also solved the estimation of model parameters. This work has been published in the Image and Vision Computing journal. 2) We have proposed an RJMCMC sampling method which is able to identify multi-dimensional Gaussian mixtures. Using this technique, we have developed a fully automatic color image segmentation algorithm. Our results have been published at BMVC 2004 international conference and in the Image and Vision Computing journal. 3) A new multilayer MRF model has been proposed which is able to segment an image based on multiple cues (such as color, texture, or motion). This work has been published at HACIPPR 2005 and ACCV 2006 international conferences. The work on optic flow computation and color-, texture-, and motion-based GVF active contours doen with my student, Mr. Peter Horvath, won a first price at the local Student Research Competition in 2004. Results have been presented at KEPAF 2004 conference. 4) A new shape prior, called 'gas of circles' has been introduced using active contour models. This work is done in collaboration with the Ariana group of INRIA, France and my PhD student, Mr. Peter Horvath. Results are published at the ICPR 2006 and ICCVGIP 2006 conferences. A preliminary study on active contour models using shape-moments has also been done, these results are published at HACIPPR 2005

Repository of the Academy's Library

Self-Supervised Relative Depth Learning for Urban Scene Understanding

Author: A Geiger
A Owens
BKP Horn
E Shelhamer
F Liu
G Larsson
Gabriel J. Brostow
GJ Brostow
L Wiskott
M Noroozi
Olaf Ronneberger
P Bideau
P Dollár
R Gao
R Garg
R Zhang
Publication venue
Publication date: 02/04/2018
Field of study

As an agent moves through the world, the apparent motion of scene elements is (usually) inversely proportional to their depth. It is natural for a learning agent to associate image patterns with the magnitude of their displacement over time: as the agent moves, faraway mountains don't move much; nearby trees move a lot. This natural relationship between the appearance of objects and their motion is a rich source of information about the world. In this work, we start by training a deep network, using fully automatic supervision, to predict relative scene depth from single images. The relative depth training images are automatically derived from simple videos of cars moving through a scene, using recent motion segmentation techniques, and no human-provided labels. This proxy task of predicting relative depth from a single image induces features in the network that result in large improvements in a set of downstream tasks including semantic segmentation, joint road segmentation and car detection, and monocular (absolute) depth estimation, over a network trained from scratch. The improvement on the semantic segmentation task is greater than those produced by any other automatically supervised methods. Moreover, for monocular depth estimation, our unsupervised pre-training method even outperforms supervised pre-training with ImageNet. In addition, we demonstrate benefits from learning to predict (unsupervised) relative depth in the specific videos associated with various downstream tasks. We adapt to the specific scenes in those tasks in an unsupervised manner to improve performance. In summary, for semantic segmentation, we present state-of-the-art results among methods that do not use supervised pre-training, and we even exceed the performance of supervised ImageNet pre-trained models for monocular depth estimation, achieving results that are comparable with state-of-the-art methods

arXiv.org e-Print Archive

Crossref