2,183 research outputs found
Automated tracking of colloidal clusters with sub-pixel accuracy and precision
Quantitative tracking of features from video images is a basic technique
employed in many areas of science. Here, we present a method for the tracking
of features that partially overlap, in order to be able to track so-called
colloidal molecules. Our approach implements two improvements into existing
particle tracking algorithms. Firstly, we use the history of previously
identified feature locations to successfully find their positions in
consecutive frames. Secondly, we present a framework for non-linear
least-squares fitting to summed radial model functions and analyze the accuracy
(bias) and precision (random error) of the method on artificial data. We find
that our tracking algorithm correctly identifies overlapping features with an
accuracy below 0.2% of the feature radius and a precision of 0.1 to 0.01 pixels
for a typical image of a colloidal cluster. Finally, we use our method to
extract the three-dimensional diffusion tensor from the Brownian motion of
colloidal dimers.Comment: 20 pages, 8 figures. Non-revised preprint version, please refer to
http://dx.doi.org/10.1088/1361-648X/29/4/04400
Holistic interpretation of visual data based on topology:semantic segmentation of architectural facades
The work presented in this dissertation is a step towards effectively incorporating contextual knowledge in the task of semantic segmentation. To date, the use of context has been confined to the genre of the scene with a few exceptions in the field. Research has been directed towards enhancing appearance descriptors. While this is unarguably important, recent studies show that computer vision has reached a near-human level of performance in relying on these descriptors when objects have stable distinctive surface properties and in proper imaging conditions. When these conditions are not met, humans exploit their knowledge about the intrinsic geometric layout of the scene to make local decisions. Computer vision lags behind when it comes to this asset. For this reason, we aim to bridge the gap by presenting algorithms for semantic segmentation of building facades making use of scene topological aspects. We provide a classification scheme to carry out segmentation and recognition simultaneously.The algorithm is able to solve a single optimization function and yield a semantic interpretation of facades, relying on the modeling power of probabilistic graphs and efficient discrete combinatorial optimization tools. We tackle the same problem of semantic facade segmentation with the neural network approach.We attain accuracy figures that are on-par with the state-of-the-art in a fully automated pipeline.Starting from pixelwise classifications obtained via Convolutional Neural Networks (CNN). These are then structurally validated through a cascade of Restricted Boltzmann Machines (RBM) and Multi-Layer Perceptron (MLP) that regenerates the most likely layout. In the domain of architectural modeling, there is geometric multi-model fitting. We introduce a novel guided sampling algorithm based on Minimum Spanning Trees (MST), which surpasses other propagation techniques in terms of robustness to noise. We make a number of additional contributions such as measure of model deviation which captures variations among fitted models
Optimization of facade segmentation based on layout priors
We propose an algorithm that provides a pixel-wise classification of building facades. Building facades provide a rich environment for testing semantic segmentation techniques. They come in a variety of styles affecting appearance and layout. On the other hand, they exhibit a degree of stability in the arrangement of structures across different instances. Furthermore, a single image is often composed of a repetitive architectural pattern. We integrate appearance, layout and repetition cues in a single energy function, that is optimized through the TRW-S algorithm to provide a classification of superpixels. The appearance energy is based on scores of a Random Forrest classifier. The feature space is composed of higher-level vectors encoding distance to structure clusters. Layout priors are obtained from locations and structural adjacencies in training data. In addition, priors result from translational symmetry cues acquired from the scene itself through clustering via the α -expansion graphcut algorithm. We are on par with state-of-the-art. We are able to fine tune classifications at the superpixel level, while most methods model all architectural features with bounding rectangles
- …