989 research outputs found

    Regressing Local to Global Shape Properties for Online Segmentation and Tracking

    Full text link

    Regressing local to global shape properties for online segmentation and tracking

    Get PDF
    We propose a regression based learning framework that learns a set of shapes online, which can then be used to recover occluded object shapes. We represent shapes using their 2D discrete cosine transforms, and the key insight we propose is to regress low frequency harmonics, which represent the global properties of the shape, from high frequency harmonics, that encode the details of the object's shape. We learn the regression model using Locally Weighted Projection Regression (LWPR) which expedites online, incremental learning. After sufficient observation of a set of unoccluded shapes, the learned model can detect occlusion and recover the full shapes from the occluded ones. We demonstrate the ideas using a level-set based tracking system that provides shape and pose, however, the framework could be embedded in any segmentation-based tracking system. Our experiments demonstrate the efficacy of the method on a variety of objects using both real data and artificial data.Carl Yuheng Ren, Victor Adrian Prisacariu, Ian Rei

    A random cell motility gradient downstream of FGF controls elongation of amniote embryos

    Get PDF
    Vertebrate embryos are characterized by an elongated antero-posterior (AP) body axis, which forms by progressive cell deposition from a posterior growth zone in the embryo. Here, we used tissue ablation in the chicken embryo to demonstrate that the caudal presomitic mesoderm (PSM) has a key role in axis elongation. Using time-lapse microscopy, we analysed the movements of fluorescently labelled cells in the PSM during embryo elongation, which revealed a clear posterior-to-anterior gradient of cell motility and directionality in the PSM. We tracked the movement of the PSM extracellular matrix in parallel with the labelled cells and subtracted the extracellular matrix movement from the global motion of cells. After subtraction, cell motility remained graded but lacked directionality, indicating that the posterior cell movements associated with axis elongation in the PSM are not intrinsic but reflect tissue deformation. The gradient of cell motion along the PSM parallels the fibroblast growth factor (FGF)/mitogen-activated protein kinase (MAPK) gradient1, which has been implicated in the control of cell motility in this tissue2. Both FGF signalling gain- and loss-of-function experiments lead to disruption of the motility gradient and a slowing down of axis elongation. Furthermore, embryos treated with cell movement inhibitors (blebbistatin or RhoK inhibitor), but not cell cycle inhibitors, show a slower axis elongation rate. We propose that the gradient of random cell motility downstream of FGF signalling in the PSM controls posterior elongation in the amniote embryo. Our data indicate that tissue elongation is an emergent property that arises from the collective regulation of graded, random cell motion rather than by the regulation of directionality of individual cellular movements

    Recombinator Networks: Learning Coarse-to-Fine Feature Aggregation

    Full text link
    Deep neural networks with alternating convolutional, max-pooling and decimation layers are widely used in state of the art architectures for computer vision. Max-pooling purposefully discards precise spatial information in order to create features that are more robust, and typically organized as lower resolution spatial feature maps. On some tasks, such as whole-image classification, max-pooling derived features are well suited; however, for tasks requiring precise localization, such as pixel level prediction and segmentation, max-pooling destroys exactly the information required to perform well. Precise localization may be preserved by shallow convnets without pooling but at the expense of robustness. Can we have our max-pooled multi-layered cake and eat it too? Several papers have proposed summation and concatenation based methods for combining upsampled coarse, abstract features with finer features to produce robust pixel level predictions. Here we introduce another model --- dubbed Recombinator Networks --- where coarse features inform finer features early in their formation such that finer features can make use of several layers of computation in deciding how to use coarse features. The model is trained once, end-to-end and performs better than summation-based architectures, reducing the error from the previous state of the art on two facial keypoint datasets, AFW and AFLW, by 30\% and beating the current state-of-the-art on 300W without using extra data. We improve performance even further by adding a denoising prediction model based on a novel convnet formulation.Comment: accepted in CVPR 201
    corecore