Search CORE

48,896 research outputs found

Face Identification and Clustering

Author: Dhingra Atul
Publication venue
Publication date: 26/04/2017
Field of study

In this thesis, we study two problems based on clustering algorithms. In the first problem, we study the role of visual attributes using an agglomerative clustering algorithm to whittle down the search area where the number of classes is high to improve the performance of clustering. We observe that as we add more attributes, the clustering performance increases overall. In the second problem, we study the role of clustering in aggregating templates in a 1:N open set protocol using multi-shot video as a probe. We observe that by increasing the number of clusters, the performance increases with respect to the baseline and reaches a peak, after which increasing the number of clusters causes the performance to degrade. Experiments are conducted using recently introduced unconstrained IARPA Janus IJB-A, CS2, and CS3 face recognition datasets

arXiv.org e-Print Archive

Real-Time Salient Closed Boundary Tracking via Line Segments Perceptual Grouping

Author: Dehghan Masood
He Shida
Jagersand Martin
Qin Xuebin
Quintero Camilo Perez
Singh Abhineet
Publication venue
Publication date: 09/08/2017
Field of study

This paper presents a novel real-time method for tracking salient closed boundaries from video image sequences. This method operates on a set of straight line segments that are produced by line detection. The tracking scheme is coherently integrated into a perceptual grouping framework in which the visual tracking problem is tackled by identifying a subset of these line segments and connecting them sequentially to form a closed boundary with the largest saliency and a certain similarity to the previous one. Specifically, we define a new tracking criterion which combines a grouping cost and an area similarity constraint. The proposed criterion makes the resulting boundary tracking more robust to local minima. To achieve real-time tracking performance, we use Delaunay Triangulation to build a graph model with the detected line segments and then reduce the tracking problem to finding the optimal cycle in this graph. This is solved by our newly proposed closed boundary candidates searching algorithm called "Bidirectional Shortest Path (BDSP)". The efficiency and robustness of the proposed method are tested on real video sequences as well as during a robot arm pouring experiment.Comment: 7 pages, 8 figures, The 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2017) submission ID 103

arXiv.org e-Print Archive

Crossref

Controlled Lagrangians and the stabilization of mechanical systems. II. Potential shaping

Author: Bloch Anthony M.
Chang Dong Eui
Leonard Naomi Ehrich
Marsden Jerrold E.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2001
Field of study

For pt.I, see ibid., vol.45, p.2253-70 (2000). We extend the method of controlled Lagrangians (CL) to include potential shaping, which achieves complete state-space asymptotic stabilization of mechanical systems. The CL method deals with mechanical systems with symmetry and provides symmetry-preserving kinetic shaping and feedback-controlled dissipation for state-space stabilization in all but the symmetry variables. Potential shaping complements the kinetic shaping by breaking symmetry and stabilizing the remaining state variables. The approach also extends the method of controlled Lagrangians to include a class of mechanical systems without symmetry such as the inverted pendulum on a cart that travels along an incline

CiteSeerX

Caltech Authors

Feature Tracking Cardiac Magnetic Resonance via Deep Learning and Spline Optimization

Author: Bluemke David A.
Noble J. Alison
Vigneault Davis M.
Xie Weidi
Publication venue
Publication date: 01/01/2017
Field of study

Feature tracking Cardiac Magnetic Resonance (CMR) has recently emerged as an area of interest for quantification of regional cardiac function from balanced, steady state free precession (SSFP) cine sequences. However, currently available techniques lack full automation, limiting reproducibility. We propose a fully automated technique whereby a CMR image sequence is first segmented with a deep, fully convolutional neural network (CNN) architecture, and quadratic basis splines are fitted simultaneously across all cardiac frames using least squares optimization. Experiments are performed using data from 42 patients with hypertrophic cardiomyopathy (HCM) and 21 healthy control subjects. In terms of segmentation, we compared state-of-the-art CNN frameworks, U-Net and dilated convolution architectures, with and without temporal context, using cross validation with three folds. Performance relative to expert manual segmentation was similar across all networks: pixel accuracy was ~97%, intersection-over-union (IoU) across all classes was ~87%, and IoU across foreground classes only was ~85%. Endocardial left ventricular circumferential strain calculated from the proposed pipeline was significantly different in control and disease subjects (-25.3% vs -29.1%, p = 0.006), in agreement with the current clinical literature.Comment: Accepted to Functional Imaging and Modeling of the Heart (FIMH) 201

arXiv.org e-Print Archive

Oxford University Research Archive

ROAM: a Rich Object Appearance Model with Application to Rotoscoping

Author: Miksik Ondrej
Pérez Patrick
Pérez-Rúa Juan-Manuel
Torr Philip H. S.
Publication venue
Publication date: 05/12/2016
Field of study

Rotoscoping, the detailed delineation of scene elements through a video shot, is a painstaking task of tremendous importance in professional post-production pipelines. While pixel-wise segmentation techniques can help for this task, professional rotoscoping tools rely on parametric curves that offer the artists a much better interactive control on the definition, editing and manipulation of the segments of interest. Sticking to this prevalent rotoscoping paradigm, we propose a novel framework to capture and track the visual aspect of an arbitrary object in a scene, given a first closed outline of this object. This model combines a collection of local foreground/background appearance models spread along the outline, a global appearance model of the enclosed object and a set of distinctive foreground landmarks. The structure of this rich appearance model allows simple initialization, efficient iterative optimization with exact minimization at each step, and on-line adaptation in videos. We demonstrate qualitatively and quantitatively the merit of this framework through comparisons with tools based on either dynamic segmentation with a closed curve or pixel-wise binary labelling

arXiv.org e-Print Archive

Crossref

INRIA a CCSD electronic archive server

Oxford University Research Archive