1,077 research outputs found
Pose Independent Face Recognition by Localizing Local Binary Patterns via Deformation Components
In this paper we address the problem of pose independent face recognition with a gallery set containing one frontal face image per enrolled subject while the probe set is composed by just a face image undergoing pose variations. The approach uses a set of aligned 3D models to learn deformation components using a 3D Morph able Model (3DMM). This further allows fitting a 3DMM efficiently on an image using a Ridge regression solution, regularized on the face space estimated via PCA. Then the approach describes each profile face by computing Local Binary Pattern (LBP) histograms localized on each deformed vertex, projected on a rendered frontal view. In the experimental result we evaluate the proposed method on the CMU Multi-PIE to assess face recognition algorithm across pose. We show how our process leads to higher performance than regular baselines reporting high recognition rate considering a range of facial poses in the probe set, up to ±45°. Finally we remark that our approach can handle continuous pose variations and it is comparable with recent state-of-the-art approaches
Do-It-Yourself Single Camera 3D Pointer Input Device
We present a new algorithm for single camera 3D reconstruction, or 3D input
for human-computer interfaces, based on precise tracking of an elongated
object, such as a pen, having a pattern of colored bands. To configure the
system, the user provides no more than one labelled image of a handmade
pointer, measurements of its colored bands, and the camera's pinhole projection
matrix. Other systems are of much higher cost and complexity, requiring
combinations of multiple cameras, stereocameras, and pointers with sensors and
lights. Instead of relying on information from multiple devices, we examine our
single view more closely, integrating geometric and appearance constraints to
robustly track the pointer in the presence of occlusion and distractor objects.
By probing objects of known geometry with the pointer, we demonstrate
acceptable accuracy of 3D localization.Comment: 8 pages, 6 figures, 2018 15th Conference on Computer and Robot Visio
Statistical Shape Spaces for 3D Data: A Review
International audienceMethods and systems for capturing 3D geometry are becoming increasingly commonplace–and with them a plethora of 3D data. Much of this data is unfortunately corrupted by noise, missing data, occlusions or other outliers. However, when we are interested in the shape of a particular class of objects, such as human faces or bodies, we can use machine learning techniques, applied to clean, registered databases of these shapes, to make sense of raw 3D point clouds or other data. This has applications ranging from virtual change rooms to motion and gait analysis to surgical planning depending on the type of shape. In this chapter, we give an overview of these techniques, a brief review of the literature, and comparative evaluation of two such shape spaces for human faces
Digital Image Access & Retrieval
The 33th Annual Clinic on Library Applications of Data Processing, held at the University of Illinois at Urbana-Champaign in March of 1996, addressed the theme of "Digital Image Access & Retrieval." The papers from this conference cover a wide range of topics concerning digital imaging technology for visual resource collections. Papers covered three general areas: (1) systems, planning, and implementation; (2) automatic and semi-automatic indexing; and (3) preservation with the bulk of the conference focusing on indexing and retrieval.published or submitted for publicatio
Higher-Order Topology, Monopole Nodal Lines, and the Origin of Large Fermi Arcs in Transition Metal Dichalcogenides XTe (X=Mo,W)
In recent years, transition metal dichalcogenides (TMDs) have garnered great
interest as topological materials -- monolayers of centrosymmetric
-phase TMDs have been identified as 2D topological insulators (TIs), and
bulk crystals of noncentrosymmetric -phase MoTe and WTe have
been identified as type-II Weyl semimetals. However, ARPES and STM probes of
these TMDs have revealed huge, "arc-like" surface states that overwhelm, and
are sometimes mistaken for, the much smaller topological surface Fermi arcs of
bulk type-II Weyl points. In this letter, we use first-principles calculations
and (nested) Wilson loops to analyze the bulk and surface electronic structure
of both - and -MoTe, finding that -MoTe
(-MoTe gapped with symmetry-preserving distortion) is an
inversion-symmetry-indicated -nontrivial (--) higher-order TI (HOTI) driven by double band
inversion. Both structural phases of MoTe exhibit the same surface features
as WTe, revealing that the large Fermi arcs are in fact not topologically
trivial, but are rather the characteristic split and gapped fourfold surface
states of a HOTI. We also show that, when the effects of SOC are neglected,
-MoTe is a nodal-line semimetal with -nontrivial
monopole nodal lines (MNLSM). This finding confirms that MNLSMs driven by
double band inversion are the weak-SOC limit of HOTIs, implying that MNLSMs are
higher-order topological with flat-band-like hinge states, which
we find to originate from the corner modes of 2D "fragile" TIs.Comment: Final version, 5 pg main text + 18 pg supplement, 4 + 6 figures,
abstract abridged for arXiv posting - see paper for full abstrac
An improved multi-dimensional CMAC neural network: Receptive field function and placement
The standard CMAC has been shown to have fast learning computation as a result of modular receptive field placement, rectangular receptive field shape and a simple weight adaptation algorithm. The standard CMAC, however, suffers from slow convergence at some critical frequency due to the rectangular receptive field shape. A linearly-tapered field, which requires a uniform placement, was used in this research. The receptive field placement of the standard CMAC becomes less uniform locally for a larger receptive field width. This dissertation suggests a new field placement which is more uniform without extra computation. Results show that the slow convergence at the critical frequency is eliminated, and the interaction of the linearly-tapered field with the new placement achieves more accurate function approximation. A theoretical bound on the receptive field width as a function of the input dimension is proposed if a uniform placement is to be achieved. Also, a procedure for adapting receptive field density to minimize the weight usage for a given approximation accuracy is suggested
Developing advanced mathematical models for detecting abnormalities in 2D/3D medical structures.
Detecting abnormalities in two-dimensional (2D) and three-dimensional (3D) medical structures is among the most interesting and challenging research areas in the medical imaging field. Obtaining the desired accurate automated quantification of abnormalities in medical structures is still very challenging. This is due to a large and constantly growing number of different objects of interest and associated abnormalities, large variations of their appearances and shapes in images, different medical imaging modalities, and associated changes of signal homogeneity and noise for each object. The main objective of this dissertation is to address these problems and to provide proper mathematical models and techniques that are capable of analyzing low and high resolution medical data and providing an accurate, automated analysis of the abnormalities in medical structures in terms of their area/volume, shape, and associated abnormal functionality. This dissertation presents different preliminary mathematical models and techniques that are applied in three case studies: (i) detecting abnormal tissue in the left ventricle (LV) wall of the heart from delayed contrast-enhanced cardiac magnetic resonance images (MRI), (ii) detecting local cardiac diseases based on estimating the functional strain metric from cardiac cine MRI, and (iii) identifying the abnormalities in the corpus callosum (CC) brain structure—the largest fiber bundle that connects the two hemispheres in the brain—for subjects that suffer from developmental brain disorders. For detecting the abnormal tissue in the heart, a graph-cut mathematical optimization model with a cost function that accounts for the object’s visual appearance and shape is used to segment the the inner cavity. The model is further integrated with a geometric model (i.e., a fast marching level set model) to segment the outer border of the myocardial wall (the LV). Then the abnormal tissue in the myocardium wall (also called dead tissue, pathological tissue, or infarct area) is identified based on a joint Markov-Gibbs random field (MGRF) model of the image and its region (segmentation) map that accounts for the pixel intensities and the spatial interactions between the pixels. Experiments with real in-vivo data and comparative results with ground truth (identified by a radiologist) and other approaches showed that the proposed framework can accurately detect the pathological tissue and can provide useful metrics for radiologists and clinicians. To estimate the strain from cardiac cine MRI, a novel method based on tracking the LV wall geometry is proposed. To achieve this goal, a partial differential equation (PDE) method is applied to track the LV wall points by solving the Laplace equation between the LV contours of each two successive image frames over the cardiac cycle. The main advantage of the proposed tracking method over traditional texture-based methods is its ability to track the movement and rotation of the LV wall based on tracking the geometric features of the inner, mid-, and outer walls of the LV. This overcomes noise sources that come from scanner and heart motion. To identify the abnormalities in the CC from brain MRI, the CCs are aligned using a rigid registration model and are segmented using a shape-appearance model. Then, they are mapped to a simple unified space for analysis. This work introduces a novel cylindrical mapping model, which is conformal (i.e., one to one transformation and bijective), that enables accurate 3D shape analysis of the CC in the cylindrical domain. The framework can detect abnormalities in all divisions of the CC (i.e., splenium, rostrum, genu and body). In addition, it offers a whole 3D analysis of the CC abnormalities instead of only area-based analysis as done by previous groups. The initial classification results based on the centerline length and CC thickness suggest that the proposed CC shape analysis is a promising supplement to the current techniques for diagnosing dyslexia. The proposed techniques in this dissertation have been successfully tested on complex synthetic and MR images and can be used to advantage in many of today’s clinical applications of computer-assisted medical diagnostics and intervention
- …