Search CORE

647 research outputs found

Discriminative Appearance Models for Face Alignment

Author: Gao Hua
Publication venue: KIT-Bibliothek, Karlsruhe
Publication date: 01/01/2013
Field of study

The proposed face alignment algorithm uses local gradient features as the appearance representation. These features are obtained by pixel value comparison, which provide robustness against changes in illumination, as well as partial occlusion and local deformation due to the locality. The adopted features are modeled in three discriminative methods, which correspond to different alignment cost functions. The discriminative appearance modeling alleviate the generalization problem to some extent

KITopen

Data-Driven Shape Analysis and Processing

Author: Huang Qixing
Kalogerakis Evangelos
Kim Vladimir G.
Xu Kai
Publication venue
Publication date: 23/02/2015
Field of study

Data-driven methods play an increasingly important role in discovering geometric, structural, and semantic relationships between 3D shapes in collections, and applying this analysis to support intelligent modeling, editing, and visualization of geometric data. In contrast to traditional approaches, a key feature of data-driven approaches is that they aggregate information from a collection of shapes to improve the analysis and processing of individual shapes. In addition, they are able to learn models that reason about properties and relationships of shapes without relying on hard-coded rules or explicitly programmed instructions. We provide an overview of the main concepts and components of these techniques, and discuss their application to shape classification, segmentation, matching, reconstruction, modeling and exploration, as well as scene analysis and synthesis, through reviewing the literature and relating the existing works with both qualitative and numerical comparisons. We conclude our report with ideas that can inspire future research in data-driven shape analysis and processing.Comment: 10 pages, 19 figure

arXiv.org e-Print Archive

CiteSeerX

Deep Learning Beyond Traditional Supervision

Author: Chen Shixing
Publication venue: DigitalCommons@WayneState
Publication date: 01/01/2019
Field of study

With the rapid development of innovative models and huge success on various applications, the field of deep learning has attracted enormous attention in computer vision, machine learning, and artificial intelligence. Countless researches have validated the superior performance and unprecedented extensiveness of deep learning models, especially with the advantages of high performance computing by GPUs and parallel computation. Nonetheless, drawbacks including strong dependency on supervision (sufficient labeled data) and monotonous usage of categorized labels are negatively interfering the advancement of deep learning. In this dissertation, we plan to expose and exploit some possibilities of deep learning without using data and labels in the traditional supervision way. Specifically, we propose a pipeline to fulfill this process in a three-step manner: ranking instead of classification and regression, transfer leaning including domain adaptation, and finally data synthesis without supervised labels. First, we propose a novel ranking-based Convolutional Neural Network architecture. It can take advantage of both ranking algorithms and features learned with CNN models. Specifically, instead of using labels in classification or regression, it can take ordinal information into consideration. Meanwhile, features learned in CNN-based models can significantly outperform engineered features to achieve superior performance. Then, we propose a transfer learning framework which can also fulfill the functions of knowledge distillation and domain adaptation. In this step, we propose to solve the problem when inadequate or even no labels are available for a target domain by taking advantage of a source domain. Furthermore, our approach can utilize the information across platform and architecture as long as a forward pass of the source network is obtainable. Last, we propose an efficient and scalable model for cross-dataset one-shot person re-identification tasks. In this case, we address the problem to determine the relationship for a pair of query and gallery images from different camera styles. We adopt the concept from style transfer together with adversarial training to boost the performance and improve the robustness

Digital Commons@Wayne State University