42,456 research outputs found
Towards Addressing Key Visual Processing Challenges in Social Media Computing
abstract: Visual processing in social media platforms is a key step in gathering and understanding information in the era of Internet and big data. Online data is rich in content, but its processing faces many challenges including: varying scales for objects of interest, unreliable and/or missing labels, the inadequacy of single modal data and difficulty in analyzing high dimensional data. Towards facilitating the processing and understanding of online data, this dissertation primarily focuses on three challenges that I feel are of great practical importance: handling scale differences in computer vision tasks, such as facial component detection and face retrieval, developing efficient classifiers using partially labeled data and noisy data, and employing multi-modal models and feature selection to improve multi-view data analysis. For the first challenge, I propose a scale-insensitive algorithm to expedite and accurately detect facial landmarks. For the second challenge, I propose two algorithms that can be used to learn from partially labeled data and noisy data respectively. For the third challenge, I propose a new framework that incorporates feature selection modules into LDA models.Dissertation/ThesisDoctoral Dissertation Computer Science 201
Integrated Face Analytics Networks through Cross-Dataset Hybrid Training
Face analytics benefits many multimedia applications. It consists of a number
of tasks, such as facial emotion recognition and face parsing, and most
existing approaches generally treat these tasks independently, which limits
their deployment in real scenarios. In this paper we propose an integrated Face
Analytics Network (iFAN), which is able to perform multiple tasks jointly for
face analytics with a novel carefully designed network architecture to fully
facilitate the informative interaction among different tasks. The proposed
integrated network explicitly models the interactions between tasks so that the
correlations between tasks can be fully exploited for performance boost. In
addition, to solve the bottleneck of the absence of datasets with comprehensive
training data for various tasks, we propose a novel cross-dataset hybrid
training strategy. It allows "plug-in and play" of multiple datasets annotated
for different tasks without the requirement of a fully labeled common dataset
for all the tasks. We experimentally show that the proposed iFAN achieves
state-of-the-art performance on multiple face analytics tasks using a single
integrated model. Specifically, iFAN achieves an overall F-score of 91.15% on
the Helen dataset for face parsing, a normalized mean error of 5.81% on the
MTFL dataset for facial landmark localization and an accuracy of 45.73% on the
BNU dataset for emotion recognition with a single model.Comment: 10 page
An Improved Fatigue Detection System Based on Behavioral Characteristics of Driver
In recent years, road accidents have increased significantly. One of the
major reasons for these accidents, as reported is driver fatigue. Due to
continuous and longtime driving, the driver gets exhausted and drowsy which may
lead to an accident. Therefore, there is a need for a system to measure the
fatigue level of driver and alert him when he/she feels drowsy to avoid
accidents. Thus, we propose a system which comprises of a camera installed on
the car dashboard. The camera detect the driver's face and observe the
alteration in its facial features and uses these features to observe the
fatigue level. Facial features include eyes and mouth. Principle Component
Analysis is thus implemented to reduce the features while minimizing the amount
of information lost. The parameters thus obtained are processed through Support
Vector Classifier for classifying the fatigue level. After that classifier
output is sent to the alert unit.Comment: 4 pages, 2 figures, edited version of published paper in IEEE ICITE
201
- …