Search CORE

12,162 research outputs found

Deep Feature-based Face Detection on Mobile Devices

Author: Chellappa Rama
Patel Vishal M.
Sarkar Sayantan
Publication venue
Publication date: 15/02/2016
Field of study

We propose a deep feature-based face detector for mobile devices to detect user's face acquired by the front facing camera. The proposed method is able to detect faces in images containing extreme pose and illumination variations as well as partial faces. The main challenge in developing deep feature-based algorithms for mobile devices is the constrained nature of the mobile platform and the non-availability of CUDA enabled GPUs on such devices. Our implementation takes into account the special nature of the images captured by the front-facing camera of mobile devices and exploits the GPUs present in mobile devices without CUDA-based frameorks, to meet these challenges.Comment: ISBA 201

arXiv.org e-Print Archive

Crossref

MobiFace: A Novel Dataset for Mobile Face Tracking in the Wild

Author: Cheng Shiyang
Lin Yiming
Pantic Maja
Shen Jie
Publication venue
Publication date: 22/01/2019
Field of study

Face tracking serves as the crucial initial step in mobile applications trying to analyse target faces over time in mobile settings. However, this problem has received little attention, mainly due to the scarcity of dedicated face tracking benchmarks. In this work, we introduce MobiFace, the first dataset for single face tracking in mobile situations. It consists of 80 unedited live-streaming mobile videos captured by 70 different smartphone users in fully unconstrained environments. Over

95K

bounding boxes are manually labelled. The videos are carefully selected to cover typical smartphone usage. The videos are also annotated with 14 attributes, including 6 newly proposed attributes and 8 commonly seen in object tracking. 36 state-of-the-art trackers, including facial landmark trackers, generic object trackers and trackers that we have fine-tuned or improved, are evaluated. The results suggest that mobile face tracking cannot be solved through existing approaches. In addition, we show that fine-tuning on the MobiFace training data significantly boosts the performance of deep learning-based trackers, suggesting that MobiFace captures the unique characteristics of mobile face tracking. Our goal is to offer the community a diverse dataset to enable the design and evaluation of mobile face trackers. The dataset, annotations and the evaluation server will be on \url{https://mobiface.github.io/}.Comment: To appear on The 14th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2019

arXiv.org e-Print Archive

Crossref

Recommended from our members

Coal mine low power laser methane detection and alarm instrument

Author: Grattan K. T. V.
Hu J.
Jin G.
Liu T.
Meng H.
Sun T.
Wei Y.
Zhang H.
Publication venue: 'SPIE-Intl Soc Optical Eng'
Publication date: 01/01/2019
Field of study

At present, the portable carrier catalytic methane detection and alarm instrument for coal mine generally has many problems, such as high power consumption, short standby time, low detection accuracy, few parameters and single function, which can not meet the rapid development needs of mine safety. In this paper, a low power portable laser methane detection and alarm instrument based on tunable laser absorption spectroscopy (TDLAS) is designed. The instrument can detect methane concentration, ambient temperature and ambient pressure at the same time. It has the functions of sound and light alarm, historical data storage and query, and integrates Wi-Fi to realize data wireless transmission. The instrument can work continuously for 36 hours, and the response time is less than 15 seconds. It has the function of self-diagnosis. The overall performance of the instrument has been greatly improved compared with the traditional mine methane portable instrument. A mobile methane alarm Internet of things(IOT) system for coal mine based on portable instrument has been developed, which realizes real-time upload of data and cloud analysis, makes the traditional mine gas monitoring and control system powerfully supplemented, greatly improves the detection level of coal mine gas, and has broad application prospects

City Research Online

Crossref

Fast Deep Matting for Portrait Animation on Mobile Phone

Author: Cho Donghyeon
Gastal Eduardo SL
He Kaiming
Huang Gao
Jégou Simon
Paszke Adam
Qin Hongwei
Redmon Joseph
Shen Xiaoyong
Szegedy Christian
Publication venue
Publication date: 26/07/2017
Field of study

Image matting plays an important role in image and video editing. However, the formulation of image matting is inherently ill-posed. Traditional methods usually employ interaction to deal with the image matting problem with trimaps and strokes, and cannot run on the mobile phone in real-time. In this paper, we propose a real-time automatic deep matting approach for mobile devices. By leveraging the densely connected blocks and the dilated convolution, a light full convolutional network is designed to predict a coarse binary mask for portrait images. And a feathering block, which is edge-preserving and matting adaptive, is further developed to learn the guided filter and transform the binary mask into alpha matte. Finally, an automatic portrait animation system based on fast deep matting is built on mobile devices, which does not need any interaction and can realize real-time matting with 15 fps. The experiments show that the proposed approach achieves comparable results with the state-of-the-art matting solvers.Comment: ACM Multimedia Conference (MM) 2017 camera-read

arXiv.org e-Print Archive

Crossref

GazeTouchPIN: Protecting Sensitive Data on Mobile Devices Using Secure Multimodal Authentication

Author: Alt Florian
Bulling Andreas
Hassib Mariam
Khamis Mohamed
Zezschwitz Emanuel von
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2017
Field of study

Although mobile devices provide access to a plethora of sensitive data, most users still only protect them with PINs or patterns, which are vulnerable to side-channel attacks (e.g., shoulder surfing). How-ever, prior research has shown that privacy-aware users are willing to take further steps to protect their private data. We propose GazeTouchPIN, a novel secure authentication scheme for mobile devices that combines gaze and touch input. Our multimodal approach complicates shoulder-surfing attacks by requiring attackers to ob-serve the screen as well as the user’s eyes to and the password. We evaluate the security and usability of GazeTouchPIN in two user studies (N=30). We found that while GazeTouchPIN requires longer entry times, privacy aware users would use it on-demand when feeling observed or when accessing sensitive data. The results show that successful shoulder surfing attack rate drops from 68% to 10.4%when using GazeTouchPIN

Driver Distraction Identification with an Ensemble of Convolutional Neural Networks

Author: Abouelnaga Yehya
Eraqi Hesham M.
Moustafa Mohamed N.
Saad Mohamed H.
Publication venue
Publication date: 01/01/2019
Field of study

The World Health Organization (WHO) reported 1.25 million deaths yearly due to road traffic accidents worldwide and the number has been continuously increasing over the last few years. Nearly fifth of these accidents are caused by distracted drivers. Existing work of distracted driver detection is concerned with a small set of distractions (mostly, cell phone usage). Unreliable ad-hoc methods are often used.In this paper, we present the first publicly available dataset for driver distraction identification with more distraction postures than existing alternatives. In addition, we propose a reliable deep learning-based solution that achieves a 90% accuracy. The system consists of a genetically-weighted ensemble of convolutional neural networks, we show that a weighted ensemble of classifiers using a genetic algorithm yields in a better classification confidence. We also study the effect of different visual elements in distraction detection by means of face and hand localizations, and skin segmentation. Finally, we present a thinned version of our ensemble that could achieve 84.64% classification accuracy and operate in a real-time environment.Comment: arXiv admin note: substantial text overlap with arXiv:1706.0949

arXiv.org e-Print Archive

Directory of Open Access Journals