2,151 research outputs found
CNN Based Posture-Free Hand Detection
Although many studies suggest high performance hand detection methods, those
methods are likely to be overfitting. Fortunately, the Convolution Neural
Network (CNN) based approach provides a better way that is less sensitive to
translation and hand poses. However the CNN approach is complex and can
increase computational time, which at the end reduce its effectiveness on a
system where the speed is essential.In this study we propose a shallow CNN
network which is fast, and insensitive to translation and hand poses. It is
tested on two different domains of hand datasets, and performs in relatively
comparable performance and faster than the other state-of-the-art hand
CNN-based hand detection method. Our evaluation shows that the proposed shallow
CNN network performs at 93.9% accuracy and reaches much faster speed than its
competitors.Comment: 4 pages, 5 figures, in The 10th International Conference on
Information Technology and Electrical Engineering 2018, ISBN:
978-1-5386-4739-
Looking at faces in the wild
Recent advances in the face detection (FD) and recognition (FR) technology may give an impression that the problem of face matching is essentially solved, e.g. via deep learning models using thousands of samples per face for training and validation on the available benchmark data-sets. Human vision system seems to handle face localization and matching problem differently from the modern FR systems, since humans detect faces instantly even in most cluttered environments, and often require a single view of a face to reliably distinguish it from all others. This prompted us to take a biologically inspired look at building a cognitive architecture that uses artificial neural nets at the face detection stage and adapts a single image per person (SIPP) approach for face image matching
Efficient refinements on YOLOv3 for real-time detection and assessment of diabetic foot Wagner grades
Currently, the screening of Wagner grades of diabetic feet (DF) still relies
on professional podiatrists. However, in less-developed countries, podiatrists
are scarce, which led to the majority of undiagnosed patients. In this study,
we proposed the real-time detection and location method for Wagner grades of DF
based on refinements on YOLOv3. We collected 2,688 data samples and implemented
several methods, such as a visual coherent image mixup, label smoothing, and
training scheduler revamping, based on the ablation study. The experimental
results suggested that the refinements on YOLOv3 achieved an accuracy of 91.95%
and the inference speed of a single picture reaches 31ms with the NVIDIA Tesla
V100. To test the performance of the model on a smartphone, we deployed the
refinements on YOLOv3 models on an Android 9 system smartphone. This work has
the potential to lead to a paradigm shift for clinical treatment of the DF in
the future, to provide an effective healthcare solution for DF tissue analysis
and healing status.Comment: 11 pages with 11 figure
A Deep Pyramid Deformable Part Model for Face Detection
We present a face detection algorithm based on Deformable Part Models and
deep pyramidal features. The proposed method called DP2MFD is able to detect
faces of various sizes and poses in unconstrained conditions. It reduces the
gap in training and testing of DPM on deep features by adding a normalization
layer to the deep convolutional neural network (CNN). Extensive experiments on
four publicly available unconstrained face detection datasets show that our
method is able to capture the meaningful structure of faces and performs
significantly better than many competitive face detection algorithms
Tiny hand gesture recognition without localization via a deep convolutional network
Visual hand-gesture recognition is being increasingly desired for human-computer interaction interfaces. In many applications, hands only occupy about 10% of the image, whereas the most of it contains background, human face, and human body. Spatial localization of the hands in such scenarios could be a challenging task and ground truth bounding boxes need to be provided for training, which is usually not accessible. However, the location of the hand is not a requirement when the criteria is just the recognition of a gesture to command a consumer electronics device, such as mobiles phones and TVs. In this paper, a deep convolutional neural network is proposed to directly classify hand gestures in images without any segmentation or detection stage that could discard the irrelevant not-hand areas. The designed hand-gesture recognition network can classify seven sorts of hand gestures in a user-independent manner and on real time, achieving an accuracy of 97.1% in the dataset with simple backgrounds and 85.3% in the dataset with complex backgrounds
- …