Search CORE

725 research outputs found

Face Detection with Effective Feature Extraction

Author: C. Huang
D.G. Lowe
H. Bay
J. Friedman
J. Wu
P. Viola
S. Avidan
S.Z. Li
T. Mita
T. Ojala
Y. Freund
Publication venue
Publication date: 01/01/2010
Field of study

There is an abundant literature on face detection due to its important role in many vision applications. Since Viola and Jones proposed the first real-time AdaBoost based face detector, Haar-like features have been adopted as the method of choice for frontal face detection. In this work, we show that simple features other than Haar-like features can also be applied for training an effective face detector. Since, single feature is not discriminative enough to separate faces from difficult non-faces, we further improve the generalization performance of our simple features by introducing feature co-occurrences. We demonstrate that our proposed features yield a performance improvement compared to Haar-like features. In addition, our findings indicate that features play a crucial role in the ability of the system to generalize.Comment: 7 pages. Conference version published in Asian Conf. Comp. Vision 201

arXiv.org e-Print Archive

CiteSeerX

Crossref

Adelaide Research & Scholarship

OPUS - University of Technology Sydney

p-norms of histogram of oriented gradients for X-ray images

Author: Hamada Nuha H.
Kharbat Faten F.
Publication venue: 'Institute of Advanced Engineering and Science'
Publication date: 01/10/2021
Field of study

Lebesgue spaces (Lp over Rn) play a significant role in mathematical analysis. They are widely used in machine learning and artificial intelligence to maximize performance or minimize error. The well-known histogram of oriented gradients (HOG) algorithm applies the 2-norm (Euclidean distance) to detect features in images. In this paper, we apply different p-norm values to identify the impact that changing these norms has on the original algorithm. The aim of this modification is to achieve better performance in classifying X-ray medical images related to of COVID-19 patients. The efficiency of the p-HOG algorithm is compared with the original HOG descriptor using a support vector machine implemented in Python. The results of the comparisons are promising, and the p-HOG algorithm shows greater efficiency in most cases

ZENODO

NEUROSURGERY ENTHUSIASTIC WOMEN SOCIETY

Institute of Advanced Engineering and Science

Image Reconstruction from Bag-of-Visual-Words

Author: Harada Tatsuya
Kato Hiroharu
Publication venue
Publication date: 19/05/2015
Field of study

The objective of this work is to reconstruct an original image from Bag-of-Visual-Words (BoVW). Image reconstruction from features can be a means of identifying the characteristics of features. Additionally, it enables us to generate novel images via features. Although BoVW is the de facto standard feature for image recognition and retrieval, successful image reconstruction from BoVW has not been reported yet. What complicates this task is that BoVW lacks the spatial information for including visual words. As described in this paper, to estimate an original arrangement, we propose an evaluation function that incorporates the naturalness of local adjacency and the global position, with a method to obtain related parameters using an external image database. To evaluate the performance of our method, we reconstruct images of objects of 101 kinds. Additionally, we apply our method to analyze object classifiers and to generate novel images via BoVW

arXiv.org e-Print Archive

Crossref

Real-time food intake classification and energy expenditure estimation on a mobile device

Author: Lo B
Ravi D
Yang G
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 15/04/2015
Field of study

© 2015 IEEE.Assessment of food intake has a wide range of applications in public health and life-style related chronic disease management. In this paper, we propose a real-time food recognition platform combined with daily activity and energy expenditure estimation. In the proposed method, food recognition is based on hierarchical classification using multiple visual cues, supported by efficient software implementation suitable for realtime mobile device execution. A Fischer Vector representation together with a set of linear classifiers are used to categorize food intake. Daily energy expenditure estimation is achieved by using the built-in inertial motion sensors of the mobile device. The performance of the vision-based food recognition algorithm is compared to the current state-of-the-art, showing improved accuracy and high computational efficiency suitable for realtime feedback. Detailed user studies have also been performed to demonstrate the practical value of the software environment

Crossref

Spiral - Imperial College Digital Repository