Search CORE

115,215 research outputs found

Rotation-invariant features for multi-oriented text detection in natural images.

Author: Bai Xiang
Liu Wenyu
Ma Yi
Tu Zhuowen
Yao Cong
Zhang Xin
Publication venue: eScholarship, University of California
Publication date: 01/01/2013
Field of study

Texts in natural scenes carry rich semantic information, which can be used to assist a wide range of applications, such as object recognition, image/video retrieval, mapping/navigation, and human computer interaction. However, most existing systems are designed to detect and recognize horizontal (or near-horizontal) texts. Due to the increasing popularity of mobile-computing devices and applications, detecting texts of varying orientations from natural images under less controlled conditions has become an important but challenging task. In this paper, we propose a new algorithm to detect texts of varying orientations. Our algorithm is based on a two-level classification scheme and two sets of features specially designed for capturing the intrinsic characteristics of texts. To better evaluate the proposed method and compare it with the competing algorithms, we generate a comprehensive dataset with various types of texts in diverse real-world scenes. We also propose a new evaluation protocol, which is more suitable for benchmarking algorithms for detecting texts in varying orientations. Experiments on benchmark datasets demonstrate that our system compares favorably with the state-of-the-art algorithms when handling horizontal texts and achieves significantly enhanced performance on variant texts in complex natural scenes

Directory of Open Access Journals

PubMed Central

eScholarship - University of California

UA-DETRAC: A New Benchmark and Protocol for Multi-Object Detection and Tracking

Author: Cai Zhaowei
Chang Ming-Ching
Du Dawei
Lei Zhen
Lim Jongwoo
Lyu Siwei
Qi Honggang
Wen Longyin
Yang Ming-Hsuan
Publication venue
Publication date: 24/01/2020
Field of study

In recent years, numerous effective multi-object tracking (MOT) methods are developed because of the wide range of applications. Existing performance evaluations of MOT methods usually separate the object tracking step from the object detection step by using the same fixed object detection results for comparisons. In this work, we perform a comprehensive quantitative study on the effects of object detection accuracy to the overall MOT performance, using the new large-scale University at Albany DETection and tRACking (UA-DETRAC) benchmark dataset. The UA-DETRAC benchmark dataset consists of 100 challenging video sequences captured from real-world traffic scenes (over 140,000 frames with rich annotations, including occlusion, weather, vehicle category, truncation, and vehicle bounding boxes) for object detection, object tracking and MOT system. We evaluate complete MOT systems constructed from combinations of state-of-the-art object detection and object tracking methods. Our analysis shows the complex effects of object detection accuracy on MOT system performance. Based on these observations, we propose new evaluation tools and metrics for MOT systems that consider both object detection and object tracking for comprehensive analysis.Comment: 18 pages, 11 figures, accepted by CVI

arXiv.org e-Print Archive

eScholarship - University of California

Unconstrained Face Detection and Open-Set Face Recognition Challenge

Author: Beyerer Jürgen
Boult Terrance E.
Chan Chi Ho
Dhamija Akshay Raj
Guo Guodong
Günther Manuel
Herrmann Christian
Hu Peiyun
Jazaery Mohamad Al
Jiang Min
Kittler Josef
Nouyed Mohammad Iqbal
Ramanan Deva
Stankiewicz Cezary
Yang Shufan
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2018
Field of study

Face detection and recognition benchmarks have shifted toward more difficult environments. The challenge presented in this paper addresses the next step in the direction of automatic detection and identification of people from outdoor surveillance cameras. While face detection has shown remarkable success in images collected from the web, surveillance cameras include more diverse occlusions, poses, weather conditions and image blur. Although face verification or closed-set face identification have surpassed human capabilities on some datasets, open-set identification is much more complex as it needs to reject both unknown identities and false accepts from the face detector. We show that unconstrained face detection can approach high detection rates albeit with moderate false accept rates. By contrast, open-set face recognition is currently weak and requires much more attention.Comment: This is an ERRATA version of the paper originally presented at the International Joint Conference on Biometrics. Due to a bug in our evaluation code, the results of the participants changed. The final conclusion, however, is still the sam

arXiv.org e-Print Archive

Enlighten

Improvements on coronal hole detection in SDO/AIA images using supervised classification

Author: Ahammer Helmut
De Visscher Ruben
Delouille Véronique
Hofmeister Stefan J.
Mampaey Benjamin
Reiss Martin A.
Temmer Manuela
Veronig Astrid M.
Publication venue
Publication date: 22/06/2015
Field of study

We demonstrate the use of machine learning algorithms in combination with segmentation techniques in order to distinguish coronal holes and filaments in SDO/AIA EUV images of the Sun. Based on two coronal hole detection techniques (intensity-based thresholding, SPoCA), we prepared data sets of manually labeled coronal hole and filament channel regions present on the Sun during the time range 2011 - 2013. By mapping the extracted regions from EUV observations onto HMI line-of-sight magnetograms we also include their magnetic characteristics. We computed shape measures from the segmented binary maps as well as first order and second order texture statistics from the segmented regions in the EUV images and magnetograms. These attributes were used for data mining investigations to identify the most performant rule to differentiate between coronal holes and filament channels. We applied several classifiers, namely Support Vector Machine, Linear Support Vector Machine, Decision Tree, and Random Forest and found that all classification rules achieve good results in general, with linear SVM providing the best performances (with a true skill statistic of ~0.90). Additional information from magnetic field data systematically improves the performance across all four classifiers for the SPoCA detection. Since the calculation is inexpensive in computing time, this approach is well suited for applications on real-time data. This study demonstrates how a machine learning approach may help improve upon an unsupervised feature extraction method.Comment: in press for SWS

arXiv.org e-Print Archive

EDP Sciences OAI-PMH repository (1.2.0)

Statistical and Electrical Features Evaluation for Electrical Appliances Energy Disaggregation

Author: Harell
Iosif Mporas
Pascal Schirmer
Zhang
Publication venue: 'MDPI AG'
Publication date: 11/06/2019
Field of study

In this paper we evaluate several well-known and widely used machine learning algorithms for regression in the energy disaggregation task. Specifically, the Non-Intrusive Load Monitoring approach was considered and the K-Nearest-Neighbours, Support Vector Machines, Deep Neural Networks and Random Forest algorithms were evaluated across five datasets using seven different sets of statistical and electrical features. The experimental results demonstrated the importance of selecting both appropriate features and regression algorithms. Analysis on device level showed that linear devices can be disaggregated using statistical features, while for non-linear devices the use of electrical features significantly improves the disaggregation accuracy, as non-linear appliances have non-sinusoidal current draw and thus cannot be well parametrized only by their active power consumption. The best performance in terms of energy disaggregation accuracy was achieved by the Random Forest regression algorithm.Peer reviewedFinal Published versio

Multidisciplinary Digital Publishing Institute

Crossref

University of Hertfordshire Research Archive