Search CORE

5,112 research outputs found

Offline signature verification using classifier combination of HOG and LBP features

Author: Kholmatov Alisher Anatolyevich
Tirkaz Caglar
Tırkaz Çağlar
Yanikoglu Berrin
Yanıkoğlu Berrin
Yılmaz Mustafa Berkay
Yilmaz Mustafa Berkay
Publication venue: 'Indiana University Press (Project Muse)'
Publication date: 01/07/2011
Field of study

We present an offline signature verification system based on a signature’s local histogram features. The signature is divided into zones using both the Cartesian and polar coordinate systems and two different histogram features are calculated for each zone: histogram of oriented gradients (HOG) and histogram of local binary patterns (LBP). The classification is performed using Support Vector Machines (SVMs), where two different approaches for training are investigated, namely global and user-dependent SVMs. User-dependent SVMs, trained separately for each user, learn to differentiate a user’s signature from others, whereas a single global SVM trained with difference vectors of query and reference signatures’ features of all users, learns how to weight dissimilarities. The global SVM classifier is trained using genuine and forgery signatures of subjects that are excluded from the test set, while userdependent SVMs are separately trained for each subject using genuine and random forgeries. The fusion of all classifiers (global and user-dependent classifiers trained with each feature type), achieves a 15.41% equal error rate in skilled forgery test, in the GPDS-160 signature database without using any skilled forgeries in training

Sabanci University Research Database

Supervised semantic labeling of places using information extracted from sensor data

Author: Althaus
Axel Rottmann
Chakrabarti
Choset
Friedman
Gonzalez
Haralick
Howard
Koenig
Kuipers
Moravec
O’Rourke
Patric Jensfelt
Rosenfeld
Rosenfeld
Rottmann
Rudolph Triebel
Russ
Schapire
Thrun
Thrun
Wolfram Burgard
Yamamoto
Óscar Martínez Mozos
Publication venue: 'Elsevier BV'
Publication date: 01/05/2007
Field of study

Indoor environments can typically be divided into places with different functionalities like corridors, rooms or doorways. The ability to learn such semantic categories from sensor data enables a mobile robot to extend the representation of the environment facilitating interaction with humans. As an example, natural language terms like “corridor” or “room” can be used to communicate the position of the robot in a map in a more intuitive way. In this work, we first propose an approach based on supervised learning to classify the pose of a mobile robot into semantic classes. Our method uses AdaBoost to boost simple features extracted from sensor range data into a strong classifier. We present two main applications of this approach. Firstly, we show how our approach can be utilized by a moving robot for an online classification of the poses traversed along its path using a hidden Markov model. In this case we additionally use as features objects extracted from images. Secondly, we introduce an approach to learn topological maps from geometric maps by applying our semantic classification procedure in combination with a probabilistic relaxation method. Alternatively, we apply associative Markov networks to classify geometric maps and compare the results with a relaxation approach. Experimental results obtained in simulation and with real robots demonstrate the effectiveness of our approach in various indoor environments

University of Lincoln Institutional Repository

Crossref

Fast and Accurate Algorithm for Eye Localization for Gaze Tracking in Low Resolution Images

Author: Anjith George
Aurobinda Routray
Bradski G.
Burrus C.S.S.
Cristinacce D.
Lewis J.P.
Tomasi C.
Young D.
Zhang X.
Zhu Z.
Publication venue: 'Institution of Engineering and Technology (IET)'
Publication date: 17/05/2016
Field of study

Iris centre localization in low-resolution visible images is a challenging problem in computer vision community due to noise, shadows, occlusions, pose variations, eye blinks, etc. This paper proposes an efficient method for determining iris centre in low-resolution images in the visible spectrum. Even low-cost consumer-grade webcams can be used for gaze tracking without any additional hardware. A two-stage algorithm is proposed for iris centre localization. The proposed method uses geometrical characteristics of the eye. In the first stage, a fast convolution based approach is used for obtaining the coarse location of iris centre (IC). The IC location is further refined in the second stage using boundary tracing and ellipse fitting. The algorithm has been evaluated in public databases like BioID, Gi4E and is found to outperform the state of the art methods.Comment: 12 pages, 10 figures, IET Computer Vision, 201

arXiv.org e-Print Archive

Crossref

Directory of Open Access Journals

DROW: Real-Time Deep Learning based Wheelchair Detection in 2D Range Data

Author: Beyer Lucas
Hermans Alexander
Leibe Bastian
Publication venue
Publication date: 05/12/2016
Field of study

We introduce the DROW detector, a deep learning based detector for 2D range data. Laser scanners are lighting invariant, provide accurate range data, and typically cover a large field of view, making them interesting sensors for robotics applications. So far, research on detection in laser range data has been dominated by hand-crafted features and boosted classifiers, potentially losing performance due to suboptimal design choices. We propose a Convolutional Neural Network (CNN) based detector for this task. We show how to effectively apply CNNs for detection in 2D range data, and propose a depth preprocessing step and voting scheme that significantly improve CNN performance. We demonstrate our approach on wheelchairs and walkers, obtaining state of the art detection results. Apart from the training data, none of our design choices limits the detector to these two classes, though. We provide a ROS node for our detector and release our dataset containing 464k laser scans, out of which 24k were annotated.Comment: Lucas Beyer and Alexander Hermans contributed equall

arXiv.org e-Print Archive

Publikationsserver der RWTH Aachen University

StairNet: Top-Down Semantic Aggregation for Accurate One Shot Detection

Author: Hwang Soonmin
Kweon In So
Woo Sanghyun
Publication venue
Publication date: 18/09/2017
Field of study

One-stage object detectors such as SSD or YOLO already have shown promising accuracy with small memory footprint and fast speed. However, it is widely recognized that one-stage detectors have difficulty in detecting small objects while they are competitive with two-stage methods on large objects. In this paper, we investigate how to alleviate this problem starting from the SSD framework. Due to their pyramidal design, the lower layer that is responsible for small objects lacks strong semantics(e.g contextual information). We address this problem by introducing a feature combining module that spreads out the strong semantics in a top-down manner. Our final model StairNet detector unifies the multi-scale representations and semantic distribution effectively. Experiments on PASCAL VOC 2007 and PASCAL VOC 2012 datasets demonstrate that StairNet significantly improves the weakness of SSD and outperforms the other state-of-the-art one-stage detectors

arXiv.org e-Print Archive

Crossref

COST292 experimental framework for TRECVID 2008

Author: Aginako N.
Alatan A.
Alexandre L. A.
Avrithis Y.
Benois-Pineau J.
Chandramouli K.
Corvaglia M.
Damnjanovic U.
Dimou A.
Esen E.
Fatemi N.
Goya J.
Guerrini F.
Hanjalic A.
Jarina R.
Kapsalas P.
King P.
Kompatsiaris I.
Makris L.
Mansencal B.
Mezaris V.
Migliorati P.
Moumtzidou A.
Mylonas Ph.
Naci U.
Nikolopoulos S.
Paralic M.
Piatrik T.
Pinheiro A. M. G.
Poulin F.
Raileanu L.
Saracoglu A.
Spyrou E.
Tolias G.
Vrochidis S.
Zhang Q.
Publication venue: 'University of Aden - Faculty of Economics and Administration'
Publication date: 01/01/2008
Field of study

In this paper, we give an overview of the four tasks submitted to TRECVID 2008 by COST292. The high-level feature extraction framework comprises four systems. The first system transforms a set of low-level descriptors into the semantic space using Latent Semantic Analysis and utilises neural networks for feature detection. The second system uses a multi-modal classifier based on SVMs and several descriptors. The third system uses three image classifiers based on ant colony optimisation, particle swarm optimisation and a multi-objective learning algorithm. The fourth system uses a Gaussian model for singing detection and a person detection algorithm. The search task is based on an interactive retrieval application combining retrieval functionalities in various modalities with a user interface supporting automatic and interactive search over all queries submitted. The rushes task submission is based on a spectral clustering approach for removing similar scenes based on eigenvalues of frame similarity matrix and and a redundancy removal strategy which depends on semantic features extraction such as camera motion and faces. Finally, the submission to the copy detection task is conducted by two different systems. The first system consists of a video module and an audio module. The second system is based on mid-level features that are related to the temporal structure of videos

Archivio istituzionale della ricerca - Università di Brescia

PCA-RECT: An Energy-efficient Object Detection Approach for Event Cameras

Author: B Ramesh
C Brandli
C Posch
D Lowe
G Orchard
G Orchard
H Hikawa
JH Lee
S Ren
T Delbruck
TN Vikram
V Padala
X Lagorce
Z Ni
Publication venue
Publication date: 01/01/2019
Field of study

We present the first purely event-based, energy-efficient approach for object detection and categorization using an event camera. Compared to traditional frame-based cameras, choosing event cameras results in high temporal resolution (order of microseconds), low power consumption (few hundred mW) and wide dynamic range (120 dB) as attractive properties. However, event-based object recognition systems are far behind their frame-based counterparts in terms of accuracy. To this end, this paper presents an event-based feature extraction method devised by accumulating local activity across the image frame and then applying principal component analysis (PCA) to the normalized neighborhood region. Subsequently, we propose a backtracking-free k-d tree mechanism for efficient feature matching by taking advantage of the low-dimensionality of the feature representation. Additionally, the proposed k-d tree mechanism allows for feature selection to obtain a lower-dimensional dictionary representation when hardware resources are limited to implement dimensionality reduction. Consequently, the proposed system can be realized on a field-programmable gate array (FPGA) device leading to high performance over resource ratio. The proposed system is tested on real-world event-based datasets for object categorization, showing superior classification performance and relevance to state-of-the-art algorithms. Additionally, we verified the object detection method and real-time FPGA performance in lab settings under non-controlled illumination conditions with limited training data and ground truth annotations.Comment: Accepted in ACCV 2018 Workshops, to appea

arXiv.org e-Print Archive

Crossref

Western Sydney ResearchDirect

RUR53: an Unmanned Ground Vehicle for Navigation, Recognition and Manipulation

Author: Antonello Morris
Bagarello Nicola
Bortoletto Roberto
Carraro Marco
Castaman Nicola
Gandin Silvia
Ghidoni Stefano
Menegatti Emanuele
Munaro Matteo
Pagello Enrico
Tosello Elisa
Publication venue: 'Informa UK Limited'
Publication date: 01/01/2020
Field of study

This paper proposes RUR53: an Unmanned Ground Vehicle able to autonomously navigate through, identify, and reach areas of interest; and there recognize, localize, and manipulate work tools to perform complex manipulation tasks. The proposed contribution includes a modular software architecture where each module solves specific sub-tasks and that can be easily enlarged to satisfy new requirements. Included indoor and outdoor tests demonstrate the capability of the proposed system to autonomously detect a target object (a panel) and precisely dock in front of it while avoiding obstacles. They show it can autonomously recognize and manipulate target work tools (i.e., wrenches and valve stems) to accomplish complex tasks (i.e., use a wrench to rotate a valve stem). A specific case study is described where the proposed modular architecture lets easy switch to a semi-teleoperated mode. The paper exhaustively describes description of both the hardware and software setup of RUR53, its performance when tests at the 2017 Mohamed Bin Zayed International Robotics Challenge, and the lessons we learned when participating at this competition, where we ranked third in the Gran Challenge in collaboration with the Czech Technical University in Prague, the University of Pennsylvania, and the University of Lincoln (UK).Comment: This article has been accepted for publication in Advanced Robotics, published by Taylor & Franci

arXiv.org e-Print Archive

Archivio istituzionale della ricerca - Università di Padova