167 research outputs found

    Boosted Random ferns for object detection

    Get PDF
    © 20xx IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.In this paper we introduce the Boosted Random Ferns (BRFs) to rapidly build discriminative classifiers for learning and detecting object categories. At the core of our approach we use standard random ferns, but we introduce four main innovations that let us bring ferns from an instance to a category level, and still retain efficiency. First, we define binary features on the histogram of oriented gradients-domain (as opposed to intensity-), allowing for a better representation of intra-class variability. Second, both the positions where ferns are evaluated within the sliding window, and the location of the binary features for each fern are not chosen completely at random, but instead we use a boosting strategy to pick the most discriminative combination of them. This is further enhanced by our third contribution, that is to adapt the boosting strategy to enable sharing of binary features among different ferns, yielding high recognition rates at a low computational cost. And finally, we show that training can be performed online, for sequentially arriving images. Overall, the resulting classifier can be very efficiently trained, densely evaluated for all image locations in about 0.1 seconds, and provides detection rates similar to competing approaches that require expensive and significantly slower processing times. We demonstrate the effectiveness of our approach by thorough experimentation in publicly available datasets in which we compare against state-of-the-art, and for tasks of both 2D detection and 3D multi-view estimation.Peer ReviewedPostprint (author's final draft

    Object Detection in 20 Years: A Survey

    Full text link
    Object detection, as of one the most fundamental and challenging problems in computer vision, has received great attention in recent years. Its development in the past two decades can be regarded as an epitome of computer vision history. If we think of today's object detection as a technical aesthetics under the power of deep learning, then turning back the clock 20 years we would witness the wisdom of cold weapon era. This paper extensively reviews 400+ papers of object detection in the light of its technical evolution, spanning over a quarter-century's time (from the 1990s to 2019). A number of topics have been covered in this paper, including the milestone detectors in history, detection datasets, metrics, fundamental building blocks of the detection system, speed up techniques, and the recent state of the art detection methods. This paper also reviews some important detection applications, such as pedestrian detection, face detection, text detection, etc, and makes an in-deep analysis of their challenges as well as technical improvements in recent years.Comment: This work has been submitted to the IEEE TPAMI for possible publicatio

    Detection and tracking of a human on a bicycle using HOG feature and particle filter

    Get PDF
    Detection of a human on a bicycle is an important research subject in an advanced safety vehicle driving system to decrease traffic accidents. The Histograms of Oriented Gradients (HOG) feature has been proposed as useful feature for detecting a standing human in various kinds of background. So, many researchers use currently the HOG feature to detect a human. Detecting a human on a bicycle is more difficult than detecting a standing human, because the appearance of a bicycle can change dramatically according to viewpoints. In this paper, we propose a method of detecting a human on a bicycle using HOG feature and RealAdaBoost algorithm. When detecting a human on a bicycle, occlusion is a cause of decreasing detection efficiency. Occlusion is a serious problem in car vision research, because there are often occlusion in real transportation environment. In such a case, the proposed method predicts the next position of a human on a bicycle using a tracking strategy. Experimental results and their evaluation show satisfactory performance of the proposed method

    Detection and tracking of a human on a bicycle using HOG feature and particle filter

    Get PDF
    Detection of a human on a bicycle is an important research subject in an advanced safety vehicle driving system to decrease traffic accidents. The Histograms of Oriented Gradients (HOG) feature has been proposed as useful feature for detecting a standing human in various kinds of background. So, many researchers use currently the HOG feature to detect a human. Detecting a human on a bicycle is more difficult than detecting a standing human, because the appearance of a bicycle can change dramatically according to viewpoints. In this paper, we propose a method of detecting a human on a bicycle using HOG feature and RealAdaBoost algorithm. When detecting a human on a bicycle, occlusion is a cause of decreasing detection efficiency. Occlusion is a serious problem in car vision research, because there are often occlusion in real transportation environment. In such a case, the proposed method predicts the next position of a human on a bicycle using a tracking strategy. Experimental results and their evaluation show satisfactory performance of the proposed method

    Automated Multi-Modal Search and Rescue using Boosted Histogram of Oriented Gradients

    Get PDF
    Unmanned Aerial Vehicles (UAVs) provides a platform for many automated tasks and with an ever increasing advances in computing, these tasks can be more complex. The use of UAVs is expanded in this thesis with the goal of Search and Rescue (SAR), where a UAV can assist fast responders to search for a lost person and relay possible search areas back to SAR teams. To identify a person from an aerial perspective, low-level Histogram of Oriented Gradients (HOG) feature descriptors are used over a segmented region, provided from thermal data, to increase classification speed. This thesis also introduces a dataset to support a Bird’s-Eye-View (BEV) perspective and tests the viability of low level HOG feature descriptors on this dataset. The low-level feature descriptors are known as Boosted Histogram of Oriented Gradients (BHOG) features, which discretizes gradients over varying sized cells and blocks that are trained with a Cascaded Gentle AdaBoost Classifier using our compiled BEV dataset. The classification is supported by multiple sensing modes with color and thermal videos to increase classification speed. The thermal video is segmented to indicate any Region of Interest (ROI) that are mapped to the color video where classification occurs. The ROI decreases classification time needed for the aerial platform by eliminating a per-frame sliding window. Testing reveals that with the use of only color data iv and a classifier trained for a profile of a person, there is an average recall of 78%, while the thermal detection results with an average recall of 76%. However, there is a speed up of 2 with a video of 240x320 resolution. The BEV testing reveals that higher resolutions are favored with a recall rate of 71% using BHOG features, and 92% using Haar-Features. In the lower resolution BEV testing, the recall rates are 42% and 55%, for BHOG and Haar-Features, respectively

    SCALE-ROBUST DEEP LEARNING FOR VISUAL RECOGNITION

    Get PDF
    Ph.DDOCTOR OF PHILOSOPH
    corecore