124 research outputs found

    Optimal Geometric Matching for Patch-Based Object Detection

    Get PDF
    We present an efficient method to determine the optimal matching of two patch-based image object representations under rotation, scaling, and translation (RST). This use of patches is equivalent to a fullyconnected part-based model, for which the presented approach offers an efficient procedure to determine the best fit. While other approaches that use fully connected models have a high complexity in the number of parts used, we achieve linear complexity in that variable, because we only allow RST-matchings. The presented approach is used for object recognition in images: by matching images that contain certain objects to a test image, we can detect whether the test image contains an object of that class or not. We evaluate this approach on the Caltech data and obtain very competitive results

    Higher-Order Statistics in Visual Object Recognition

    Get PDF
    In this paper, I develop a higher-order statistical theory of matching models against images. The basic idea is not only to take into account {\em how much} of an object can be seen in the image, but also {\em what parts} of it are jointly present. I show that this additional information can improve the specificity (i.e., reduce the probability of false positive matches) of a recognition algorithm. I demonstrate formally that most commonly used quality of match measures employed by recognition algorithms are based on an independence assumption. Using the Minimum Description Length (MDL) principle and a simple scene-description language as a guide, I show that this independence assumption is not satisfied for common scenes, and propose several important higher-order statistical properties of matches that approximate some aspects of these statistical dependencies. I have implemented a recognition system that takes advantage of this additional statistical information and demonstrate its efficacy in comparisons with a standard recognition system based on bounded error matching. We also observe that the existing use of grouping and segmentation methods has significant effects on the performance of recognition systems that are similar to those resulting from the use of higher-order statistical information. Our analysis provides a statistical framework in which to understand the effects of grouping and segmentation on recognition and suggests ways to take better advantage of such information
    • …
    corecore