3,784 research outputs found

    A Novel Quadrilateral-based Tracking Method

    Get PDF
    This paper proposes a novel tracking method based on quadrilateral-based segmentation. The tracking method matches quadrilaterals in a region with those in a reference frame to construct region correspondences, from which trajectory of each region can be obtained. The Table Tennis sequence has been used to evaluate the tracking method. Experiment results show that the proposed tracking method can track the ping-pong ball and the racket reasonably well over a series of images, indicating matching quadrilaterals to track region is a viable approach.published_or_final_versio

    Quadrilateral-based region segmentation for tracking

    Get PDF
    We propose a novel quadrilateral based region segmentation method that is favorable for object tracking. Instead of using groups of pixels or regular blocks, it uses groups of connected quadrilaterals to represent regions. The proposed method derives the vertices of each quadrilateral from the edge map using the concept of center of masses. By merging the quadrilaterals, regions can be represented. The proposed method offers better data reduction than pixelwise region representation and better boundary approximation than block-based segmentation methods. Experimental results show that it generates a more reasonable region map, which is more suitable for object tracking, and a smaller number of regions than the seeded region growing, K-means clustering, and constrained gravitational clustering methods. © 2002 Society of Photo-Optical Instrumentation Engineers.published_or_final_versio

    Automated Top View Registration of Broadcast Football Videos

    Full text link
    In this paper, we propose a novel method to register football broadcast video frames on the static top view model of the playing surface. The proposed method is fully automatic in contrast to the current state of the art which requires manual initialization of point correspondences between the image and the static model. Automatic registration using existing approaches has been difficult due to the lack of sufficient point correspondences. We investigate an alternate approach exploiting the edge information from the line markings on the field. We formulate the registration problem as a nearest neighbour search over a synthetically generated dictionary of edge map and homography pairs. The synthetic dictionary generation allows us to exhaustively cover a wide variety of camera angles and positions and reduce this problem to a minimal per-frame edge map matching procedure. We show that the per-frame results can be improved in videos using an optimization framework for temporal camera stabilization. We demonstrate the efficacy of our approach by presenting extensive results on a dataset collected from matches of football World Cup 2014

    A sparsity-driven approach to multi-camera tracking in visual sensor networks

    Get PDF
    In this paper, a sparsity-driven approach is presented for multi-camera tracking in visual sensor networks (VSNs). VSNs consist of image sensors, embedded processors and wireless transceivers which are powered by batteries. Since the energy and bandwidth resources are limited, setting up a tracking system in VSNs is a challenging problem. Motivated by the goal of tracking in a bandwidth-constrained environment, we present a sparsity-driven method to compress the features extracted by the camera nodes, which are then transmitted across the network for distributed inference. We have designed special overcomplete dictionaries that match the structure of the features, leading to very parsimonious yet accurate representations. We have tested our method in indoor and outdoor people tracking scenarios. Our experimental results demonstrate how our approach leads to communication savings without significant loss in tracking performance

    Contextual Attention for Hand Detection in the Wild

    Get PDF
    We present Hand-CNN, a novel convolutional network architecture for detecting hand masks and predicting hand orientations in unconstrained images. Hand-CNN extends MaskRCNN with a novel attention mechanism to incorporate contextual cues in the detection process. This attention mechanism can be implemented as an efficient network module that captures non-local dependencies between features. This network module can be inserted at different stages of an object detection network, and the entire detector can be trained end-to-end. We also introduce large-scale annotated hand datasets containing hands in unconstrained images for training and evaluation. We show that Hand-CNN outperforms existing methods on the newly collected datasets and the publicly available PASCAL VOC human layout dataset. Data and code: https://www3.cs.stonybrook.edu/~cvl/projects/hand_det_attention

    Contextual Attention for Hand Detection in the Wild

    Get PDF
    We present Hand-CNN, a novel convolutional network architecture for detecting hand masks and predicting hand orientations in unconstrained images. Hand-CNN extends MaskRCNN with a novel attention mechanism to incorporate contextual cues in the detection process. This attention mechanism can be implemented as an efficient network module that captures non-local dependencies between features. This network module can be inserted at different stages of an object detection network, and the entire detector can be trained end-to-end. We also introduce a large-scale annotated hand dataset containing hands in unconstrained images for training and evaluation. We show that Hand-CNN outperforms existing methods on several datasets, including our hand detection benchmark and the publicly available PASCAL VOC human layout challenge. We also conduct ablation studies on hand detection to show the effectiveness of the proposed contextual attention module.Comment: 9 pages, 9 figure
    corecore