1,968 research outputs found

    Circulant temporal encoding for video retrieval and temporal alignment

    Get PDF
    We address the problem of specific video event retrieval. Given a query video of a specific event, e.g., a concert of Madonna, the goal is to retrieve other videos of the same event that temporally overlap with the query. Our approach encodes the frame descriptors of a video to jointly represent their appearance and temporal order. It exploits the properties of circulant matrices to efficiently compare the videos in the frequency domain. This offers a significant gain in complexity and accurately localizes the matching parts of videos. The descriptors can be compressed in the frequency domain with a product quantizer adapted to complex numbers. In this case, video retrieval is performed without decompressing the descriptors. We also consider the temporal alignment of a set of videos. We exploit the matching confidence and an estimate of the temporal offset computed for all pairs of videos by our retrieval approach. Our robust algorithm aligns the videos on a global timeline by maximizing the set of temporally consistent matches. The global temporal alignment enables synchronous playback of the videos of a given scene

    Vehicle detection and tracking using homography-based plane rectification and particle filtering

    Get PDF
    This paper presents a full system for vehicle detection and tracking in non-stationary settings based on computer vision. The method proposed for vehicle detection exploits the geometrical relations between the elements in the scene so that moving objects (i.e., vehicles) can be detected by analyzing motion parallax. Namely, the homography of the road plane between successive images is computed. Most remarkably, a novel probabilistic framework based on Kalman filtering is presented for reliable and accurate homography estimation. The estimated homography is used for image alignment, which in turn allows to detect the moving vehicles in the image. Tracking of vehicles is performed on the basis of a multidimensional particle filter, which also manages the exit and entries of objects. The filter involves a mixture likelihood model that allows a better adaptation of the particles to the observed measurements. The system is specially designed for highway environments, where it has been proven to yield excellent results

    Exploiting Points and Lines in Regression Forests for RGB-D Camera Relocalization

    Full text link
    Camera relocalization plays a vital role in many robotics and computer vision tasks, such as global localization, recovery from tracking failure and loop closure detection. Recent random forests based methods exploit randomly sampled pixel comparison features to predict 3D world locations for 2D image locations to guide the camera pose optimization. However, these image features are only sampled randomly in the images, without considering the spatial structures or geometric information, leading to large errors or failure cases with the existence of poorly textured areas or in motion blur. Line segment features are more robust in these environments. In this work, we propose to jointly exploit points and lines within the framework of uncertainty driven regression forests. The proposed approach is thoroughly evaluated on three publicly available datasets against several strong state-of-the-art baselines in terms of several different error metrics. Experimental results prove the efficacy of our method, showing superior or on-par state-of-the-art performance.Comment: published as a conference paper at 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS

    Long gravitational-wave transients and associated detection strategies for a network of terrestrial interferometers

    Get PDF
    Searches for gravitational waves (GWs) traditionally focus on persistent sources (e.g., pulsars or the stochastic background) or on transients sources (e.g., compact binary inspirals or core-collapse supernovae), which last for time scales of milliseconds to seconds. We explore the possibility of long GW transients with unknown waveforms lasting from many seconds to weeks. We propose a novel analysis technique to bridge the gap between short O(s) “burst” analyses and persistent stochastic analyses. Our technique utilizes frequency-time maps of GW strain cross power between two spatially separated terrestrial GW detectors. The application of our cross power statistic to searches for GW transients is framed as a pattern recognition problem, and we discuss several pattern-recognition techniques. We demonstrate these techniques by recovering simulated GW signals in simulated detector noise. We also recover environmental noise artifacts, thereby demonstrating a novel technique for the identification of such artifacts in GW interferometers. We compare the efficiency of this framework to other techniques such as matched filtering

    SCHLIEREN SEQUENCE ANALYSIS USING COMPUTER VISION

    Get PDF
    Computer vision-based methods are proposed for extraction and measurement of flow structures of interest in schlieren video. As schlieren data has increased with faster frame rates, we are faced with thousands of images to analyze. This presents an opportunity to study global flow structures over time that may not be evident from surface measurements. A degree of automation is desirable to extract flow structures and features to give information on their behavior through the sequence. Using an interdisciplinary approach, the analysis of large schlieren data is recast as a computer vision problem. The double-cone schlieren sequence is used as a testbed for the methodology; it is unique in that it contains 5,000 images, complex phenomena, and is feature rich. Oblique structures such as shock waves and shear layers are common in schlieren images. A vision-based methodology is used to provide an estimate of oblique structure angles through the unsteady sequence. The methodology has been applied to a complex flowfield with multiple shocks. A converged detection success rate between 94% and 97% for these structures is obtained. The modified curvature scale space is used to define features at salient points on shock contours. A challenge in developing methods for feature extraction in schlieren images is the reconciliation of existing techniques with features of interest to an aerodynamicist. Domain-specific knowledge of physics must therefore be incorporated into the definition and detec- tion phases. Known location and physically possible structure representations form a knowledge base that provides a unique feature definition and extraction. Model tip location and the motion of a shock intersection across several thousand frames are identified, localized, and tracked. Images are parsed into physically meaningful labels using segmentation. Using this representation, it is shown that in the double-cone flowfield, the dominant unsteady motion is associated with large scale random events within the aft-cone bow shock. Small scale organized motion is associated with the shock-separated flow on the fore-cone surface. We show that computer vision is a natural and useful extension to the evaluation of schlieren data, and that segmentation has the potential to permit new large scale measurements of flow motion

    Hough Transform Track Reconstruction in the Cathode Strip Chambers in ATLAS

    Get PDF
    The world's largest and highest energy particle accelerator, the Large Hadron Collider (LHC), will collide two highly energetic proton beams in an attempt to discover a wide range of new physics. Among which, the primary ambitions are the discovery of the Higgs boson and suppersymmetric particles. ATLAS, one of its primary particle detectors, was designed as a general-purpose detector covering a broad range of energies and physical processes. A special emphasis on accurate muon tracking has led the ATLAS collaboration to design a stand-alone Muon Spectrometer, an extremely large tracking system extending all the way around the detector. Due to its immense size and range, parts of the spectrometer were designed to withstand a high rate of radiation, sifting the muon signals from the rest of the signals (primarily neutrons and photons). The Cathode Strip Chambers (CSCs) are special multiwire proportional chambers placed in the high η\eta region on of the Muon Spectrometer, where flux of background particles is highest. Their purpose is to efficiently filter out the background particle, tracking only the muons traversing it with high degree of accuracy. In order to do that, this special algorithm was designed using a novel modification of the Hough Transform. This thesis will detail the key elements of this algorithm, how it is used for better muon track detection and parameterization, and give a preliminary evaluation of the perform ance of this algorithm
    corecore