7,595 research outputs found

    A robust and efficient video representation for action recognition

    Get PDF
    This paper introduces a state-of-the-art video representation and applies it to efficient action recognition and detection. We first propose to improve the popular dense trajectory features by explicit camera motion estimation. More specifically, we extract feature point matches between frames using SURF descriptors and dense optical flow. The matches are used to estimate a homography with RANSAC. To improve the robustness of homography estimation, a human detector is employed to remove outlier matches from the human body as human motion is not constrained by the camera. Trajectories consistent with the homography are considered as due to camera motion, and thus removed. We also use the homography to cancel out camera motion from the optical flow. This results in significant improvement on motion-based HOF and MBH descriptors. We further explore the recent Fisher vector as an alternative feature encoding approach to the standard bag-of-words histogram, and consider different ways to include spatial layout information in these encodings. We present a large and varied set of evaluations, considering (i) classification of short basic actions on six datasets, (ii) localization of such actions in feature-length movies, and (iii) large-scale recognition of complex events. We find that our improved trajectory features significantly outperform previous dense trajectories, and that Fisher vectors are superior to bag-of-words encodings for video recognition tasks. In all three tasks, we show substantial improvements over the state-of-the-art results

    Massive MIMO is a Reality -- What is Next? Five Promising Research Directions for Antenna Arrays

    Full text link
    Massive MIMO (multiple-input multiple-output) is no longer a "wild" or "promising" concept for future cellular networks - in 2018 it became a reality. Base stations (BSs) with 64 fully digital transceiver chains were commercially deployed in several countries, the key ingredients of Massive MIMO have made it into the 5G standard, the signal processing methods required to achieve unprecedented spectral efficiency have been developed, and the limitation due to pilot contamination has been resolved. Even the development of fully digital Massive MIMO arrays for mmWave frequencies - once viewed prohibitively complicated and costly - is well underway. In a few years, Massive MIMO with fully digital transceivers will be a mainstream feature at both sub-6 GHz and mmWave frequencies. In this paper, we explain how the first chapter of the Massive MIMO research saga has come to an end, while the story has just begun. The coming wide-scale deployment of BSs with massive antenna arrays opens the door to a brand new world where spatial processing capabilities are omnipresent. In addition to mobile broadband services, the antennas can be used for other communication applications, such as low-power machine-type or ultra-reliable communications, as well as non-communication applications such as radar, sensing and positioning. We outline five new Massive MIMO related research directions: Extremely large aperture arrays, Holographic Massive MIMO, Six-dimensional positioning, Large-scale MIMO radar, and Intelligent Massive MIMO.Comment: 20 pages, 9 figures, submitted to Digital Signal Processin

    Bird Species Categorization Using Pose Normalized Deep Convolutional Nets

    Get PDF
    We propose an architecture for fine-grained visual categorization that approaches expert human performance in the classification of bird species. Our architecture first computes an estimate of the object's pose; this is used to compute local image features which are, in turn, used for classification. The features are computed by applying deep convolutional nets to image patches that are located and normalized by the pose. We perform an empirical study of a number of pose normalization schemes, including an investigation of higher order geometric warping functions. We propose a novel graph-based clustering algorithm for learning a compact pose normalization space. We perform a detailed investigation of state-of-the-art deep convolutional feature implementations and fine-tuning feature learning for fine-grained classification. We observe that a model that integrates lower-level feature layers with pose-normalized extraction routines and higher-level feature layers with unaligned image features works best. Our experiments advance state-of-the-art performance on bird species recognition, with a large improvement of correct classification rates over previous methods (75% vs. 55-65%)

    Optimal observables and estimators for practical superresolution imaging

    Full text link
    Recent works identified resolution limits for the distance between incoherent point sources. However, it remains unclear how to choose suitable observables and estimators to reach these limits in practical situations. Here, we show how estimators saturating the Cram\'er-Rao bound for the distance between two thermal point sources can be constructed using an optimally designed observable in the presence of practical imperfections, such as misalignment, crosstalk and detector noise.Comment: 6 pages, 4 figures. Comments are welcom

    Nanoscale mosaicity revealed in peptide microcrystals by scanning electron nanodiffraction.

    Get PDF
    Changes in lattice structure across sub-regions of protein crystals are challenging to assess when relying on whole crystal measurements. Because of this difficulty, macromolecular structure determination from protein micro and nanocrystals requires assumptions of bulk crystallinity and domain block substructure. Here we map lattice structure across micron size areas of cryogenically preserved three-dimensional peptide crystals using a nano-focused electron beam. This approach produces diffraction from as few as 1500 molecules in a crystal, is sensitive to crystal thickness and three-dimensional lattice orientation. Real-space maps reconstructed from unsupervised classification of diffraction patterns across a crystal reveal regions of crystal order/disorder and three-dimensional lattice tilts on the sub-100nm scale. The nanoscale lattice reorientation observed in the micron-sized peptide crystal lattices studied here provides a direct view of their plasticity. Knowledge of these features facilitates an improved understanding of peptide assemblies that could aid in the determination of structures from nano- and microcrystals by single or serial crystal electron diffraction
    corecore