21 research outputs found

    Beyond standard benchmarks: Parameterizing performance evaluation in visual object tracking

    Get PDF
    Object-to-camera motion produces a variety of apparent motion patterns that significantly affect performance of short-term visual trackers. Despite being crucial for designing robust trackers, their influence is poorly explored in standard benchmarks due to weakly defined, biased and overlapping attribute annotations. In this paper we propose to go beyond pre-recorded benchmarks with post-hoc annotations by presenting an approach that utilizes omnidirectional videos to generate realistic, consistently annotated, short-term tracking scenarios with exactly parameterized motion patterns. We have created an evaluation system, constructed a fully annotated dataset of omnidirectional videos and the generators for typical motion patterns. We provide an in-depth analysis of major tracking paradigms which is complementary to the standard benchmarks and confirms the expressiveness of our evaluation approach

    Normal Transformer: Extracting Surface Geometry from LiDAR Points Enhanced by Visual Semantics

    Full text link
    High-quality estimation of surface normal can help reduce ambiguity in many geometry understanding problems, such as collision avoidance and occlusion inference. This paper presents a technique for estimating the normal from 3D point clouds and 2D colour images. We have developed a transformer neural network that learns to utilise the hybrid information of visual semantic and 3D geometric data, as well as effective learning strategies. Compared to existing methods, the information fusion of the proposed method is more effective, which is supported by experiments. We have also built a simulation environment of outdoor traffic scenes in a 3D rendering engine to obtain annotated data to train the normal estimator. The model trained on synthetic data is tested on the real scenes in the KITTI dataset. And subsequent tasks built upon the estimated normal directions in the KITTI dataset show that the proposed estimator has advantage over existing methods

    Direct 3D Tomographic Reconstruction and Phase-Retrieval of Far-Field Coherent Diffraction Patterns

    Get PDF
    We present an alternative numerical reconstruction algorithm for direct tomographic reconstruction of a sample refractive indices from the measured intensities of its far-field coherent diffraction patterns. We formulate the well-known phase-retrieval problem in ptychography in a tomographic framework which allows for simultaneous reconstruction of the illumination function and the sample refractive indices in three dimensions. Our iterative reconstruction algorithm is based on the Levenberg-Marquardt algorithm. We demonstrate the performance of our proposed method with simulation studies

    Gait Recognition from Motion Capture Data

    Full text link
    Gait recognition from motion capture data, as a pattern classification discipline, can be improved by the use of machine learning. This paper contributes to the state-of-the-art with a statistical approach for extracting robust gait features directly from raw data by a modification of Linear Discriminant Analysis with Maximum Margin Criterion. Experiments on the CMU MoCap database show that the suggested method outperforms thirteen relevant methods based on geometric features and a method to learn the features by a combination of Principal Component Analysis and Linear Discriminant Analysis. The methods are evaluated in terms of the distribution of biometric templates in respective feature spaces expressed in a number of class separability coefficients and classification metrics. Results also indicate a high portability of learned features, that means, we can learn what aspects of walk people generally differ in and extract those as general gait features. Recognizing people without needing group-specific features is convenient as particular people might not always provide annotated learning data. As a contribution to reproducible research, our evaluation framework and database have been made publicly available. This research makes motion capture technology directly applicable for human recognition.Comment: Preprint. Full paper accepted at the ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), special issue on Representation, Analysis and Recognition of 3D Humans. 18 pages. arXiv admin note: substantial text overlap with arXiv:1701.00995, arXiv:1609.04392, arXiv:1609.0693

    An Evaluation Framework and Database for MoCap-Based Gait Recognition Methods

    Get PDF
    As a contribution to reproducible research, this paper presents a framework and a database to improve the development, evaluation and comparison of methods for gait recognition from Motion Capture (MoCap) data. The evaluation framework provides implementation details and source codes of state-of-the-art human-interpretable geometric features as well as our own approaches where gait features are learned by a modification of Fisher's Linear Discriminant Analysis with the Maximum Margin Criterion, and by a combination of Principal Component Analysis and Linear Discriminant Analysis. It includes a description and source codes of a mechanism for evaluating four class separability coefficients of feature space and four rank-based classifier performance metrics. This framework also contains a tool for learning a custom classifier and for classifying a custom query on a custom gallery. We provide an experimental database along with source codes for its extraction from the general CMU MoCap database

    Non-negative matrix factorization for self-calibration of photometric redshift scatter in weak lensing surveys

    Full text link
    Photo-z error is one of the major sources of systematics degrading the accuracy of weak lensing cosmological inferences. Zhang et al. (2010) proposed a self-calibration method combining galaxy-galaxy correlations and galaxy-shear correlations between different photo-z bins. Fisher matrix analysis shows that it can determine the rate of photo-z outliers at a level of 0.01-1% merely using photometric data and do not rely on any prior knowledge. In this paper, we develop a new algorithm to implement this method by solving a constrained nonlinear optimization problem arising in the self-calibration process. Based on the techniques of fixed-point iteration and non-negative matrix factorization, the proposed algorithm can efficiently and robustly reconstruct the scattering probabilities between the true-z and photo-z bins. The algorithm has been tested extensively by applying it to mock data from simulated stage IV weak lensing projects. We find that the algorithm provides a successful recovery of the scatter rates at the level of 0.01-1%, and the true mean redshifts of photo-z bins at the level of 0.001, which may satisfy the requirements in future lensing surveys.Comment: 12 pages, 6 figures. Accepted for publication in ApJ. Updated to match the published versio

    Site index estimation using airborne laser scanner data in Eucalyptus dunnii maide stands in Uruguay

    Get PDF
    Intensive silviculture demands new inventory tools for better forest management and planning. Airborne laser scanning (ALS) was shown to be one of the best alternatives for high-precision inventories applied to productive plantations. The aim of this study was to generate multiple stand-scale maps of the site index (SI) using ALS data in the intensive silviculture of Eucalyptus dunnii Maide plantations in Uruguay. Forty-three plots (314.16 m3) were established in intensive E. dunnii plantations in the departments of Río Negro and Paysandú (Uruguay). ALS data were obtained for an area of 1995 ha. Linear and Random Forest models were fitted to estimate the height and site index, and OrpheoToolBox (OTB) software was used for stand segmentation. Linear models for dominant height (DH) estimation had a better fit (R2 = 0.84, RMSE = 0.94 m, MAPE = 0.04, Bias = 0.002) than the Random Forest (R2 = 0.85, RMSE = 1.27 m, MAPE = 7.20, Bias=−0.173) model when including only the 99th percentile metric. The coefficient between RMSE values of the cross-validation and RMSE of the model had a higher value for the linear model (0.93) than the Random Forest (0.75). The SI was estimated by applying the RF model, which included the ALS metrics corresponding to the 99th height percentile and the 80th height bicentile (R2 = 0.65; RMSE = 1.62 m). OTB segmentation made it possible to define a minimum segment size of 2.03 ha (spatial radius = 30, range radius = 1 and minimum region size = 64). This study provides a new tool for better forest management and promotes the need for further progress in the application of ALS data in the intensive silviculture of Eucalyptus spp. plantations in Uruguay
    corecore