21 research outputs found
Beyond standard benchmarks: Parameterizing performance evaluation in visual object tracking
Object-to-camera motion produces a variety of apparent motion patterns that
significantly affect performance of short-term visual trackers. Despite being
crucial for designing robust trackers, their influence is poorly explored in
standard benchmarks due to weakly defined, biased and overlapping attribute
annotations. In this paper we propose to go beyond pre-recorded benchmarks with
post-hoc annotations by presenting an approach that utilizes omnidirectional
videos to generate realistic, consistently annotated, short-term tracking
scenarios with exactly parameterized motion patterns. We have created an
evaluation system, constructed a fully annotated dataset of omnidirectional
videos and the generators for typical motion patterns. We provide an in-depth
analysis of major tracking paradigms which is complementary to the standard
benchmarks and confirms the expressiveness of our evaluation approach
Normal Transformer: Extracting Surface Geometry from LiDAR Points Enhanced by Visual Semantics
High-quality estimation of surface normal can help reduce ambiguity in many
geometry understanding problems, such as collision avoidance and occlusion
inference. This paper presents a technique for estimating the normal from 3D
point clouds and 2D colour images. We have developed a transformer neural
network that learns to utilise the hybrid information of visual semantic and 3D
geometric data, as well as effective learning strategies. Compared to existing
methods, the information fusion of the proposed method is more effective, which
is supported by experiments.
We have also built a simulation environment of outdoor traffic scenes in a 3D
rendering engine to obtain annotated data to train the normal estimator. The
model trained on synthetic data is tested on the real scenes in the KITTI
dataset. And subsequent tasks built upon the estimated normal directions in the
KITTI dataset show that the proposed estimator has advantage over existing
methods
Direct 3D Tomographic Reconstruction and Phase-Retrieval of Far-Field Coherent Diffraction Patterns
We present an alternative numerical reconstruction algorithm for direct
tomographic reconstruction of a sample refractive indices from the measured
intensities of its far-field coherent diffraction patterns. We formulate the
well-known phase-retrieval problem in ptychography in a tomographic framework
which allows for simultaneous reconstruction of the illumination function and
the sample refractive indices in three dimensions. Our iterative reconstruction
algorithm is based on the Levenberg-Marquardt algorithm. We demonstrate the
performance of our proposed method with simulation studies
Gait Recognition from Motion Capture Data
Gait recognition from motion capture data, as a pattern classification
discipline, can be improved by the use of machine learning. This paper
contributes to the state-of-the-art with a statistical approach for extracting
robust gait features directly from raw data by a modification of Linear
Discriminant Analysis with Maximum Margin Criterion. Experiments on the CMU
MoCap database show that the suggested method outperforms thirteen relevant
methods based on geometric features and a method to learn the features by a
combination of Principal Component Analysis and Linear Discriminant Analysis.
The methods are evaluated in terms of the distribution of biometric templates
in respective feature spaces expressed in a number of class separability
coefficients and classification metrics. Results also indicate a high
portability of learned features, that means, we can learn what aspects of walk
people generally differ in and extract those as general gait features.
Recognizing people without needing group-specific features is convenient as
particular people might not always provide annotated learning data. As a
contribution to reproducible research, our evaluation framework and database
have been made publicly available. This research makes motion capture
technology directly applicable for human recognition.Comment: Preprint. Full paper accepted at the ACM Transactions on Multimedia
Computing, Communications, and Applications (TOMM), special issue on
Representation, Analysis and Recognition of 3D Humans. 18 pages. arXiv admin
note: substantial text overlap with arXiv:1701.00995, arXiv:1609.04392,
arXiv:1609.0693
An Evaluation Framework and Database for MoCap-Based Gait Recognition Methods
As a contribution to reproducible research, this paper presents a framework and a database to improve the development, evaluation and comparison of methods for gait recognition from Motion Capture (MoCap) data. The evaluation framework provides implementation details and source codes of state-of-the-art human-interpretable geometric features as well as our own approaches where gait features are learned by a modification of Fisher's Linear Discriminant Analysis with the Maximum Margin Criterion, and by a combination of Principal Component Analysis and Linear Discriminant Analysis. It includes a description and source codes of a mechanism for evaluating four class separability coefficients of feature space and four rank-based classifier performance metrics. This framework also contains a tool for learning a custom classifier and for classifying a custom query on a custom gallery. We provide an experimental database along with source codes for its extraction from the general CMU MoCap database
Non-negative matrix factorization for self-calibration of photometric redshift scatter in weak lensing surveys
Photo-z error is one of the major sources of systematics degrading the
accuracy of weak lensing cosmological inferences. Zhang et al. (2010) proposed
a self-calibration method combining galaxy-galaxy correlations and galaxy-shear
correlations between different photo-z bins. Fisher matrix analysis shows that
it can determine the rate of photo-z outliers at a level of 0.01-1% merely
using photometric data and do not rely on any prior knowledge. In this paper,
we develop a new algorithm to implement this method by solving a constrained
nonlinear optimization problem arising in the self-calibration process. Based
on the techniques of fixed-point iteration and non-negative matrix
factorization, the proposed algorithm can efficiently and robustly reconstruct
the scattering probabilities between the true-z and photo-z bins. The algorithm
has been tested extensively by applying it to mock data from simulated stage IV
weak lensing projects. We find that the algorithm provides a successful
recovery of the scatter rates at the level of 0.01-1%, and the true mean
redshifts of photo-z bins at the level of 0.001, which may satisfy the
requirements in future lensing surveys.Comment: 12 pages, 6 figures. Accepted for publication in ApJ. Updated to
match the published versio
Site index estimation using airborne laser scanner data in Eucalyptus dunnii maide stands in Uruguay
Intensive silviculture demands new inventory tools for better forest management and planning. Airborne laser scanning (ALS) was shown to be one of the best alternatives for high-precision inventories applied to productive plantations. The aim of this study was to generate multiple stand-scale maps of the site index (SI) using ALS data in the intensive silviculture of Eucalyptus dunnii Maide plantations in Uruguay. Forty-three plots (314.16 m3) were established in intensive E. dunnii plantations in the departments of Río Negro and Paysandú (Uruguay). ALS data were obtained for an area of 1995 ha. Linear and Random Forest models were fitted to estimate the height and site index, and OrpheoToolBox (OTB) software was used for stand segmentation. Linear models for dominant height (DH) estimation had a better fit (R2 = 0.84, RMSE = 0.94 m, MAPE = 0.04, Bias = 0.002) than the Random Forest (R2 = 0.85, RMSE = 1.27 m, MAPE = 7.20, Bias=−0.173) model when including only the 99th percentile metric. The coefficient between RMSE values of the cross-validation and RMSE of the model had a higher value for the linear model (0.93) than the Random Forest (0.75). The SI was estimated by applying the RF model, which included the ALS metrics corresponding to the 99th height percentile and the 80th height bicentile (R2 = 0.65; RMSE = 1.62 m). OTB segmentation made it possible to define a minimum segment size of 2.03 ha (spatial radius = 30, range radius = 1 and minimum region size = 64). This study provides a new tool for better forest management and promotes the need for further progress in the application of ALS data in the intensive silviculture of Eucalyptus spp. plantations in Uruguay