Search CORE

21 research outputs found

Beyond standard benchmarks: Parameterizing performance evaluation in visual object tracking

Author: Kristan Matej
Leonardis Aleš
Lukežič Alan
Zajc Luka Čehovin
Publication venue
Publication date: 25/03/2017
Field of study

Object-to-camera motion produces a variety of apparent motion patterns that significantly affect performance of short-term visual trackers. Despite being crucial for designing robust trackers, their influence is poorly explored in standard benchmarks due to weakly defined, biased and overlapping attribute annotations. In this paper we propose to go beyond pre-recorded benchmarks with post-hoc annotations by presenting an approach that utilizes omnidirectional videos to generate realistic, consistently annotated, short-term tracking scenarios with exactly parameterized motion patterns. We have created an evaluation system, constructed a fully annotated dataset of omnidirectional videos and the generators for typical motion patterns. We provide an in-depth analysis of major tracking paradigms which is complementary to the standard benchmarks and confirms the expressiveness of our evaluation approach

arXiv.org e-Print Archive

Crossref

University of Birmingham Research Portal

Normal Transformer: Extracting Surface Geometry from LiDAR Points Enhanced by Visual Semantics

Author: Li Jun
Lin Ancheng
Publication venue
Publication date: 18/11/2022
Field of study

High-quality estimation of surface normal can help reduce ambiguity in many geometry understanding problems, such as collision avoidance and occlusion inference. This paper presents a technique for estimating the normal from 3D point clouds and 2D colour images. We have developed a transformer neural network that learns to utilise the hybrid information of visual semantic and 3D geometric data, as well as effective learning strategies. Compared to existing methods, the information fusion of the proposed method is more effective, which is supported by experiments. We have also built a simulation environment of outdoor traffic scenes in a 3D rendering engine to obtain annotated data to train the normal estimator. The model trained on synthetic data is tested on the real scenes in the KITTI dataset. And subsequent tasks built upon the estimated normal directions in the KITTI dataset show that the proposed estimator has advantage over existing methods

arXiv.org e-Print Archive

Direct 3D Tomographic Reconstruction and Phase-Retrieval of Far-Field Coherent Diffraction Patterns

Author: Andersen Martin Skovgaard
Andreasen J. W.
Grønager Bastian E.
Ramos T.
Publication venue: 'American Physical Society (APS)'
Publication date: 01/01/2019
Field of study

We present an alternative numerical reconstruction algorithm for direct tomographic reconstruction of a sample refractive indices from the measured intensities of its far-field coherent diffraction patterns. We formulate the well-known phase-retrieval problem in ptychography in a tomographic framework which allows for simultaneous reconstruction of the illumination function and the sample refractive indices in three dimensions. Our iterative reconstruction algorithm is based on the Levenberg-Marquardt algorithm. We demonstrate the performance of our proposed method with simulation studies

arXiv.org e-Print Archive

Online Research Database In Technology

Gait Recognition from Motion Capture Data

Author: Balazia Michal
Sojka Petr
Publication venue
Publication date: 24/08/2017
Field of study

Gait recognition from motion capture data, as a pattern classification discipline, can be improved by the use of machine learning. This paper contributes to the state-of-the-art with a statistical approach for extracting robust gait features directly from raw data by a modification of Linear Discriminant Analysis with Maximum Margin Criterion. Experiments on the CMU MoCap database show that the suggested method outperforms thirteen relevant methods based on geometric features and a method to learn the features by a combination of Principal Component Analysis and Linear Discriminant Analysis. The methods are evaluated in terms of the distribution of biometric templates in respective feature spaces expressed in a number of class separability coefficients and classification metrics. Results also indicate a high portability of learned features, that means, we can learn what aspects of walk people generally differ in and extract those as general gait features. Recognizing people without needing group-specific features is convenient as particular people might not always provide annotated learning data. As a contribution to reproducible research, our evaluation framework and database have been made publicly available. This research makes motion capture technology directly applicable for human recognition.Comment: Preprint. Full paper accepted at the ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), special issue on Representation, Analysis and Recognition of 3D Humans. 18 pages. arXiv admin note: substantial text overlap with arXiv:1701.00995, arXiv:1609.04392, arXiv:1609.0693

arXiv.org e-Print Archive

An Evaluation Framework and Database for MoCap-Based Gait Recognition Methods

Author: B Kwolek
J Sedmidubsky
M Balazia
S Ali
S Jiang
T Krzeszowski
V Ramu Reddy
X Chen
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

As a contribution to reproducible research, this paper presents a framework and a database to improve the development, evaluation and comparison of methods for gait recognition from Motion Capture (MoCap) data. The evaluation framework provides implementation details and source codes of state-of-the-art human-interpretable geometric features as well as our own approaches where gait features are learned by a modification of Fisher's Linear Discriminant Analysis with the Maximum Margin Criterion, and by a combination of Principal Component Analysis and Linear Discriminant Analysis. It includes a description and source codes of a mechanism for evaluating four class separability coefficients of feature space and four rank-based classifier performance metrics. This framework also contains a tool for learning a custom classifier and for classifying a custom query on a custom gallery. We provide an experimental database along with source codes for its extraction from the general CMU MoCap database

arXiv.org e-Print Archive

Crossref

Univerzitní repozitář Masarykovy univerzity

International Evaluation of Research and Doctoral Training at the University of Helsinki 2005-2010 : RC-Specific Evaluation of ALKO - Algorithms and Data Analysis

Author
Publication venue
Publication date: 01/01/2012
Field of study

Helsingin yliopiston digitaalinen arkisto

Non-negative matrix factorization for self-calibration of photometric redshift scatter in weak lensing surveys

Author: Yu Yu
Zhang Le
Zhang Pengjie
Publication venue: 'American Astronomical Society'
Publication date: 10/10/2017
Field of study

Photo-z error is one of the major sources of systematics degrading the accuracy of weak lensing cosmological inferences. Zhang et al. (2010) proposed a self-calibration method combining galaxy-galaxy correlations and galaxy-shear correlations between different photo-z bins. Fisher matrix analysis shows that it can determine the rate of photo-z outliers at a level of 0.01-1% merely using photometric data and do not rely on any prior knowledge. In this paper, we develop a new algorithm to implement this method by solving a constrained nonlinear optimization problem arising in the self-calibration process. Based on the techniques of fixed-point iteration and non-negative matrix factorization, the proposed algorithm can efficiently and robustly reconstruct the scattering probabilities between the true-z and photo-z bins. The algorithm has been tested extensively by applying it to mock data from simulated stage IV weak lensing projects. We find that the algorithm provides a successful recovery of the scatter rates at the level of 0.01-1%, and the true mean redshifts of photo-z bins at the level of 0.001, which may satisfy the requirements in future lensing surveys.Comment: 12 pages, 6 figures. Accepted for publication in ApJ. Updated to match the published versio

arXiv.org e-Print Archive

Shanghai Astronomical Observatory,Chinese Academy of Sciences

Generalizing the mean intercept length tensor for gray-level images

Author: Baddour
Basri
Bigün
Canny
Chung
Ciarelli
Cowin
Driscoll
Fitzgibbon
Foley
Groemer
Healy
Homminga
Horn
Ilic
Jupp
Kanatani
Kanatani
Kreider
Launeau
Launeau
Magnus Borga
Mahajan
Mizuno
Moreno
Moreno
Odgaard
Odgaard
Pahr
Parfitt
Peeters
Podshivalov
Pollari
Ramamoorthi
Robin
Rodrigo Moreno
Saha
Saltykov
Sun
Tabor
Tabor
Tabor
Tabor
Varga
Vasilić
Voyiadjis
Wald
Weickert
Whitehouse
Wolfram
Wolski
Zysset
Örjan Smedby
Publication venue: 'Wiley'
Publication date
Field of study

Crossref

Site index estimation using airborne laser scanner data in Eucalyptus dunnii maide stands in Uruguay

Author: Arthus-Bacovich Rodrigo
Hirigoyen-Domínguez Andrés
Navarro Cerrillo Rafael M.
Rizzo Martín Iván Gabriel
Varo-Martínez Mª Ángeles
Publication venue: 'MDPI AG'
Publication date: 01/01/2023
Field of study

Intensive silviculture demands new inventory tools for better forest management and planning. Airborne laser scanning (ALS) was shown to be one of the best alternatives for high-precision inventories applied to productive plantations. The aim of this study was to generate multiple stand-scale maps of the site index (SI) using ALS data in the intensive silviculture of Eucalyptus dunnii Maide plantations in Uruguay. Forty-three plots (314.16 m3) were established in intensive E. dunnii plantations in the departments of Río Negro and Paysandú (Uruguay). ALS data were obtained for an area of 1995 ha. Linear and Random Forest models were fitted to estimate the height and site index, and OrpheoToolBox (OTB) software was used for stand segmentation. Linear models for dominant height (DH) estimation had a better fit (R2 = 0.84, RMSE = 0.94 m, MAPE = 0.04, Bias = 0.002) than the Random Forest (R2 = 0.85, RMSE = 1.27 m, MAPE = 7.20, Bias=−0.173) model when including only the 99th percentile metric. The coefficient between RMSE values of the cross-validation and RMSE of the model had a higher value for the linear model (0.93) than the Random Forest (0.75). The SI was estimated by applying the RF model, which included the ALS metrics corresponding to the 99th height percentile and the 80th height bicentile (R2 = 0.65; RMSE = 1.62 m). OTB segmentation made it possible to define a minimum segment size of 2.03 ha (spatial radius = 30, range radius = 1 and minimum region size = 64). This study provides a new tool for better forest management and promotes the need for further progress in the application of ALS data in the intensive silviculture of Eucalyptus spp. plantations in Uruguay

Repositorio Institucional de la Universidad de Córdoba

Automated angular and translational tomographic alignment and application to phase-contrast imaging

Author: Andreasen Jens Wenzel
Cunha Ramos Tiago Joao
Jørgensen Jakob Sauer
Publication venue: 'The Optical Society'
Publication date: 01/01/2017
Field of study

Crossref

Online Research Database In Technology