Search CORE

101 research outputs found

Enhanced GMM-based Filtering with Measurement Update Ordering and Innovation-based Pruning

Author: Guo F.
Li X.
Mihaylova L.S.
Yang L.
Zhang M.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 06/09/2018
Field of study

The Gaussian mixture model (GMM) has been extensively investigated in nonlinear/non-Gaussian filtering problems. This paper presents two enhancements for GMM-based nonlinear filtering techniques, namely, the adaptive ordering of the measurement update and normalized innovation square (NIS)-based mixture component management. The first technique selects the order of measurement update by maximizing the marginal measurement likelihood to improve performance. The second approach takes the filtering history of a mixture component into account and prunes those components with NIS larger than a threshold to eliminate their impact on the filtering posterior. The advantage of the proposed enhancements is illustrated via simulations that consider source tracking using the time difference of arrival (TDOA) and frequency difference of arrival (FDOA) measurements received at two unmanned aerial vehicles (UAVs). A GMM-cubature quadrature Kalman filter (CQKF) is implemented and its performances with different measurement update and mixture component management strategies are compared. The superior performance obtained via the use of the two proposed techniques is demonstrated

White Rose Research Online

Multipath assisted positioning using machine learning

Author: Karasek Rostislav
Publication venue
Publication date: 01/01/2023
Field of study

The multipath propagation of the radio signal was considered a problem for positioning systems that had to be eliminated. However, a groundbreaking new approach called multipath assisted positioning caused a paradigm shift, where multipath propagation improves the positioning performance. Moreover, the multipath assisted positioning algorithm called Channel-SLAM shows the possibility of using a single physical transmitter in a multipath environment for positioning. In this thesis, I open a discussion on some problems that have vital importance for multipath assisted positioning algorithms with a focus on pedestrian positioning. Using the idea of multipath assisted positioning, I present a single frequency network positioning algorithm. I evaluated the single frequency network-based positioning algorithm for positioning in a real scenario using a terrestrial digital video broadcasting transmission. I propose a novel pedestrian transition model utilizing the inertial measurements from a handheld inertial measurement unit. The proposed pedestrian transition model improves the precision and reliability of the Channel-SLAM. Comparing the proposed transition model with the Rician transition model previously used in Channel-SLAM quantifies the performance improvement. This thesis proposes a joint data association technique that overcomes the strong dependence on the radio channel estimation algorithm used in Channel-SLAM. The joint data association allows reusing the previously observed virtual transmitters after an outage of multipath component tracking. The evaluation based on the walking pedestrian scenario shows that the joint data association algorithm provides superior positioning precision. The virtual transmitter position estimation yields a significant computational load in Channel-SLAM. I propose a method that represents the virtual transmitter by a Gaussian mixture model and learns its parameters. The evaluation shows that the proposed method outperforms the previous approach while decreasing the computational load. Also, the current methods for radio channel estimation yield a considerable computational load that prohibits a real-time deployment. The thesis investigates the possibility of using artificial neural networks trained to estimate the number of multipath components and corresponding delays in a noisy measurement of a channel impulse response. The artificial neural network-based delay estimator provides a superresolution performance and faster runtime than the classical approaches. The precision of the trained artificial neural network architecture is evaluated and compared to the Cramer-Rao lower bound theoretical limit and classical channel estimation algorithms

Institute of Transport Research:Publications

Fusing Multimedia Data Into Dynamic Virtual Environments

Author: Du Ruofei
Publication venue
Publication date: 01/01/2018
Field of study

In spite of the dramatic growth of virtual and augmented reality (VR and AR) technology, content creation for immersive and dynamic virtual environments remains a significant challenge. In this dissertation, we present our research in fusing multimedia data, including text, photos, panoramas, and multi-view videos, to create rich and compelling virtual environments. First, we present Social Street View, which renders geo-tagged social media in its natural geo-spatial context provided by 360° panoramas. Our system takes into account visual saliency and uses maximal Poisson-disc placement with spatiotemporal filters to render social multimedia in an immersive setting. We also present a novel GPU-driven pipeline for saliency computation in 360° panoramas using spherical harmonics (SH). Our spherical residual model can be applied to virtual cinematography in 360° videos. We further present Geollery, a mixed-reality platform to render an interactive mirrored world in real time with three-dimensional (3D) buildings, user-generated content, and geo-tagged social media. Our user study has identified several use cases for these systems, including immersive social storytelling, experiencing the culture, and crowd-sourced tourism. We next present Video Fields, a web-based interactive system to create, calibrate, and render dynamic videos overlaid on 3D scenes. Our system renders dynamic entities from multiple videos, using early and deferred texture sampling. Video Fields can be used for immersive surveillance in virtual environments. Furthermore, we present VRSurus and ARCrypt projects to explore the applications of gestures recognition, haptic feedback, and visual cryptography for virtual and augmented reality. Finally, we present our work on Montage4D, a real-time system for seamlessly fusing multi-view video textures with dynamic meshes. We use geodesics on meshes with view-dependent rendering to mitigate spatial occlusion seams while maintaining temporal consistency. Our experiments show significant enhancement in rendering quality, especially for salient regions such as faces. We believe that Social Street View, Geollery, Video Fields, and Montage4D will greatly facilitate several applications such as virtual tourism, immersive telepresence, and remote education

Digital Repository at the University of Maryland

Context Exploitation in Data Fusion

Author: Vaci Lubos
Publication venue: Universit\ue0 degli Studi di Udine
Publication date: 19/03/2018
Field of study

Complex and dynamic environments constitute a challenge for existing tracking algorithms. For this reason, modern solutions are trying to utilize any available information which could help to constrain, improve or explain the measurements. So called Context Information (CI) is understood as information that surrounds an element of interest, whose knowledge may help understanding the (estimated) situation and also in reacting to that situation. However, context discovery and exploitation are still largely unexplored research topics. Until now, the context has been extensively exploited as a parameter in system and measurement models which led to the development of numerous approaches for the linear or non-linear constrained estimation and target tracking. More specifically, the spatial or static context is the most common source of the ambient information, i.e. features, utilized for recursive enhancement of the state variables either in the prediction or the measurement update of the filters. In the case of multiple model estimators, context can not only be related to the state but also to a certain mode of the filter. Common practice for multiple model scenarios is to represent states and context as a joint distribution of Gaussian mixtures. These approaches are commonly referred as the join tracking and classification. Alternatively, the usefulness of context was also demonstrated in aiding the measurement data association. Process of formulating a hypothesis, which assigns a particular measurement to the track, is traditionally governed by the empirical knowledge of the noise characteristics of sensors and operating environment, i.e. probability of detection, false alarm, clutter noise, which can be further enhanced by conditioning on context. We believe that interactions between the environment and the object could be classified into actions, activities and intents, and formed into structured graphs with contextual links translated into arcs. By learning the environment model we will be able to make prediction on the target\u2019s future actions based on its past observation. Probability of target future action could be utilized in the fusion process to adjust tracker confidence on measurements. By incorporating contextual knowledge of the environment, in the form of a likelihood function, in the filter measurement update step, we have been able to reduce uncertainties of the tracking solution and improve the consistency of the track. The promising results demonstrate that the fusion of CI brings a significant performance improvement in comparison to the regular tracking approaches

Archivio istituzionale della ricerca - Università degli Studi di Udine

Motion-capture-based hand gesture recognition for computing and control

Author: Gardner Andrew
Publication venue: Louisiana Tech Digital Commons
Publication date: 01/07/2017
Field of study

This dissertation focuses on the study and development of algorithms that enable the analysis and recognition of hand gestures in a motion capture environment. Central to this work is the study of unlabeled point sets in a more abstract sense. Evaluations of proposed methods focus on examining their generalization to users not encountered during system training. In an initial exploratory study, we compare various classification algorithms based upon multiple interpretations and feature transformations of point sets, including those based upon aggregate features (e.g. mean) and a pseudo-rasterization of the capture space. We find aggregate feature classifiers to be balanced across multiple users but relatively limited in maximum achievable accuracy. Certain classifiers based upon the pseudo-rasterization performed best among tested classification algorithms. We follow this study with targeted examinations of certain subproblems. For the first subproblem, we introduce the a fortiori expectation-maximization (AFEM) algorithm for computing the parameters of a distribution from which unlabeled, correlated point sets are presumed to be generated. Each unlabeled point is assumed to correspond to a target with independent probability of appearance but correlated positions. We propose replacing the expectation phase of the algorithm with a Kalman filter modified within a Bayesian framework to account for the unknown point labels which manifest as uncertain measurement matrices. We also propose a mechanism to reorder the measurements in order to improve parameter estimates. In addition, we use a state-of-the-art Markov chain Monte Carlo sampler to efficiently sample measurement matrices. In the process, we indirectly propose a constrained k-means clustering algorithm. Simulations verify the utility of AFEM against a traditional expectation-maximization algorithm in a variety of scenarios. In the second subproblem, we consider the application of positive definite kernels and the earth mover\u27s distance (END) to our work. Positive definite kernels are an important tool in machine learning that enable efficient solutions to otherwise difficult or intractable problems by implicitly linearizing the problem geometry. We develop a set-theoretic interpretation of ENID and propose earth mover\u27s intersection (EMI). a positive definite analog to ENID. We offer proof of EMD\u27s negative definiteness and provide necessary and sufficient conditions for ENID to be conditionally negative definite, including approximations that guarantee negative definiteness. In particular, we show that ENID is related to various min-like kernels. We also present a positive definite preserving transformation that can be applied to any kernel and can be used to derive positive definite EMD-based kernels, and we show that the Jaccard index is simply the result of this transformation applied to set intersection. Finally, we evaluate kernels based on EMI and the proposed transformation versus ENID in various computer vision tasks and show that END is generally inferior even with indefinite kernel techniques. Finally, we apply deep learning to our problem. We propose neural network architectures for hand posture and gesture recognition from unlabeled marker sets in a coordinate system local to the hand. As a means of ensuring data integrity, we also propose an extended Kalman filter for tracking the rigid pattern of markers on which the local coordinate system is based. We consider fixed- and variable-size architectures including convolutional and recurrent neural networks that accept unlabeled marker input. We also consider a data-driven approach to labeling markers with a neural network and a collection of Kalman filters. Experimental evaluations with posture and gesture datasets show promising results for the proposed architectures with unlabeled markers, which outperform the alternative data-driven labeling method

Louisiana Tech Digital Commons

Tracking interacting targets in multi-modal sensors

Author: Taj Murtaza
Publication venue
Publication date: 01/01/2009
Field of study

PhDObject tracking is one of the fundamental tasks in various applications such as surveillance, sports, video conferencing and activity recognition. Factors such as occlusions, illumination changes and limited field of observance of the sensor make tracking a challenging task. To overcome these challenges the focus of this thesis is on using multiple modalities such as audio and video for multi-target, multi-modal tracking. Particularly, this thesis presents contributions to four related research topics, namely, pre-processing of input signals to reduce noise, multi-modal tracking, simultaneous detection and tracking, and interaction recognition. To improve the performance of detection algorithms, especially in the presence of noise, this thesis investigate filtering of the input data through spatio-temporal feature analysis as well as through frequency band analysis. The pre-processed data from multiple modalities is then fused within Particle filtering (PF). To further minimise the discrepancy between the real and the estimated positions, we propose a strategy that associates the hypotheses and the measurements with a real target, using a Weighted Probabilistic Data Association (WPDA). Since the filtering involved in the detection process reduces the available information and is inapplicable on low signal-to-noise ratio data, we investigate simultaneous detection and tracking approaches and propose a multi-target track-beforedetect Particle filtering (MT-TBD-PF). The proposed MT-TBD-PF algorithm bypasses the detection step and performs tracking in the raw signal. Finally, we apply the proposed multi-modal tracking to recognise interactions between targets in regions within, as well as outside the cameras’ fields of view. The efficiency of the proposed approaches are demonstrated on large uni-modal, multi-modal and multi-sensor scenarios from real world detections, tracking and event recognition datasets and through participation in evaluation campaigns

Queen Mary Research Online

OpenGrey Repository

Recommended from our members

Transformational maintenance by reuse of design histories

Author: Baxter Ira D.
Publication venue: eScholarship, University of California
Publication date: 01/01/1990
Field of study

This thesis provides theory and procedures for modifying software artifacts implemented by a formal transformation process. Installing modifications requires knowing not only what transformations were applied (a derivation history) to construct the artifact, but also why the application sequence ensures that the artifact meets its specification. The derivation history and the justification are collectively called a design history. A Design Maintenance System (DMS), when provided with a formal change called a maintenance delta, revises a design history to guide construction of a new artifact. A DMS can be used to integrate a stream of deltas into a history, providing implementations as a side effect, leading to an incremental-evolution model for software construction.We provide a broadly applicable formal model of transformation systems in which specifications are performance predicates, subsuming the functional specifications which are traditional for transformation systems. Such performance predicates provide vocabulary used in the design history to describe the effect of applying sets of transformations.A nonprocedural, performance-goal-oriented Transformation Control Language (TCL) is defined to control navigation of the design space for a transformation system. Recording the execution of a TCL metaprogram directly provides a design history.A complete classification of, and representation for, the set of possible maintenance deltas is given in terms of the inputs defined by the transformation system model. Such deltas include not only specification changes, but also changes to implementation support technologies. Delta integration procedures for revising derivation histories given functional or support technology deltas are provided, based on rearranging the order of transformations in the design space. Building on these operations, integration procedures that revise the design history for each type of delta are described. An agenda-oriented TCL execution process dovetails smoothly with the integration procedures.Our DMS is compared to a number of other maintenance systems. By using an explicit delta and verified commutativity, our DMS often reuses transformations correctly when others fail

eScholarship - University of California

Recent Advances in Social Data and Artificial Intelligence 2019

Author
Publication venue: 'MDPI AG'
Publication date: 12/08/2022
Field of study

The importance and usefulness of subjects and topics involving social data and artificial intelligence are becoming widely recognized. This book contains invited review, expository, and original research articles dealing with, and presenting state-of-the-art accounts pf, the recent advances in the subjects of social data and artificial intelligence, and potentially their links to Cyberspace

Directory of Open Access Books (DOAB)

Proceedings of the 18th Irish Conference on Artificial Intelligence and Cognitive Science

Author: Delany Sarah Jane
Madden Michael
Publication venue: Dublin Institute of Technology
Publication date: 29/08/2007
Field of study

These proceedings contain the papers that were accepted for publication at AICS-2007, the 18th Annual Conference on Artificial Intelligence and Cognitive Science, which was held in the Technological University Dublin; Dublin, Ireland; on the 29th to the 31st August 2007. AICS is the annual conference of the Artificial Intelligence Association of Ireland (AIAI)

Arrow@TUDublin

A Study of Syntactic Information Retrieval

Author: Liu Chang
Liu Jun
McClean Sally
Wang Hui
Wu Shengli
Publication venue: Springer
Publication date: 01/01/2007
Field of study

Queen's University Belfast Research Portal

Ulster University's Research Portal