597 research outputs found

    Extrinsic Methods for Coding and Dictionary Learning on Grassmann Manifolds

    Get PDF
    Sparsity-based representations have recently led to notable results in various visual recognition tasks. In a separate line of research, Riemannian manifolds have been shown useful for dealing with features and models that do not lie in Euclidean spaces. With the aim of building a bridge between the two realms, we address the problem of sparse coding and dictionary learning over the space of linear subspaces, which form Riemannian structures known as Grassmann manifolds. To this end, we propose to embed Grassmann manifolds into the space of symmetric matrices by an isometric mapping. This in turn enables us to extend two sparse coding schemes to Grassmann manifolds. Furthermore, we propose closed-form solutions for learning a Grassmann dictionary, atom by atom. Lastly, to handle non-linearity in data, we extend the proposed Grassmann sparse coding and dictionary learning algorithms through embedding into Hilbert spaces. Experiments on several classification tasks (gender recognition, gesture classification, scene analysis, face recognition, action recognition and dynamic texture classification) show that the proposed approaches achieve considerable improvements in discrimination accuracy, in comparison to state-of-the-art methods such as kernelized Affine Hull Method and graph-embedding Grassmann discriminant analysis.Comment: Appearing in International Journal of Computer Visio

    A time series distance measure for efficient clustering of input output signals by their underlying dynamics

    Full text link
    Starting from a dataset with input/output time series generated by multiple deterministic linear dynamical systems, this paper tackles the problem of automatically clustering these time series. We propose an extension to the so-called Martin cepstral distance, that allows to efficiently cluster these time series, and apply it to simulated electrical circuits data. Traditionally, two ways of handling the problem are used. The first class of methods employs a distance measure on time series (e.g. Euclidean, Dynamic Time Warping) and a clustering technique (e.g. k-means, k-medoids, hierarchical clustering) to find natural groups in the dataset. It is, however, often not clear whether these distance measures effectively take into account the specific temporal correlations in these time series. The second class of methods uses the input/output data to identify a dynamic system using an identification scheme, and then applies a model norm-based distance (e.g. H2, H-infinity) to find out which systems are similar. This, however, can be very time consuming for large amounts of long time series data. We show that the new distance measure presented in this paper performs as good as when every input/output pair is modelled explicitly, but remains computationally much less complex. The complexity of calculating this distance between two time series of length N is O(N logN).Comment: 6 pages, 4 figures, sent in for review to IEEE L-CSS (CDC 2017 option

    Data-based Parameter Estimation of Generalized Multidimensional Langevin Processes

    Get PDF
    The generalized Langevin equation is useful for modeling a wide range of physical processes. Unfortunately its parameters, especially the memory function, are difficult to determine for nontrivial processes. We establish relations between a time-discrete generalized Langevin model and discrete multivariate autoregressive (AR) or autoregressive moving average models (ARMA). This allows a wide range of discrete linear methods known from time series analysis to be applied. In particular, the determination of the memory function via the order of the respective AR or ARMA model is addressed. The method is illustrated on a one-dimensional test system and subsequently applied to the molecular dynamics time series of a biomolecule that exhibits an interesting relationship between the solvent method used, the respective molecular conformation, and the depth of the memory

    Radar HRRP Modeling using Dynamic System for Radar Target Recognition

    Get PDF
    High resolution range profile (HRRP) is being known as one of the most powerful tools for radar target recognition. The main problem with range profile for radar target recognition is its sensitivity to aspect angle. To overcome this problem, consecutive samples of HRRP were assumed to be identically independently distributed (IID) in small frames of aspect angles in most of the related works. Here, considering the physical circumstances of maneuver of an aerial target, we have proposed dynamic system which models the short dependency between consecutive samples of HRRP in segments of the whole HRRP sequence. Dynamic system (DS) is used to model the sequence of PCA (principal component analysis) coefficients extracted from the sequence of HRRPs. Considering this we have proposed a model called PCA+DS. We have also proposed a segmentation algorithm which segments the HRRP sequence reliably. Akaike information criterion (AIC) used to evaluate the quality of data modeling showed that our PCA+DS model outperforms factor analysis (FA) model. In addition, target recognition results using simulated data showed that our method based on PCA+DS achieves better recognition rates compared to the method based on FA

    Learning human actions by combining global dynamics and local appearance

    Get PDF
    In this paper, we address the problem of human action recognition through combining global temporal dynamics and local visual spatio-temporal appearance features. For this purpose, in the global temporal dimension, we propose to model the motion dynamics with robust linear dynamical systems (LDSs) and use the model parameters as motion descriptors. Since LDSs live in a non-Euclidean space and the descriptors are in non-vector form, we propose a shift invariant subspace angles based distance to measure the similarity between LDSs. In the local visual dimension, we construct curved spatio-temporal cuboids along the trajectories of densely sampled feature points and describe them using histograms of oriented gradients (HOG). The distance between motion sequences is computed with the Chi-Squared histogram distance in the bag-of-words framework. Finally we perform classification using the maximum margin distance learning method by combining the global dynamic distances and the local visual distances. We evaluate our approach for action recognition on five short clips data sets, namely Weizmann, KTH, UCF sports, Hollywood2 and UCF50, as well as three long continuous data sets, namely VIRAT, ADL and CRIM13. We show competitive results as compared with current state-of-the-art methods

    Graph Kernels via Functional Embedding

    Full text link
    We propose a representation of graph as a functional object derived from the power iteration of the underlying adjacency matrix. The proposed functional representation is a graph invariant, i.e., the functional remains unchanged under any reordering of the vertices. This property eliminates the difficulty of handling exponentially many isomorphic forms. Bhattacharyya kernel constructed between these functionals significantly outperforms the state-of-the-art graph kernels on 3 out of the 4 standard benchmark graph classification datasets, demonstrating the superiority of our approach. The proposed methodology is simple and runs in time linear in the number of edges, which makes our kernel more efficient and scalable compared to many widely adopted graph kernels with running time cubic in the number of vertices
    corecore