603 research outputs found

    Computing and Visualizing Dynamic Time Warping Alignments in R: The dtw Package

    Get PDF
    Dynamic time warping is a popular technique for comparing time series, providing both a distance measure that is insensitive to local compression and stretches and the warping which optimally deforms one of the two input series onto the other. A variety of algorithms and constraints have been discussed in the literature. The dtw package provides an unification of them; it allows R users to compute time series alignments mixing freely a variety of continuity constraints, restriction windows, endpoints, local distance definitions, and so on. The package also provides functions for visualizing alignments and constraints using several classic diagram types.

    Similarity searching in sequence databases under time warping.

    Get PDF
    Wong, Siu Fung.Thesis submitted in: December 2003.Thesis (M.Phil.)--Chinese University of Hong Kong, 2004.Includes bibliographical references (leaves 77-84).Abstracts in English and Chinese.Abstract --- p.iiAcknowledgement --- p.viChapter 1 --- Introduction --- p.1Chapter 2 --- Preliminary --- p.6Chapter 2.1 --- Dynamic Time Warping (DTW) --- p.6Chapter 2.2 --- Spatial Indexing --- p.10Chapter 2.3 --- Relevance Feedback --- p.11Chapter 3 --- Literature Review --- p.13Chapter 3.1 --- Searching Sequences under Euclidean Metric --- p.13Chapter 3.2 --- Searching Sequences under Dynamic Time Warping Metric --- p.17Chapter 4 --- Subsequence Matching under Time Warping --- p.21Chapter 4.1 --- Subsequence Matching --- p.22Chapter 4.1.1 --- Sequential Search --- p.22Chapter 4.1.2 --- Indexing Scheme --- p.23Chapter 4.2 --- Lower Bound Technique --- p.25Chapter 4.2.1 --- Properties of Lower Bound Technique --- p.26Chapter 4.2.2 --- Existing Lower Bound Functions --- p.27Chapter 4.3 --- Point-Based indexing --- p.28Chapter 4.3.1 --- Lower Bound for subsequences matching --- p.28Chapter 4.3.2 --- Algorithm --- p.35Chapter 4.4 --- Rectangle-Based indexing --- p.37Chapter 4.4.1 --- Lower Bound for subsequences matching --- p.37Chapter 4.4.2 --- Algorithm --- p.41Chapter 4.5 --- Experimental Results --- p.43Chapter 4.5.1 --- Candidate ratio vs Width of warping window --- p.44Chapter 4.5.2 --- CPU time vs Number of subsequences --- p.45Chapter 4.5.3 --- CPU time vs Width of warping window --- p.46Chapter 4.5.4 --- CPU time vs Threshold --- p.46Chapter 4.6 --- Summary --- p.47Chapter 5 --- Relevance Feedback under Time Warping --- p.49Chapter 5.1 --- Integrating Relevance Feedback with DTW --- p.49Chapter 5.2 --- Query Reformulation --- p.53Chapter 5.2.1 --- Constraint Updating --- p.53Chapter 5.2.2 --- Weight Updating --- p.55Chapter 5.2.3 --- Overall Strategy --- p.58Chapter 5.3 --- Experiments and Evaluation --- p.59Chapter 5.3.1 --- Effectiveness of the strategy --- p.61Chapter 5.3.2 --- Efficiency of the strategy --- p.63Chapter 5.3.3 --- Usability --- p.64Chapter 5.4 --- Summary --- p.71Chapter 6 --- Conclusion --- p.72Chapter A --- Deduction of Data Bounding Hyper-rectangle --- p.74Chapter B --- Proof of Theorem2 --- p.76Bibliography --- p.77Publications --- p.8

    Retrieving, annotating and recognizing human activities in web videos

    Get PDF
    Recent e orts in computer vision tackle the problem of human activity understanding in video sequences. Traditionally, these algorithms require annotated video data to learn models. In this work, we introduce a novel data collection framework, to take advantage of the large amount of video data available on the web. We use this new framework to retrieve videos of human activities, and build training and evaluation datasets for computer vision algorithms. We rely on Amazon Mechanical Turk workers to obtain high accuracy annotations. An agglomerative clustering technique brings the possibility to achieve reliable and consistent annotations for temporal localization of human activities in videos. Using two datasets, Olympics Sports and our novel Daily Human Activities dataset, we show that our collection/annotation framework can make robust annotations of human activities in large amount of video data

    Componentes e pontos de quebra em séries temporais na análise de imagens de sensoriamento remoto

    Get PDF
    Orientador: Ricardo da Silva TorresDissertação (mestrado) - Universidade Estadual de Campinas, Instituto de ComputaçãoResumo: A detecção e caracterização de mudanças temporais são indicadores cruciais no processo de compreensão da maneira como mecanismos complexos funcionam e evoluem. Técnicas e imagens de sensoriamento remoto têm sido amplamente empregadas nas últimas décadas com objetivo de detectar e investigar mudanças temporais na superfície terrestre. Tal detecção em dados de séries temporais é passível de ser refinada ainda mais isolando-se as componentes aditivas de tendência e sazonalidade do ruído subjacente. Este trabalho investiga, em particular, o método Breaks For Additive Season and Trend (BFAST) para a análise, decomposição e detecção de pontos de quebra em séries temporais associadas a dados de sensoriamento remoto. Os outputs do método são, então, utilizados em três distintas ¿ mas altamente interconectadas ¿ linhas de pesquisa: em uma melhor compreensão de fenômenos climáticos; na correlação com dados de distúrbios antropológicos; e em problemas de classificação usando funções de dissimilaridade descobertas por um framework evolucionário baseado em Programação Genética (GP). Experimentos realizados demonstram que a decomposição e pontos de quebra produziram resultados efetivos quando aplicados aos estudos com dados ecológicos, mas não foram capazes de melhorar os resultados de classificação quando comparados ao uso das séries brutas. As realizações nesses três contextos também culminaram na criação de duas ferramentas de análise de séries temporais com código aberto baseadas na web, sendo que uma delas foi tão bem aceita pela comunidade-alvo, que atualmente encontra-se integrada em uma plataforma privada de computação em nuvemAbstract: Detecting and characterizing temporal changes are crucial indicators in the process of understanding how complex mechanisms work and evolve. The use of remote sensing images and techniques has been broadly employed over the past decades in order to detect and investigate temporal changes on the Earth surface. Such change detection in time series data may be even further refined by isolating the additive long-term (trend) and cyclical (seasonal) components from the underlying noise. This work investigates the particular Breaks For Additive Season and Trend (BFAST) method for the analysis, decomposition, and breakpoint detection of time series associated with remote sensing data. The derived outputs from that method are, then, used in three distinct ¿ but highly interconnected ¿ research venues: in a better comprehension of climatic phenomena; in the correlation to human-induced disturbances data; and in data classification problems using time series dissimilarity functions discovered by a Genetic-Programming-(GP)-based evolutionary framework. Performed experiments show that decomposition and breakpoints produced insightful and effective results when applied to the ecological data studies, but could not further improve the classification results when compared to its raw time series counterpart. The achievements in those three contexts also led to the creation of two open-source web-based time series analysis tools. One of those tools was so well received by the target community, that it is currently integrated into a private cloud computing platformMestradoCiência da ComputaçãoMestre em Ciência da Computação132847/2015-92015/02105-0CNPQFAPES

    Indexing and Retrieval of 3D Articulated Geometry Models

    Get PDF
    In this PhD research study, we focus on building a content-based search engine for 3D articulated geometry models. 3D models are essential components in nowadays graphic applications, and are widely used in the game, animation and movies production industry. With the increasing number of these models, a search engine not only provides an entrance to explore such a huge dataset, it also facilitates sharing and reusing among different users. In general, it reduces production costs and time to develop these 3D models. Though a lot of retrieval systems have been proposed in recent years, search engines for 3D articulated geometry models are still in their infancies. Among all the works that we have surveyed, reliability and efficiency are the two main issues that hinder the popularity of such systems. In this research, we have focused our attention mainly to address these two issues. We have discovered that most existing works design features and matching algorithms in order to reflect the intrinsic properties of these 3D models. For instance, to handle 3D articulated geometry models, it is common to extract skeletons and use graph matching algorithms to compute the similarity. However, since this kind of feature representation is complex, it leads to high complexity of the matching algorithms. As an example, sub-graph isomorphism can be NP-hard for model graph matching. Our solution is based on the understanding that skeletal matching seeks correspondences between the two comparing models. If we can define descriptive features, the correspondence problem can be solved by bag-based matching where fast algorithms are available. In the first part of the research, we propose a feature extraction algorithm to extract such descriptive features. We then convert the skeletal matching problems into bag-based matching. We further define metric similarity measure so as to support fast search. We demonstrate the advantages of this idea in our experiments. The improvement on precision is 12\% better at high recall. The indexing search of 3D model is 24 times faster than the state of the art if only the first relevant result is returned. However, improving the quality of descriptive features pays the price of high dimensionality. Curse of dimensionality is a notorious problem on large multimedia databases. The computation time scales exponentially as the dimension increases, and indexing techniques may not be useful in such situation. In the second part of the research, we focus ourselves on developing an embedding retrieval framework to solve the high dimensionality problem. We first argue that our proposed matching method projects 3D models on manifolds. We then use manifold learning technique to reduce dimensionality and maximize intra-class distances. We further propose a numerical method to sub-sample and fast search databases. To preserve retrieval accuracy using fewer landmark objects, we propose an alignment method which is also beneficial to existing works for fast search. The advantages of the retrieval framework are demonstrated in our experiments that it alleviates the problem of curse of dimensionality. It also improves the efficiency (3.4 times faster) and accuracy (30\% more accurate) of our matching algorithm proposed above. In the third part of the research, we also study a closely related area, 3D motions. 3D motions are captured by sticking sensor on human beings. These captured data are real human motions that are used to animate 3D articulated geometry models. Creating realistic 3D motions is an expensive and tedious task. Although 3D motions are very different from 3D articulated geometry models, we observe that existing works also suffer from the problem of temporal structure matching. This also leads to low efficiency in the matching algorithms. We apply the same idea of bag-based matching into the work of 3D motions. From our experiments, the proposed method has a 13\% improvement on precision at high recall and is 12 times faster than existing works. As a summary, we have developed algorithms for 3D articulated geometry models and 3D motions, covering feature extraction, feature matching, indexing and fast search methods. Through various experiments, our idea of converting restricted matching to bag-based matching improves matching efficiency and reliability. These have been shown in both 3D articulated geometry models and 3D motions. We have also connected 3D matching to the area of manifold learning. The embedding retrieval framework not only improves efficiency and accuracy, but has also opened a new area of research

    Subseries Join and Compression of Time Series Data Based on Non-uniform Segmentation

    Get PDF
    A time series is composed of a sequence of data items that are measured at uniform intervals. Many application areas generate or manipulate time series, including finance, medicine, digital audio, and motion capture. Efficiently searching a large time series database is still a challenging problem, especially when partial or subseries matches are needed. This thesis proposes a new denition of subseries join, a symmetric generalization of subseries matching, which finds similar subseries in two or more time series datasets. A solution is proposed to compute the subseries join based on a hierarchical feature representation. This hierarchical feature representation is generated by an anisotropic diffusion scale-space analysis and a non-uniform segmentation method. Each segment is represented by a minimal polynomial envelope in a reduced-dimensionality space. Based on the hierarchical feature representation, all features in a dataset are indexed in an R-tree, and candidate matching features of two datasets are found by an R-tree join operation. Given candidate matching features, a dynamic programming algorithm is developed to compute the final subseries join. To improve storage efficiency, a hierarchical compression scheme is proposed to compress features. The minimal polynomial envelope representation is transformed to a Bezier spline envelope representation. The control points of each Bezier spline are then hierarchically differenced and an arithmetic coding is used to compress these differences. To empirically evaluate their effectiveness, the proposed subseries join and compression techniques are tested on various publicly available datasets. A large motion capture database is also used to verify the techniques in a real-world application. The experiments show that the proposed subseries join technique can better tolerate noise and local scaling than previous work, and the proposed compression technique can also achieve about 85% higher compression rates than previous work with the same distortion error

    Multi-resolution approach to time series retrieval

    Full text link
    corecore