Search CORE

1,110 research outputs found

Learning High-Dimensional Markov Forest Distributions: Analysis of Error Rates

Author: Anandkumar Animashree
Tan Vincent Y. F.
Willsky Alan S.
Publication venue
Publication date: 01/01/2010
Field of study

The problem of learning forest-structured discrete graphical models from i.i.d. samples is considered. An algorithm based on pruning of the Chow-Liu tree through adaptive thresholding is proposed. It is shown that this algorithm is both structurally consistent and risk consistent and the error probability of structure learning decays faster than any polynomial in the number of samples under fixed model size. For the high-dimensional scenario where the size of the model d and the number of edges k scale with the number of samples n, sufficient conditions on (n,d,k) are given for the algorithm to satisfy structural and risk consistencies. In addition, the extremal structures for learning are identified; we prove that the independent (resp. tree) model is the hardest (resp. easiest) to learn using the proposed algorithm in terms of error rates for structure learning.Comment: Accepted to the Journal of Machine Learning Research (Feb 2011

arXiv.org e-Print Archive

CiteSeerX

DSpace@MIT

Caltech Authors

Scaling laws for learning high-dimensional Markov forest distributions

Author: Anandkumar Animashree
Tan Vincent Yan Fu
Willsky Alan S.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/09/2010
Field of study

The problem of learning forest-structured discrete graphical models from i.i.d. samples is considered. An algorithm based on pruning of the Chow-Liu tree through adaptive thresholding is proposed. It is shown that this algorithm is structurally consistent and the error probability of structure learning decays faster than any polynomial in the number of samples under fixed model size. For the high-dimensional scenario where the size of the model d and the number of edges k scale with the number of samples n, sufficient conditions on (n, d, k) are given for the algorithm to be structurally consistent. In addition, the extremal structures for learning are identified; we prove that the independent (resp. tree) model is the hardest (resp. easiest) to learn using the proposed algorithm in terms of error rates for structure learning.United States. Air Force Office of Scientific Research (Grant FA9559-08-1- 1080)United States. Army Research Office. Multidisciplinary University Research Initiative (Grant W911NF-06-1-0076)United States. Army Research Office. Multidisciplinary University Research Initiative (Grant FA9550-06-1-0324)Singapore. Agency for Science, Technology and Researc

DSpace@MIT

Crossref

A Large-Deviation Analysis of the Maximum-Likelihood Learning of Markov Tree Structures

Author: Anandkumar Animashree
Tan Vincent Y. F.
Tong Lang
Willsky Alan S.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2009
Field of study

The problem of maximum-likelihood (ML) estimation of discrete tree-structured distributions is considered. Chow and Liu established that ML-estimation reduces to the construction of a maximum-weight spanning tree using the empirical mutual information quantities as the edge weights. Using the theory of large-deviations, we analyze the exponent associated with the error probability of the event that the ML-estimate of the Markov tree structure differs from the true tree structure, given a set of independently drawn samples. By exploiting the fact that the output of ML-estimation is a tree, we establish that the error exponent is equal to the exponential rate of decay of a single dominant crossover event. We prove that in this dominant crossover event, a non-neighbor node pair replaces a true edge of the distribution that is along the path of edges in the true tree graph connecting the nodes in the non-neighbor pair. Using ideas from Euclidean information theory, we then analyze the scenario of ML-estimation in the very noisy learning regime and show that the error exponent can be approximated as a ratio, which is interpreted as the signal-to-noise ratio (SNR) for learning tree distributions. We show via numerical experiments that in this regime, our SNR approximation is accurate.Comment: Accepted to the IEEE Transactions on Information Theory on Nov 18, 201

arXiv.org e-Print Archive

CiteSeerX

Crossref

DSpace@MIT

Large-deviation analysis and applications Of learning tree-structured graphical models

Author: Tan Vincent Yan Fu
Publication venue: Massachusetts Institute of Technology
Publication date: 01/01/2011
Field of study

Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2011.This electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections.Cataloged from student submitted PDF version of thesis.Includes bibliographical references (p. 213-228).The design and analysis of complexity-reduced representations for multivariate data is important in many scientific and engineering domains. This thesis explores such representations from two different perspectives: deriving and analyzing performance measures for learning tree-structured graphical models and salient feature subset selection for discrimination. Graphical models have proven to be a flexible class of probabilistic models for approximating high-dimensional data. Learning the structure of such models from data is an important generic task. It is known that if the data are drawn from tree-structured distributions, then the algorithm of Chow and Liu (1968) provides an efficient algorithm for finding the tree that maximizes the likelihood of the data. We leverage this algorithm and the theory of large deviations to derive the error exponent of structure learning for discrete and Gaussian graphical models. We determine the extremal tree structures for learning, that is, the structures that lead to the highest and lowest exponents. We prove that the star minimizes the exponent and the chain maximizes the exponent, which means that among all unlabeled trees, the star and the chain are the worst and best for learning respectively. The analysis is also extended to learning foreststructured graphical models by augmenting the Chow-Liu algorithm with a thresholding procedure. We prove scaling laws on the number of samples and the number variables for structure learning to remain consistent in high-dimensions. The next part of the thesis is concerned with discrimination. We design computationally efficient tree-based algorithms to learn pairs of distributions that are specifically adapted to the task of discrimination and show that they perform well on various datasets vis-`a-vis existing tree-based algorithms. We define the notion of a salient set for discrimination using information-theoretic quantities and derive scaling laws on the number of samples so that the salient set can be recovered asymptotically.by Vincent Yan Fu Tan.Ph.D

DSpace@MIT

Critical Market Crashes

Author: Andersen
Andersen
Assoe
Bak
Barber
Bassi
Bikhchandani
Blanchard
Blanchard
Boissevain
Bouchaud
Cai
Camerer
Campbell
Chaitin
Chauvet
Checki
Chen
Chowdhury
Coe
Cont
Corcos
Crutchfield
D Sornette
Devenow
Driffill
Drozdz
Falkovich
Fama
Feigenbaum
Feigenbaum
Feigenbaum
Feldman
Frankel
Frankel
Galbraith
Gaunersdorfer
Geller
Geller
Gluzman
Goldenfeld
Gould
Graham
Grant
Grassia
Gray
Grinblatt
Hamilton
Helbing
Holland
Holldobler
Holmes
Huberman
Ide
Johansen
Johansen
Johansen
Johansen
Johansen
Johansen
Johansen
Johansen
Johansen
Johansen
Kaminsky
Karplus
Keynes
Kindleberger
Knetter
Krawiecki
Laherrère
Lamont
Levy
Levy
Liggett
Liggett
Lux
Lux
Lux
Lux
MacDonald
Malamud
Malkiel
Minnich
Montroll
Mood
Moss de Oliveira
Onsager
Orléan
Orléan
Orléan
Orléan
Pandey
Phoa
Potters
Press
Roehner
Roehner
Roll
Romer
Saleur
Sato
Schaller
Scharfstein
Shefrin
Shiller
Shiller
Shleifer
Sircar
Sornette
Sornette
Sornette
Sornette
Sornette
Sornette
Sornette
Sornette
Sornette
Sornette
Sornette
Sornette
Sornette
Sornette
Stauffer
Stauffer
Stauffer
Takayasu
Trueman
Van Norden
Van Norden
Vandewalle
Vandewalle
Welch
Welch
White
Wilson
Wilson
Youssefmir
Zhou
Zwiebel
Publication venue: 'Elsevier BV'
Publication date: 01/01/2003
Field of study

This review is a partial synthesis of the book ``Why stock market crash'' (Princeton University Press, January 2003), which presents a general theory of financial crashes and of stock market instabilities that his co-workers and the author have developed over the past seven years. The study of the frequency distribution of drawdowns, or runs of successive losses shows that large financial crashes are ``outliers'': they form a class of their own as can be seen from their statistical signatures. If large financial crashes are ``outliers'', they are special and thus require a special explanation, a specific model, a theory of their own. In addition, their special properties may perhaps be used for their prediction. The main mechanisms leading to positive feedbacks, i.e., self-reinforcement, such as imitative behavior and herding between investors are reviewed with many references provided to the relevant literature outside the confine of Physics. Positive feedbacks provide the fuel for the development of speculative bubbles, preparing the instability for a major crash. We demonstrate several detailed mathematical models of speculative bubbles and crashes. The most important message is the discovery of robust and universal signatures of the approach to crashes. These precursory patterns have been documented for essentially all crashes on developed as well as emergent stock markets, on currency markets, on company stocks, and so on. The concept of an ``anti-bubble'' is also summarized, with two forward predictions on the Japanese stock market starting in 1999 and on the USA stock market still running. We conclude by presenting our view of the organization of financial markets.Comment: Latex 89 pages and 38 figures, in press in Physics Report

arXiv.org e-Print Archive

CiteSeerX

Crossref

Proceedings of the Third Annual Symposium on Mathematical Pattern Recognition and Image Analysis

Author: Guseman L. F., Jr.
Publication venue
Publication date
Field of study

Topics addressed include: multivariate spline method; normal mixture analysis applied to remote sensing; image data analysis; classifications in spatially correlated environments; probability density functions; graphical nonparametric methods; subpixel registration analysis; hypothesis integration in image understanding systems; rectification of satellite scanner imagery; spatial variation in remotely sensed images; smooth multidimensional interpolation; and optimal frequency domain textural edge detection filters

NASA Technical Reports Server

Testing for the Markov Property in Time Series via Deep Conditional Generative Learning

Author: Li Lexin
Shi Chengchun
Yao Qiwei
Zhou Yunzhe
Publication venue
Publication date: 30/05/2023
Field of study

The Markov property is widely imposed in analysis of time series data. Correspondingly, testing the Markov property, and relatedly, inferring the order of a Markov model, are of paramount importance. In this article, we propose a nonparametric test for the Markov property in high-dimensional time series via deep conditional generative learning. We also apply the test sequentially to determine the order of the Markov model. We show that the test controls the type-I error asymptotically, and has the power approaching one. Our proposal makes novel contributions in several ways. We utilize and extend state-of-the-art deep generative learning to estimate the conditional density functions, and establish a sharp upper bound on the approximation error of the estimators. We derive a doubly robust test statistic, which employs a nonparametric estimation but achieves a parametric convergence rate. We further adopt sample splitting and cross-fitting to minimize the conditions required to ensure the consistency of the test. We demonstrate the efficacy of the test through both simulations and the three data applications

arXiv.org e-Print Archive

Demand forecasting using exogenous leading indicators

Author: Aghezzaf El-Houssaine
Desmet Bram
Kourentzes Nikolaos
Sagaert Yves
Publication venue
Publication date: 01/01/2014
Field of study

Ghent University Academic Bibliography