Search CORE

983 research outputs found

Learning Latent Tree Graphical Models

Author: Anandkumar Animashree
Choi Myung Jin
Tan Vincent Y. F.
Willsky Alan S.
Publication venue
Publication date: 14/09/2010
Field of study

We study the problem of learning a latent tree graphical model where samples are available only from a subset of variables. We propose two consistent and computationally efficient algorithms for learning minimal latent trees, that is, trees without any redundant hidden nodes. Unlike many existing methods, the observed nodes (or variables) are not constrained to be leaf nodes. Our first algorithm, recursive grouping, builds the latent tree recursively by identifying sibling groups using so-called information distances. One of the main contributions of this work is our second algorithm, which we refer to as CLGrouping. CLGrouping starts with a pre-processing procedure in which a tree over the observed variables is constructed. This global step groups the observed nodes that are likely to be close to each other in the true latent tree, thereby guiding subsequent recursive grouping (or equivalent procedures) on much smaller subsets of variables. This results in more accurate and efficient learning of latent trees. We also present regularized versions of our algorithms that learn latent tree approximations of arbitrary distributions. We compare the proposed algorithms to other methods by performing extensive numerical experiments on various latent tree graphical models such as hidden Markov models and star graphs. In addition, we demonstrate the applicability of our methods on real-world datasets by modeling the dependency structure of monthly stock returns in the S&P index and of the words in the 20 newsgroups dataset

arXiv.org e-Print Archive

CiteSeerX

DSpace@MIT

Caltech Authors

Folding and unfolding phylogenetic trees and networks

Author: C Semple
D Gusfield
DH Huson
F Pardi
G Cardona
G Cardona
I Kanj
K Huber
Katharina T. Huber
L Iersel van
M Lott
Mike Steel
P Boldi
S Willson
T Marcussen
T Wu
Taoyang Wu
Vincent Moulton
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 14/06/2015
Field of study

Phylogenetic networks are rooted, labelled directed acyclic graphs which are commonly used to represent reticulate evolution. There is a close relationship between phylogenetic networks and multi-labelled trees (MUL-trees). Indeed, any phylogenetic network

N

can be "unfolded" to obtain a MUL-tree

U(N)

and, conversely, a MUL-tree

T

can in certain circumstances be "folded" to obtain a phylogenetic network

F(T)

that exhibits

T

. In this paper, we study properties of the operations

U

and

F

in more detail. In particular, we introduce the class of stable networks, phylogenetic networks

N

for which

F(U(N))

is isomorphic to

N

, characterise such networks, and show that they are related to the well-known class of tree-sibling networks.We also explore how the concept of displaying a tree in a network

N

can be related to displaying the tree in the MUL-tree

U(N)

. To do this, we develop a phylogenetic analogue of graph fibrations. This allows us to view

U(N)

as the analogue of the universal cover of a digraph, and to establish a close connection between displaying trees in

U(N)

and reconcilingphylogenetic trees with networks

arXiv.org e-Print Archive

Crossref

Springer - Publisher Connector

PubMed Central

University of East Anglia digital repository