Search CORE

13,056 research outputs found

Direct Learning of Sparse Changes in Markov Networks by Density Ratio Estimation

Author: C.M. Bishop
H. Liu
H. Zou
J. Friedman
M. Gutmann
M. Sugiyama
M. Sugiyama
M.J. Wainwright
M.W. Schmidt
N. Meinshausen
O. Banerjee
P. Ravikumar
R. Tibshirani
R.M. Neal
S.I. Lee
T. Bulcke Van den
T. Hastie
V.N. Vapnik
Y. Tsuboi
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

Abstract. We propose a new method for detecting changes in Markov network structure between two sets of samples. Instead of naively fitting two Markov network models separately to the two data sets and figuring out their difference, we directly learn the network structure change by estimating the ratio of Markov network models. This density-ratio formulation naturally allows us to introduce sparsity in the network structure change, which highly contributes to enhancing interpretability. Furthermore, computation of the normalization term, which is a critical computational bottleneck of the naive approach, can be remarkably mitigated. Through experiments on gene expression and Twitter data analysis, we demonstrate the usefulness of our method.

arXiv.org e-Print Archive

CiteSeerX

Crossref

Edinburgh Research Explorer

Structure Learning of Partitioned Markov Networks

Author: Fukumizu Kenji
Liu Song
Sugiyama Masashi
Suzuki Taiji
Publication venue
Publication date: 26/05/2016
Field of study

We learn the structure of a Markov Network between two groups of random variables from joint observations. Since modelling and learning the full MN structure may be hard, learning the links between two groups directly may be a preferable option. We introduce a novel concept called the \emph{partitioned ratio} whose factorization directly associates with the Markovian properties of random variables across two groups. A simple one-shot convex optimization procedure is proposed for learning the \emph{sparse} factorizations of the partitioned ratio and it is theoretically guaranteed to recover the correct inter-group structure under mild conditions. The performance of the proposed method is experimentally compared with the state of the art MN structure learning methods using ROC curves. Real applications on analyzing bipartisanship in US congress and pairwise DNA/time-series alignments are also reported.Comment: Camera Ready for ICML 2016. Fixed some minor typo

arXiv.org e-Print Archive

Explore Bristol Research

Lower Bounds for Two-Sample Structural Change Detection in Ising and Gaussian Models

Author: Gangrade Aditya
Nazer Bobak
Saligrama Venkatesh
Publication venue
Publication date: 27/10/2017
Field of study

The change detection problem is to determine if the Markov network structures of two Markov random fields differ from one another given two sets of samples drawn from the respective underlying distributions. We study the trade-off between the sample sizes and the reliability of change detection, measured as a minimax risk, for the important cases of the Ising models and the Gaussian Markov random fields restricted to the models which have network structures with

p

nodes and degree at most

d

, and obtain information-theoretic lower bounds for reliable change detection over these models. We show that for the Ising model,

\Omega\left(\frac{d^2}{(\log d)^2}\log p\right)

samples are required from each dataset to detect even the sparsest possible changes, and that for the Gaussian,

\Omega\left( \gamma^{-2} \log(p)\right)

samples are required from each dataset to detect change, where

\gamma

is the smallest ratio of off-diagonal to diagonal terms in the precision matrices of the distributions. These bounds are compared to the corresponding results in structure learning, and closely match them under mild conditions on the model parameters. Thus, our change detection bounds inherit partial tightness from the structure learning schemes in previous literature, demonstrating that in certain parameter regimes, the naive structure learning based approach to change detection is minimax optimal up to constant factors.Comment: Presented at the 55th Annual Allerton Conference on Communication, Control, and Computing, Oct. 201

arXiv.org e-Print Archive

Crossref

Representation Learning: A Review and New Perspectives

Author: Bengio Yoshua
Courville Aaron
Vincent Pascal
Publication venue
Publication date: 01/01/2014
Field of study

The success of machine learning algorithms generally depends on data representation, and we hypothesize that this is because different representations can entangle and hide more or less the different explanatory factors of variation behind the data. Although specific domain knowledge can be used to help design representations, learning with generic priors can also be used, and the quest for AI is motivating the design of more powerful representation-learning algorithms implementing such priors. This paper reviews recent work in the area of unsupervised feature learning and deep learning, covering advances in probabilistic models, auto-encoders, manifold learning, and deep networks. This motivates longer-term unanswered questions about the appropriate objectives for learning good representations, for computing representations (i.e., inference), and the geometrical connections between representation learning, density estimation and manifold learning

arXiv.org e-Print Archive

CiteSeerX