Search CORE

651 research outputs found

What Is a Macrostate? Subjective Observations and Objective Dynamics

Author: Moore Cristopher
Shalizi Cosma Rohilla
Publication venue
Publication date: 01/01/2003
Field of study

We consider the question of whether thermodynamic macrostates are objective consequences of dynamics, or subjective reflections of our ignorance of a physical system. We argue that they are both; more specifically, that the set of macrostates forms the unique maximal partition of phase space which 1) is consistent with our observations (a subjective fact about our ability to observe the system) and 2) obeys a Markov process (an objective fact about the system's dynamics). We review the ideas of computational mechanics, an information-theoretic method for finding optimal causal models of stochastic processes, and argue that macrostates coincide with the ``causal states'' of computational mechanics. Defining a set of macrostates thus consists of an inductive process where we start with a given set of observables, and then refine our partition of phase space until we reach a set of states which predict their own future, i.e. which are Markovian. Macrostates arrived at in this way are provably optimal statistical predictors of the future values of our observables.Comment: 15 pages, no figure

arXiv.org e-Print Archive

CiteSeerX

PhilSci Archive

Predictive PAC Learning and Process Decompositions

Author: Kontorovich Aryeh
Shalizi Cosma Rohilla
Publication venue
Publication date: 19/09/2013
Field of study

We informally call a stochastic process learnable if it admits a generalization error approaching zero in probability for any concept class with finite VC-dimension (IID processes are the simplest example). A mixture of learnable processes need not be learnable itself, and certainly its generalization error need not decay at the same rate. In this paper, we argue that it is natural in predictive PAC to condition not on the past observations but on the mixture component of the sample path. This definition not only matches what a realistic learner might demand, but also allows us to sidestep several otherwise grave problems in learning from dependent data. In particular, we give a novel PAC generalization bound for mixtures of learnable processes with a generalization error that is not worse than that of each mixture component. We also provide a characterization of mixtures of absolutely regular (

\beta

-mixing) processes, of independent probability-theoretic interest.Comment: 9 pages, accepted in NIPS 201

arXiv.org e-Print Archive

CiteSeerX

Consistency of Maximum Likelihood for Continuous-Space Network Models

Author: Asta Dena
Shalizi Cosma Rohilla
Publication venue
Publication date: 01/10/2019
Field of study

Network analysis needs tools to infer distributions over graphs of arbitrary size from a single graph. Assuming the distribution is generated by a continuous latent space model which obeys certain natural symmetry and smoothness properties, we establish three levels of consistency for non-parametric maximum likelihood inference as the number of nodes grows: (i) the estimated locations of all nodes converge in probability on their true locations; (ii) the distribution over locations in the latent space converges on the true distribution; and (iii) the distribution over graphs of arbitrary size converges.Comment: 21 page

arXiv.org e-Print Archive

Projective, Sparse, and Learnable Latent Position Network Models

Author: Shalizi Cosma Rohilla
Spencer Neil A.
Publication venue
Publication date: 07/02/2020
Field of study

When modeling network data using a latent position model, it is typical to assume that the nodes' positions are independently and identically distributed. However, this assumption implies the average node degree grows linearly with the number of nodes, which is inappropriate when the graph is thought to be sparse. We propose an alternative assumption---that the latent positions are generated according to a Poisson point process---and show that it is compatible with various levels of sparsity. Unlike other notions of sparse latent position models in the literature, our framework also defines a projective sequence of probability models, thus ensuring consistency of statistical inference across networks of different sizes. We establish conditions for consistent estimation of the latent positions, and compare our results to existing frameworks for modeling sparse networks.Comment: 51 pages, 2 figure

arXiv.org e-Print Archive