26,152 research outputs found
Scalable Bayesian nonparametric measures for exploring pairwise dependence via Dirichlet Process Mixtures
In this article we propose novel Bayesian nonparametric methods using
Dirichlet Process Mixture (DPM) models for detecting pairwise dependence
between random variables while accounting for uncertainty in the form of the
underlying distributions. A key criteria is that the procedures should scale to
large data sets. In this regard we find that the formal calculation of the
Bayes factor for a dependent-vs.-independent DPM joint probability measure is
not feasible computationally. To address this we present Bayesian diagnostic
measures for characterising evidence against a "null model" of pairwise
independence. In simulation studies, as well as for a real data analysis, we
show that our approach provides a useful tool for the exploratory nonparametric
Bayesian analysis of large multivariate data sets
Graphical Log-linear Models: Fundamental Concepts and Applications
We present a comprehensive study of graphical log-linear models for
contingency tables. High dimensional contingency tables arise in many areas
such as computational biology, collection of survey and census data and others.
Analysis of contingency tables involving several factors or categorical
variables is very hard. To determine interactions among various factors,
graphical and decomposable log-linear models are preferred. First, we explore
connections between the conditional independence in probability and graphs;
thereafter we provide a few illustrations to describe how graphical log-linear
model are useful to interpret the conditional independences between factors. We
also discuss the problem of estimation and model selection in decomposable
models
Statistical testing of directions observations independence
Independence of observations is often assumed when adjusting geodetic network. Unlike the\ud
distance observations, no dependence of environmental conditions is known for horizontal\ud
direction observations. In order to determine the dependence of horizontal direction observations,\ud
we established test geodetic network of a station and four observation points. Measurements of\ud
the highest possible accuracy were carried out using Leica TS30 total station along with precise\ud
prisms GPH1P. Two series of hundred sets of angles were measured, with the first one in bad\ud
observation conditions. Using different methods, i.e. variance–covariance matrices, x2 test and analyses of time series, the independence of measured directions, reduced directions and horizontal angles were tested. The results show that the independence of horizontal direction\ud
observations is not obvious and certainly not in poor conditions. In this case, it would be appropriate for geodetic network adjustments to use variance–covariance matrix calculated from measurements instead of diagonal variance–covariance matrix
Concepts and a case study for a flexible class of graphical Markov models
With graphical Markov models, one can investigate complex dependences,
summarize some results of statistical analyses with graphs and use these graphs
to understand implications of well-fitting models. The models have a rich
history and form an area that has been intensively studied and developed in
recent years. We give a brief review of the main concepts and describe in more
detail a flexible subclass of models, called traceable regressions. These are
sequences of joint response regressions for which regression graphs permit one
to trace and thereby understand pathways of dependence. We use these methods to
reanalyze and interpret data from a prospective study of child development, now
known as the Mannheim Study of Children at Risk. The two related primary
features concern cognitive and motor development, at the age of 4.5 and 8 years
of a child. Deficits in these features form a sequence of joint responses.
Several possible risks are assessed at birth of the child and when the child
reached age 3 months and 2 years.Comment: 21 pages, 7 figures, 7 tables; invited, refereed chapter in a boo
Bayesian inference through encompassing priors and importance sampling for a class of marginal models for categorical data
We develop a Bayesian approach for selecting the model which is the most
supported by the data within a class of marginal models for categorical
variables formulated through equality and/or inequality constraints on
generalised logits (local, global, continuation or reverse continuation),
generalised log-odds ratios and similar higher-order interactions. For each
constrained model, the prior distribution of the model parameters is formulated
following the encompassing prior approach. Then, model selection is performed
by using Bayes factors which are estimated by an importance sampling method.
The approach is illustrated through three applications involving some datasets,
which also include explanatory variables. In connection with one of these
examples, a sensitivity analysis to the prior specification is also considered
Testing Dependence Among Serially Correlated Multi-category Variables
The contingency table literature on tests for dependence among discrete multi-category variables assume that draws are independent, and there are no tests that account for serial dependencies − a problem that is particularly important in economics and finance. This paper proposes a new test of independence based on the maximum canonical correlation between pairs of discrete variables. We also propose a trace canonical correlation test using dynamically augmented reduced rank regressions or an iterated weighting method in order to account for serial dependence. Such tests are useful, for example, when testing for predictability of one sequence of discrete random variables by means of another sequence of discrete random variables as in tests of market timing skills or business cycle analysis. The proposed tests allow for an arbitrary number of categories, are robust in the presence of serial dependencies and are simple to implement using multivariate regression methods
Binary Models for Marginal Independence
Log-linear models are a classical tool for the analysis of contingency
tables. In particular, the subclass of graphical log-linear models provides a
general framework for modelling conditional independences. However, with the
exception of special structures, marginal independence hypotheses cannot be
accommodated by these traditional models. Focusing on binary variables, we
present a model class that provides a framework for modelling marginal
independences in contingency tables. The approach taken is graphical and draws
on analogies to multivariate Gaussian models for marginal independence. For the
graphical model representation we use bi-directed graphs, which are in the
tradition of path diagrams. We show how the models can be parameterized in a
simple fashion, and how maximum likelihood estimation can be performed using a
version of the Iterated Conditional Fitting algorithm. Finally we consider
combining these models with symmetry restrictions
- …