3,621,545 research outputs found
Statistical Risk Models
We give complete algorithms and source code for constructing statistical risk
models, including methods for fixing the number of risk factors. One such
method is based on eRank (effective rank) and yields results similar to (and
further validates) the method set forth in an earlier paper by one of us. We
also give a complete algorithm and source code for computing eigenvectors and
eigenvalues of a sample covariance matrix which requires i) no costly
iterations and ii) the number of operations linear in the number of returns.
The presentation is intended to be pedagogical and oriented toward practical
applications.Comment: 44 pages; a trivial typo corrected, references updated; to appear in
The Journal of Investment Strategies. arXiv admin note: text overlap with
arXiv:1602.04902, arXiv:1508.04883, arXiv:1604.0874
Fluctuations in statistical models
Proceedings of 4th International Workshop "Critical Point and Onset of Deconfinement", July 9-13, 2007, Darmstadt, Germany: The multiplicity fluctuations of hadrons are studied within the statistical hadron-resonance gas model in the large volume limit. The role of quantum statistics and resonance decay effects are discussed. The microscopic correlator method is used to enforce conservation of three charges - baryon number, electric charge, and strangeness - in the canonical ensemble. In addition, in the micro-canonical ensemble energy conservation is included. An analytical method is used to account for resonance decays. The multiplicity distributions and the scaled variances for negatively and positively charged hadrons are calculated for the sets of thermodynamical parameters along the chemical freeze-out line of central Pb+Pb (Au+Au) collisions from SIS to LHC energies. Predictions obtained within different statistical ensembles are compared with the preliminary NA49 experimental results on central Pb+Pb collisions in the SPS energy range. The measured fluctuations are significantly narrower than the Poisson ones and clearly favor expectations for the micro-canonical ensemble. Thus, this is a first observation of the recently predicted suppression of the multiplicity fluctuations in relativistic gases in the thermodynamical limit due to conservation laws
Tropical Geometry of Statistical Models
This paper presents a unified mathematical framework for inference in
graphical models, building on the observation that graphical models are
algebraic varieties.
From this geometric viewpoint, observations generated from a model are
coordinates of a point in the variety, and the sum-product algorithm is an
efficient tool for evaluating specific coordinates. The question addressed here
is how the solutions to various inference problems depend on the model
parameters. The proposed answer is expressed in terms of tropical algebraic
geometry. A key role is played by the Newton polytope of a statistical model.
Our results are applied to the hidden Markov model and to the general Markov
model on a binary tree.Comment: 14 pages, 3 figures. Major revision. Applications now in companion
paper, "Parametric Inference for Biological Sequence Analysis
Statistical Models on Spherical Geometries
We use a one-dimensional random walk on -dimensional hyper-spheres to
determine the critical behavior of statistical systems in hyper-spherical
geometries. First, we demonstrate the properties of such a walk by studying the
phase diagram of a percolation problem. We find a line of second and first
order phase transitions separated by a tricritical point. Then, we analyze the
adsorption-desorption transition for a polymer growing near the attractive
boundary of a cylindrical cell membrane. We find that the fraction of adsorbed
monomers on the boundary vanishes exponentially when the adsorption energy
decreases towards its critical value.Comment: 8 pages, latex, 2 figures in p
Statistical inference in compound functional models
We consider a general nonparametric regression model called the compound
model. It includes, as special cases, sparse additive regression and
nonparametric (or linear) regression with many covariates but possibly a small
number of relevant covariates. The compound model is characterized by three
main parameters: the structure parameter describing the "macroscopic" form of
the compound function, the "microscopic" sparsity parameter indicating the
maximal number of relevant covariates in each component and the usual
smoothness parameter corresponding to the complexity of the members of the
compound. We find non-asymptotic minimax rate of convergence of estimators in
such a model as a function of these three parameters. We also show that this
rate can be attained in an adaptive way
Statistical Models with Uncertain Error Parameters
In a statistical analysis in Particle Physics, nuisance parameters can be
introduced to take into account various types of systematic uncertainties. The
best estimate of such a parameter is often modeled as a Gaussian distributed
variable with a given standard deviation (the corresponding "systematic
error"). Although the assigned systematic errors are usually treated as
constants, in general they are themselves uncertain. A type of model is
presented where the uncertainty in the assigned systematic errors is taken into
account. Estimates of the systematic variances are modeled as gamma distributed
random variables. The resulting confidence intervals show interesting and
useful properties. For example, when averaging measurements to estimate their
mean, the size of the confidence interval increases for decreasing
goodness-of-fit, and averages have reduced sensitivity to outliers. The basic
properties of the model are presented and several examples relevant for
Particle Physics are explored.Comment: 26 pages, 27 figure
- …
