4,159 research outputs found
Estimation and Model Selection of Semiparametric Multivariate Survival Functions under General Censorship
Many models of semiparametric multivariate survival functions are characterized by nonparametric marginal survival functions and parametric copula functions, where different copulas imply different dependence structures. This paper considers estimation and model selection for these semiparametric multivariate survival functions, allowing for misspecified parametric copulas and data subject to general censoring. We first establish convergence of the two-step estimator of the copula parameter to the pseudo-true value defined as the value of the parameter that minimizes the KLIC between the parametric copula induced multivariate density and the unknown true density. We then derive its root--n asymptotically normal distribution and provide a simple consistent asymptotic variance estimator by accounting for the impact of the nonparametric estimation of the marginal survival functions. These results are used to establish the asymptotic distribution of the penalized pseudo-likelihood ratio statistic for comparing multiple semiparametric multivariate survival functions subject to copula misspecification and general censorship. An empirical application of the model selection test to the Loss-ALAE insurance data set is provided.Multivariate survival models, Misspecified copulas, Penalized pseudo-likelihood ratio, Fixed or random censoring, Kaplan-Meier estimator
The Poisson transform for unnormalised statistical models
Contrary to standard statistical models, unnormalised statistical models only
specify the likelihood function up to a constant. While such models are natural
and popular, the lack of normalisation makes inference much more difficult.
Here we show that inferring the parameters of a unnormalised model on a space
can be mapped onto an equivalent problem of estimating the intensity
of a Poisson point process on . The unnormalised statistical model now
specifies an intensity function that does not need to be normalised.
Effectively, the normalisation constant may now be inferred as just another
parameter, at no loss of information. The result can be extended to cover
non-IID models, which includes for example unnormalised models for sequences of
graphs (dynamical graphs), or for sequences of binary vectors. As a
consequence, we prove that unnormalised parameteric inference in non-IID models
can be turned into a semi-parametric estimation problem. Moreover, we show that
the noise-contrastive divergence of Gutmann & Hyv\"arinen (2012) can be
understood as an approximation of the Poisson transform, and extended to
non-IID settings. We use our results to fit spatial Markov chain models of eye
movements, where the Poisson transform allows us to turn a highly non-standard
model into vanilla semi-parametric logistic regression
Conformal Prediction: a Unified Review of Theory and New Challenges
In this work we provide a review of basic ideas and novel developments about
Conformal Prediction -- an innovative distribution-free, non-parametric
forecasting method, based on minimal assumptions -- that is able to yield in a
very straightforward way predictions sets that are valid in a statistical sense
also in in the finite sample case. The in-depth discussion provided in the
paper covers the theoretical underpinnings of Conformal Prediction, and then
proceeds to list the more advanced developments and adaptations of the original
idea.Comment: arXiv admin note: text overlap with arXiv:0706.3188,
arXiv:1604.04173, arXiv:1709.06233, arXiv:1203.5422 by other author
Eliciting density ratio classes
AbstractThe probability distributions of uncertain quantities needed for predictive modelling and decision support are frequently elicited from subject matter experts. However, experts are often uncertain about quantifying their beliefs using precise probability distributions. Therefore, it seems natural to describe their uncertain beliefs using sets of probability distributions. There are various possible structures, or classes, for defining set membership of continuous random variables. The Density Ratio Class has desirable properties, but there is no established procedure for eliciting this class. Thus, we propose a method for constructing Density Ratio Classes that builds on conventional quantile or probability elicitation, but allows the expert to state intervals for these quantities. Parametric shape functions, ideally also suggested by the expert, are then used to bound the nonparametric set of shapes of densities that belong to the class and are compatible with the stated intervals. This leads to a natural metric for the size of the class based on the ratio of the total areas under upper and lower bounding shape functions. This ratio will be determined by the characteristics of the shape functions, the scatter of the elicited values, and the explicit expert imprecision, as characterized by the width of the stated intervals. We provide some examples, both didactic and real, and conclude with recommendations for the further development and application of the Density Ratio Class
- …