835 research outputs found

    Doubly robust confidence sequences for sequential causal inference

    Full text link
    This paper derives time-uniform confidence sequences (CS) for causal effects in experimental and observational settings. A confidence sequence for a target parameter ψ\psi is a sequence of confidence intervals (Ct)t=1(C_t)_{t=1}^\infty such that every one of these intervals simultaneously captures ψ\psi with high probability. Such CSs provide valid statistical inference for ψ\psi at arbitrary stopping times, unlike classical fixed-time confidence intervals which require the sample size to be fixed in advance. Existing methods for constructing CSs focus on the nonasymptotic regime where certain assumptions (such as known bounds on the random variables) are imposed, while doubly robust estimators of causal effects rely on (asymptotic) semiparametric theory. We use sequential versions of central limit theorem arguments to construct large-sample CSs for causal estimands, with a particular focus on the average treatment effect (ATE) under nonparametric conditions. These CSs allow analysts to update inferences about the ATE in lieu of new data, and experiments can be continuously monitored, stopped, or continued for any data-dependent reason, all while controlling the type-I error. Finally, we describe how these CSs readily extend to other causal estimands and estimators, providing a new framework for sequential causal inference in a wide array of problems

    Almost the Best of Three Worlds: Risk, Consistency and Optional Stopping for the Switch Criterion in Nested Model Selection

    Get PDF
    We study the switch distribution, introduced by Van Erven et al. (2012), applied to model selection and subsequent estimation. While switching was known to be strongly consistent, here we show that it achieves minimax optimal parametric risk rates up to a loglogn\log\log n factor when comparing two nested exponential families, partially confirming a conjecture by Lauritzen (2012) and Cavanaugh (2012) that switching behaves asymptotically like the Hannan-Quinn criterion. Moreover, like Bayes factor model selection but unlike standard significance testing, when one of the models represents a simple hypothesis, the switch criterion defines a robust null hypothesis test, meaning that its Type-I error probability can be bounded irrespective of the stopping rule. Hence, switching is consistent, insensitive to optional stopping and almost minimax risk optimal, showing that, Yang's (2005) impossibility result notwithstanding, it is possible to `almost' combine the strengths of AIC and Bayes factor model selection.Comment: To appear in Statistica Sinic

    Empirical processes for recurrent and transient random walks in random scenery

    Full text link
    In this paper, we are interested in the asymptotic behaviour of the sequence of processes (Wn(s,t))s,t[0,1](W_n(s,t))_{s,t\in[0,1]} with \begin{equation*} W_n(s,t):=\sum_{k=1}^{\lfloor nt\rfloor}\big(1_{\{\xi_{S_k}\leq s\}}-s\big) \end{equation*} where (ξx,xZd)(\xi_x, x\in\mathbb{Z}^d) is a sequence of independent random variables uniformly distributed on [0,1][0,1] and (Sn)nN(S_n)_{n\in\mathbb N} is a random walk evolving in Zd\mathbb{Z}^d, independent of the ξ\xi's. In Wendler (2016), the case where (Sn)nN(S_n)_{n\in\mathbb N} is a recurrent random walk in Z\mathbb{Z} such that (n1αSn)n1(n^{-\frac 1\alpha}S_n)_{n\geq 1} converges in distribution to a stable distribution of index α\alpha, with α(1,2]\alpha\in(1,2], has been investigated. Here, we consider the cases where (Sn)nN(S_n)_{n\in\mathbb N} is either: a) a transient random walk in Zd\mathbb{Z}^d, b) a recurrent random walk in Zd\mathbb{Z}^d such that (n1dSn)n1(n^{-\frac 1d}S_n)_{n\geq 1} converges in distribution to a stable distribution of index d{1,2}d\in\{1,2\}
    corecore