301 research outputs found

    Assessing the Distribution Consistency of Sequential Data

    Get PDF
    Given n observations, we study the consistency of a batch of k new observations, in terms of their distribution function. We propose a non-parametric, non-likelihood test based on Edgeworth expansion of the distribution function. The keypoint is to approximate the distribution of the n+k observations by the distribution of n-k among the n observations. Edgeworth expansion gives the correcting term and the rate of convergence. We also study the discrete distribution case, for which Cram\`er's condition of smoothness is not satisfied. The rate of convergence for the various cases are compared.Comment: 20 pages, 0 figure

    La précognition est-elle démontrée ?

    Get PDF
    International audienceEn général, quand quelqu'un dit avoir la preuve que les " dons psychiques " existent, telles la télépathie ou la prémonition, la réaction du monde académique se résume à un haussement d'épaule. Quand réaction il y a, évidemment. Le débat actuel a été lancé par Daryl J. Bem, professeur émérite [1] de psychologie à Cornell University dans The Journal of Personality and Social Psychology (une revue tout à fait respectable). Il décrit les expériences qu'il a faites afin, dit-il, d'amener ses collègues à au moins considérer la possibilité de l'existence de tels pouvoirs. Du coup, les uns crient au scandale dans le New York Times. D'autres à la mauvaise utilisation généralisée des statistiques en sciences sociales. Et d'autres encore subodorent un canular

    Spatio-temporal Functional Regression on Paleo-ecological Data

    Get PDF
    The influence of climate on biodiversity is an important ecological question. Various theories try to link climate change to allelic richness and therefore to predict the impact of global warming on genetic diversity. We model the relationship between genetic diversity in the European beech forests and curves of temperature and precipitation reconstructed from pollen databases. Our model links the genetic measure to the climate curves through a linear functional regression. The interaction in climate variables is assumed to be bilinear. Since the data are georeferenced, our methodology accounts for the spatial dependence among the observations. The practical issues of these extensions are discussed

    Non-Adaptive Policies for 20 Questions Target Localization

    Full text link
    The problem of target localization with noise is addressed. The target is a sample from a continuous random variable with known distribution and the goal is to locate it with minimum mean squared error distortion. The localization scheme or policy proceeds by queries, or questions, weather or not the target belongs to some subset as it is addressed in the 20-question framework. These subsets are not constrained to be intervals and the answers to the queries are noisy. While this situation is well studied for adaptive querying, this paper is focused on the non adaptive querying policies based on dyadic questions. The asymptotic minimum achievable distortion under such policies is derived. Furthermore, a policy named the Aurelian1 is exhibited which achieves asymptotically this distortion

    Spatial cluster detection using the number of connected components of a graph

    Get PDF
    The aim of this work is to detect spatial clusters. We link Erdös graph and Poisson point process. We give the probability distribution function (pdf) of the number of connected component for an Erdös graph and obtain the pdf of the number of cluster for a Poisson process. Using this result, we directly obtain a test for complete spatial randomness and also obtain the clusters that violates the CSR hypothesis. Border effects are computed. We illustrate our results on a tropical forest example

    Supplementary Material for: Homogeneity and identity tests for unidimensional Poisson processes with an application to neurophysiological peri-stimulus time histograms–R version

    Get PDF
    R version of the Supplementary material for "Homogeneity and identity tests for unidimensional Poisson processes with an application to neurophysiological peri-stimulus time histograms.

    An iterative procedure for differential analysis of gene expression

    Get PDF
    Microarrays are a popular technology to study genes that are differentially expressed between two conditions. In this Note, we propose an iterative procedure to determine the biggest subset of non-differentially expressed genes. We prove a pseudo Markov relationship that allows practical computations. We obtain explicit expressions for FDR and the level of the proposed test at each step. To cite this article: A. Bar-Hen, S. Robin, C. R. Acad. Sci. Paris, Ser. I ••• (••••)

    Stochastic block-models for multiplex networks:An application to a multilevel network of researchers

    Get PDF
    Modelling relationships between individuals is a classical question in social sci- ences and clustering individuals according to the observed patterns of interactions allows us to uncover a latent structure in the data. The stochastic block model is a popular approach for grouping individuals with respect to their social comportment. When several relationships of various types can occur jointly between individuals, the data are represented by multiplex networks where more than one edge can exist between the nodes. We extend stochastic block models to multiplex networks to obtain a clustering based on more than one kind of relation- ship. We propose to estimate the parameters—such as the marginal probabilities of assignment to groups (blocks) and the matrix of probabilities of connections between groups—through a variational expectation–maximization procedure. Consistency of the estimates is studied. The number of groups is chosen by using the integrated completed likelihood criterion, which is a penalized likelihood criterion. Multiplex stochastic block models arise in many situations but our applied example is motivated by a network of French cancer researchers. The two possi- ble links (edges) between researchers are a direct connection or a connection through their laboratories. Our results show strong interactions between these two kinds of connection and the groups that are obtained are discussed to emphasize the common features of researchers grouped together
    • …
    corecore