74 research outputs found

    Automated extraction of mutual independence patterns using Bayesian comparison of partition models

    Full text link
    Mutual independence is a key concept in statistics that characterizes the structural relationships between variables. Existing methods to investigate mutual independence rely on the definition of two competing models, one being nested into the other and used to generate a null distribution for a statistic of interest, usually under the asymptotic assumption of large sample size. As such, these methods have a very restricted scope of application. In the present manuscript, we propose to change the investigation of mutual independence from a hypothesis-driven task that can only be applied in very specific cases to a blind and automated search within patterns of mutual independence. To this end, we treat the issue as one of model comparison that we solve in a Bayesian framework. We show the relationship between such an approach and existing methods in the case of multivariate normal distributions as well as cross-classified multinomial distributions. We propose a general Markov chain Monte Carlo (MCMC) algorithm to numerically approximate the posterior distribution on the space of all patterns of mutual independence. The relevance of the method is demonstrated on synthetic data as well as two real datasets, showing the unique insight provided by this approach.Comment: IEEE Transactions on Pattern Analysis and Machine Intelligence (in press

    Exploration of phylogenetic data using a global sequence analysis method

    Get PDF
    BACKGROUND: Molecular phylogenetic methods are based on alignments of nucleic or peptidic sequences. The tremendous increase in molecular data permits phylogenetic analyses of very long sequences and of many species, but also requires methods to help manage large datasets. RESULTS: Here we explore the phylogenetic signal present in molecular data by genomic signatures, defined as the set of frequencies of short oligonucleotides present in DNA sequences. Although violating many of the standard assumptions of traditional phylogenetic analyses – in particular explicit statements of homology inherent in character matrices – the use of the signature does permit the analysis of very long sequences, even those that are unalignable, and is therefore most useful in cases where alignment is questionable. We compare the results obtained by traditional phylogenetic methods to those inferred by the signature method for two genes: RAG1, which is easily alignable, and 18S RNA, where alignments are often ambiguous for some regions. We also apply this method to a multigene data set of 33 genes for 9 bacteria and one archea species as well as to the whole genome of a set of 16 γ-proteobacteria. In addition to delivering phylogenetic results comparable to traditional methods, the comparison of signatures for the sequences involved in the bacterial example identified putative candidates for horizontal gene transfers. CONCLUSION: The signature method is therefore a fast tool for exploring phylogenetic data, providing not only a pretreatment for discovering new sequence relationships, but also for identifying cases of sequence evolution that could confound traditional phylogenetic analysis

    Detection of melanoma from dermoscopic images of naevi acquired under uncontrolled conditions.

    No full text
    International audienceBACKGROUND AND OBJECTIVE: Several systems for the diagnosis of melanoma from images of naevi obtained under controlled conditions have demonstrated comparable efficiency with dermatologists. However, their robustness to analyze daily routine images was sometimes questionable. The purpose of this work is to investigate to what extent the automatic melanoma diagnosis may be achieved from the analysis of uncontrolled images of pigmented skin lesions. MATERIALS AND METHODS: Images were acquired during regular practice by two dermatologists using Reflex 24 x 36 cameras combined with Heine Delta 10 dermascopes. The images were then digitalized using a scanner. In addition, five senior dermatologists were asked to give the diagnosis and therapeutic decision (exeresis) for 227 images of naevi, together with an opinion about the existence of malignancy-predictive features. Meanwhile, a learning by sample classifier for the diagnosis of melanoma was constructed, which combines image-processing with machine-learning techniques. After an automatic segmentation, geometric and colorimetric parameters were extracted from images and selected according to their efficiency in predicting malignancy features. A diagnosis was subsequently provided based on selected parameters. An extensive comparison of dermatologists' and computer results was subsequently performed. RESULTS AND CONCLUSION: The KL-PLS-based classifier shows comparable performances with respect to dermatologists (sensitivity: 95% and specificity: 60%). The algorithm provides an original insight into the clinical knowledge of pigmented skin lesions

    Conception et developpement d'un systeme expert pour la therapeutique des cancers de l'ovaire. Etude des procedures d'evaluation des systemes experts

    No full text
    SIGLEINIST T 76867 / INIST-CNRS - Institut de l'Information Scientifique et TechniqueFRFranc

    Cumulants of multiinformation density in the case of a multivariate normal distribution

    No full text
    International audienceWe consider a generalization of information density to a partitioning into N ≄ 2 subvectors. We calculate its cumulant-generating function and its cumulants, showing that these quantities are only a function of all the regression coeïŹƒcients associated with the partitionin

    Estimating the concentration parameter of a von Mises distribution: a systematic simulation benchmark

    No full text
    International audienceIn directional statistics, the von Mises distribution is a key element in the analysis of circular data. While there is a general agreement regarding the estimation of its location parameter ÎŒ, several methods have been proposed to estimate the concentration parameter Îș. We here provide a thorough evaluation of the behavior of 12 such estimators for datasets of size N ranging from 2 to 8192 generated with a Îș ranging from 0 to 100. We provide detailed results as well as a global analysis of the results, showing that (1) for a given Îș, most estimators have behaviors that are very similar for large datasets (N≄16) and more variable for small datasets, and (2) for a given estimator, results are very similar if we consider the mean absolute error for Îș≀1 and the mean relative absolute error for Îș≄1

    Automated extraction of mutual independence patterns using Bayesian comparison of partition models

    No full text
    International audienceMutual independence is a key concept in statistics that characterizes the structural relationships between variables. Existing methods to investigate mutual independence rely on the definition of two competing models, one being nested into the other and used to generate a null distribution for a statistic of interest, usually under the asymptotic assumption of large sample size. As such, these methods have a very restricted scope of application. In the present manuscript, we propose to change the investigation of mutual independence from a hypothesis-driven task that can only be applied in very specific cases to a blind and automated search within patterns of mutual independence. To this end, we treat the issue as one of model comparison that we solve in a Bayesian framework. We show the relationship between such an approach and existing methods in the case of multivariate normal distributions as well as cross-classified multinomial distributions. We propose a general Markov chain Monte Carlo (MCMC) algorithm to numerically approximate the posterior distribution on the space of all patterns of mutual independence. The relevance of the method is demonstrated on synthetic data as well as two real datasets, showing the unique insight provided by this approach

    Exponential decay of pairwise correlation in Gaussian graphical models with an equicorrelational one-dimensional connection pattern

    No full text
    International audienceWe consider Gaussian graphical models associated with an equicorrelational and one dimensional conditional independence graph. We show that pairwise correlation decays exponentially as a function of distance. We also provide a limit when the number of variables tend to infinity and quantify the difference between the finite and infinite cases
    • 

    corecore