247 research outputs found

    Accelerating Bayesian hierarchical clustering of time series data with a randomised algorithm

    Get PDF
    We live in an era of abundant data. This has necessitated the development of new and innovative statistical algorithms to get the most from experimental data. For example, faster algorithms make practical the analysis of larger genomic data sets, allowing us to extend the utility of cutting-edge statistical methods. We present a randomised algorithm that accelerates the clustering of time series data using the Bayesian Hierarchical Clustering (BHC) statistical method. BHC is a general method for clustering any discretely sampled time series data. In this paper we focus on a particular application to microarray gene expression data. We define and analyse the randomised algorithm, before presenting results on both synthetic and real biological data sets. We show that the randomised algorithm leads to substantial gains in speed with minimal loss in clustering quality. The randomised time series BHC algorithm is available as part of the R package BHC, which is available for download from Bioconductor (version 2.10 and above) via http://bioconductor.org/packages/2.10/bioc/html/BHC.html. We have also made available a set of R scripts which can be used to reproduce the analyses carried out in this paper. These are available from the following URL. https://sites.google.com/site/randomisedbhc/

    Apocrine Hidradenocarcinoma of the Scalp: A Classification Conundrum

    Get PDF
    Introduction The classification of malignant sweat gland lesions is complex. Traditionally, cutaneous sweat gland tumors have been classified by either eccrine or apocrine features. Methods A case report of a 33-year-old Hispanic man with a left scalp mass diagnosed as a malignancy of adnexal origin preoperatively is discussed. After presentation at our multidisciplinary tumor board, excision with ipsilateral neck dissection was undertaken. Results Final pathology revealed an apocrine hidradenocarcinoma. The classification and behavior of this entity are discussed in this report. Conclusion Apocrine hidradenocarcinoma can be viewed as an aggressive malignant lesion of cutaneous sweat glands on a spectrum that involves both eccrine and apoeccrine lesions

    Graph-based analysis of the metabolic exchanges between two co-resident intracellular symbionts, baumannia cicadellinicola and sulcia muelleri with their insect host, homalodisca coagulata

    Get PDF
    International audienceEndosymbiotic bacteria from different species can live inside cells of the same eukaryotic organism. Metabolic exchanges occur between host and bacteria but also between different endocytobionts. Since a complete genome annotation is available for both, we built the metabolic network of two endosymbiotic bacteria, Sulcia muelleri and Baumannia cicadellinicola, that live inside specific cells of the sharpshooter Homalodisca coagulata and studied the metabolic exchanges involving transfers of carbon atoms between the three. We automatically determined the set of metabolites potentially exogenously acquired (seeds) for both metabolic networks. We show that the number of seeds needed by both bacteria in the carbon metabolism is extremely reduced. Moreover, only three seeds are common to both metabolic networks, indicating that the complementarity of the two metabolisms is not only manifested in the metabolic capabilities of each bacterium, but also by their different use of the same environment. Furthermore, our results show that the carbon metabolism of S. muelleri may be completely independent of the metabolic network of B. cicadellinicola. On the contrary, the carbon metabolism of the latter appears dependent on the metabolism of S. muelleri, at least for two essential amino acids, threonine and lysine. Next, in order to define which subsets of seeds (precursor sets) are sufficient to produce the metabolites involved in a symbiotic function, we used a graph-based method, PITUFO, that we recently developed. Our results highly refine our knowledge about the complementarity between the metabolisms of the two bacteria and their host. We thus indicate seeds that appear obligatory in the synthesis of metabolites are involved in the symbiotic function. Our results suggest both B. cicadellinicola and S. muelleri may be completely independent of the metabolites provided by the co-resident endocytobiont to produce the carbon backbone of the metabolites provided to the symbiotic system (., thr and lys are only exploited by B. cicadellinicola to produce its proteins)

    Structure and dynamics of the operon map of Buchnera aphidicola sp. strain APS

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Gene expression regulation is still poorly documented in bacteria with highly reduced genomes. Understanding the evolution and mechanisms underlying the regulation of gene transcription in <it>Buchnera aphidicola</it>, the primary endosymbiont of aphids, is expected both to enhance our understanding of this nutritionally based association and to provide an intriguing case-study of the evolution of gene expression regulation in a reduced bacterial genome.</p> <p>Results</p> <p>A Bayesian predictor was defined to infer the <it>B. aphidicola </it>transcription units, which were further validated using transcriptomic data and RT-PCR experiments. The characteristics of <it>B. aphidicola </it>predicted transcription units (TUs) were analyzed in order to evaluate the impact of operon map organization on the regulation of gene transcription.</p> <p>On average, <it>B. aphidicola </it>TUs contain more genes than those of <it>E. coli</it>. The global layout of <it>B. aphidicola </it>operon map was mainly shaped by the big reduction and the rearrangements events, which occurred at the early stage of the symbiosis. Our analysis suggests that this operon map may evolve further only by small reorganizations around the frontiers of <it>B. aphidicola </it>TUs, through promoter and/or terminator sequence modifications and/or by pseudogenization events. We also found that the need for specific transcription regulation exerts some pressure on gene conservation, but not on gene assembling in the operon map in <it>Buchnera</it>. Our analysis of the TUs spacing pointed out that a selection pressure is maintained on the length of the intergenic regions between divergent adjacent gene pairs.</p> <p>Conclusions</p> <p><it>B. aphidicola </it>can seemingly only evolve towards a more polycistronic operon map. This implies that gene transcription regulation is probably subject to weak selection pressure in <it>Buchnera </it>conserving operons composed of genes with unrelated functions.</p

    Evaluation of Jackknife and Bootstrap for Defining Confidence Intervals for Pairwise Agreement Measures

    Get PDF
    Several research fields frequently deal with the analysis of diverse classification results of the same entities. This should imply an objective detection of overlaps and divergences between the formed clusters. The congruence between classifications can be quantified by clustering agreement measures, including pairwise agreement measures. Several measures have been proposed and the importance of obtaining confidence intervals for the point estimate in the comparison of these measures has been highlighted. A broad range of methods can be used for the estimation of confidence intervals. However, evidence is lacking about what are the appropriate methods for the calculation of confidence intervals for most clustering agreement measures. Here we evaluate the resampling techniques of bootstrap and jackknife for the calculation of the confidence intervals for clustering agreement measures. Contrary to what has been shown for some statistics, simulations showed that the jackknife performs better than the bootstrap at accurately estimating confidence intervals for pairwise agreement measures, especially when the agreement between partitions is low. The coverage of the jackknife confidence interval is robust to changes in cluster number and cluster size distribution

    Objective quantification of nanoscale protein distributions

    Get PDF
    Nanoscale distribution of molecules within small subcellular compartments of neurons critically influences their functional roles. Although, numerous ways of analyzing the spatial arrangement of proteins have been described, a thorough comparison of their effectiveness is missing. Here we present an open source software, GoldExt, with a plethora of measures for quantification of the nanoscale distribution of proteins in subcellular compartments (e.g. synapses) of nerve cells. First, we compared the ability of five different measures to distinguish artificial uniform and clustered patterns from random point patterns. Then, the performance of a set of clustering algorithms was evaluated on simulated datasets with predefined number of clusters. Finally, we applied the best performing methods to experimental data, and analyzed the nanoscale distribution of different pre- and postsynaptic proteins, revealing random, uniform and clustered sub-synaptic distribution patterns. Our results reveal that application of a single measure is sufficient to distinguish between different distributions

    CD133 Positive Embryonal Rhabdomyosarcoma Stem-Like Cell Population Is Enriched in Rhabdospheres

    Get PDF
    Cancer stem cells (CSCs) have been identified in a number of solid tumors, but not yet in rhabdomyosarcoma (RMS), the most frequently occurring soft tissue tumor in childhood. Hence, the aim of this study was to identify and characterize a CSC population in RMS using a functional approach. We found that embryonal rhabdomyosarcoma (eRMS) cell lines can form rhabdomyosarcoma spheres (short rhabdospheres) in stem cell medium containing defined growth factors over several passages. Using an orthotopic xenograft model, we demonstrate that a 100 fold less sphere cells result in faster tumor growth compared to the adherent population suggesting that CSCs were enriched in the sphere population. Furthermore, stem cell genes such as oct4, nanog, c-myc, pax3 and sox2 are significantly upregulated in rhabdospheres which can be differentiated into multiple lineages such as adipocytes, myocytes and neuronal cells. Surprisingly, gene expression profiles indicate that rhabdospheres show more similarities with neuronal than with hematopoietic or mesenchymal stem cells. Analysis of these profiles identified the known CSC marker CD133 as one of the genes upregulated in rhabdospheres, both on RNA and protein levels. CD133+ sorted cells were subsequently shown to be more tumorigenic and more resistant to commonly used chemotherapeutics. Using a tissue microarray (TMA) of eRMS patients, we found that high expression of CD133 correlates with poor overall survival. Hence, CD133 could be a prognostic marker for eRMS. These experiments indicate that a CD133+ CSC population can be enriched from eRMS which might help to develop novel targeted therapies against this pediatric tumor

    Overthrowing the dictator: a game-theoretic approach to revolutions and media

    Get PDF
    A distinctive feature of recent revolutions was the key role of social media (e.g. Facebook, Twitter and YouTube). In this paper, we study its role in mobilization. We assume that social media allow potential participants to observe the individual participation decisions of others, while traditional mass media allow potential participants to see only the total number of people who participated before them. We show that when individuals’ willingness to revolt is publicly known, then both sorts of media foster a successful revolution. However, when willingness to revolt is private information, only social media ensure that a revolt succeeds, with mass media multiple outcomes are possible, one of which has individuals not participating in the revolt. This suggests that social media enhance the likelihood that a revolution triumphs more than traditional mass media

    Prominent and Persistent Extraneural Infection in Human PrP Transgenic Mice Infected with Variant CJD

    Get PDF
    Background. The evolution of the variant Creutzfeldt-Jakob disease (vCJD) epidemic is hazardous to predict due to uncertainty in ascertaining the prevalence of infection and because the disease might remain asymptomatic or produce an alternate, sporadic-like phenotype. Methodology/Principal Findings. Transgenic mice were produced that overexpress human prion protein with methionine at codon 129, the only allele found so far in vCJD-affected patients. These mice were infected with prions derived from variant and sporadic CJD (sCJD) cases by intracerebral or intraperitoneal route, and transmission efficiency and strain phenotype were analyzed in brain and spleen. We showed that i) the main features of vCJD infection in humans, including a prominent involvement of the lymphoid tissues compared to that in sCJD infection were faithfully reproduced in such mice; ii) transmission of vCJD agent by intracerebral route could lead to the propagation of either vCJD or sCJD-like prion in the brain, whereas vCJD prion was invariably propagated in the spleen, iii) after peripheral exposure, inefficient neuroinvasion was observed, resulting in an asymptomatic infection with life-long persistence of vCJD prion in the spleen at stable and elevated levels. Conclusion/Significance. Our findings emphasize the possibility that human-to-human transmission of vCJD might produce alternative neuropathogical phenotypes and that lymphoid tissue examination of CJD cases classified as sporadic might reveal an infection by vCJD-type prions. They also provide evidence for the strong propensity of this agent to establish long-lasting, subclinical vCJD infection of lymphoreticular tissues, thus amplifying the risk for iatrogenic transmission
    • …
    corecore