7 research outputs found

    Fast lossless compression via cascading Bloom filters

    Full text link

    MGMR: leveraging RNA-Seq population data to optimize expression estimation

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>RNA-Seq is a technique that uses Next Generation Sequencing to identify transcripts and estimate transcription levels. When applying this technique for quantification, one must contend with reads that align to multiple positions in the genome (multireads). Previous efforts to resolve multireads have shown that RNA-Seq expression estimation can be improved using probabilistic allocation of reads to genes. These methods use a probabilistic generative model for data generation and resolve ambiguity using likelihood-based approaches. In many instances, RNA-seq experiments are performed in the context of a population. The generative models of current methods do not take into account such population information, and it is an open question whether this information can improve quantification of the individual samples</p> <p>Results</p> <p>In order to explore the contribution of population level information in RNA-seq quantification, we apply a hierarchical probabilistic generative model, which assumes that expression levels of different individuals are sampled from a Dirichlet distribution with parameters specific to the population, and reads are sampled from the distribution of expression levels. We introduce an optimization procedure for the estimation of the model parameters, and use HapMap data and simulated data to demonstrate that the model yields a significant improvement in the accuracy of expression levels of paralogous genes.</p> <p>Conclusions</p> <p>We provide a proof of principal of the benefit of drawing on population commonalities to estimate expression. The results of our experiments demonstrate this approach can be beneficial, primarily for estimation at the gene level.</p

    Systems-level dynamic analyses of fate change in murine embryonic stem cells

    Get PDF
    Molecular regulation of embryonic stem cell (ESC) fate involves a coordinated interaction between epigenetic, transcriptional and translational mechanisms. It is unclear how these different molecular regulatory mechanisms interact to regulate changes in stem cell fate. Here we present a dynamic systems-level study of cell fate change in murine ESCs following a well-defined perturbation. Global changes in histone acetylation, chromatin-bound RNA polymerase II, messenger RNA (mRNA), and nuclear protein levels were measured over 5 days after downregulation of Nanog, a key pluripotency regulator. Our data demonstrate how a single genetic perturbation leads to progressive widespread changes in several molecular regulatory layers, and provide a dynamic view of information flow in the epigenome, transcriptome and proteome. We observe that a large proportion of changes in nuclear protein levels are not accompanied by concordant changes in the expression of corresponding mRNAs, indicating important roles for translational and post-translational regulation of ESC fate. Gene-ontology analysis across different molecular layers indicates that although chromatin reconfiguration is important for altering cell fate, it is preceded by transcription-factor-mediated regulatory events. The temporal order of gene expression alterations shows the order of the regulatory network reconfiguration and offers further insight into the gene regulatory network. Our studies extend the conventional systems biology approach to include many molecular species, regulatory layers and temporal series, and underscore the complexity of the multilayer regulatory mechanisms responsible for changes in protein expression that determine stem cell fat

    Patient-specific induced pluripotent stem-cell-derived models of LEOPARD syndrome

    No full text
    Generation of reprogrammed induced pluripotent stem cells (iPSC) from patients with defined genetic disorders promises important avenues to understand the etiologies of complex diseases, and the development of novel therapeutic interventions. We have generated iPSC from patients with LEOPARD syndrome (LS; acronym of its main features: Lentigines, Electrocardiographic abnormalities, Ocular hypertelorism, Pulmonary valve stenosis, Abnormal genitalia, Retardation of growth and Deafness), an autosomal dominant developmental disorder belonging to a relatively prevalent class of inherited RAS-MAPK signaling diseases, which also includes Noonan syndrome (NS), with pleiomorphic effects on several tissues and organ systems1,2. The patient-derived cells have a mutation in the PTPN11 gene, which encodes the SHP2 phosphatase. The iPSC have been extensively characterized and produce multiple differentiated cell lineages. A major disease phenotype in patients with LEOPARD syndrome is hypertrophic cardiomyopathy. We show that in vitro-derived cardiomyocytes from LS-iPSC are larger, have a higher degree of sarcomeric organization and preferential localization of NFATc4 in the nucleus when compared to cardiomyocytes derived from human embryonic stem cells (HESC) or wild type (wt) iPSC derived from a healthy brother of one of the LS patients. These features correlate with a potential hypertrophic state. We also provide molecular insights into signaling pathways that may promote the disease phenotype
    corecore