38 research outputs found

    Finite Models of Splicing and Their Complexity

    Get PDF
    Durante las dos últimas décadas ha surgido una colaboración estrecha entre informáticos, bioquímicos y biólogos moleculares, que ha dado lugar a la investigación en un área conocida como la computación biomolecular. El trabajo en esta tesis pertenece a este área, y estudia un modelo de cómputo llamado sistema de empalme (splicing system). El empalme es el modelo formal del corte y de la recombinación de las moléculas de ADN bajo la influencia de las enzimas de la restricción.Esta tesis presenta el trabajo original en el campo de los sistemas de empalme, que, como ya indica el título, se puede dividir en dos partes. La primera parte introduce y estudia nuevos modelos finitos de empalme. La segunda investiga aspectos de complejidad (tanto computacional como descripcional) de los sistema de empalme. La principal contribución de la primera parte es que pone en duda la asunción general que una definición finita, más realista de sistemas de empalme es necesariamente débil desde un punto de vista computacional. Estudiamos varios modelos alternativos y demostramos que en muchos casos tienen más poder computacional. La segunda parte de la tesis explora otro territorio. El modelo de empalme se ha estudiado mucho respecto a su poder computacional, pero las consideraciones de complejidad no se han tratado apenas. Introducimos una noción de la complejidad temporal y espacial para los sistemas de empalme. Estas definiciones son utilizadas para definir y para caracterizar las clases de complejidad para los sistemas de empalme. Entre otros resultados, presentamos unas caracterizaciones exactas de las clases de empalme en términos de clases de máquina de Turing conocidas. Después, usando una nueva variante de sistemas de empalme, que acepta lenguajes en lugar de generarlos, demostramos que los sistemas de empalme se pueden usar para resolver problemas. Por último, definimos medidas de complejidad descriptional para los sistemas de empalme. Demostramos que en este respecto los sistemas de empalme finitos tienen buenas propiedades comparadosOver the last two decades, a tight collaboration has emerged between computer scientists, biochemists and molecular biologists, which has spurred research into an area known as DNAComputing (also biomolecular computing). The work in this thesis belongs to this field, and studies a computational model called splicing system. Splicing is the formal model of the cutting and recombination of DNA molecules under the influence of restriction enzymes.This thesis presents original work in the field of splicing systems, which, as the title already indicates, can be roughly divided into two parts: 'Finite models of splicing' on the onehand and 'their complexity' on the other. The main contribution of the first part is that it challenges the general assumption that a finite, more realistic definition of splicing is necessarily weal from a computational point of view. We propose and study various alternative models and show that in most cases they have more computational power, often reaching computational completeness. The second part explores other territory. Splicing research has been mainly focused on computational power, but complexity considerations have hardly been addressed. Here we introduce notions of time and space complexity for splicing systems. These definitions are used to characterize splicing complexity classes in terms of well known Turing machine classes. Then, using a new accepting variant of splicing systems, we show that they can also be used as problem solvers. Finally, we study descriptional complexity. We define measures of descriptional complexity for splicing systems and show that for representing regular languages they have good properties with respect to finite automata, especially in the accepting variant

    On the Concepts of Parallelism in Biomolecular Computing

    Get PDF
    In this paper we consider DNA and membrane computing, both as theoretical models and as problem solving devices. The basic motivation behind these models of natural computing is using parallelism to make hard problems tractable. In this paper we analyze the concept of parallelism. We will show that parallelism has very different meanings in these models.We introduce the terms ’or-parallelism’ and ’and-parallelism’ for these two basic types of parallelism

    Sall4 controls differentiation of pluripotent cells independently of the Nucleosome Remodelling and Deacetylation (NuRD) complex.

    Get PDF
    Sall4 is an essential transcription factor for early mammalian development and is frequently overexpressed in cancer. Although it is reported to play an important role in embryonic stem cell (ESC) self-renewal, whether it is an essential pluripotency factor has been disputed. Here, we show that Sall4 is dispensable for mouse ESC pluripotency. Sall4 is an enhancer-binding protein that prevents precocious activation of the neural gene expression programme in ESCs but is not required for maintenance of the pluripotency gene regulatory network. Although a proportion of Sall4 protein physically associates with the Nucleosome Remodelling and Deacetylase (NuRD) complex, Sall4 neither recruits NuRD to chromatin nor influences transcription via NuRD; rather, free Sall4 protein regulates transcription independently of NuRD. We propose a model whereby enhancer binding by Sall4 and other pluripotency-associated transcription factors is responsible for maintaining the balance between transcriptional programmes in pluripotent cells.Wellcome Trust (PhD Studentship; Senior Fellowship in the Basic Biomedical Sciences [098021/Z/11/Z]), Wellcome Trust and UK Medical Research Council core funding to the Cambridge Stem Cell Institute [079249/Z/06/I], European Union Seventh Framework Programme (FP7) Project ‘4DCellFate’This is the final version of the article. It first appeared from The Company of Biologists via http://dx.doi.org/10.1242/dev.13911

    Lineage-Specific Profiling Delineates the Emergence and Progression of Naive Pluripotency in Mammalian Embryogenesis.

    Get PDF
    Naive pluripotency is manifest in the preimplantation mammalian embryo. Here we determine transcriptome dynamics of mouse development from the eight-cell stage to postimplantation using lineage-specific RNA sequencing. This method combines high sensitivity and reporter-based fate assignment to acquire the full spectrum of gene expression from discrete embryonic cell types. We define expression modules indicative of developmental state and temporal regulatory patterns marking the establishment and dissolution of naive pluripotency in vivo. Analysis of embryonic stem cells and diapaused embryos reveals near-complete conservation of the core transcriptional circuitry operative in the preimplantation epiblast. Comparison to inner cell masses of marmoset primate blastocysts identifies a similar complement of pluripotency factors but use of alternative signaling pathways. Embryo culture experiments further indicate that marmoset embryos utilize WNT signaling during early lineage segregation, unlike rodents. These findings support a conserved transcription factor foundation for naive pluripotency while revealing species-specific regulatory features of lineage segregation.We thank Peter Humphreys for assistance with imaging, and Samuel Jameson and staff for mouse husbandry. We are grateful to Charis Drummer, Ayako Sedohara, Akiko Shimada, Yuko Yamada, Ryo Oiwa, and Takeshi Kuge for technical support with marmoset embryo recovery. Illumina sequencing was provided by Bettina Haase and Dinko Pavlinic at the EMBL Genomics Core Facility. This work was supported by funding from the Wellcome Trust, the Genome Biology Unit of the European Molecular Biology Laboratory, BBSRC grants BB/G015678/1 and BB/M004023/1, an MRC Centenary Award, and the Louis Jeantet Foundation. A.S. is a Medical Research Council Professor.This is the final version of the article. It first appeared from Elsevier via http://dx.doi.org/10.1016/j.devcel.2015.10.01

    Small Universal Accepting Networks of Evolutionary Processors with Filtered Connections

    Full text link
    In this paper, we present some results regarding the size complexity of Accepting Networks of Evolutionary Processors with Filtered Connections (ANEPFCs). We show that there are universal ANEPFCs of size 10, by devising a method for simulating 2-Tag Systems. This result significantly improves the known upper bound for the size of universal ANEPFCs which is 18. We also propose a new, computationally and descriptionally efficient simulation of nondeterministic Turing machines by ANEPFCs. More precisely, we describe (informally, due to space limitations) how ANEPFCs with 16 nodes can simulate in O(f(n)) time any nondeterministic Turing machine of time complexity f(n). Thus the known upper bound for the number of nodes in a network simulating an arbitrary Turing machine is decreased from 26 to 16

    iNucs:Inter-nucleosome interactions

    Get PDF
    [Motivation] Deciphering nucleosome–nucleosome interactions is an important step toward mesoscale description of chromatin organization but computational tools to perform such analyses are not publicly available. [Results] We developed iNucs, a user-friendly and efficient Python-based bioinformatics tool to compute and visualize nucleosome-resolved interactions using standard pairs format input generated from pairtools

    Resetting transcription factor control circuitry toward ground-state pluripotency in human.

    Get PDF
    Current human pluripotent stem cells lack the transcription factor circuitry that governs the ground state of mouse embryonic stem cells (ESC). Here, we report that short-term expression of two components, NANOG and KLF2, is sufficient to ignite other elements of the network and reset the human pluripotent state. Inhibition of ERK and protein kinase C sustains a transgene-independent rewired state. Reset cells self-renew continuously without ERK signaling, are phenotypically stable, and are karyotypically intact. They differentiate in vitro and form teratomas in vivo. Metabolism is reprogrammed with activation of mitochondrial respiration as in ESC. DNA methylation is dramatically reduced and transcriptome state is globally realigned across multiple cell lines. Depletion of ground-state transcription factors, TFCP2L1 or KLF4, has marginal impact on conventional human pluripotent stem cells but collapses the reset state. These findings demonstrate feasibility of installing and propagating functional control circuitry for ground-state pluripotency in human cells.This research was supported by the UK Medical Research Council, the Japan Science and Technology agency (JST, PRESTO), the Genome Biology Unit of the European Molecular Biology Laboratory, European Commission projects PluriMes, BetaCellTherapy, EpiGeneSys, and Blueprint, and the Wellcome Trust. Y.T. was a University of Cambridge Herchel Smith Fellow. A.S. is a Medical Research Council Professor

    Transcriptional diversity during lineage commitment of human blood progenitors.

    Get PDF
    Blood cells derive from hematopoietic stem cells through stepwise fating events. To characterize gene expression programs driving lineage choice, we sequenced RNA from eight primary human hematopoietic progenitor populations representing the major myeloid commitment stages and the main lymphoid stage. We identified extensive cell type-specific expression changes: 6711 genes and 10,724 transcripts, enriched in non-protein-coding elements at early stages of differentiation. In addition, we found 7881 novel splice junctions and 2301 differentially used alternative splicing events, enriched in genes involved in regulatory processes. We demonstrated experimentally cell-specific isoform usage, identifying nuclear factor I/B (NFIB) as a regulator of megakaryocyte maturation-the platelet precursor. Our data highlight the complexity of fating events in closely related progenitor populations, the understanding of which is essential for the advancement of transplantation and regenerative medicine.The work described in this article was primarily supported by the European Commission Seventh Framework Program through the BLUEPRINT grant with code HEALTH-F5-2011-282510 (D.H., F.B., G.C., J.H.A.M., K.D., L.C., M.F., S.C., S.F., and S.P.G.). Research in the Ouwehand laboratory is further supported by program grants from the National Institute for Health Research (NIHR, www.nihr.ac.uk; to A.A., M.K., P.P., S.B.G.J., S.N., and W.H.O.) and the British Heart Foundation under nos. RP-PG-0310-1002 and RG/09/12/28096 (www.bhf.org.uk; to A.R. and W.J.A.). K.F. and M.K. were supported by Marie Curie funding from the NETSIM FP7 program funded by the European Commission. The laboratory receives funding from the NHS Blood and Transplant for facilities. The Cambridge BioResource (www.cambridgebioresource.org.uk), the Cell Phenotyping Hub, and the Cambridge Translational GenOmics laboratory (www.catgo.org.uk) are supported by an NIHR grant to the Cambridge NIHR Biomedical Research Centre (BRC). The BRIDGE-Bleeding and Platelet Disorders Consortium is supported by the NIHR BioResource—Rare Diseases (http://bioresource.nihr.ac.uk/; to E.T., N.F., and Whole Exome Sequencing effort). Research in the Soranzo laboratory (L.V., N.S., and S. Watt) is further supported by the Wellcome Trust (Grant Codes WT098051 and WT091310) and the EU FP7 EPIGENESYS initiative (Grant Code 257082). Research in the Cvejic laboratory (A. Cvejic and C.L.) is funded by the Cancer Research UK under grant no. C45041/A14953. S.J.S. is funded by NIHR. M.E.F. is supported by a British Heart Foundation Clinical Research Training Fellowship, no. FS/12/27/29405. E.B.-M. is supported by a Wellcome Trust grant, no. 084183/Z/07/Z. Research in the Laffan laboratory is supported by Imperial College BRC. F.A.C., C.L., and S. Westbury are supported by Medical Research Council Clinical Training Fellowships, and T.B. by a British Society of Haematology/NHS Blood and Transplant grant. R.J.R. is a Principal Research Fellow of the Wellcome Trust, grant no. 082961/Z/07/Z. Research in the Flicek laboratory is also supported by the Wellcome Trust (grant no. 095908) and EMBL. Research in the Bertone laboratory is supported by EMBL. K.F. and C.v.G. are supported by FWO-Vlaanderen through grant G.0B17.13N. P.F. is a compensated member of the Omicia Inc. Scientific Advisory Board. This study made use of data generated by the UK10K Consortium, derived from samples from the Cohorts arm of the project.This is the author’s version of the work. It is posted here by permission of the AAAS for personal use, not for redistribution. The definitive version was published in Science on 26/9/14 in volume 345, number 6204, DOI: 10.1126/science.1251033. This version will be under embargo until the 26th of March 2015

    Citrullination regulates pluripotency and histone H1 binding to chromatin.

    Get PDF
    Citrullination is the post-translational conversion of an arginine residue within a protein to the non-coded amino acid citrulline. This modification leads to the loss of a positive charge and reduction in hydrogen-bonding ability. It is carried out by a small family of tissue-specific vertebrate enzymes called peptidylarginine deiminases (PADIs) and is associated with the development of diverse pathological states such as autoimmunity, cancer, neurodegenerative disorders, prion diseases and thrombosis. Nevertheless, the physiological functions of citrullination remain ill-defined, although citrullination of core histones has been linked to transcriptional regulation and the DNA damage response. PADI4 (also called PAD4 or PADV), the only PADI with a nuclear localization signal, was previously shown to act in myeloid cells where it mediates profound chromatin decondensation during the innate immune response to infection. Here we show that the expression and enzymatic activity of Padi4 are also induced under conditions of ground-state pluripotency and during reprogramming in mouse. Padi4 is part of the pluripotency transcriptional network, binding to regulatory elements of key stem-cell genes and activating their expression. Its inhibition lowers the percentage of pluripotent cells in the early mouse embryo and significantly reduces reprogramming efficiency. Using an unbiased proteomic approach we identify linker histone H1 variants, which are involved in the generation of compact chromatin, as novel PADI4 substrates. Citrullination of a single arginine residue within the DNA-binding site of H1 results in its displacement from chromatin and global chromatin decondensation. Together, these results uncover a role for citrullination in the regulation of pluripotency and provide new mechanistic insights into how citrullination regulates chromatin compaction.Cancer Research UKThis is the author accepted manuscript. The final version is available from the Nature Publishing Group via http://dx.doi.org/10.1038/nature1294

    Nuclear architecture organized by Rif1 underpins the replication-timing program

    Get PDF
    DNA replication is temporally and spatially organized in all eukaryotes, yet the molecular control and biological function of the replication-timing program are unclear. Rif1 is required for normal genome-wide regulation of replication timing, but its molecular function is poorly understood. Here we show that in mouse embryonic stem cells, Rif1 coats late-replicating domains and, with Lamin B1, identifies most of the late-replicating genome. Rif1 is an essential determinant of replication timing of non-Lamin B1-bound late domains. We further demonstrate that Rif1 defines and restricts the interactions between replication-timing domains during the G1 phase, thereby revealing a function of Rif1 as organizer of nuclear architecture. Rif1 loss affects both number and replication-timing specificity of the interactions between replication-timing domains. In addition, during the S phase, Rif1 ensures that replication of interacting domains is temporally coordinated. In summary, our study identifies Rif1 as the molecular link between nuclear architecture and replication-timing establishment in mammals
    corecore