117 research outputs found

    Non-perturbative topological recursion and knot invariants

    Full text link
    The goal of the present thesis is to study new examples, applications and com- putational aspects of the topological recursion formalism introduced by Eynard and Orantin. We develop efficient methods for the calculation of non-perturbative wave functions associated to spectral curves of genus one. Our results are used to test two conjectures. The first one relates perturbative knot invariants obtained from the AJ Conjecture and a state integral model to the wave function obtained from topological recursion. The second conjecture describes the structure of the quantum curve for the Weierstrass spectral curve. We are able to verify the conjectures up to some order in a formal parameter h and we state a stronger version of the conjecture in the case of the Weierstrass curve. Some of these results are based upon joint work with Greyson Potter

    PyHIST: a histological image segmentation tool

    Get PDF
    The development of increasingly sophisticated methods to acquire high resolution images has led to the generation of large collections of biomedical imaging data, including images of tissues and organs. Many of the current machine learning methods that aim to extract biological knowledge from histopathological images require several data preprocessing stages, creating an overhead before the proper analysis. Here we present PyHIST (https://github.com/manuel-munoz-aguirre/PyHIST), an easy-to-use, open source whole slide histological image tissue segmentation and preprocessing tool aimed at data preparation for machine learning applications.Peer ReviewedPreprin

    SelenoDB 1.0 : A Database of Selenoprotein Genes, Proteins and SECIS Elements

    Get PDF
    Selenoproteins are a diverse group of proteins usually misidentified and misannotated in sequence databases. The presence of an in-frame UGA (stop) codon in the coding sequence of selenoprotein genes precludes their identification and correct annotation. The in-frame UGA codons are recoded to cotranslationally incorporate selenocysteine, a rare selenium-containing amino acid. The development of ad hoc experimental and, more recently, computational approaches have allowed the efficient identification and characterization of the selenoproteomes of a growing number of species. Today, dozens of selenoprotein families have been described and more are being discovered in recently sequenced species, but the correct genomic annotation is not available for the majority of these genes. SelenoDB is a long-term project that aims to provide, through the collaborative effort of experimental and computational researchers, automatic and manually curated annotations of selenoprotein genes, proteins and SECIS elements. Version 1.0 of the database includes an initial set of eukaryotic genomic annotations, with special emphasis on the human selenoproteome, for immediate inspection by selenium researchers or incorporation into more general databases. SelenoDB is freely available at http://www.selenodb.org

    SECISearch3 and Seblastian: new tools for prediction of SECIS elements and selenoproteins

    Get PDF
    Selenoproteins are proteins containing an uncommon amino acid selenocysteine (Sec). Sec is inserted by a specific translational machinery that recognizes a stem-loop structure, the SECIS element, at the 3′ UTR of selenoprotein genes and recodes a UGA codon within the coding sequence. As UGA is normally a translational stop signal, selenoproteins are generally misannotated and designated tools have to be developed for this class of proteins. Here, we present two new computational methods for selenoprotein identification and analysis, which we provide publicly through the web servers at http://gladyshevlab.org/SelenoproteinPredictionServer or http://seblastian.crg.es. SECISearch3 replaces its predecessor SECISearch as a tool for prediction of eukaryotic SECIS elements. Seblastian is a new method for selenoprotein gene detection that uses SECISearch3 and then predicts selenoprotein sequences encoded upstream of SECIS elements. Seblastian is able to both identify known selenoproteins and predict new selenoproteins. By applying these tools to diverse eukaryotic genomes, we provide a ranked list of newly predicted selenoproteins together with their annotated cysteine-containing homologues. An analysis of a representative candidate belonging to the AhpC family shows how the use of Sec in this protein evolved in bacterial and eukaryotic lineages

    Needed for completion of the human genome: hypothesis driven experiments and biologically realistic mathematical models

    Get PDF
    With the sponsorship of ``Fundacio La Caixa'' we met in Barcelona, November 21st and 22nd, to analyze the reasons why, after the completion of the human genome sequence, the identification all protein coding genes and their variants remains a distant goal. Here we report on our discussions and summarize some of the major challenges that need to be overcome in order to complete the human gene catalog.Comment: Report and discussion resulting from the `Fundacio La Caixa' gene finding meeting held November 21 and 22 2003 in Barcelon

    Identification and Analysis of Genes and Pseudogenes within Duplicated Regions in the Human and Mouse Genomes

    Get PDF
    The identification and classification of genes and pseudogenes in duplicated regions still constitutes a challenge for standard automated genome annotation procedures. Using an integrated homology and orthology analysis independent of current gene annotation, we have identified 9,484 and 9,017 gene duplicates in human and mouse, respectively. On the basis of the integrity of their coding regions, we have classified them into functional and inactive duplicates, allowing us to define the first consistent and comprehensive collection of 1,811 human and 1,581 mouse unprocessed pseudogenes. Furthermore, of the total of 14,172 human and mouse duplicates predicted to be functional genes, as many as 420 are not included in current reference gene databases and therefore correspond to likely novel mammalian genes. Some of these correspond to partial duplicates with less than half of the length of the original source genes, yet they are conserved and syntenic among different mammalian lineages. The genes and unprocessed pseudogenes obtained here will enable further studies on the mechanisms involved in gene duplication as well as of the fate of duplicated genes

    Genetically predicted telomere length and its relationship with neurodegenerative diseases and life expectancy

    Full text link
    Telomere length (TL) is a biomarker of biological aging. Shorter telomeres have been associated with mortality and increased rates of age-related diseases. However, observational studies are unable to conclude whether TL is causally associated with those outcomes. Mendelian randomization (MR) was developed for assessing causality using genetic variants in epidemiological research. The objective of this study was to test the potential causal role of TL in neurodegenerative disorders and life expectancy through MR analysis. Summary level data were extracted from the most recent genome-wide association studies for TL, Alzheimer's disease (AD), Parkinson's disease, Frontotemporal dementia, Amyotrophic Lateral Sclerosis, Progressive Supranuclear Palsy and life expectancy. MR estimates revealed that longer telomeres inferred a protective effect on risk of AD (OR = 0.964; adjusted p-value = 0.039). Moreover, longer telomeres were significantly associated with increased life expectancy (beta(IVW) = 0.011; adjusted p-value = 0.039). Sensitivity analyses suggested evidence for directional pleiotropy in AD analyses. Our results showed that genetically predicted longer TL may increase life expectancy and play a protective causal effect on AD. We did not observe significant causal relationships between longer TL and other neurodegenerative diseases. This suggests that the involvement of TL on specific biological mechanisms might differ between AD and life expectancy, with respect to that in other neurodegenerative diseases. Moreover, the presence of pleiotropy may reflect the complex interplay between TL homeostasis and AD pathophysiology. Further observational studies are needed to confirm these results. (C) 2022 The Author(s). Published by Elsevier B.V. on behalf of Research Network of Computational and Structural Biotechnology. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)

    A short motif in Drosophila SECIS Binding Protein 2 provides differential binding affinity to SECIS RNA hairpins

    Get PDF
    Selenoproteins contain the amino acid selenocysteine which is encoded by a UGA Sec codon. Recoding UGA Sec requires a complex mechanism, comprising the cis-acting SECIS RNA hairpin in the 3′UTR of selenoprotein mRNAs, and trans-acting factors. Among these, the SECIS Binding Protein 2 (SBP2) is central to the mechanism. SBP2 has been so far functionally characterized only in rats and humans. In this work, we report the characterization of the Drosophila melanogaster SBP2 (dSBP2). Despite its shorter length, it retained the same selenoprotein synthesis-promoting capabilities as the mammalian counterpart. However, a major difference resides in the SECIS recognition pattern: while human SBP2 (hSBP2) binds the distinct form 1 and 2 SECIS RNAs with similar affinities, dSBP2 exhibits high affinity toward form 2 only. In addition, we report the identification of a K (lysine)-rich domain in all SBP2s, essential for SECIS and 60S ribosomal subunit binding, differing from the well-characterized L7Ae RNA-binding domain. Swapping only five amino acids between dSBP2 and hSBP2 in the K-rich domain conferred reversed SECIS-binding properties to the proteins, thus unveiling an important sequence for form 1 binding

    Convergent phenotypes but non-convergent genomes in simple social insect societies

    Get PDF
    Dissertação de Mestrado na área de especialização em Ciências Jurídico-Forenses apresentada à Faculdade de Direito da Universidade de Coimbr
    corecore