955 research outputs found

    Back-translation for discovering distant protein homologies

    Get PDF
    Frameshift mutations in protein-coding DNA sequences produce a drastic change in the resulting protein sequence, which prevents classic protein alignment methods from revealing the proteins' common origin. Moreover, when a large number of substitutions are additionally involved in the divergence, the homology detection becomes difficult even at the DNA level. To cope with this situation, we propose a novel method to infer distant homology relations of two proteins, that accounts for frameshift and point mutations that may have affected the coding sequences. We design a dynamic programming alignment algorithm over memory-efficient graph representations of the complete set of putative DNA sequences of each protein, with the goal of determining the two putative DNA sequences which have the best scoring alignment under a powerful scoring system designed to reflect the most probable evolutionary process. This allows us to uncover evolutionary information that is not captured by traditional alignment methods, which is confirmed by biologically significant examples.Comment: The 9th International Workshop in Algorithms in Bioinformatics (WABI), Philadelphia : \'Etats-Unis d'Am\'erique (2009

    A genome search for primary vesicoureteral reflux shows further evidence for genetic heterogeneity

    Get PDF
    Vesicoureteral reflux (VUR) is the most common disease of the urinary tract in children. In order to identify gene(s) involved in this complex disorder, we performed a genome-wide search in a selected sample of 31 patients with primary VUR from eight families originating from southern Italy. Sixteen additional families with 41 patients were included in a second stage. Nonparametric, affected-only linkage analysis identified four genomic areas on chromosomes 1, 3, and 4 (p < 0.05); the best result corresponded to the D3S3681-D3S1569 interval on chromosome 3 (nonparametric linkage score, NPL = 2.75, p = 0.008). This region was then saturated with 26 additional markers, tested in the complete group of 72 patients from 24 families (NPL = 2.01, p = 0.01). We identified a genomic area on 3q22.2-23, where 26 patients from six multiplex families shared overlapping haplotypes. However, we did not find evidence for a common ancestral haplotype. The region on chromosome 1 was delimited to 1p36.2-34.3 (D1S228-D1S255, max. NPL = 1.70, p = 0.03), after additional fine typing. Furthermore, on chromosome 22q11.22-12.3, patients from a single family showed excess allele sharing (NPL = 3.35, p = 0.015). Only the chromosome 3q region has been previously reported in the single genome-wide screening available for primary VUR. Our results suggest the presence of several novel loci for primary VUR, giving further evidence for the genetic heterogeneity of this disorder

    Alternative splicing in the fragile X gene <i>FMR1</i>

    Get PDF
    Human Molecular Genetics 2 pp. 399-404 (1993)The authors wish to note a mistake which was incorporated in figure 3 where both Asp and Asn were given the letter code N. A correct version of the figure and its legend is printed below.</p

    Alternative splicing in the fragile X gene <i>FMR1</i>

    Get PDF
    Human Molecular Genetics 2 pp. 399-404 (1993)The authors wish to note a mistake which was incorporated in figure 3 where both Asp and Asn were given the letter code N. A correct version of the figure and its legend is printed below.</p

    A Novel RNA Transcript with Antiapoptotic Function Is Silenced in Fragile X Syndrome

    Get PDF
    Several genome-wide transcriptomics efforts have shown that a large percentage of the mammalian genome is transcribed into RNAs, however, only a small percentage (1–2%) of these RNAs is translated into proteins. Currently there is an intense interest in characterizing the function of the different classes of noncoding RNAs and their relevance to human disease. Using genomic approaches we discovered FMR4, a primate-specific noncoding RNA transcript (2.4 kb) that resides upstream and likely shares a bidirectional promoter with FMR1. FMR4 is a product of RNA polymerase II and has a similar half-life to FMR1. The CGG expansion in the 5′ UTR of FMR1 appears to affect transcription in both directions as we found FMR4, similar to FMR1, to be silenced in fragile X patients and up-regulated in premutation carriers. Knockdown of FMR4 by several siRNAs did not affect FMR1 expression, nor vice versa, suggesting that FMR4 is not a direct regulatory transcript for FMR1. However, FMR4 markedly affected human cell proliferation in vitro; siRNAs knockdown of FMR4 resulted in alterations in the cell cycle and increased apoptosis, while the overexpression of FMR4 caused an increase in cell proliferation. Collectively, our results demonstrate an antiapoptotic function of FMR4 and provide evidence that a well-studied genomic locus can show unexpected functional complexity. It cannot be excluded that altered FMR4 expression might contribute to aspects of the clinical presentation of fragile X syndrome and/or related disorders

    A high-coverage draft genome of the mycalesine butterfly Bicyclus anynana

    Get PDF
    The mycalesine butterfly Bicyclus anynana, the “Squinting bush brown,” is a model organism in the study of lepidopteran ecology, development, and evolution. Here, we present a draft genome sequence for B. anynana to serve as a genomics resource for current and future studies of this important model species. Seven libraries with insert sizes ranging from 350 bp to 20 kb were constructed using DNA from an inbred female and sequenced using both Illumina and PacBio technology; 128 Gb of raw Illumina data was filtered to 124 Gb and assembled to a final size of 475 Mb (∼×260 assembly coverage). Contigs were scaffolded using mate-pair, transcriptome, and PacBio data into 10 800 sequences with an N50 of 638 kb (longest scaffold 5 Mb). The genome is comprised of 26% repetitive elements and encodes a total of 22 642 predicted protein-coding genes. Recovery of a BUSCO set of core metazoan genes was almost complete (98%). Overall, these metrics compare well with other recently published lepidopteran genomes. We report a high-quality draft genome sequence for Bicyclus anynana. The genome assembly and annotated gene models are available at LepBase (http://ensembl.lepbase.org/index.html).Peer reviewe
    corecore