6 research outputs found

    Finishing the euchromatic sequence of the human genome

    Get PDF
    The sequence of the human genome encodes the genetic instructions for human physiology, as well as rich information about human evolution. In 2001, the International Human Genome Sequencing Consortium reported a draft sequence of the euchromatic portion of the human genome. Since then, the international collaboration has worked to convert this draft into a genome sequence with high accuracy and nearly complete coverage. Here, we report the result of this finishing process. The current genome sequence (Build 35) contains 2.85 billion nucleotides interrupted by only 341 gaps. It covers ∼99% of the euchromatic genome and is accurate to an error rate of ∼1 event per 100,000 bases. Many of the remaining euchromatic gaps are associated with segmental duplications and will require focused work with new methods. The near-complete sequence, the first for a vertebrate, greatly improves the precision of biological analyses of the human genome including studies of gene number, birth and death. Notably, the human enome seems to encode only 20,000-25,000 protein-coding genes. The genome sequence reported here should serve as a firm foundation for biomedical research in the decades ahead

    Bidirectional nucleolar dysfunction in C9orf72 frontotemporal lobar degeneration

    Get PDF
    An intronic GGGGCC expansion in C9orf72 is the most common known cause of both frontotemporal lobar degeneration (FTLD) and amyotrophic lateral sclerosis (ALS). The repeat expansion leads to the generation of sense and antisense repeat RNA aggregates and dipeptide repeat (DPR) proteins, generated by repeat-associated non-ATG translation. The arginine-rich DPR proteins poly(glycine-arginine or GR) and poly(proline-arginine or PR) are potently neurotoxic and can localise to the nucleolus when expressed in cells, resulting in enlarged nucleoli with disrupted functionality. Furthermore, GGGGCC repeat RNA can bind nucleolar proteins in vitro. However, the relevance of nucleolar stress is unclear, as the arginine-rich DPR proteins do not localise to the nucleolus in C9orf72-associated FTLD/ALS (C9FTLD/ALS) patient brain. We measured nucleolar size in C9FTLD frontal cortex neurons using a three-dimensional, volumetric approach. Intriguingly, we found that C9FTLD brain exhibited bidirectional nucleolar stress. C9FTLD neuronal nucleoli were significantly smaller than control neuronal nucleoli. However, within C9FTLD brains, neurons containing poly(GR) inclusions had significantly larger nucleolar volumes than neurons without poly(GR) inclusions. In addition, expression of poly(GR) in adult Drosophila neurons led to significantly enlarged nucleoli. A small but significant increase in nucleolar volume was also observed in C9FTLD frontal cortex neurons containing GGGGCC repeat-containing RNA foci. These data show that nucleolar abnormalities are a consistent feature of C9FTLD brain, but that diverse pathomechanisms are at play, involving both DPR protein and repeat RNA toxicity

    Sense and antisense RNA are not toxic in <i>Drosophila </i>models of <i>C9orf72</i>-associated ALS/FTD

    Get PDF
    A GGGGCC hexanucleotide repeat expansion in the C9orf72 gene is the most common genetic cause of amyotrophic lateral sclerosis and frontotemporal dementia. Neurodegeneration may occur via transcription of the repeats into inherently toxic repetitive sense and antisense RNA species, or via repeat-associated non-ATG initiated translation (RANT) of sense and antisense RNA into toxic dipeptide repeat proteins. We have previously demonstrated that regular interspersion of repeat RNA with stop codons prevents RANT (RNA-only models), allowing us to study the role of repeat RNA in isolation. Here we have created novel RNA-only Drosophila models, including the first models of antisense repeat toxicity, and flies expressing extremely large repeats, within the range observed in patients. We generated flies expressing ~ 100 repeat sense or antisense RNA either as part of a processed polyadenylated transcript or intronic sequence. We additionally created Drosophila expressing > 1000 RNA-only repeats in the sense direction. When expressed in adult Drosophila neurons polyadenylated repeat RNA is largely cytoplasmic in localisation, whilst intronic repeat RNA forms intranuclear RNA foci, as does > 1000 repeat RNA, thus allowing us to investigate both nuclear and cytoplasmic RNA toxicity. We confirmed that these RNA foci are capable of sequestering endogenous Drosophila RNA-binding proteins, and that the production of dipeptide proteins (poly-glycine-proline, and poly-glycine-arginine) is suppressed in our models. We find that neither cytoplasmic nor nuclear sense or antisense RNA are toxic when expressed in adult Drosophila neurons, suggesting they have a limited role in disease pathogenesis

    C9orf72 repeat expansions cause neurodegeneration in Drosophila through arginine-rich proteins.

    Get PDF
    An expanded GGGGCC repeat in C9orf72 is the most common genetic cause of frontotemporal dementia and amyotrophic lateral sclerosis. A fundamental question is whether toxicity is driven by the repeat RNA itself and/or by dipeptide repeat proteins generated by repeat-associated, non-ATG translation. To address this question we developed in vitro and in vivo models to dissect repeat RNA and dipeptide repeat protein toxicity. Expression of pure repeats in Drosophila caused adult-onset neurodegeneration attributable to poly-(glycine-arginine) proteins. Thus, expanded repeats promoted neurodegeneration through neurotoxic proteins. Expression of individual dipeptide repeat proteins with a non-GGGGCC RNA sequence showed both poly-(glycine-arginine) and poly-(proline-arginine) proteins caused neurodegeneration. These findings are consistent with a dual toxicity mechanism, whereby both arginine-rich proteins and repeat RNA contribute to C9orf72-mediated neurodegeneration

    The DNA sequence of the human X chromosome

    No full text
    The human X chromosome has a unique biology that was shaped by its evolution as the sex chromosome shared by males and females. We have determined 99.3% of the euchromatic sequence of the X chromosome. Our analysis illustrates the autosomal origin of the mammalian sex chromosomes, the stepwise process that led to the progressive loss of recombination between X and Y, and the extent of subsequent degradation of the Y chromosome. LINE1 repeat elements cover one-third of the X chromosome, with a distribution that is consistent with their proposed role as way stations in the process of X-chromosome inactivation. We found 1,098 genes in the sequence, of which 99 encode proteins expressed in testis and in various tumour types. A disproportionately high number of mendelian diseases are documented for the X chromosome. Of this number, 168 have been explained by mutations in 113 X-linked genes, which in many cases were characterized with the aid of the DNA sequence
    corecore