47 research outputs found

    RNA G-quadruplexes are globally unfolded in eukaryotic cells and depleted in bacteria

    Get PDF
    INTRODUCTION: Many cellular RNAs contain regions that fold into stable structures required for function. Both Watson−Crick and noncanonical interactions can play important roles in forming these structures. An intriguing noncanonical structure is the RNA G-quadruplex (RG4), a four-stranded structure containing two or more layers of G-quartets, in which the Watson–Crick face of each of four G residues pairs to the Hoogsteen face of the neighboring G residues. RG4 regions can be very stable in vitro, particularly in the presence of K+, and thus they are generally assumed to be predominantly folded within cells, which have ample K+. Indeed, these structures have been implicated in mRNA processing and translation, with recently proposed roles in cancer and other human diseases. However, the number of cellular RNAs that can fold into RG4 structures has been unclear, as has been the extent to which these RG4 regions are folded in cells. RATIONALE: Enzymes and chemicals that act on RNA with structure-dependent preferences provide valuable tools for detecting and monitoring RNA folding. For example, dimethyl sulfate (DMS) treatment of RNA, either in vitro or in cells, coupled with high-throughput sequencing of abortive primer-extension products can monitor the folding states of many RNAs in one experiment. Analogous high-throughput methods use cell-permeable variants of SHAPE (selective 2′-hydroxyl acylation analyzed by primer extension) reagents. These methods reveal important differences between RNA structures formed in vivo and those formed in vitro. However, they are designed to detect Watson−Crick pairing and thus do not identify RG4 structures or provide information on their folding states. After recognizing that RG4 regions can block reverse transcriptase, we reasoned that this property, together with the known ability of RG4s to protect the N7 of participating G nucleotides from DMS modification, could be used to develop a suite of high-throughput methods to both identify endogenous RNAs that can fold into RG4s in vitro and determine whether these regions also fold in cells. RESULTS: We first developed a high-throughput method that identifies RG4 regions on the basis of their propensity to stall reverse transcriptase in a K+-dependent manner. Applying this method to RNA from mammalian cell lines and yeast, we identified >10,000 endogenous regions that form RG4s in vitro, thereby expanding by a factor of >100 the catalog of endogenous regions with experimentally supported propensity to fold into RG4 structures. To infer the folding state of these RG4 regions in vitro and in cells, DMS treatment was performed before profiling of reverse-transcriptase stops. These analyses showed that, in contrast to previous assumptions, regions that folded into RG4 structures in vitro were overwhelmingly unfolded in vivo, as indicated by their accessibility to DMS modification in cells. A complementary probing strategy using a SHAPE reagent confirmed the unfolded state of most RG4 regions in eukaryotic cells. Moreover, RG4 regions remained unfolded both in cells depleted of adenosine 5′-triphosphate and in cells lacking a helicase known to unfold RG4 regions in vitro. Applying our probing methods to bacteria revealed a different behavior, in that model RG4 regions that were unfolded in eukaryotic cells were folded when expressed in Escherichia coli. However, these ectopically expressed quadruplexes impaired mRNA translation and cell growth, which helps explain why very few endogenous sequences that could fold into RG4s were detected in the transcriptomes of E. coli and the two other eubacteria analyzed. CONCLUSION: In mammals, thousands of endogenous RNA sequences have regions that can fold into RG4s in vitro, but these regions are globally unfolded in eukaryotic cells, presumably by robust and effective machinery that remains to be fully characterized. In contrast, RG4 regions are permitted to fold in E. coli cells, but E. coli and other bacteria have undergone evolutionary depletion of endogenous RG4-forming sequences

    Generation of ESTs for Flowering Gene Discovery and SSR Marker Development in Upland Cotton

    Get PDF
    BACKGROUND: Upland cotton, Gossypium hirsutum L., is one of the world's most important economic crops. In the absence of the entire genomic sequence, a large number of expressed sequence tag (EST) resources of upland cotton have been generated and used in several studies. However, information about the flower development of this species is rare. METHODOLOGY/PRINCIPAL FINDINGS: To clarify the molecular mechanism of flower development in upland cotton, 22,915 high-quality ESTs were generated and assembled into 14,373 unique sequences consisting of 4,563 contigs and 9,810 singletons from a normalized and full-length cDNA library constructed from pooled RNA isolated from shoot apexes, squares, and flowers. Comparative analysis indicated that 5,352 unique sequences had no high-degree matches to the cotton public database. Functional annotation showed that several upland cotton homologs with flowering-related genes were identified in our library. The majority of these genes were specifically expressed in flowering-related tissues. Three GhSEP (G. hirsutum L. SEPALLATA) genes determining floral organ development were cloned, and quantitative real-time PCR (qRT-PCR) revealed that these genes were expressed preferentially in squares or flowers. Furthermore, 670 new putative microsatellites with flanking sequences sufficient for primer design were identified from the 645 unigenes. Twenty-five EST-simple sequence repeats were randomly selected for validation and transferability testing in 17 Gossypium species. Of these, 23 were identified as true-to-type simple sequence repeat loci and were highly transferable among Gossypium species. CONCLUSIONS/SIGNIFICANCE: A high-quality, normalized, full-length cDNA library with a total of 14,373 unique ESTs was generated to provide sequence information for gene discovery and marker development related to upland cotton flower development. These EST resources form a valuable foundation for gene expression profiling analysis, functional analysis of newly discovered genes, genetic linkage, and quantitative trait loci analysis

    Global burden of 369 diseases and injuries in 204 countries and territories, 1990–2019: a systematic analysis for the Global Burden of Disease Study 2019

    Get PDF
    Background: In an era of shifting global agendas and expanded emphasis on non-communicable diseases and injuries along with communicable diseases, sound evidence on trends by cause at the national level is essential. The Global Burden of Diseases, Injuries, and Risk Factors Study (GBD) provides a systematic scientific assessment of published, publicly available, and contributed data on incidence, prevalence, and mortality for a mutually exclusive and collectively exhaustive list of diseases and injuries. Methods: GBD estimates incidence, prevalence, mortality, years of life lost (YLLs), years lived with disability (YLDs), and disability-adjusted life-years (DALYs) due to 369 diseases and injuries, for two sexes, and for 204 countries and territories. Input data were extracted from censuses, household surveys, civil registration and vital statistics, disease registries, health service use, air pollution monitors, satellite imaging, disease notifications, and other sources. Cause-specific death rates and cause fractions were calculated using the Cause of Death Ensemble model and spatiotemporal Gaussian process regression. Cause-specific deaths were adjusted to match the total all-cause deaths calculated as part of the GBD population, fertility, and mortality estimates. Deaths were multiplied by standard life expectancy at each age to calculate YLLs. A Bayesian meta-regression modelling tool, DisMod-MR 2.1, was used to ensure consistency between incidence, prevalence, remission, excess mortality, and cause-specific mortality for most causes. Prevalence estimates were multiplied by disability weights for mutually exclusive sequelae of diseases and injuries to calculate YLDs. We considered results in the context of the Socio-demographic Index (SDI), a composite indicator of income per capita, years of schooling, and fertility rate in females younger than 25 years. Uncertainty intervals (UIs) were generated for every metric using the 25th and 975th ordered 1000 draw values of the posterior distribution. Findings: Global health has steadily improved over the past 30 years as measured by age-standardised DALY rates. After taking into account population growth and ageing, the absolute number of DALYs has remained stable. Since 2010, the pace of decline in global age-standardised DALY rates has accelerated in age groups younger than 50 years compared with the 1990–2010 time period, with the greatest annualised rate of decline occurring in the 0–9-year age group. Six infectious diseases were among the top ten causes of DALYs in children younger than 10 years in 2019: lower respiratory infections (ranked second), diarrhoeal diseases (third), malaria (fifth), meningitis (sixth), whooping cough (ninth), and sexually transmitted infections (which, in this age group, is fully accounted for by congenital syphilis; ranked tenth). In adolescents aged 10–24 years, three injury causes were among the top causes of DALYs: road injuries (ranked first), self-harm (third), and interpersonal violence (fifth). Five of the causes that were in the top ten for ages 10–24 years were also in the top ten in the 25–49-year age group: road injuries (ranked first), HIV/AIDS (second), low back pain (fourth), headache disorders (fifth), and depressive disorders (sixth). In 2019, ischaemic heart disease and stroke were the top-ranked causes of DALYs in both the 50–74-year and 75-years-and-older age groups. Since 1990, there has been a marked shift towards a greater proportion of burden due to YLDs from non-communicable diseases and injuries. In 2019, there were 11 countries where non-communicable disease and injury YLDs constituted more than half of all disease burden. Decreases in age-standardised DALY rates have accelerated over the past decade in countries at the lower end of the SDI range, while improvements have started to stagnate or even reverse in countries with higher SDI. Interpretation: As disability becomes an increasingly large component of disease burden and a larger component of health expenditure, greater research and developm nt investment is needed to identify new, more effective intervention strategies. With a rapidly ageing global population, the demands on health services to deal with disabling outcomes, which increase with age, will require policy makers to anticipate these changes. The mix of universal and more geographically specific influences on health reinforces the need for regular reporting on population health in detail and by underlying cause to help decision makers to identify success stories of disease control to emulate, as well as opportunities to improve. Funding: Bill & Melinda Gates Foundation. © 2020 The Author(s). Published by Elsevier Ltd. This is an Open Access article under the CC BY 4.0 licens

    Global, regional, and national burden of disorders affecting the nervous system, 1990–2021: a systematic analysis for the Global Burden of Disease Study 2021

    Get PDF
    BackgroundDisorders affecting the nervous system are diverse and include neurodevelopmental disorders, late-life neurodegeneration, and newly emergent conditions, such as cognitive impairment following COVID-19. Previous publications from the Global Burden of Disease, Injuries, and Risk Factor Study estimated the burden of 15 neurological conditions in 2015 and 2016, but these analyses did not include neurodevelopmental disorders, as defined by the International Classification of Diseases (ICD)-11, or a subset of cases of congenital, neonatal, and infectious conditions that cause neurological damage. Here, we estimate nervous system health loss caused by 37 unique conditions and their associated risk factors globally, regionally, and nationally from 1990 to 2021.MethodsWe estimated mortality, prevalence, years lived with disability (YLDs), years of life lost (YLLs), and disability-adjusted life-years (DALYs), with corresponding 95% uncertainty intervals (UIs), by age and sex in 204 countries and territories, from 1990 to 2021. We included morbidity and deaths due to neurological conditions, for which health loss is directly due to damage to the CNS or peripheral nervous system. We also isolated neurological health loss from conditions for which nervous system morbidity is a consequence, but not the primary feature, including a subset of congenital conditions (ie, chromosomal anomalies and congenital birth defects), neonatal conditions (ie, jaundice, preterm birth, and sepsis), infectious diseases (ie, COVID-19, cystic echinococcosis, malaria, syphilis, and Zika virus disease), and diabetic neuropathy. By conducting a sequela-level analysis of the health outcomes for these conditions, only cases where nervous system damage occurred were included, and YLDs were recalculated to isolate the non-fatal burden directly attributable to nervous system health loss. A comorbidity correction was used to calculate total prevalence of all conditions that affect the nervous system combined.FindingsGlobally, the 37 conditions affecting the nervous system were collectively ranked as the leading group cause of DALYs in 2021 (443 million, 95% UI 378–521), affecting 3·40 billion (3·20–3·62) individuals (43·1%, 40·5–45·9 of the global population); global DALY counts attributed to these conditions increased by 18·2% (8·7–26·7) between 1990 and 2021. Age-standardised rates of deaths per 100 000 people attributed to these conditions decreased from 1990 to 2021 by 33·6% (27·6–38·8), and age-standardised rates of DALYs attributed to these conditions decreased by 27·0% (21·5–32·4). Age-standardised prevalence was almost stable, with a change of 1·5% (0·7–2·4). The ten conditions with the highest age-standardised DALYs in 2021 were stroke, neonatal encephalopathy, migraine, Alzheimer's disease and other dementias, diabetic neuropathy, meningitis, epilepsy, neurological complications due to preterm birth, autism spectrum disorder, and nervous system cancer.InterpretationAs the leading cause of overall disease burden in the world, with increasing global DALY counts, effective prevention, treatment, and rehabilitation strategies for disorders affecting the nervous system are needed

    Expanded identification and characterization of mammalian circular RNAs

    No full text
    Background: The recent reports of two circular RNAs (circRNAs) with strong potential to act as microRNA (miRNA) sponges suggest that circRNAs might play important roles in regulating gene expression. However, the global properties of circRNAs are not well understood.Results: We developed a computational pipeline to identify circRNAs and quantify their relative abundance from RNA-seq data. Applying this pipeline to a large set of non-poly(A)-selected RNA-seq data from the ENCODE project, we annotated 7,112 human circRNAs that were estimated to comprise at least 10% of the transcripts accumulating from their loci. Most circRNAs are expressed in only a few cell types and at low abundance, but they are no more cell-type-specific than are mRNAs with similar overall expression levels. Although most circRNAs overlap protein-coding sequences, ribosome profiling provides no evidence for their translation. We also annotated 635 mouse circRNAs, and although 20% of them are orthologous to human circRNAs, the sequence conservation of these circRNA orthologs is no higher than that of their neighboring linear exons. The previously proposed miR-7 sponge, CDR1as, is one of only two circRNAs with more miRNA sites than expected by chance, with the next best miRNA-sponge candidate deriving from a gene encoding a primate-specific zinc-finger protein, ZNF91. Conclusions: Our results provide a new framework for future investigation of this intriguing topological isoform while raising doubts regarding a biological function of most circRNAs.ASTAR (Agency for Sci., Tech. and Research, S’pore)Published versio

    Genome-wide CRISPR screens identify noncanonical translation factor eIF2A as an enhancer of SARS-CoV-2 programmed −1 ribosomal frameshifting

    No full text
    Summary: Many positive-strand RNA viruses, including all known coronaviruses, employ programmed −1 ribosomal frameshifting (−1 PRF) to regulate the translation of polycistronic viral RNAs. However, only a few host factors have been shown to regulate −1 PRF. Through a genome-wide CRISPR-Cas9 knockout screen, we have identified host factors that either suppress or enhance severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) –1 PRF. Among them, eukaryotic translation initiation factor 2A (eIF2A) specifically and directly enhances −1 PRF independent of changes in initiation. Consistent with the crucial role of efficient −1 PRF in transcriptase/replicase expression, loss of eIF2A reduces SARS-CoV-2 replication in cells. Furthermore, transcriptome-wide analysis shows that eIF2A preferentially binds CG-rich RNA motifs, including a region within 18S ribosomal RNA near the contacts between the SARS-CoV-2 frameshift-stimulatory element (FSE) and the ribosome. Thus, our results indicate a role for eIF2A in modulating the translation of specific RNAs independent of its role during initiation

    C9orf72 arginine-rich dipeptide repeats inhibit UPF1-mediated RNA decay via translational repression

    No full text
    C9orf72 repeat expansion is the major genetic cause of familial amyotrophic lateral sclerosis (ALS) and frontotemporal dementia (FTD). Here, the authors show that transcriptome aberrations commonly found in c9ALS/FTD are a result from defects in cellular RNA surveillance pathways that involve an RNA helicase UPF1
    corecore