58 research outputs found
Theoretical Analysis of the Stress Induced B-Z Transition in Superhelical DNA
We present a method to calculate the propensities of regions within a DNA molecule to transition from B-form to Z-form under negative superhelical stresses. We use statistical mechanics to analyze the competition that occurs among all susceptible Z-forming regions at thermodynamic equilibrium in a superhelically stressed DNA of specified sequence. This method, which we call SIBZ, is similar to the SIDD algorithm that was previously developed to analyze superhelical duplex destabilization. A state of the system is determined by assigning to each base pair either the B- or the Z-conformation, accounting for the dinucleotide repeat unit of Z-DNA. The free energy of a state is comprised of the nucleation energy, the sequence-dependent B-Z transition energy, and the energy associated with the residual superhelicity remaining after the change of twist due to transition. Using this information, SIBZ calculates the equilibrium B-Z transition probability of each base pair in the sequence. This can be done at any physiologically reasonable level of negative superhelicity. We use SIBZ to analyze a variety of representative genomic DNA sequences. We show that the dominant Z-DNA forming regions in a sequence can compete in highly complex ways as the superhelicity level changes. Despite having no tunable parameters, the predictions of SIBZ agree precisely with experimental results, both for the onset of transition in plasmids containing introduced Z-forming sequences and for the locations of Z-forming regions in genomic sequences. We calculate the transition profiles of 5 kb regions taken from each of 12,841 mouse genes and centered on the transcription start site (TSS). We find a substantial increase in the frequency of Z-forming regions immediately upstream from the TSS. The approach developed here has the potential to illuminate the occurrence of Z-form regions in vivo, and the possible roles this transition may play in biological processes
Stability of mRNA/DNA and DNA/DNA Duplexes Affects mRNA Transcription
Nucleic acids, due to their structural and chemical properties, can form double-stranded secondary structures that assist the transfer of genetic information and can modulate gene expression. However, the nucleotide sequence alone is insufficient in explaining phenomena like intron-exon recognition during RNA processing. This raises the question whether nucleic acids are endowed with other attributes that can contribute to their biological functions. In this work, we present a calculation of thermodynamic stability of DNA/DNA and mRNA/DNA duplexes across the genomes of four species in the genus Saccharomyces by nearest-neighbor method. The results show that coding regions are more thermodynamically stable than introns, 3′-untranslated regions and intergenic sequences. Furthermore, open reading frames have more stable sense mRNA/DNA duplexes than the potential antisense duplexes, a property that can aid gene discovery. The lower stability of the DNA/DNA and mRNA/DNA duplexes of 3′-untranslated regions and the higher stability of genes correlates with increased mRNA level. These results suggest that the thermodynamic stability of DNA/DNA and mRNA/DNA duplexes affects mRNA transcription
Negative Supercoiling Creates Single-Stranded Patches of DNA That Are Substrates for AID–Mediated Mutagenesis
Antibody diversification necessitates targeted mutation of regions within the immunoglobulin locus by activation-induced cytidine deaminase (AID). While AID is known to act on single-stranded DNA (ssDNA), the source, structure, and distribution of these substrates in vivo remain unclear. Using the technique of in situ bisulfite treatment, we characterized these substrates—which we found to be unique to actively transcribed genes—as short ssDNA regions, that are equally distributed on both DNA strands. We found that the frequencies of these ssDNA patches act as accurate predictors of AID activity at reporter genes in hypermutating and class switching B cells as well as in Escherichia coli. Importantly, these ssDNA patches rely on transcription, and we report that transcription-induced negative supercoiling enhances both ssDNA tract formation and AID mutagenesis. In addition, RNaseH1 expression does not impact the formation of these ssDNA tracts indicating that these structures are distinct from R-loops. These data emphasize the notion that these transcription-generated ssDNA tracts are one of many in vivo substrates for AID
Anthroponotic transmission of Cryptosporidium parvum predominates in countries with poorer sanitation - a systematic review and meta-analysis
Background: Globally cryptosporidiosis is one of the commonest causes of mortality in children under 24 months old and may be associated with important longterm health effects. Whilst most strains of Cryptosporidium parvum are zoonotic, C. parvum IIc is almost certainly anthroponotic. The global distribution of this potentially important emerging infection is not clear. Methods: We conducted a systematic review of papers identifying the subtype distribution of C. parvum infections globally. We searched PubMed and Scopus using the following key terms Cryptospor* AND parvum AND (genotyp* OR subtyp* OR gp60). Studies were eligible for inclusion if they had found C. parvum within their human study population and had subtyped some or all of these samples using standard gp60 subtyping. Pooled analyses of the proportion of strains being of the IIc subtype were determined using StatsDirect. Meta-regression analyses were run to determine any association between the relative prevalence of IIc and Gross Domestic Product, proportion of the population with access to improved drinking water and improved sanitation. Results: From an initial 843 studies, 85 were included in further analysis. Cryptosporidium parvum IIc was found in 43 of these 85 studies. Across all studies the pooled estimate of relative prevalence of IIc was 19.0% (95% CI: 12.9–25.9%), but there was substantial heterogeneity. In a meta-regression analysis, the relative proportion of all C. parvum infections being IIc decreased as the percentage of the population with access to improved sanitation increased and was some 3.4 times higher in those studies focussing on HIV-positive indivduals. Conclusions: The anthroponotic C. parvum IIc predominates primarily in lower-income countries with poor sanitation and in HIV-positive individuals. Given the apparent enhanced post-infectious virulence of the other main anthroponotic species of Cryptosporidium (C. hominis), it is important to learn about the impact of this subtype on human health
Anthroponotic transmission of Cryptosporidium parvum predominates in countries with poorer sanitation - a systematic review and meta-analysis
Background: Globally cryptosporidiosis is one of the commonest causes of mortality in children under 24 months old and may be associated with important longterm health effects. Whilst most strains of Cryptosporidium parvum are zoonotic, C. parvum IIc is almost certainly anthroponotic. The global distribution of this potentially important emerging infection is not clear. Methods: We conducted a systematic review of papers identifying the subtype distribution of C. parvum infections globally. We searched PubMed and Scopus using the following key terms Cryptospor* AND parvum AND (genotyp* OR subtyp* OR gp60). Studies were eligible for inclusion if they had found C. parvum within their human study population and had subtyped some or all of these samples using standard gp60 subtyping. Pooled analyses of the proportion of strains being of the IIc subtype were determined using StatsDirect. Meta-regression analyses were run to determine any association between the relative prevalence of IIc and Gross Domestic Product, proportion of the population with access to improved drinking water and improved sanitation. Results: From an initial 843 studies, 85 were included in further analysis. Cryptosporidium parvum IIc was found in 43 of these 85 studies. Across all studies the pooled estimate of relative prevalence of IIc was 19.0% (95% CI: 12.9–25.9%), but there was substantial heterogeneity. In a meta-regression analysis, the relative proportion of all C. parvum infections being IIc decreased as the percentage of the population with access to improved sanitation increased and was some 3.4 times higher in those studies focussing on HIV-positive indivduals. Conclusions: The anthroponotic C. parvum IIc predominates primarily in lower-income countries with poor sanitation and in HIV-positive individuals. Given the apparent enhanced post-infectious virulence of the other main anthroponotic species of Cryptosporidium (C. hominis), it is important to learn about the impact of this subtype on human health
Theoretical Analysis of Competing Conformational Transitions in Superhelical DNA
We develop a statistical mechanical model to analyze the competitive behavior of transitions to multiple alternate conformations in a negatively supercoiled DNA molecule of kilobase length and specified base sequence. Since DNA superhelicity topologically couples together the transition behaviors of all base pairs, a unified model is required to analyze all the transitions to which the DNA sequence is susceptible. Here we present a first model of this type. Our numerical approach generalizes the strategy of previously developed algorithms, which studied superhelical transitions to a single alternate conformation. We apply our multi-state model to study the competition between strand separation and B-Z transitions in superhelical DNA. We show this competition to be highly sensitive to temperature and to the imposed level of supercoiling. Comparison of our results with experimental data shows that, when the energetics appropriate to the experimental conditions are used, the competition between these two transitions is accurately captured by our algorithm. We analyze the superhelical competition between B-Z transitions and denaturation around the c-myc oncogene, where both transitions are known to occur when this gene is transcribing. We apply our model to explore the correlation between stress-induced transitions and transcriptional activity in various organisms. In higher eukaryotes we find a strong enhancement of Z-forming regions immediately 5′ to their transcription start sites (TSS), and a depletion of strand separating sites in a broad region around the TSS. The opposite patterns occur around transcript end locations. We also show that susceptibility to each type of transition is different in eukaryotes and prokaryotes. By analyzing a set of untranscribed pseudogenes we show that the Z-susceptibility just downstream of the TSS is not preserved, suggesting it may be under selection pressure
Mapping the human genetic architecture of COVID-19
Matters Arising to this article was published on 03 August 2022, available online at: https://doi.org/10.1038/s41586-022-04826-7 . A second Matters Arising to this article was published on 06 September 2023, available online at: https://doi.org/10.1038/s41586-023-06355-3 .Data availability:
Summary statistics generated by the COVID-19 HGI are available at https://www.covid19hg.org/results/r5/ and are available in the GWAS Catalog (study code GCST011074). The analyses described here include the freeze-5 data. COVID-19 HGI continues to regularly release new data freezes. Summary statistics for non-European ancestry samples are not currently available due to the small individual sample sizes of these groups, but results for lead variants of 13 loci are reported in Supplementary Table 3. Individual level data can be requested directly from contributing studies, listed in Supplementary Table 1. We used publicly available data from GTEx (https://gtexportal.org/home/), the Neale lab (https://www.nealelab.is/uk-biobank/), Finucane lab (https://www.finucanelab.org), the FinnGen Freeze 4 cohort (https://www.finngen.fi/en/access_results) and the eQTL catalogue release 3 (https://www.ebi.ac.uk/eqtl/).Code availability:
The code for summary statistics lift-over, the projection PCA pipeline including precomputed loadings and meta-analyses are available on GitHub (https://github.com/covid19-hg/) and the code for the Mendelian randomization and genetic correlation pipeline is available on GitHub at https://github.com/marcoralab/MRcovid.Reporting summary:
Further information on research design is available in the Nature Research Reporting Summary linked to this paper online at: https://www.nature.com/articles/s41586-021-03767-x#MOESM2 .Supplementary information is available onlne at: https://www.nature.com/articles/s41586-021-03767-x#Sec24 .Extended data figures and tables are available online at: https://www.nature.com/articles/s41586-021-03767-x#Sec23 .Copyright © The Author(s) 2021. The genetic make-up of an individual contributes to the susceptibility and response to viral infection. Although environmental, clinical and social factors have a role in the chance of exposure to SARS-CoV-2 and the severity of COVID-191,2, host genetics may also be important. Identifying host-specific genetic factors may reveal biological mechanisms of therapeutic relevance and clarify causal relationships of modifiable environmental risk factors for SARS-CoV-2 infection and outcomes. We formed a global network of researchers to investigate the role of human genetics in SARS-CoV-2 infection and COVID-19 severity. Here we describe the results of three genome-wide association meta-analyses that consist of up to 49,562 patients with COVID-19 from 46 studies across 19 countries. We report 13 genome-wide significant loci that are associated with SARS-CoV-2 infection or severe manifestations of COVID-19. Several of these loci correspond to previously documented associations to lung or autoimmune and inflammatory diseases3–7. They also represent potentially actionable mechanisms in response to infection. Mendelian randomization analyses support a causal role for smoking and body-mass index for severe COVID-19 although not for type II diabetes. The identification of novel host genetic factors associated with COVID-19 was made possible by the community of human genetics researchers coming together to prioritize the sharing of data, results, resources and analytical frameworks. This working model of international collaboration underscores what is possible for future genetic discoveries in emerging pandemics, or indeed for any complex human disease
- …