677 research outputs found

    Analysis of 3.5 Million SARS-CoV-2 Sequences Reveals Unique Mutational Trends with Consistent Nucleotide and Codon Frequencies

    Get PDF
    Background: Since the onset of the SARS-CoV-2 pandemic, bioinformatic analyses have been performed to understand the nucleotide and synonymous codon usage features and mutational patterns of the virus. However, comparatively few have attempted to perform such analyses on a considerably large cohort of viral genomes while organizing the plethora of available sequence data for a month-by-month analysis to observe changes over time. Here, we aimed to perform sequence composition and mutation analysis of SARS-CoV-2, separating sequences by gene, clade, and timepoints, and contrast the mutational profile of SARS-CoV-2 to other comparable RNA viruses.Methods: Using a cleaned, filtered, and pre-aligned dataset of over 3.5 million sequences downloaded from the GISAID database, we computed nucleotide and codon usage statistics, including calculation of relative synonymous codon usage values. We then calculated codon adaptation index (CAI) changes and a nonsynonymous/synonymous mutation ratio (dN/dS) over time for our dataset. Finally, we compiled information on the types of mutations occurring for SARS-CoV-2 and other comparable RNA viruses, and generated heatmaps showing codon and nucleotide composition at high entropy positions along the Spike sequence.Results: We show that nucleotide and codon usage metrics remain relatively consistent over the 32-month span, though there are significant differences between clades within each gene at various timepoints. CAI and dN/dS values vary substantially between different timepoints and different genes, with Spike gene on average showing both the highest CAI and dN/dS values. Mutational analysis showed that SARS-CoV-2 Spike has a higher proportion of nonsynonymous mutations than analogous genes in other RNA viruses, with nonsynonymous mutations outnumbering synonymous ones by up to 20:1. However, at several specific positions, synonymous mutations were overwhelmingly predominant.Conclusions: Our multifaceted analysis covering both the composition and mutation signature of SARS-CoV-2 gives valuable insight into the nucleotide frequency and codon usage heterogeneity of SARS-CoV-2 over time, and its unique mutational profile compared to other RNA viruses

    Inhibition of Multidrug Resistance by SV40 Pseudovirion Delivery of an Antigene Peptide Nucleic Acid (PNA) in Cultured Cells

    Get PDF
    Peptide nucleic acid (PNA) is known to bind with extraordinarily high affinity and sequence-specificity to complementary nucleic acid sequences and can be used to suppress gene expression. However, effective delivery into cells is a major obstacle to the development of PNA for gene therapy applications. Here, we present a novel method for the in vitro delivery of antigene PNA to cells. By using a nucleocapsid protein derived from Simian virus 40, we have been able to package PNA into pseudovirions, facilitating the delivery of the packaged PNA into cells. We demonstrate that this system can be used effectively to suppress gene expression associated with multidrug resistance in cancer cells, as shown by RT-PCR, flow cytometry, Western blotting, and cell viability under chemotherapy. The combination of PNA with the SV40-based delivery system is a method for suppressing a gene of interest that could be broadly applied to numerous targets

    Splicing Dysregulation Contributes to the Pathogenicity of Several F9 Exonic Point Variants

    Get PDF
    Background: Pre‐mRNA splicing is a complex process requiring the identification of donor site, acceptor site, and branch point site with an adjacent polypyrimidine tract sequence. Splicing is regulated by splicing regulatory elements (SREs) with both enhancer and suppressor functions. Variants located in exonic regions can impact splicing through dysregulation of native splice sites, SREs, and cryptic splice site activation. While splicing dysregulation is considered primary disease‐inducing mechanism of synonymous variants, its contribution toward disease phenotype of non‐synonymous variants is underappreciated. Methods: In this study, we analyzed 415 disease‐causing and 120 neutral F9 exonic point variants including both synonymous and non‐synonymous for their effect on splicing using a series of in silico splice site prediction tools, SRE prediction tools, and in vitro minigene assays. Results: The use of splice site and SRE prediction tools in tandem provided better prediction but were not always in agreement with the minigene assays. The net effect of splicing dysregulation caused by variants was context dependent. Minigene assays revealed that perturbed splicing can be found. Conclusion: Synonymous variants primarily cause disease phenotype via splicing dysregulation while additional mechanisms such as translation rate also play an important role. Splicing dysregulation is likely to contribute to the disease phenotype of several non‐synonymous variants

    High Cooperativity of the SV40 Major Capsid Protein VP1 in Virus Assembly

    Get PDF
    SV40 is a small, non enveloped DNA virus with an icosahedral capsid of 45 nm. The outer shell is composed of pentamers of the major capsid protein, VP1, linked via their flexible carboxy-terminal arms. Its morphogenesis occurs by assembly of capsomers around the viral minichromosome. However the steps leading to the formation of mature virus are poorly understood. Intermediates of the assembly reaction could not be isolated from cells infected with wt SV40. Here we have used recombinant VP1 produced in insect cells for in vitro assembly studies around supercoiled heterologous plasmid DNA carrying a reporter gene. This strategy yields infective nanoparticles, affording a simple quantitative transduction assay. We show that VP1 assembles under physiological conditions into uniform nanoparticles of the same shape, size and CsCl density as the wild type virus. The stoichiometry is one DNA molecule per capsid. VP1 deleted in the C-arm, which is unable to assemble but can bind DNA, was inactive indicating genuine assembly rather than non-specific DNA-binding. The reaction requires host enzymatic activities, consistent with the participation of chaperones, as recently shown. Our results demonstrate dramatic cooperativity of VP1, with a Hill coefficient of ∼6. These findings suggest that assembly may be a concerted reaction. We propose that concerted assembly is facilitated by simultaneous binding of multiple capsomers to a single DNA molecule, as we have recently reported, thus increasing their local concentration. Emerging principles of SV40 assembly may help understanding assembly of other complex systems. In addition, the SV40-based nanoparticles described here are potential gene therapy vectors that combine efficient gene delivery with safety and flexibility

    DNA-Free Recombinant SV40 Capsids Protect Mice from Acute Renal Failure by Inducing Stress Response, Survival Pathway and Apoptotic Arrest

    Get PDF
    Viruses induce signaling and host defense during infection. Employing these natural trigger mechanisms to combat organ or tissue failure is hampered by harmful effects of most viruses. Here we demonstrate that SV40 empty capsids (Virus Like Particles-VLPs), with no DNA, induce host Hsp/c70 and Akt-1 survival pathways, key players in cellular survival mechanisms. We postulated that this signaling might protect against organ damage in vivo. Acute kidney injury (AKI) was chosen as target. AKI is critical, prevalent disorder in humans, caused by nephrotoxic agents, sepsis or ischemia, via apoptosis/necrosis of renal tubular cells, with high morbidity and mortality. Systemic administration of VLPs activated Akt-1 and upregulated Hsp/c70 in vivo. Experiments in mercury-induced AKI mouse model demonstrated that apoptosis, oxidative stress and toxic renal failure were significantly attenuated by pretreatment with capsids prior to the mercury insult. Survival rate increased from 12% to >60%, with wide dose response. This study demonstrates that SV40 VLPs, devoid of DNA, may potentially be used as prophylactic agent for AKI. We anticipate that these finding may be projected to a wide range of organ failure, using empty capsids of SV40 as well as other viruses

    Characterization of Coding Synonymous and Non-Synonymous Variants in ADAMTS13 Using Ex Vivo and In Silico Approaches

    Get PDF
    Synonymous variations, which are defined as codon substitutions that do not change the encoded amino acid, were previously thought to have no effect on the properties of the synthesized protein(s). However, mounting evidence shows that these “silent” variations can have a significant impact on protein expression and function and should no longer be considered “silent”. Here, the effects of six synonymous and six non-synonymous variations, previously found in the gene of ADAMTS13, the von Willebrand Factor (VWF) cleaving hemostatic protease, have been investigated using a variety of approaches. The ADAMTS13 mRNA and protein expression levels, as well as the conformation and activity of the variants have been compared to that of wild-type ADAMTS13. Interestingly, not only the non-synonymous variants but also the synonymous variants have been found to change the protein expression levels, conformation and function. Bioinformatic analysis of ADAMTS13 mRNA structure, amino acid conservation and codon usage allowed us to establish correlations between mRNA stability, RSCU, and intracellular protein expression. This study demonstrates that variants and more specifically, synonymous variants can have a substantial and definite effect on ADAMTS13 function and that bioinformatic analysis may allow development of predictive tools to identify variants that will have significant effects on the encoded protein

    Correlation between nucleotide composition and folding energy of coding sequences with special attention to wobble bases

    Get PDF
    Background: The secondary structure and complexity of mRNA influences its accessibility to regulatory molecules (proteins, micro-RNAs), its stability and its level of expression. The mobile elements of the RNA sequence, the wobble bases, are expected to regulate the formation of structures encompassing coding sequences. Results: The sequence/folding energy (FE) relationship was studied by statistical, bioinformatic methods in 90 CDS containing 26,370 codons. I found that the FE (dG) associated with coding sequences is significant and negative (407 kcal/1000 bases, mean +/- S.E.M.) indicating that these sequences are able to form structures. However, the FE has only a small free component, less than 10% of the total. The contribution of the 1st and 3rd codon bases to the FE is larger than the contribution of the 2nd (central) bases. It is possible to achieve a ~ 4-fold change in FE by altering the wobble bases in synonymous codons. The sequence/FE relationship can be described with a simple algorithm, and the total FE can be predicted solely from the sequence composition of the nucleic acid. The contributions of different synonymous codons to the FE are additive and one codon cannot replace another. The accumulated contributions of synonymous codons of an amino acid to the total folding energy of an mRNA is strongly correlated to the relative amount of that amino acid in the translated protein. Conclusion: Synonymous codons are not interchangable with regard to their role in determining the mRNA FE and the relative amounts of amino acids in the translated protein, even if they are indistinguishable in respect of amino acid coding.Comment: 14 pages including 6 figures and 1 tabl

    Genetic polymorphisms of the endocannabinoid system in obesity and diabetes

    Get PDF
    The endocannabinoid system (ECS) is involved in many physiological processes including fertility, pain and energy regulation. The aim of this systematic review was to examine the contribution of single nucleotide polymorphisms (SNPs) of the ECS to adiposity and glucose metabolism. Database searches identified 734 articles, of which 65 were included; these covered 70 SNPs in genes coding for cannabinoid receptors 1 and 2 (CB1, CB2), fatty acid amide hydrolase (FAAH) and N‐acyl phosphatidylethanolamine phospholipase D (NAPE‐PLD). No studies included SNPs relating to monoacylglycerol lipase or diacylglycerol lipase. The CB1 receptor SNP rs1049353 showed 17 associations with lower body mass index (BMI) and fat mass (five studies). It also showed three associations with lower insulin levels (one study). Conversely, the CB1 receptor SNP rs806368 was associated with increased BMI and waist circumference (two studies). The FAAH SNP rs324420 was associated with increased obesity (three studies). A haplotype of NAPE‐PLD was associated with decreased BMI (one study). A total of 60 SNPs showed no association with any measured outcome. This review suggests a complex but important role of ECS SNPs in energy and glucose metabolism

    Reduced mRNA Secondary-Structure Stability Near the Start Codon Indicates Functional Genes in Prokaryotes

    Get PDF
    Several recent studies have found that selection acts on synonymous mutations at the beginning of genes to reduce mRNA secondary-structure stability, presumably to aid in translation initiation. This observation suggests that a metric of relative mRNA secondary-structure stability, ZΔG, could be used to test whether putative genes are likely to be functionally important. Using the Escherichia coli genome, we compared the mean ZΔG of genes with known functions, genes with known orthologs, genes where function and orthology are unknown, and pseudogenes. Genes in the first two categories demonstrated similar levels of selection for reduced stability (increased ZΔG), whereas for pseudogenes stability did not differ from our null expectation. Surprisingly, genes where function and orthology were unknown were also not different from the null expectation, suggesting that many of these open reading frames are not functionally important. We extended our analysis by constructing a Bayesian phylogenetic mixed model based on data from 145 prokaryotic genomes. As in E. coli, genes with no known function had consistently lower ZΔG, even though we expect that many of the currently unannotated genes will ultimately have their functional utility discovered. Our findings suggest that functional genes tend to evolve increased ZΔG, whereas nonfunctional ones do not. Therefore, ZΔG may be a useful metric for identifying genes of potentially important function and could be used to target genes for further functional study

    Single nucleotide polymorphisms of 8 inflammation-related genes and their associations with smoking-related cancers

    Get PDF
    Tobacco smoke and its metabolites are carcinogens that increase tissue oxidative stress and induce target tissue inflammation. We hypothesized that genetic variation of inflammatory pathway genes plays a role in tobacco-related carcinogenesis and is modified by tobacco smoking. We evaluated the association of 12 single nucleotide polymorphisms of 8 inflammation-related genes with tobacco-related cancers (lung, oropharynx, larynx, esophagus, stomach, liver, bladder, and kidney) using 3 case-control studies from: Los Angeles (population-based; 611 lung and 553 upper aero-digestive tract cancer cases and 1,040 controls), Taixing, China (population-based; 218 esophagus, 206 stomach, 204 liver cancer cases, and 415 controls), and Memorial Sloan-Kettering Cancer Center (hospital-based; 227 bladder cancer cases and 211 controls). After adjusting for age, education, ethnicity, gender, and tobacco smoking, IL10 rs1800871 was inversely associated with oropharyngeal cancer (CT+TT vs . CC adjusted odds ratio [aOR]: 0.69, 95% confidence interval [CI]: 0.50–0.95), and was positively associated with lung cancer among never smokers (TT vs . CT+CC aOR: 2.5, 95% CI: 1.3–5.1) and inversely with oropharyngeal cancer among ever smokers (CT+TT vs . CC aOR: 0.63, 95% CI: 0.41–0.95). Among all pooled never smokers (588 cases and 816 controls), TNF rs1799964 was inversely associated with smoking-related cancer (CC vs . CT+TT aOR: 0.36, 95% CI: 0.17–0.77). Bayesian correction for multiple comparisons suggests that chance is unlikely to explain our findings (although epigenetic mechanisms may be in effect), which support our hypotheses, suggesting that IL10 rs1800871 is a susceptibility marker for oropharyngeal and lung cancers, and that TNF rs1799964 is associated with smoking-related cancers among never smokers.Peer Reviewedhttp://deepblue.lib.umich.edu/bitstream/2027.42/78055/1/25214_ftp.pd
    corecore