578 research outputs found

    Genome landscapes and bacteriophage codon usage

    Get PDF
    Across all kingdoms of biological life, protein-coding genes exhibit unequal usage of synonmous codons. Although alternative theories abound, translational selection has been accepted as an important mechanism that shapes the patterns of codon usage in prokaryotes and simple eukaryotes. Here we analyze patterns of codon usage across 74 diverse bacteriophages that infect E. coli, P. aeruginosa and L. lactis as their primary host. We introduce the concept of a `genome landscape,' which helps reveal non-trivial, long-range patterns in codon usage across a genome. We develop a series of randomization tests that allow us to interrogate the significance of one aspect of codon usage, such a GC content, while controlling for another aspect, such as adaptation to host-preferred codons. We find that 33 phage genomes exhibit highly non-random patterns in their GC3-content, use of host-preferred codons, or both. We show that the head and tail proteins of these phages exhibit significant bias towards host-preferred codons, relative to the non-structural phage proteins. Our results support the hypothesis of translational selection on viral genes for host-preferred codons, over a broad range of bacteriophages.Comment: 9 Color Figures, 5 Tables, 53 Reference

    Translational selection on SHH genes

    Get PDF
    Codon usage bias has been observed in various organisms. In this study, the correlation between SHH genes expression in some tissues and codon usage features was analyzed by bioinformatics. We found that translational selection may act on compositional features of this set of genes

    The C-terminal cysteine annulus participates in auto-chaperone function for Salmonella phage P22 tailspike folding and assembly

    Get PDF
    Elongated trimeric adhesins are a distinct class of proteins employed by phages and viruses to recognize and bind to their host cells, and by bacteria to bind to their target cells and tissues. The tailspikes of E. coli phage K1F and Bacillus phage Ø29 exhibit auto-chaperone activity in their trimeric C-terminal domains. The P22 tailspike is structurally homologous to those adhesins. Though there are no disulfide bonds or reactive cysteines in the native P22 tailspikes, a set of C-terminal cysteines are very reactive in partially folded intermediates, implying an unusual local conformation in the domain. This is likely to be involved in the auto-chaperone function. We examined the unusual reactivity of C-terminal tailspike cysteines during folding and assembly as a potential reporter of auto-chaperone function. Reaction with IAA blocked productive refolding in vitro, but not off-pathway aggregation. Two-dimensional PAGE revealed that the predominant intermediate exhibiting reactive cysteine side chains was a partially folded monomer. Treatment with reducing reagent promoted native trimer formation from these species, consistent with transient disulfide bonds in the auto-chaperone domain. Limited enzymatic digestion and mass spectrometry of folding and assembly intermediates indicated that the C-terminal domain was compact in the protrimer species. These results indicate that the C-terminal domain of the P22 tailspike folds itself and associates prior to formation of the protrimer intermediate, and not after, as previously proposed. The C-terminal cysteines and triple β-helix domains apparently provide the staging for the correct auto-chaperone domain formation, needed for alignment of P22 tailspike native trimer

    A Novel Bioinformatics Strategy for Function Prediction of Poorly-Characterized Protein Genes Obtained from Metagenome Analyses

    Get PDF
    As a result of remarkable progresses of DNA sequencing technology, vast quantities of genomic sequences have been decoded. Homology search for amino acid sequences, such as BLAST, has become a basic tool for assigning functions of genes/proteins when genomic sequences are decoded. Although the homology search has clearly been a powerful and irreplaceable method, the functions of only 50% or fewer of genes can be predicted when a novel genome is decoded. A prediction method independent of the homology search is urgently needed. By analyzing oligonucleotide compositions in genomic sequences, we previously developed a modified Self-Organizing Map β€˜BLSOM’ that clustered genomic fragments according to phylotype with no advance knowledge of phylotype. Using BLSOM for di-, tri- and tetrapeptide compositions, we developed a system to enable separation (self-organization) of proteins by function. Analyzing oligopeptide frequencies in proteins previously classified into COGs (clusters of orthologous groups of proteins), BLSOMs could faithfully reproduce the COG classifications. This indicated that proteins, whose functions are unknown because of lack of significant sequence similarity with function-known proteins, can be related to function-known proteins based on similarity in oligopeptide composition. BLSOM was applied to predict functions of vast quantities of proteins derived from mixed genomes in environmental samples

    Detecting Horizontally Transferred and Essential Genes Based on Dinucleotide Relative Abundance

    Get PDF
    Various methods have been developed to detect horizontal gene transfer in bacteria, based on anomalous nucleotide composition, assuming that compositional features undergo amelioration in the host genome. Evolutionary theory predicts the inevitability of false positives when essential sequences are strongly conserved. Foreign genes could become more detectable on the basis of their higher order compositions if such features ameliorate more rapidly and uniformly than lower order features. This possibility is tested by comparing the heterogeneities of bacterial genomes with respect to strand-independent first- and second-order features, (i) G + C content and (ii) dinucleotide relative abundance, in 1 kb segments. Although statistical analysis confirms that (ii) is less inhomogeneous than (i) in all 12 species examined, extreme anomalies with respect to (ii) in the Escherichia coli K12 genome are typically co-located with essential genes

    Determinants of translation efficiency and accuracy

    Get PDF
    A given protein sequence can be encoded by an astronomical number of alternative nucleotide sequences. Recent research has revealed that this flexibility provides evolution with multiple ways to tune the efficiency and fidelity of protein translation and folding

    Relationship between amino acid composition and gene expression in the mouse genome

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Codon bias is a phenomenon that refers to the differences in the frequencies of synonymous codons among different genes. In many organisms, natural selection is considered to be a cause of codon bias because codon usage in highly expressed genes is biased toward optimal codons. Methods have previously been developed to predict the expression level of genes from their nucleotide sequences, which is based on the observation that synonymous codon usage shows an overall bias toward a few codons called major codons. However, the relationship between codon bias and gene expression level, as proposed by the translation-selection model, is less evident in mammals.</p> <p>Findings</p> <p>We investigated the correlations between the expression levels of 1,182 mouse genes and amino acid composition, as well as between gene expression and codon preference. We found that a weak but significant correlation exists between gene expression levels and amino acid composition in mouse. In total, less than 10% of variation of expression levels is explained by amino acid components. We found the effect of codon preference on gene expression was weaker than the effect of amino acid composition, because no significant correlations were observed with respect to codon preference.</p> <p>Conclusion</p> <p>These results suggest that it is difficult to predict expression level from amino acid components or from codon bias in mouse.</p

    Primary skin fibroblasts as a model of Parkinson's disease

    Get PDF
    Parkinson's disease is the second most frequent neurodegenerative disorder. While most cases occur sporadic mutations in a growing number of genes including Parkin (PARK2) and PINK1 (PARK6) have been associated with the disease. Different animal models and cell models like patient skin fibroblasts and recombinant cell lines can be used as model systems for Parkinson's disease. Skin fibroblasts present a system with defined mutations and the cumulative cellular damage of the patients. PINK1 and Parkin genes show relevant expression levels in human fibroblasts and since both genes participate in stress response pathways, we believe fibroblasts advantageous in order to assess, e.g. the effect of stressors. Furthermore, since a bioenergetic deficit underlies early stage Parkinson's disease, while atrophy underlies later stages, the use of primary cells seems preferable over the use of tumor cell lines. The new option to use fibroblast-derived induced pluripotent stem cells redifferentiated into dopaminergic neurons is an additional benefit. However, the use of fibroblast has also some drawbacks. We have investigated PARK6 fibroblasts and they mirror closely the respiratory alterations, the expression profiles, the mitochondrial dynamics pathology and the vulnerability to proteasomal stress that has been documented in other model systems. Fibroblasts from patients with PARK2, PARK6, idiopathic Parkinson's disease, Alzheimer's disease, and spinocerebellar ataxia type 2 demonstrated a distinct and unique mRNA expression pattern of key genes in neurodegeneration. Thus, primary skin fibroblasts are a useful Parkinson's disease model, able to serve as a complement to animal mutants, transformed cell lines and patient tissues

    Evolution of Synonymous Codon Usage in Neurospora tetrasperma and Neurospora discreta

    Get PDF
    Neurospora comprises a primary model system for the study of fungal genetics and biology. In spite of this, little is known about genome evolution in Neurospora. For example, the evolution of synonymous codon usage is largely unknown in this genus. In the present investigation, we conducted a comprehensive analysis of synonymous codon usage and its relationship to gene expression and gene length (GL) in Neurospora tetrasperma and Neurospora discreta. For our analysis, we examined codon usage among 2,079 genes per organism and assessed gene expression using large-scale expressed sequenced tag (EST) data sets (279,323 and 453,559 ESTs for N. tetrasperma and N. discreta, respectively). Data on relative synonymous codon usage revealed 24 codons (and two putative codons) that are more frequently used in genes with high than with low expression and thus were defined as optimal codons. Although codon-usage bias was highly correlated with gene expression, it was independent of selectively neutral base composition (introns); thus demonstrating that translational selection drives synonymous codon usage in these genomes. We also report that GL (coding sequences [CDS]) was inversely associated with optimal codon usage at each gene expression level, with highly expressed short genes having the greatest frequency of optimal codons. Optimal codon frequency was moderately higher in N. tetrasperma than in N. discreta, which might be due to variation in selective pressures and/or mating systems
    • …
    corecore