578 research outputs found
Genome landscapes and bacteriophage codon usage
Across all kingdoms of biological life, protein-coding genes exhibit unequal
usage of synonmous codons. Although alternative theories abound, translational
selection has been accepted as an important mechanism that shapes the patterns
of codon usage in prokaryotes and simple eukaryotes. Here we analyze patterns
of codon usage across 74 diverse bacteriophages that infect E. coli, P.
aeruginosa and L. lactis as their primary host. We introduce the concept of a
`genome landscape,' which helps reveal non-trivial, long-range patterns in
codon usage across a genome. We develop a series of randomization tests that
allow us to interrogate the significance of one aspect of codon usage, such a
GC content, while controlling for another aspect, such as adaptation to
host-preferred codons. We find that 33 phage genomes exhibit highly non-random
patterns in their GC3-content, use of host-preferred codons, or both. We show
that the head and tail proteins of these phages exhibit significant bias
towards host-preferred codons, relative to the non-structural phage proteins.
Our results support the hypothesis of translational selection on viral genes
for host-preferred codons, over a broad range of bacteriophages.Comment: 9 Color Figures, 5 Tables, 53 Reference
Translational selection on SHH genes
Codon usage bias has been observed in various organisms. In this study, the correlation between SHH genes expression in some tissues and codon usage features was analyzed by bioinformatics. We found that translational selection may act on compositional features of this set of genes
The C-terminal cysteine annulus participates in auto-chaperone function for Salmonella phage P22 tailspike folding and assembly
Elongated trimeric adhesins are a distinct class of proteins employed by phages and viruses to recognize and bind to their host cells, and by bacteria to bind to their target cells and tissues. The tailspikes of E. coli phage K1F and Bacillus phage Γ29 exhibit auto-chaperone activity in their trimeric C-terminal domains. The P22 tailspike is structurally homologous to those adhesins. Though there are no disulfide bonds or reactive cysteines in the native P22 tailspikes, a set of C-terminal cysteines are very reactive in partially folded intermediates, implying an unusual local conformation in the domain. This is likely to be involved in the auto-chaperone function. We examined the unusual reactivity of C-terminal tailspike cysteines during folding and assembly as a potential reporter of auto-chaperone function. Reaction with IAA blocked productive refolding in vitro, but not off-pathway aggregation. Two-dimensional PAGE revealed that the predominant intermediate exhibiting reactive cysteine side chains was a partially folded monomer. Treatment with reducing reagent promoted native trimer formation from these species, consistent with transient disulfide bonds in the auto-chaperone domain. Limited enzymatic digestion and mass spectrometry of folding and assembly intermediates indicated that the C-terminal domain was compact in the protrimer species. These results indicate that the C-terminal domain of the P22 tailspike folds itself and associates prior to formation of the protrimer intermediate, and not after, as previously proposed. The C-terminal cysteines and triple Ξ²-helix domains apparently provide the staging for the correct auto-chaperone domain formation, needed for alignment of P22 tailspike native trimer
A Novel Bioinformatics Strategy for Function Prediction of Poorly-Characterized Protein Genes Obtained from Metagenome Analyses
As a result of remarkable progresses of DNA sequencing technology, vast quantities of genomic sequences have been decoded. Homology search for amino acid sequences, such as BLAST, has become a basic tool for assigning functions of genes/proteins when genomic sequences are decoded. Although the homology search has clearly been a powerful and irreplaceable method, the functions of only 50% or fewer of genes can be predicted when a novel genome is decoded. A prediction method independent of the homology search is urgently needed. By analyzing oligonucleotide compositions in genomic sequences, we previously developed a modified Self-Organizing Map βBLSOMβ that clustered genomic fragments according to phylotype with no advance knowledge of phylotype. Using BLSOM for di-, tri- and tetrapeptide compositions, we developed a system to enable separation (self-organization) of proteins by function. Analyzing oligopeptide frequencies in proteins previously classified into COGs (clusters of orthologous groups of proteins), BLSOMs could faithfully reproduce the COG classifications. This indicated that proteins, whose functions are unknown because of lack of significant sequence similarity with function-known proteins, can be related to function-known proteins based on similarity in oligopeptide composition. BLSOM was applied to predict functions of vast quantities of proteins derived from mixed genomes in environmental samples
Detecting Horizontally Transferred and Essential Genes Based on Dinucleotide Relative Abundance
Various methods have been developed to detect horizontal gene transfer in bacteria, based on anomalous nucleotide composition, assuming that compositional features undergo amelioration in the host genome. Evolutionary theory predicts the inevitability of false positives when essential sequences are strongly conserved. Foreign genes could become more detectable on the basis of their higher order compositions if such features ameliorate more rapidly and uniformly than lower order features. This possibility is tested by comparing the heterogeneities of bacterial genomes with respect to strand-independent first- and second-order features, (i) G + C content and (ii) dinucleotide relative abundance, in 1 kb segments. Although statistical analysis confirms that (ii) is less inhomogeneous than (i) in all 12 species examined, extreme anomalies with respect to (ii) in the Escherichia coli K12 genome are typically co-located with essential genes
Determinants of translation efficiency and accuracy
A given protein sequence can be encoded by an astronomical number of alternative nucleotide sequences. Recent research has revealed that this flexibility provides evolution with multiple ways to tune the efficiency and fidelity of protein translation and folding
Relationship between amino acid composition and gene expression in the mouse genome
<p>Abstract</p> <p>Background</p> <p>Codon bias is a phenomenon that refers to the differences in the frequencies of synonymous codons among different genes. In many organisms, natural selection is considered to be a cause of codon bias because codon usage in highly expressed genes is biased toward optimal codons. Methods have previously been developed to predict the expression level of genes from their nucleotide sequences, which is based on the observation that synonymous codon usage shows an overall bias toward a few codons called major codons. However, the relationship between codon bias and gene expression level, as proposed by the translation-selection model, is less evident in mammals.</p> <p>Findings</p> <p>We investigated the correlations between the expression levels of 1,182 mouse genes and amino acid composition, as well as between gene expression and codon preference. We found that a weak but significant correlation exists between gene expression levels and amino acid composition in mouse. In total, less than 10% of variation of expression levels is explained by amino acid components. We found the effect of codon preference on gene expression was weaker than the effect of amino acid composition, because no significant correlations were observed with respect to codon preference.</p> <p>Conclusion</p> <p>These results suggest that it is difficult to predict expression level from amino acid components or from codon bias in mouse.</p
Primary skin fibroblasts as a model of Parkinson's disease
Parkinson's disease is the second most frequent neurodegenerative disorder. While most cases occur sporadic mutations in a growing number of genes including Parkin (PARK2) and PINK1 (PARK6) have been associated with the disease. Different animal models and cell models like patient skin fibroblasts and recombinant cell lines can be used as model systems for Parkinson's disease. Skin fibroblasts present a system with defined mutations and the cumulative cellular damage of the patients. PINK1 and Parkin genes show relevant expression levels in human fibroblasts and since both genes participate in stress response pathways, we believe fibroblasts advantageous in order to assess, e.g. the effect of stressors. Furthermore, since a bioenergetic deficit underlies early stage Parkinson's disease, while atrophy underlies later stages, the use of primary cells seems preferable over the use of tumor cell lines. The new option to use fibroblast-derived induced pluripotent stem cells redifferentiated into dopaminergic neurons is an additional benefit. However, the use of fibroblast has also some drawbacks. We have investigated PARK6 fibroblasts and they mirror closely the respiratory alterations, the expression profiles, the mitochondrial dynamics pathology and the vulnerability to proteasomal stress that has been documented in other model systems. Fibroblasts from patients with PARK2, PARK6, idiopathic Parkinson's disease, Alzheimer's disease, and spinocerebellar ataxia type 2 demonstrated a distinct and unique mRNA expression pattern of key genes in neurodegeneration. Thus, primary skin fibroblasts are a useful Parkinson's disease model, able to serve as a complement to animal mutants, transformed cell lines and patient tissues
Evolution of Synonymous Codon Usage in Neurospora tetrasperma and Neurospora discreta
Neurospora comprises a primary model system for the study of fungal genetics and biology. In spite of this, little is known about genome evolution in Neurospora. For example, the evolution of synonymous codon usage is largely unknown in this genus. In the present investigation, we conducted a comprehensive analysis of synonymous codon usage and its relationship to gene expression and gene length (GL) in Neurospora tetrasperma and Neurospora discreta. For our analysis, we examined codon usage among 2,079 genes per organism and assessed gene expression using large-scale expressed sequenced tag (EST) data sets (279,323 and 453,559 ESTs for N. tetrasperma and N. discreta, respectively). Data on relative synonymous codon usage revealed 24 codons (and two putative codons) that are more frequently used in genes with high than with low expression and thus were defined as optimal codons. Although codon-usage bias was highly correlated with gene expression, it was independent of selectively neutral base composition (introns); thus demonstrating that translational selection drives synonymous codon usage in these genomes. We also report that GL (coding sequences [CDS]) was inversely associated with optimal codon usage at each gene expression level, with highly expressed short genes having the greatest frequency of optimal codons. Optimal codon frequency was moderately higher in N. tetrasperma than in N. discreta, which might be due to variation in selective pressures and/or mating systems
- β¦