198 research outputs found

    Phamerator: a bioinformatic tool for comparative bacteriophage genomics

    Get PDF
    Background: Bacteriophage genomes have mosaic architectures and are replete with small open reading frames of unknown function, presenting challenges in their annotation, comparative analysis, and representation.Results: We describe here a bioinformatic tool, Phamerator, that assorts protein-coding genes into phamilies of related sequences using pairwise comparisons to generate a database of gene relationships. This database is used to generate genome maps of multiple phages that incorporate nucleotide and amino acid sequence relationships, as well as genes containing conserved domains. Phamerator also generates phamily circle representations of gene phamilies, facilitating analysis of the different evolutionary histories of individual genes that migrate through phage populations by horizontal genetic exchange.Conclusions: Phamerator represents a useful tool for comparative genomic analysis and comparative representations of bacteriophage genomes. © 2011 Cresawn et al; licensee BioMed Central Ltd

    Genome landscapes and bacteriophage codon usage

    Get PDF
    Across all kingdoms of biological life, protein-coding genes exhibit unequal usage of synonmous codons. Although alternative theories abound, translational selection has been accepted as an important mechanism that shapes the patterns of codon usage in prokaryotes and simple eukaryotes. Here we analyze patterns of codon usage across 74 diverse bacteriophages that infect E. coli, P. aeruginosa and L. lactis as their primary host. We introduce the concept of a `genome landscape,' which helps reveal non-trivial, long-range patterns in codon usage across a genome. We develop a series of randomization tests that allow us to interrogate the significance of one aspect of codon usage, such a GC content, while controlling for another aspect, such as adaptation to host-preferred codons. We find that 33 phage genomes exhibit highly non-random patterns in their GC3-content, use of host-preferred codons, or both. We show that the head and tail proteins of these phages exhibit significant bias towards host-preferred codons, relative to the non-structural phage proteins. Our results support the hypothesis of translational selection on viral genes for host-preferred codons, over a broad range of bacteriophages.Comment: 9 Color Figures, 5 Tables, 53 Reference

    Cluster M Mycobacteriophages Bongo, PegLeg, and Rey with Unusually Large Repertoires of tRNA Isotopes

    Full text link
    Genomic analysis of a large set of phages infecting the common hostMycobacterium smegmatis mc2155 shows that they span considerable genetic diversity. There are more than 20 distinct types that lack nucleotide similarity with each other, and there is considerable diversity within most of the groups. Three newly isolated temperate mycobacteriophages, Bongo, PegLeg, and Rey, constitute a new group (cluster M), with the closely related phages Bongo and PegLeg forming subcluster M1 and the more distantly related Rey forming subcluster M2. The cluster M mycobacteriophages have siphoviral morphologies with unusually long tails, are homoimmune, and have larger than average genomes (80.2 to 83.7 kbp). They exhibit a variety of features not previously described in other mycobacteriophages, including noncanonical genome architectures and several unusual sets of conserved repeated sequences suggesting novel regulatory systems for both transcription and translation. In addition to containing transfer-messenger RNA and RtcB-like RNA ligase genes, their genomes encode 21 to 24 tRNA genes encompassing complete or nearly complete sets of isotypes. We predict that these tRNAs are used in late lytic growth, likely compensating for the degradation or inadequacy of host tRNAs. They may represent a complete set of tRNAs necessary for late lytic growth, especially when taken together with the apparent lack of codons in the same late genes that correspond to tRNAs that the genomes of the phages do not obviously encode

    Exploring the mycobacteriophage metaproteome: Phage genomics as an educational platform

    Get PDF
    Bacteriophages are the most abundant forms of life in the biosphere and carry genomes characterized by high genetic diversity and mosaic architectures. The complete sequences of 30 mycobacteriophage genomes show them collectively to encode 101 tRNAs, three tmRNAs, and 3,357 proteins belonging to 1,536 "phamilies" of related sequences, and a statistical analysis predicts that these represent approximately 50% of the total number of phamilies in the mycobacteriophage population. These phamilies contain 2.19 proteins on average; more than half (774) of them contain just a single protein sequence. Only six phamilies have representatives in more than half of the 30 genomes, and only three - encoding tape-measure proteins, lysins, and minor tail proteins - are present in all 30 phages, although these phamilies are themselves highly modular, such that no single amino acid sequence element is present in all 30 mycobacteriophage genomes. Of the 1,536 phamilies, only 230 (15%) have amino acid sequence similarity to previously reported proteins, reflecting the enormous genetic diversity of the entire phage population. The abundance and diversity of phages, the simplicity of phage isolation, and the relatively small size of phage genomes support bacteriophage isolation and comparative genomic analysis as a highly suitable platform for discovery-based education. © 2006 Hatfull et al

    The use of genomic signature distance between bacteriophages and their hosts displays evolutionary relationships and phage growth cycle determination

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Bacteriophage classification is mainly based on morphological traits and genome characteristics combined with host information and in some cases on phage growth lifestyle. A lack of molecular tools can impede more precise studies on phylogenetic relationships or even a taxonomic classification. The use of methods to analyze genome sequences without the requirement for homology has allowed advances in classification.</p> <p>Results</p> <p>Here, we proposed to use genome sequence signature to characterize bacteriophages and to compare them to their host genome signature in order to obtain host-phage relationships and information on their lifestyle. We analyze the host-phage relationships in the four most representative groups of Caudoviridae, the dsDNA group of phages. We demonstrate that the use of phage genomic signature and its comparison with that of the host allows a grouping of phages and is also able to predict the host-phage relationships (lytic <it>vs</it>. temperate).</p> <p>Conclusions</p> <p>We can thus condense, in relatively simple figures, this phage information dispersed over many publications.</p

    Cluster K Mycobacteriophages: Insights into the Evolutionary Origins of Mycobacteriophage TM4

    Get PDF
    Five newly isolated mycobacteriophages –Angelica, CrimD, Adephagia, Anaya, and Pixie – have similar genomic architectures to mycobacteriophage TM4, a previously characterized phage that is widely used in mycobacterial genetics. The nucleotide sequence similarities warrant grouping these into Cluster K, with subdivision into three subclusters: K1, K2, and K3. Although the overall genome architectures of these phages are similar, TM4 appears to have lost at least two segments of its genome, a central region containing the integration apparatus, and a segment at the right end. This suggests that TM4 is a recent derivative of a temperate parent, resolving a long-standing conundrum about its biology, in that it was reportedly recovered from a lysogenic strain of Mycobacterium avium, but it is not capable of forming lysogens in any mycobacterial host. Like TM4, all of the Cluster K phages infect both fast- and slow-growing mycobacteria, and all of them – with the exception of TM4 – form stable lysogens in both Mycobacterium smegmatis and Mycobacterium tuberculosis; immunity assays show that all five of these phages share the same immune specificity. TM4 infects these lysogens suggesting that it was either derived from a heteroimmune temperate parent or that it has acquired a virulent phenotype. We have also characterized a widely-used conditionally replicating derivative of TM4 and identified mutations conferring the temperature-sensitive phenotype. All of the Cluster K phages contain a series of well conserved 13 bp repeats associated with the translation initiation sites of a subset of the genes; approximately one half of these contain an additional sequence feature composed of imperfectly conserved 17 bp inverted repeats separated by a variable spacer. The K1 phages integrate into the host tmRNA and the Cluster K phages represent potential new tools for the genetics of M. tuberculosis and related species

    The Structure of the Oligomerization Domain of Lsr2 from Mycobacterium tuberculosis Reveals a Mechanism for Chromosome Organization and Protection

    Get PDF
    Lsr2 is a small DNA-binding protein present in mycobacteria and related actinobacteria that regulates gene expression and influences the organization of bacterial chromatin. Lsr2 is a dimer that binds to AT-rich regions of chromosomal DNA and physically protects DNA from damage by reactive oxygen intermediates (ROI). A recent structure of the C-terminal DNA-binding domain of Lsr2 provides a rationale for its interaction with the minor groove of DNA, its preference for AT-rich tracts, and its similarity to other bacterial nucleoid-associated DNA-binding domains. In contrast, the details of Lsr2 dimerization (and oligomerization) via its N-terminal domain, and the mechanism of Lsr2-mediated chromosomal cross-linking and protection is unknown. We have solved the structure of the N-terminal domain of Lsr2 (N-Lsr2) at 1.73 Å resolution using crystallographic ab initio approaches. The structure shows an intimate dimer of two ß–ß–a motifs with no close homologues in the structural databases. The organization of individual N-Lsr2 dimers in the crystal also reveals a mechanism for oligomerization. Proteolytic removal of three N-terminal residues from Lsr2 results in the formation of an anti-parallel β-sheet between neighboring molecules and the formation of linear chains of N-Lsr2. Oligomerization can be artificially induced using low concentrations of trypsin and the arrangement of N-Lsr2 into long chains is observed in both monoclinic and hexagonal crystallographic space groups. In solution, oligomerization of N-Lsr2 is also observed following treatment with trypsin. A change in chromosomal topology after the addition of trypsin to full-length Lsr2-DNA complexes and protection of DNA towards DNAse digestion can be observed using electron microscopy and electrophoresis. These results suggest a mechanism for oligomerization of Lsr2 via protease-activation leading to chromosome compaction and protection, and concomitant down-regulation of large numbers of genes. This mechanism is likely to be relevant under conditions of stress where cellular proteases are known to be upregulated
    corecore