185 research outputs found

    Molecular Characterization of a 21.4 Kilobase Antibiotic Resistance Plasmid from an α-Hemolytic Escherichia coli O108:H- Human Clinical Isolate

    Get PDF
    This study characterizes the 21.4 kilobase plasmid pECTm80 isolated from Escherichia coli strain 80, an α hemolytic human clinical diarrhoeal isolate (serotype O108:H-). DNA sequence analysis of pECTm80 revealed it belonged to incompatibility group X1, and contained plasmid partition and toxin-antitoxin systems, an R6K-like triple origin (ori) replication system, genes required for replication regulation, insertion sequences IS1R, ISEc37 and a truncated transposase gene (Tn3-like ΔtnpA) of the Tn3 family, and carried a class 2 integron. The class 2 integron of pECTm80 contains an intact cassette array dfrA1-sat2, encoding resistance to trimethoprim and streptothricin, and an aadA1 gene cassette truncated by the insertion of IS1R. The complex plasmid replication system includes α, β and γ origins of replication. Pairwise BLASTn comparison of pECTm80 with plasmid pE001 reveals a conserved plasmid backbone suggestive of a common ancestral lineage. Plasmid pECTm80 is of potential clinical importance, as it carries multiple genes to ensure its stable maintenance through successive bacterial cell divisions and multiple antibiotic resistance genes

    Gene identification and protein classification in microbial metagenomic sequence data via incremental clustering

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The identification and study of proteins from metagenomic datasets can shed light on the roles and interactions of the source organisms in their communities. However, metagenomic datasets are characterized by the presence of organisms with varying GC composition, codon usage biases etc., and consequently gene identification is challenging. The vast amount of sequence data also requires faster protein family classification tools.</p> <p>Results</p> <p>We present a computational improvement to a sequence clustering approach that we developed previously to identify and classify protein coding genes in large microbial metagenomic datasets. The clustering approach can be used to identify protein coding genes in prokaryotes, viruses, and intron-less eukaryotes. The computational improvement is based on an incremental clustering method that does not require the expensive all-against-all compute that was required by the original approach, while still preserving the remote homology detection capabilities. We present evaluations of the clustering approach in protein-coding gene identification and classification, and also present the results of updating the protein clusters from our previous work with recent genomic and metagenomic sequences. The clustering results are available via CAMERA, (http://camera.calit2.net).</p> <p>Conclusion</p> <p>The clustering paradigm is shown to be a very useful tool in the analysis of microbial metagenomic data. The incremental clustering method is shown to be much faster than the original approach in identifying genes, grouping sequences into existing protein families, and also identifying novel families that have multiple members in a metagenomic dataset. These clusters provide a basis for further studies of protein families.</p

    Gene prediction in metagenomic fragments: A large scale machine learning approach

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Metagenomics is an approach to the characterization of microbial genomes via the direct isolation of genomic sequences from the environment without prior cultivation. The amount of metagenomic sequence data is growing fast while computational methods for metagenome analysis are still in their infancy. In contrast to genomic sequences of single species, which can usually be assembled and analyzed by many available methods, a large proportion of metagenome data remains as unassembled anonymous sequencing reads. One of the aims of all metagenomic sequencing projects is the identification of novel genes. Short length, for example, Sanger sequencing yields on average 700 bp fragments, and unknown phylogenetic origin of most fragments require approaches to gene prediction that are different from the currently available methods for genomes of single species. In particular, the large size of metagenomic samples requires fast and accurate methods with small numbers of false positive predictions.</p> <p>Results</p> <p>We introduce a novel gene prediction algorithm for metagenomic fragments based on a two-stage machine learning approach. In the first stage, we use linear discriminants for monocodon usage, dicodon usage and translation initiation sites to extract features from DNA sequences. In the second stage, an artificial neural network combines these features with open reading frame length and fragment GC-content to compute the probability that this open reading frame encodes a protein. This probability is used for the classification and scoring of gene candidates. With large scale training, our method provides fast single fragment predictions with good sensitivity and specificity on artificially fragmented genomic DNA. Additionally, this method is able to predict translation initiation sites accurately and distinguishes complete from incomplete genes with high reliability.</p> <p>Conclusion</p> <p>Large scale machine learning methods are well-suited for gene prediction in metagenomic DNA fragments. In particular, the combination of linear discriminants and neural networks is promising and should be considered for integration into metagenomic analysis pipelines. The data sets can be downloaded from the URL provided (see Availability and requirements section).</p

    Comparative Genomics Study of Multi-Drug-Resistance Mechanisms in the Antibiotic-Resistant Streptococcus suis R61 Strain

    Get PDF
    BACKGROUND: Streptococcus suis infections are a serious problem for both humans and pigs worldwide. The emergence and increasing prevalence of antibiotic-resistant S. suis strains pose significant clinical and societal challenges. RESULTS: In our study, we sequenced one multi-drug-resistant S. suis strain, R61, and one S. suis strain, A7, which is fully sensitive to all tested antibiotics. Comparative genomic analysis revealed that the R61 strain is phylogenetically distinct from other S. suis strains, and the genome of R61 exhibits extreme levels of evolutionary plasticity with high levels of gene gain and loss. Our results indicate that the multi-drug-resistant strain R61 has evolved three main categories of resistance. CONCLUSIONS: Comparative genomic analysis of S. suis strains with diverse drug-resistant phenotypes provided evidence that horizontal gene transfer is an important evolutionary force in shaping the genome of multi-drug-resistant strain R61. In this study, we discovered novel and previously unexamined mutations that are strong candidates for conferring drug resistance. We believe that these mutations will provide crucial clues for designing new drugs against this pathogen. In addition, our work provides a clear demonstration that the use of drugs has driven the emergence of the multi-drug-resistant strain R61

    Evidence for the Role of Horizontal Transfer in Generating pVT1, a Large Mosaic Conjugative Plasmid from the Clam Pathogen, Vibrio tapetis

    Get PDF
    The marine bacterium Vibrio tapetis is the causative agent of the brown ring disease, which affects the clam Ruditapes philippinarum and causes heavy economic losses in North of Europe and in Eastern Asia. Further characterization of V. tapetis isolates showed that all the investigated strains harbored at least one large plasmid. We determined the sequence of the 82,266 bp plasmid pVT1 from the CECT4600T reference strain and analyzed its genetic content. pVT1 is a mosaic plasmid closely related to several conjugative plasmids isolated from Vibrio vulnificus strains and was shown to be itself conjugative in Vibrios. In addition, it contains DNA regions that have similarity with several other plasmids from marine bacteria (Vibrio sp., Shewanella sp., Listonella anguillarum and Photobacterium profundum). pVT1 contains a number of mobile elements, including twelve Insertion Sequences or inactivated IS genes and an RS1 phage element related to the CTXphi phage of V. cholerae. The genetic organization of pVT1 underscores an important role of horizontal gene transfer through conjugative plasmid shuffling and transposition events in the acquisition of new genetic resources and in generating the pVT1 modular organization. In addition, pVT1 presents a copy number of 9, relatively high for a conjugative plasmid, and appears to belong to a new type of replicon, which may be specific to Vibrionaceae and Shewanelleacae
    corecore