261 research outputs found

    Genome-wide computational prediction of tandem gene arrays: application in yeasts

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>This paper describes an efficient <it>in silico </it>method for detecting tandem gene arrays (TGAs) in fully sequenced and compact genomes such as those of prokaryotes or unicellular eukaryotes. The originality of this method lies in the search of protein sequence similarities in the vicinity of each coding sequence, which allows the prediction of tandem duplicated gene copies independently of their functionality.</p> <p>Results</p> <p>Applied to nine hemiascomycete yeast genomes, this method predicts that 2% of the genes are involved in TGAs and gene relics are present in 11% of TGAs. The frequency of TGAs with degenerated gene copies means that a significant fraction of tandem duplicated genes follows the birth-and-death model of evolution. A comparison of sequence identity distributions between sets of homologous gene pairs shows that the different copies of tandem arrayed paralogs are less divergent than copies of dispersed paralogs in yeast genomes. It suggests that paralogs included in tandem structures are more recent or more subject to the gene conversion mechanism than other paralogs.</p> <p>Conclusion</p> <p>The method reported here is a useful computational tool to provide a database of TGAs composed of functional or nonfunctional gene copies. Such a database has obvious applications in the fields of structural and comparative genomics. Notably, a detailed study of the TGA catalog will make it possible to tackle the fundamental questions of the origin and evolution of tandem gene clusters.</p

    Growth of Yersinia pseudotuberculosis in human plasma: impacts on virulence and metabolic gene expression

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>In man, infection by the Gram-negative enteropathogen <it>Yersinia pseudotuberculosis </it>is usually limited to the terminal ileum. However, in immunocompromised patients, the microorganism may disseminate from the digestive tract and thus cause a systemic infection with septicemia.</p> <p>Results</p> <p>To gain insight into the metabolic pathways and virulence factors expressed by the bacterium at the blood stage of pseudotuberculosis, we compared the overall gene transcription patterns (the transcriptome) of bacterial cells cultured in either human plasma or Luria-Bertani medium. The most marked plasma-triggered metabolic consequence in <it>Y. pseudotuberculosis </it>was the switch to high glucose consumption, which is reminiscent of the acetogenic pathway (known as "glucose overflow") in <it>Escherichia coli</it>. However, upregulation of the glyoxylate shunt enzymes suggests that (in contrast to <it>E. coli</it>) acetate may be further metabolized in <it>Y. pseudotuberculosis</it>. Our data also indicate that the bloodstream environment can regulate major virulence genes (positively or negatively); the <it>yadA </it>adhesin gene and most of the transcriptional units of the pYV-encoded type III secretion apparatus were found to be upregulated, whereas transcription of the pH6 antigen locus was strongly repressed.</p> <p>Conclusion</p> <p>Our results suggest that plasma growth of <it>Y. pseudotuberculosis </it>is responsible for major transcriptional regulatory events and prompts key metabolic reorientations within the bacterium, which may in turn have an impact on virulence.</p

    A new pheromone trail-based genetic algorithm for comparative genome assembly

    Get PDF
    Gap closing is considered one of the most challenging and time-consuming tasks in bacterial genome sequencing projects, especially with the emergence of new sequencing technologies, such as pyrosequencing, which may result in large amounts of data without the benefit of large insert libraries for contig scaffolding. We propose a novel algorithm to align contigs with more than one reference genome at a time. This approach can successfully overcome the limitations of low degrees of conserved gene order for the reference and target genomes. A pheromone trail-based genetic algorithm (PGA) was used to search globally for the optimal placement for each contig. Extensive testing on simulated and real data sets shows that PGA significantly outperforms previous methods, especially when assembling genomes that are only moderately related. An extended version of PGA can predict additional candidate connections for each contig and can thus increase the likelihood of identifying the correct arrangement of each contig. The software and test data sets can be accessed at http://sourceforge.net/projects/pga4genomics/

    Highly plastic genome of Microcystis aeruginosa PCC 7806, a ubiquitous toxic freshwater cyanobacterium

    Get PDF
    Background The colonial cyanobacterium Microcystis proliferates in a wide range of freshwater ecosystems and is exposed to changing environmental factors during its life cycle. Microcystis blooms are often toxic, potentially fatal to animals and humans, and may cause environmental problems. There has been little investigation of the genomics of these cyanobacteria. Results Deciphering the 5,172,804 bp sequence of Microcystis aeruginosa PCC 7806 has revealed the high plasticity of its genome: 11.7% DNA repeats containing more than 1,000 bases, 6.8% putative transposases and 21 putative restriction enzymes. Compared to the genomes of other cyanobacterial lineages, strain PCC 7806 contains a large number of atypical genes that may have been acquired by lateral transfers. Metabolic pathways, such as fermentation and a methionine salvage pathway, have been identified, Conclusion Microcystis aeruginosa PCC 7806 appears to have adopted an evolutionary strategy relying on unusual genome plasticity to adapt to eutrophic freshwater ecosystems, a property shared by another strain of M. aeruginosa (NIES-843). Comparisons of the genomes of PCC 7806 and other cyanobacterial strains indicate that a similar strategy may have also been used by the marine strain Crocosphaera watsonii WH8501 to adapt to other ecological niches, such as oligotrophic open oceans.

    PCR-Based Identification of Klebsiella pneumoniae subsp. rhinoscleromatis, the Agent of Rhinoscleroma

    Get PDF
    Rhinoscleroma is a chronic granulomatous infection of the upper airways caused by the bacterium Klebsiella pneumoniae subsp. rhinoscleromatis. The disease is endemic in tropical and subtropical areas, but its diagnosis remains difficult. As a consequence, and despite available antibiotherapy, some patients evolve advanced stages that can lead to disfiguration, severe respiratory impairment and death by anoxia. Because identification of the etiologic agent is crucial for the definitive diagnosis of the disease, the aim of this study was to develop two simple PCR assays. We took advantage of the fact that all Klebsiella pneumoniae subsp. rhinoscleromatis isolates are (i) of capsular serotype K3; and (ii) belong to a single clone with diagnostic single nucleotide polymorphisms (SNP). The complete sequence of the genomic region comprising the capsular polysaccharide synthesis (cps) gene cluster was determined. Putative functions of the 21 genes identified were consistent with the structure of the K3 antigen. The K3-specific sequence of gene Kr11509 (wzy) was exploited to set up a PCR test, which was positive for 40 K3 strains but negative when assayed on the 76 other Klebsiella capsular types. Further, to discriminate Klebsiella pneumoniae subsp. rhinoscleromatis from other K3 Klebsiella strains, a specific PCR assay was developed based on diagnostic SNPs in the phosphate porin gene phoE. This work provides rapid and simple molecular tools to confirm the diagnostic of rhinoscleroma, which should improve patient care as well as knowledge on the prevalence and epidemiology of rhinoscleroma

    A comparison of random sequence reads versus 16S rDNA sequences for estimating the biodiversity of a metagenomic library

    Get PDF
    The construction of metagenomic libraries has permitted the study of microorganisms resistant to isolation and the analysis of 16S rDNA sequences has been used for over two decades to examine bacterial biodiversity. Here, we show that the analysis of random sequence reads (RSRs) instead of 16S is a suitable shortcut to estimate the biodiversity of a bacterial community from metagenomic libraries. We generated 10 010 RSRs from a metagenomic library of microorganisms found in human faecal samples. Then searched them using the program BLASTN against a prokaryotic sequence database to assign a taxon to each RSR. The results were compared with those obtained by screening and analysing the clones containing 16S rDNA sequences in the whole library. We found that the biodiversity observed by RSR analysis is consistent with that obtained by 16S rDNA. We also show that RSRs are suitable to compare the biodiversity between different metagenomic libraries. RSRs can thus provide a good estimate of the biodiversity of a metagenomic library and, as an alternative to 16S, this approach is both faster and cheaper

    A Human-Curated Annotation of the Candida albicans Genome

    Get PDF
    Recent sequencing and assembly of the genome for the fungal pathogen Candida albicans used simple automated procedures for the identification of putative genes. We have reviewed the entire assembly, both by hand and with additional bioinformatic resources, to accurately map and describe 6,354 genes and to identify 246 genes whose original database entries contained sequencing errors (or possibly mutations) that affect their reading frame. Comparison with other fungal genomes permitted the identification of numerous fungus-specific genes that might be targeted for antifungal therapy. We also observed that, compared to other fungi, the protein-coding sequences in the C. albicans genome are especially rich in short sequence repeats. Finally, our improved annotation permitted a detailed analysis of several multigene families, and comparative genomic studies showed that C. albicans has a far greater catabolic range, encoding respiratory Complex 1, several novel oxidoreductases and ketone body degrading enzymes, malonyl-CoA and enoyl-CoA carriers, several novel amino acid degrading enzymes, a variety of secreted catabolic lipases and proteases, and numerous transporters to assimilate the resulting nutrients. The results of these efforts will ensure that the Candida research community has uniform and comprehensive genomic information for medical research as well as for future diagnostic and therapeutic applications

    A multi-proxy approach to exploring Homo sapiensā€™ arrival, environments and adaptations in Southeast Asia

    Get PDF
    The capability of Pleistocene hominins to successfully adapt to different types of tropical forested environments has long been debated. In order to investigate environmental changes in Southeast Asia during a critical period for the turnover of hominin species, we analysed palaeoenvironmental proxies from five late Middle to Late Pleistocene faunas. Human teeth discoveries have been reported at Duoi Uā€™Oi, Vietnam (70ā€“60Ā ka) and Nam Lot, Laos (86ā€“72Ā ka). However, the use of palaeoproteomics allowed us to discard the latter, and, to date, no human remains older thanā€‰~ā€‰70Ā ka are documented in the area. Our findings indicate that tropical rainforests were highly sensitive to climatic changes over that period, with significant fluctuations of the canopy forests. Locally, large-bodied faunas were resilient to these fluctuations until the cooling period of the Marine Isotope Stage 4 (MIS 4; 74ā€“59Ā ka) that transformed the overall biotope. Then, under strong selective pressures, populations with new phenotypic characteristics emerged while some other species disappeared. We argue that this climate-driven shift offered new foraging opportunities for hominins in a novel rainforest environment and was most likely a key factor in the settlement and dispersal of our species during MIS 4 in SE Asia

    Halogenase Genes in Nonribosomal Peptide Synthetase Gene Clusters of Microcystis (Cyanobacteria): Sporadic Distribution and Evolution

    Get PDF
    Cyanobacteria of the genus Microcystis are known to produce secondary metabolites of large structural diversity by nonribosomal peptide synthetase (NRPS) pathways. For a number of such compounds, halogenated congeners have been reported along with nonhalogenated ones. In the present study, chlorinated cyanopeptolin- and/or aeruginosin-type peptides were detected by mass spectrometry in 17 out of 28 axenic strains of Microcystis. In these strains, a halogenase gene was identified between 2 genes coding for NRPS modules in respective gene clusters, whereas it was consistently absent when the strains produced only nonchlorinated corresponding congeners. Nucleotide sequences were obtained for 12 complete halogenase genes and 14 intermodule regions of gene clusters lacking a halogenase gene or containing only fragments of it. When a halogenase gene was found absent, a specific, identical excision pattern was observed for both synthetase gene clusters in most strains. A phylogenetic analysis including other bacterial halogenases showed that the NRPS-related halogenases of Microcystis form a monophyletic group divided into 2 subgroups, corresponding to either the cyanopeptolin or the aeruginosin peptide synthetases. The distribution of these peptide synthetase gene clusters, among the tested Microcystis strains, was found in relative agreement with their phylogeny reconstructed from 16Sā€“23S rDNA intergenic spacer sequences, whereas the distribution of the associated halogenase genes appears to be sporadic. The presented data suggest that in cyanobacteria these prevalent halogenase genes originated from an ancient horizontal gene transfer followed by duplication in the cyanobacterial lineage. We propose an evolutionary scenario implying repeated gene losses to explain the distribution of halogenase genes in 2 NRPS gene clusters that subsequently defines the seemingly erratic production of halogenated and nonhalogenated aeruginosins and cyanopeptolins among Microcystis strains
    • ā€¦
    corecore