47 research outputs found

    Identification of Prophages in Bacterial Genomes by Dinucleotide Relative Abundance Difference

    Get PDF
    BACKGROUND: Prophages are integrated viral forms in bacterial genomes that have been found to contribute to interstrain genetic variability. Many virulence-associated genes are reported to be prophage encoded. Present computational methods to detect prophages are either by identifying possible essential proteins such as integrases or by an extension of this technique, which involves identifying a region containing proteins similar to those occurring in prophages. These methods suffer due to the problem of low sequence similarity at the protein level, which suggests that a nucleotide based approach could be useful. METHODOLOGY: Earlier dinucleotide relative abundance (DRA) have been used to identify regions, which deviate from the neighborhood areas, in genomes. We have used the difference in the dinucleotide relative abundance (DRAD) between the bacterial and prophage DNA to aid location of DNA stretches that could be of prophage origin in bacterial genomes. Prophage sequences which deviate from bacterial regions in their dinucleotide frequencies are detected by scanning bacterial genome sequences. The method was validated using a subset of genomes with prophage data from literature reports. A web interface for prophage scan based on this method is available at http://bicmku.in:8082/prophagedb/dra.html. Two hundred bacterial genomes which do not have annotated prophages have been scanned for prophage regions using this method. CONCLUSIONS: The relative dinucleotide distribution difference helps detect prophage regions in genome sequences. The usefulness of this method is seen in the identification of 461 highly probable loci pertaining to prophages which have not been annotated so earlier. This work emphasizes the need to extend the efforts to detect and annotate prophage elements in genome sequences

    Repetitive Elements May Comprise Over Two-Thirds of the Human Genome

    Get PDF
    Transposable elements (TEs) are conventionally identified in eukaryotic genomes by alignment to consensus element sequences. Using this approach, about half of the human genome has been previously identified as TEs and low-complexity repeats. We recently developed a highly sensitive alternative de novo strategy, P-clouds, that instead searches for clusters of high-abundance oligonucleotides that are related in sequence space (oligo “clouds”). We show here that P-clouds predicts >840 Mbp of additional repetitive sequences in the human genome, thus suggesting that 66%–69% of the human genome is repetitive or repeat-derived. To investigate this remarkable difference, we conducted detailed analyses of the ability of both P-clouds and a commonly used conventional approach, RepeatMasker (RM), to detect different sized fragments of the highly abundant human Alu and MIR SINEs. RM can have surprisingly low sensitivity for even moderately long fragments, in contrast to P-clouds, which has good sensitivity down to small fragment sizes (∼25 bp). Although short fragments have a high intrinsic probability of being false positives, we performed a probabilistic annotation that reflects this fact. We further developed “element-specific” P-clouds (ESPs) to identify novel Alu and MIR SINE elements, and using it we identified ∼100 Mb of previously unannotated human elements. ESP estimates of new MIR sequences are in good agreement with RM-based predictions of the amount that RM missed. These results highlight the need for combined, probabilistic genome annotation approaches and suggest that the human genome consists of substantially more repetitive sequence than previously believed

    Functionally Stable and Phylogenetically Diverse Microbial Enrichments from Microbial Fuel Cells during Wastewater Treatment

    Get PDF
    Microbial fuel cells (MFCs) are devices that exploit microorganisms as biocatalysts to recover energy from organic matter in the form of electricity. One of the goals of MFC research is to develop the technology for cost-effective wastewater treatment. However, before practical MFC applications are implemented it is important to gain fundamental knowledge about long-term system performance, reproducibility, and the formation and maintenance of functionally-stable microbial communities. Here we report findings from a MFC operated for over 300 days using only primary clarifier effluent collected from a municipal wastewater treatment plant as the microbial resource and substrate. The system was operated in a repeat-batch mode, where the reactor solution was replaced once every two weeks with new primary effluent that consisted of different microbial and chemical compositions with every batch exchange. The turbidity of the primary clarifier effluent solution notably decreased, and 97% of biological oxygen demand (BOD) was removed after an 8–13 day residence time for each batch cycle. On average, the limiting current density was 1000 mA/m2, the maximum power density was 13 mW/m2, and coulombic efficiency was 25%. Interestingly, the electrochemical performance and BOD removal rates were very reproducible throughout MFC operation regardless of the sample variability associated with each wastewater exchange. While MFC performance was very reproducible, the phylogenetic analyses of anode-associated electricity-generating biofilms showed that the microbial populations temporally fluctuated and maintained a high biodiversity throughout the year-long experiment. These results suggest that MFC communities are both self-selecting and self-optimizing, thereby able to develop and maintain functional stability regardless of fluctuations in carbon source(s) and regular introduction of microbial competitors. These results contribute significantly toward the practical application of MFC systems for long-term wastewater treatment as well as demonstrating MFC technology as a useful device to enrich for functionally stable microbial populations

    Achievements and new knowledge unraveled by metagenomic approaches

    Get PDF
    Metagenomics has paved the way for cultivation-independent assessment and exploitation of microbial communities present in complex ecosystems. In recent years, significant progress has been made in this research area. A major breakthrough was the improvement and development of high-throughput next-generation sequencing technologies. The application of these technologies resulted in the generation of large datasets derived from various environments such as soil and ocean water. The analyses of these datasets opened a window into the enormous phylogenetic and metabolic diversity of microbial communities living in a variety of ecosystems. In this way, structure, functions, and interactions of microbial communities were elucidated. Metagenomics has proven to be a powerful tool for the recovery of novel biomolecules. In most cases, functional metagenomics comprising construction and screening of complex metagenomic DNA libraries has been applied to isolate new enzymes and drugs of industrial importance. For this purpose, several novel and improved screening strategies that allow efficient screening of large collections of clones harboring metagenomes have been introduced

    Chlamydia trachomatis Infection and Anti-Hsp60 Immunity: The Two Sides of the Coin

    Get PDF
    Chlamydia trachomatis (CT) infection is one of the most common causes of reproductive tract diseases and infertility. CT-Hsp60 is synthesized during infection and is released in the bloodstream. As a consequence, immune cells will produce anti-CT-Hsp60 antibodies. Hsp60, a ubiquitous and evolutionarily conserved chaperonin, is normally sequestered inside the cell, particularly into mitochondria. However, upon cell stress, as well as during carcinogenesis, the chaperonin becomes exposed on the cell surface (sf-Hsp60) and/or is secreted from cells into the extracellular space and circulation. Reports in the literature on circulating Hsp and anti-Hsp antibodies are in many cases short on details about Hsp60 concentrations, and about the specificity spectra of the antibodies, their titers, and their true, direct, pathogenetic effects. Thus, more studies are still needed to obtain a definitive picture on these matters. Nevertheless, the information already available indicates that the concurrence of persistent CT infection and appearance of sf-Hsp60 can promote an autoimmune aggression towards stressed cells and the development of diseases such as autoimmune arthritis, multiple sclerosis, atherosclerosis, vasculitis, diabetes, and thyroiditis, among others. At the same time, immunocomplexes composed of anti-CT-Hsp60 antibodies and circulating Hsp60 (both CT and human) may form deposits in several anatomical locations, e.g., at the glomerular basal membrane. The opposite side of the coin is that pre-tumor and tumor cells with sf-Hsp60 can be destroyed with participation of the anti-Hsp60 antibody, thus stopping cancer progression before it is even noticed by the patient or physician

    The Genome Sequence of the Leaf-Cutter Ant Atta cephalotes Reveals Insights into Its Obligate Symbiotic Lifestyle

    Get PDF
    Leaf-cutter ants are one of the most important herbivorous insects in the Neotropics, harvesting vast quantities of fresh leaf material. The ants use leaves to cultivate a fungus that serves as the colony's primary food source. This obligate ant-fungus mutualism is one of the few occurrences of farming by non-humans and likely facilitated the formation of their massive colonies. Mature leaf-cutter ant colonies contain millions of workers ranging in size from small garden tenders to large soldiers, resulting in one of the most complex polymorphic caste systems within ants. To begin uncovering the genomic underpinnings of this system, we sequenced the genome of Atta cephalotes using 454 pyrosequencing. One prediction from this ant's lifestyle is that it has undergone genetic modifications that reflect its obligate dependence on the fungus for nutrients. Analysis of this genome sequence is consistent with this hypothesis, as we find evidence for reductions in genes related to nutrient acquisition. These include extensive reductions in serine proteases (which are likely unnecessary because proteolysis is not a primary mechanism used to process nutrients obtained from the fungus), a loss of genes involved in arginine biosynthesis (suggesting that this amino acid is obtained from the fungus), and the absence of a hexamerin (which sequesters amino acids during larval development in other insects). Following recent reports of genome sequences from other insects that engage in symbioses with beneficial microbes, the A. cephalotes genome provides new insights into the symbiotic lifestyle of this ant and advances our understanding of host–microbe symbioses

    New Insights into the Phylogeny and Molecular Classification of Nicotinamide Mononucleotide Deamidases

    Get PDF
    Nicotinamide mononucleotide (NMN) deamidase is one of the key enzymes of the bacterial pyridine nucleotide cycle (PNC). It catalyzes the conversion of NMN to nicotinic acid mononucleotide, which is later converted to NAD+ by entering the Preiss-Handler pathway. However, very few biochemical data are available regarding this enzyme. This paper represents the first complete molecular characterization of a novel NMN deamidase from the halotolerant and alkaliphilic bacterium Oceanobacillus iheyensis (OiPncC). The enzyme was active over a broad pH range, with an optimum at pH 7.4, whilst maintaining 90 % activity at pH 10.0. Surprisingly, the enzyme was quite stable at such basic pH, maintaining 61 % activity after 21 days. As regard temperature, it had an optimum at 65 °C but its stability was better below 50 °C. OiPncC was a Michaelian enzyme towards its only substrate NMN, with a Km value of 0.18 mM and a kcat/Km of 2.1 mM-1 s-1. To further our understanding of these enzymes, a complete phylogenetic and structural analysis was carried out taking into account the two Pfam domains usually associated with them (MocF and CinA). This analysis sheds light on the evolution of NMN deamidases, and enables the classification of NMN deamidases into 12 different subgroups, pointing to a novel domain architecture never before described. Using a Logo representation, conserved blocks were determined, providing new insights on the crucial residues involved in the binding and catalysis of both CinA and MocF domains. The analysis of these conserved blocks within new protein sequences could permit the more efficient data curation of incoming NMN deamidases
    corecore