46 research outputs found

    Streamlining and Large Ancestral Genomes in Archaea Inferred with a Phylogenetic Birth-and-Death Model

    Get PDF
    Homologous genes originate from a common ancestor through vertical inheritance, duplication, or horizontal gene transfer. Entire homolog families spawned by a single ancestral gene can be identified across multiple genomes based on protein sequence similarity. The sequences, however, do not always reveal conclusively the history of large families. To study the evolution of complete gene repertoires, we propose here a mathematical framework that does not rely on resolved gene family histories. We show that so-called phylogenetic profiles, formed by family sizes across multiple genomes, are sufficient to infer principal evolutionary trends. The main novelty in our approach is an efficient algorithm to compute the likelihood of a phylogenetic profile in a model of birth-and-death processes acting on a phylogeny

    Archaic chaos: intrinsically disordered proteins in Archaea

    Get PDF
    Background: Many proteins or their regions known as intrinsically disordered proteins (IDPs) and intrinsically disordered regions (IDRs) lack unique 3D structure in their native states under physiological conditions yet fulfill key biological functions. Earlier bioinformatics studies showed that IDPs and IDRs are highly abundant in different proteomes and carry out mostly regulatory functions related to molecular recognition and signal transduction. Archaea belong to an intriguing domain of life whose members, being microbes, are characterized by a unique mosaic-like combination of bacterial and eukaryotic properties and include inhabitants of some of the most extreme environments on the planet. With the expansion of the archaea genome data (more than fifty archaea species from five different phyla are known now), and with recent improvements in the accuracy of intrinsic disorder prediction, it is time to re-examine the abundance of IDPs and IDRs in the archaea domain.Results: The abundance of IDPs and IDRs in 53 archaea species is analyzed. The amino acid composition profiles of these species are generally quite different from each other. The disordered content is highly species-dependent. Thermoproteales proteomes have 14% of disordered residues, while in Halobacteria, this value increases to 34%. In proteomes of these two phyla, proteins containing long disordered regions account for 12% and 46%, whereas 4% and 26% their proteins are wholly disordered. These three measures of disorder content are linearly correlated with each other at the genome level. There is a weak correlation between the environmental factors (such as salinity, pH and temperature of the habitats) and the abundance of intrinsic disorder in Archaea, with various environmental factors possessing different disorder-promoting strengths. Harsh environmental conditions, especially those combining several hostile factors, clearly favor increased disorder content. Intrinsic disorder is highly abundant in functional Pfam domains of the archaea origin. The analysis based on the disordered content and phylogenetic tree indicated diverse evolution of intrinsic disorder among various classes and species of Archaea.Conclusions: Archaea proteins are rich in intrinsic disorder. Some of these IDPs and IDRs likely evolve to help archaea to accommodate to their hostile habitats. Other archaean IDPs and IDRs possess crucial biological functions similar to those of the bacterial and eukaryotic IDPs/IDRs

    Molecular Diversity of New Thermococcales Isolates from a Single Area of Hydrothermal Deep-Sea Vents as Revealed by Randomly Amplified Polymorphic DNA Fingerprinting and 16S rRNA Gene Sequence Analysis

    No full text
    Members of the Thermococcales are anaerobic Archaea belonging to the kingdom Euryarchaea that are studied in many laboratories as model organisms for hyperthermophiles. We describe here a molecular analysis of 86 new Thermococcales isolates collected from six different chimneys of a single hydrothermal field located in the 13°N 104°W segment of the East Pacific ridge at a depth of 2,330 m. These isolates were sorted by randomly amplified polymorphic DNA (RAPD) fingerprinting into nine groups, and nine unique RAPD profiles were obtained. One RAPD group corresponds to new isolates of Thermococcus hydrothermalis, whereas all other groups and isolates with unique profiles are different from the 22 reference strains included in this study. Analysis of 16S rRNA gene sequences of representatives of each RAPD group and unique profiles showed that one group corresponds to Pyrococcus strains, whereas all the other isolates are Thermococcus strains. We estimated that our collection may contain at least 11 new species. These putative species, isolated from a single area of hydrothermal deep-sea vents, are dispersed in the 16S rRNA tree among the reference strains previously isolated from diverse hot environments (terrestrial, shallow water, hydrothermal vents) located around the world, suggesting that there is a high degree of dispersal of Thermococcales. About one-half of our isolates contain extrachromosomal elements that could be used to search for novel replication proteins and to develop genetic tools for hyperthermophiles
    corecore