134 research outputs found

    Resolution among major placental mammal interordinal relationships with genome data imply that speciation influenced their earliest radiations

    Get PDF
    Background: A number of the deeper divergences in the placental mammal tree are still inconclusively resolved despite extensive phylogenomic analyses. A recent analysis of 200 kbp of protein coding sequences yielded only limited support for the relationships among Laurasiatheria (cow, dog, bat and shrew), probably because the divergences occurred only within a few million years from each other. It is generally expected that increasing the amount of data and improving the taxon sampling enhance the resolution of narrow divergences. Therefore these and other difficult splits were examined by phylogenomic analysis of the hitherto largest sequence alignment. The increasingly complete genome data of placental mammals also allowed developing a novel and stringent data search method. Results: The rigorous data handling, recursive BLAST, successfully removed the sequences from gene families, including those from well-known families hemoglobin, olfactory, myosin and HOX genes, thus avoiding alignment of possibly paralogous sequences. The current phylogenomic analysis of 3,012 genes (2,844,615 nucleotides) from a total of 22 species yielded statistically significant support for most relationships. While some major clades were confirmed using genomic sequence data, the placement of the treeshrew, bat and the relationship between Boreoeutheria, Xenarthra and Afrotheria remained problematic to resolve despite the size of the alignment. Phylogenomic analysis of divergence times dated the basal placental mammal splits at 95–100 million years ago. Many of the following divergences occurred only a few (2–4) million years later. Relationships with narrow divergence time intervals received unexpectedly limited support even from the phylogenomic analyses. Conclusion: The narrow temporal window within which some placental divergences took place suggests that inconsistencies and limited resolution of the mammalian tree may have their natural explanation in speciation processes such as lineage sorting, introgression from species hybridization or hybrid speciation. These processes obscure phylogenetic analysis, making some parts of the tree difficult to resolve even with genome data

    A genomic approach to examine the complex evolution of laurasiatherian mammals

    Get PDF
    Recent phylogenomic studies have failed to conclusively resolve certain branches of the placental mammalian tree, despite the evolutionary analysis of genomic data from 32 species. Previous analyses of single genes and retroposon insertion data yielded support for different phylogenetic scenarios for the most basal divergences. The results indicated that some mammalian divergences were best interpreted not as a single bifurcating tree, but as an evolutionary network. In these studies the relationships among some orders of the super-clade Laurasiatheria were poorly supported, albeit not studied in detail. Therefore, 4775 protein-coding genes (6,196,263 nucleotides) were collected and aligned in order to analyze the evolution of this clade. Additionally, over 200,000 introns were screened in silico, resulting in 32 phylogenetically informative long interspersed nuclear elements (LINE) insertion events. The present study shows that the genome evolution of Laurasiatheria may best be understood as an evolutionary network. Thus, contrary to the common expectation to resolve major evolutionary events as a bifurcating tree, genome analyses unveil complex speciation processes even in deep mammalian divergences. We exemplify this on a subset of 1159 suitable genes that have individual histories, most likely due to incomplete lineage sorting or introgression, processes that can make the genealogy of mammalian genomes complex. These unexpected results have major implications for the understanding of evolution in general, because the evolution of even some higher level taxa such as mammalian orders may sometimes not be interpreted as a simple bifurcating pattern

    Mammalian Evolution May not Be Strictly Bifurcating

    Get PDF
    The massive amount of genomic sequence data that is now available for analyzing evolutionary relationships among 31 placental mammals reduces the stochastic error in phylogenetic analyses to virtually zero. One would expect that this would make it possible to finally resolve controversial branches in the placental mammalian tree. We analyzed a 2,863,797 nucleotide-long alignment (3,364 genes) from 31 placental mammals for reconstructing their evolution. Most placental mammalian relationships were resolved, and a consensus of their evolution is emerging. However, certain branches remain difficult or virtually impossible to resolve. These branches are characterized by short divergence times in the order of 1–4 million years. Computer simulations based on parameters from the real data show that as little as about 12,500 amino acid sites could be sufficient to confidently resolve short branches as old as about 90 million years ago (Ma). Thus, the amount of sequence data should no longer be a limiting factor in resolving the relationships among placental mammals. The timing of the early radiation of placental mammals coincides with a period of climate warming some 100–80 Ma and with continental fragmentation. These global processes may have triggered the rapid diversification of placental mammals. However, the rapid radiations of certain mammalian groups complicate phylogenetic analyses, possibly due to incomplete lineage sorting and introgression. These speciation-related processes led to a mosaic genome and conflicting phylogenetic signals. Split network methods are ideal for visualizing these problematic branches and can therefore depict data conflict and possibly the true evolutionary history better than strictly bifurcating trees. Given the timing of tectonics, of placental mammalian divergences, and the fossil record, a Laurasian rather than Gondwanan origin of placental mammals seems the most parsimonious explanation

    Resolution among major placental mammal interordinal relationships with genome data imply that speciation influenced their earliest radiations

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>A number of the deeper divergences in the placental mammal tree are still inconclusively resolved despite extensive phylogenomic analyses. A recent analysis of 200 kbp of protein coding sequences yielded only limited support for the relationships among Laurasiatheria (cow, dog, bat and shrew), probably because the divergences occurred only within a few million years from each other. It is generally expected that increasing the amount of data and improving the taxon sampling enhance the resolution of narrow divergences. Therefore these and other difficult splits were examined by phylogenomic analysis of the hitherto largest sequence alignment. The increasingly complete genome data of placental mammals also allowed developing a novel and stringent data search method.</p> <p>Results</p> <p>The rigorous data handling, recursive BLAST, successfully removed the sequences from gene families, including those from well-known families hemoglobin, olfactory, myosin and HOX genes, thus avoiding alignment of possibly paralogous sequences. The current phylogenomic analysis of 3,012 genes (2,844,615 nucleotides) from a total of 22 species yielded statistically significant support for most relationships. While some major clades were confirmed using genomic sequence data, the placement of the treeshrew, bat and the relationship between Boreoeutheria, Xenarthra and Afrotheria remained problematic to resolve despite the size of the alignment. Phylogenomic analysis of divergence times dated the basal placental mammal splits at 95–100 million years ago. Many of the following divergences occurred only a few (2–4) million years later. Relationships with narrow divergence time intervals received unexpectedly limited support even from the phylogenomic analyses.</p> <p>Conclusion</p> <p>The narrow temporal window within which some placental divergences took place suggests that inconsistencies and limited resolution of the mammalian tree may have their natural explanation in speciation processes such as lineage sorting, introgression from species hybridization or hybrid speciation. These processes obscure phylogenetic analysis, making some parts of the tree difficult to resolve even with genome data.</p

    Expansion of CORE-SINEs in the genome of the Tasmanian devil

    Get PDF
    Background: The genome of the carnivorous marsupial, the Tasmanian devil (Sarcophilus harrisii, Order: Dasyuromorphia), was sequenced in the hopes of finding a cure for or gaining a better understanding of the contagious devil facial tumor disease that is threatening the species’ survival. To better understand the Tasmanian devil genome, we screened it for transposable elements and investigated the dynamics of short interspersed element (SINE) retroposons. Results: The temporal history of Tasmanian devil SINEs, elucidated using a transposition in transposition analysis, indicates that WSINE1, a CORE-SINE present in around 200,000 copies, is the most recently active element. Moreover, we discovered a new subtype of WSINE1 (WSINE1b) that comprises at least 90% of all Tasmanian devil WSINE1s. The frequencies of WSINE1 subtypes differ in the genomes of two of the other Australian marsupial orders. A co-segregation analysis indicated that at least 66 subfamilies of WSINE1 evolved during the evolution of Dasyuromorphia. Using a substitution rate derived from WSINE1 insertions, the ages of the subfamilies were estimated and correlated with a newly established phylogeny of Dasyuromorphia. Phylogenetic analyses and divergence time estimates of mitochondrial genome data indicate a rapid radiation of the Tasmanian devil and the closest relative the quolls (Dasyurus) around 14 million years ago. Conclusions: The radiation and abundance of CORE-SINEs in marsupial genomes indicates that they may be a major player in the evolution of marsupials. It is evident that the early phases of evolution of the carnivorous marsupial order Dasyuromorphia was characterized by a burst of SINE activity. A correlation between a speciation event and a major burst of retroposon activity is for the first time shown in a marsupial genome

    Expressed Sequence Tags as a Tool for Phylogenetic Analysis of Placental Mammal Evolution

    Get PDF
    BACKGROUND: We investigate the usefulness of expressed sequence tags, ESTs, for establishing divergences within the tree of placental mammals. This is done on the example of the established relationships among primates (human), lagomorphs (rabbit), rodents (rat and mouse), artiodactyls (cow), carnivorans (dog) and proboscideans (elephant). METHODOLOGY/PRINCIPAL FINDINGS: We have produced 2000 ESTs (1.2 mega bases) from a marsupial mouse and characterized the data for their use in phylogenetic analysis. The sequences were used to identify putative orthologous sequences from whole genome projects. Although most ESTs stem from single sequence reads, the frequency of potential sequencing errors was found to be lower than allelic variation. Most of the sequences represented slowly evolving housekeeping-type genes, with an average amino acid distance of 6.6% between human and mouse. Positive Darwinian selection was identified at only a few single sites. Phylogenetic analyses of the EST data yielded trees that were consistent with those established from whole genome projects. CONCLUSIONS: The general quality of EST sequences and the general absence of positive selection in these sequences make ESTs an attractive tool for phylogenetic analysis. The EST approach allows, at reasonable costs, a fast extension of data sampling from species outside the genome projects

    Transcriptomics resources of human tissues and organs

    Get PDF
    Quantifying the differential expression of genes in various human organs, tissues, and cell types is vital to understand human physiology and disease. Recently, several large-scale transcriptomics studies have analyzed the expression of protein-coding genes across tissues. These datasets provide a framework for defining the molecular constituents of the human body as well as for generating comprehensive lists of proteins expressed across tissues or in a tissue-restricted manner. Here, we review publicly available human transcriptome resources and discuss body-wide data from independent genome-wide transcriptome analyses of different tissues. Gene expression measurements from these independent datasets, generated using samples from fresh frozen surgical specimens and postmortem tissues, are consistent. Overall, the different genome-wide analyses support a distribution in which many proteins are found in all tissues and relatively few in a tissue-restricted manner. Moreover, we discuss the applications of publicly available omics data for building genome-scale metabolic models, used for analyzing cell and tissue functions both in physiological and in disease contexts

    Identification of two abundant Aerococcus urinae cell wall-anchored proteins

    Get PDF
    Aerococcus urinae is an emerging pathogen that causes urinary tract infections, bacteremia and infective endocarditis. The mechanisms through which A. urinae cause infection are largely unknown. The aims of this study were to describe the surface proteome of A. urinae and to analyse A. urinae genomes in search for genes encoding surface proteins. Two proteins, denoted Aerococcal surface protein (Asp) 1 and 2, were through the use of mass spectrometry based proteomics found to quantitatively dominate the aerococcal surface. The presence of these proteins on the surface was also shown using ELISA with serum from rabbits immunized with the recombinant Asp. These proteins had a signal sequence in the amino-terminal end and a cell wall-sorting region in the carboxy-terminal end, which contained an LPATG-motif, a hydrophobic domain and a positively charged tail. Twenty-three additional A. urinae genomes were sequenced using Illumina HiSeq technology. Six different variants of asp genes were found (denoted asp1-6). All isolates had either one or two of these asp-genes located in a conserved locus, designated Locus encoding Aerococcal Surface Proteins (LASP). The 25 genomes had in median 13 genes encoding LPXTG-proteins (range 6-24). For other Gram-positive bacteria, cell wall-anchored surface proteins with an LPXTG-motif play a key role for virulence. Thus, it will be of great interest to explore the function of the Asp proteins of A. urinae to establish a better understanding of the molecular mechanisms by which A. urinae cause disease

    Coalescent-based genome analyses resolve the early branches of the euarchontoglires

    Get PDF
    Despite numerous large-scale phylogenomic studies, certain parts of the mammalian tree are extraordinarily difficult to resolve. We used the coding regions from 19 completely sequenced genomes to study the relationships within the super-clade Euarchontoglires (Primates, Rodentia, Lagomorpha, Dermoptera and Scandentia) because the placement of Scandentia within this clade is controversial. The difficulty in resolving this issue is due to the short time spans between the early divergences of Euarchontoglires, which may cause incongruent gene trees. The conflict in the data can be depicted by network analyses and the contentious relationships are best reconstructed by coalescent-based analyses. This method is expected to be superior to analyses of concatenated data in reconstructing a species tree from numerous gene trees. The total concatenated dataset used to study the relationships in this group comprises 5,875 protein-coding genes (9,799,170 nucleotides) from all orders except Dermoptera (flying lemurs). Reconstruction of the species tree from 1,006 gene trees using coalescent models placed Scandentia as sister group to the primates, which is in agreement with maximum likelihood analyses of concatenated nucleotide sequence data. Additionally, both analytical approaches favoured the Tarsier to be sister taxon to Anthropoidea, thus belonging to the Haplorrhine clade. When divergence times are short such as in radiations over periods of a few million years, even genome scale analyses struggle to resolve phylogenetic relationships. On these short branches processes such as incomplete lineage sorting and possibly hybridization occur and make it preferable to base phylogenomic analyses on coalescent methods
    corecore