17 research outputs found

    Characterising the genetic architecture of changes in adiposity during adulthood using electronic health records

    Get PDF
    Obesity is a heritable disease, characterised by excess adiposity that is measured by body mass index (BMI). While over 1,000 genetic loci are associated with BMI, less is known about the genetic contribution to adiposity trajectories over adulthood. We derive adiposity-change phenotypes from 24.5 million primary-care health records in over 740,000 individuals in the UK Biobank, Million Veteran Program USA, and Estonian Biobank, to discover and validate the genetic architecture of adiposity trajectories. Using multiple BMI measurements over time increases power to identify genetic factors affecting baseline BMI by 14%. In the largest reported genome-wide study of adiposity-change in adulthood, we identify novel associations with BMI-change at six independent loci, including rs429358 (APOE missense variant). The SNP-based heritability of BMI-change (1.98%) is 9-fold lower than that of BMI. The modest genetic correlation between BMI-change and BMI (45.2%) indicates that genetic studies of longitudinal trajectories could uncover novel biology of quantitative traits in adulthood

    Phylogeny of Echinoderm Hemoglobins

    Get PDF
    Recent genomic information has revealed that neuroglobin and cytoglobin are the two principal lineages of vertebrate hemoglobins, with the latter encompassing the familiar myoglobin and α-globin/ÎČ-globin tetramer hemoglobin, and several minor groups. In contrast, very little is known about hemoglobins in echinoderms, a phylum of exclusively marine organisms closely related to vertebrates, beyond the presence of coelomic hemoglobins in sea cucumbers and brittle stars. We identified about 50 hemoglobins in sea urchin, starfish and sea cucumber genomes and transcriptomes, and used Bayesian inference to carry out a molecular phylogenetic analysis of their relationship to vertebrate sequences, specifically, to assess the hypothesis that the neuroglobin and cytoglobin lineages are also present in echinoderms.The genome of the sea urchin Strongylocentrotus purpuratus encodes several hemoglobins, including a unique chimeric 14-domain globin, 2 androglobin isoforms and a unique single androglobin domain protein. Other strongylocentrotid genomes appear to have similar repertoires of globin genes. We carried out molecular phylogenetic analyses of 52 hemoglobins identified in sea urchin, brittle star and sea cucumber genomes and transcriptomes, using different multiple sequence alignment methods coupled with Bayesian and maximum likelihood approaches. The results demonstrate that there are two major globin lineages in echinoderms, which are related to the vertebrate neuroglobin and cytoglobin lineages. Furthermore, the brittle star and sea cucumber coelomic hemoglobins appear to have evolved independently from the cytoglobin lineage, similar to the evolution of erythroid oxygen binding globins in cyclostomes and vertebrates.The presence of echinoderm globins related to the vertebrate neuroglobin and cytoglobin lineages suggests that the split between neuroglobins and cytoglobins occurred in the deuterostome ancestor shared by echinoderms and vertebrates

    Data from: A comparison of supermatrix and supertree methods for multilocus phylogenetics using organismal datasets

    No full text
    It has been proposed that supertree approaches should be applied to large multilocus sequence datasets to achieve computational tractability. Large datasets such as those derived from phylogenomics studies can be broken into many locus-specific tree searches and the resulting trees can be stitched together via a supertree method. Using simulated data, workers have reported that they can rapidly construct a supertree that is comparable to the results of heuristic tree search on the entire dataset. To test this assertion with organismal data, we compared tree length under the parsimony criterion and computational time for twenty multilocus datasets using supertree (SuperFine and SuperTriplets) and supermatrix (heuristic search in TNT) approaches. Tree length and computational times were compared among methods using the Wilcoxon matched-pairs signed rank test. Supermatrix searches produce significantly shorter trees than either supertree approach (SuperFine or SuperTriplets; p 0.4, not significant). In conclusion, we show by using real rather than simulated data, that there is no basis, either in time tractability or tree length, for use of supertrees over heuristic tree search using a supermatrix for phylogenomics

    RawData

    No full text
    Twenty multilocus sequence datasets used in supertree versus supermatrix comparison

    Data from: Phylotranscriptomic analysis uncovers a wealth of tissue inhibitor of metalloproteinases variants in echinoderms

    No full text
    Tissue inhibitors of metalloproteinases (TIMPs) help regulate the extracellular matrix (ECM) in animals, mostly by inhibiting matrix metalloproteinases (MMPs). They are important activators of mutable collagenous tissue (MCT), which have been extensively studied in echinoderms, and the four TIMP copies in humans have been studied for their role in cancer. To understand the evolution of TIMPs, we combined 405 TIMPs from an echinoderm transcriptome dataset built from 41 specimens representing all five classes of echinoderms with variants from protostomes and chordates. We used multiple sequence alignment with various stringencies of alignment quality to cull highly divergent sequences and then conducted phylogenetic analyses using both nucleotide and amino acid sequences. Phylogenetic hypotheses consistently recovered TIMPs as diversifying in the ancestral deuterostome and these early lineages continuing to diversify in echinoderms. The four vertebrate TIMPs diversified from a single copy in the ancestral chordate, all other copies being lost. Consistent with greater MCT needs owing to body wall liquefaction, evisceration, autotomy and reproduction by fission, holothuroids had significantly more TIMPs and higher read depths per contig. Ten cysteine residues, an HPQ binding site and several other residues were conserved in at least 70% of all TIMPs. The conservation of binding sites and the placement of echinoderm TIMPs involved in MCT modification suggest that ECM regulation remains the primary function of TIMP genes, although within this role there are a large number of specialized copies

    Echinoderm TIMP Sequences

    No full text
    The file contains sequences that belong to the TIMP gene family (tissue inhibitors of metalloproteinases). Different alignments represent the results of successive culling procedures to remove possible non-homologues. The sequences were obtained from an echinoderm transcriptome data set

    The phylogeny of extant starfish (Asteroidea: Echinodermata) including Xyloplax, based on comparative transcriptomics

    No full text
    Multi-locus phylogenetic studies of echinoderms based on Sanger and RNA-seq technologies and the fossil record have provided evidence for the Asterozoa-Echinozoa hypothesis. This hypothesis posits a sister relationship between asterozoan classes (Asteroidea and Ophiuroidea) and a similar relationship between echinozoan classes (Echinoidea and Holothuroidea). Despite this consensus around Asterozoa-Echinozoa, phylogenetic relationships within the class Asteroidea (sea stars or starfish) have been controversial for over a century. Open questions include relationships within asteroids and the status of the enigmatic taxon Xyloplax. Xyloplax is thought by some to represent a newly discovered sixth class of echinoderms - and by others to be an asteroid. To address these questions, we applied a novel workflow to a large RNA-seq dataset that encompassed a broad taxonomic and genomic sample. This study included 15 species sampled from all extant orders and 13 families, plus four ophiuroid species as an outgroup. To expand the taxonomic coverage, the study also incorporated five previously published transcriptomes and one previously published expressed sequence tags (EST) dataset. We developed and applied methods that used a range of alignment parameters with increasing permissiveness in terms of gap characters present within an alignment. This procedure facilitated the selection of phylogenomic data subsets from large amounts of transcriptome data. The results included 19 nested data subsets that ranged from 37 to 4,281loci. Tree searches on all data subsets reconstructed Xyloplax as a velatid asteroid rather than a new class. This result implies that asteroid morphology remains labile well beyond the establishment of the body plan of the group. In the phylogenetic tree with the highest average asteroid nodal support several monophyletic groups were recovered. In this tree, Forcipulatida and Velatida are monophyletic and form a clade that includes Brisingida as sister to Forcipulatida. Xyloplax is consistently recovered as sister to Pteraster. Paxillosida and Spinulosida are each monophyletic, with Notomyotida as sister to the Paxillosida. Valvatida is recovered as paraphyletic. The results from other data subsets are largely consistent with these results. Our results support the hypothesis that the earliest divergence event among extant asteroids separated Velatida and Forcipulatacea from Valvatacea and Spinulosida

    EchinoDB, an application for comparative transcriptomics of deeply-sampled clades of echinoderms

    Get PDF
    BACKGROUND: One of our goals for the echinoderm tree of life project (http://echinotol.org) is to identify orthologs suitable for phylogenetic analysis from next-generation transcriptome data. The current dataset is the largest assembled for echinoderm phylogeny and transcriptomics. We used RNA-Seq to profile adult tissues from 42 echinoderm specimens from 24 orders and 37 families. In order to achieve sampling members of clades that span key evolutionary divergence, many of our exemplars were collected from deep and polar seas. DESCRIPTION: A small fraction of the transcriptome data we produced is being used for phylogenetic reconstruction. Thus to make a larger dataset available to researchers with a wide variety of interests, we made a web-based application, EchinoDB (http://echinodb.uncc.edu). EchinoDB is a repository of orthologous transcripts from echinoderms that is searchable via keywords and sequence similarity. CONCLUSIONS: From transcripts we identified 749,397 clusters of orthologous loci. We have developed the information technology to manage and search the loci their annotations with respect to the Sea Urchin (Strongylocentrotus purpuratus) genome. Several users have already taken advantage of these data for spin-off projects in developmental biology, gene family studies, and neuroscience. We hope others will search EchinoDB to discover datasets relevant to a variety of additional questions in comparative biology

    Additional file 1: of EchinoDB, an application for comparative transcriptomics of deeply-sampled clades of echinoderms

    No full text
    Table of 42 echinoderm specimens used for RNA-seq data that are contained in http://echinodb.uncc.edu . The BJ number is an internal reference code. The voucher number represents where any residual tissues and metadata are stored. RAW indicates the number of raw reads produced by Illumina sequencing. Quality filter and adapter removal indicates the number of reads remaining following fastxtoolkit quality filter of Q score > 20 and removal of adapter regions. Percent reads remaining indicates the fraction of raw reads retained after quality filtering and adapter removal. Percentage Reads removed indicates the fraction of reads removed by quality filtering and adapter removal from the raw reads. Number of Amino Acid Sequences Participating in Orthologous Clusters indicates number of contigs for each species that participated in orthoclusters. Note that contigs may be partially overlapping and redundant. NCBI BioProject Accession number indicates where the contigs have been submitted to NCBI (note the orthoclusters only exist on http://echinodb.uncc.edu ). (XLSX 33 kb
    corecore