Search CORE

57 research outputs found

PhylomeDB: a database for genome-wide collections of gene phylogenies

Author: A. Bueno
Birney
Comas
Duret
Edgar
Gabaldon
Gascuel
Guindon
Huerta-Cepas
J. Dopazo
J. Huerta-Cepas
Leebens-Mack
Li
Ronquist
Sicheritz-Ponten
Smith
T. Gabaldon
Publication venue: Oxford University Press
Publication date
Field of study

The complete collection of evolutionary histories of all genes in a genome, also known as phylome, constitutes a valuable source of information. The reconstruction of phylomes has been previously prevented by large demands of time and computer power, but is now feasible thanks to recent developments in computers and algorithms. To provide a publicly available repository of complete phylomes that allows researchers to access and store large-scale phylogenomic analyses, we have developed PhylomeDB. PhylomeDB is a database of complete phylomes derived for different genomes within a specific taxonomic range. All phylomes in the database are built using a high-quality phylogenetic pipeline that includes evolutionary model testing and alignment trimming phases. For each genome, PhylomeDB provides the alignments, phylogentic trees and tree-based orthology predictions for every single encoded protein. The current version of PhylomeDB includes the phylomes of Human, the yeast Saccharomyces cerevisiae and the bacterium Escherichia coli, comprising a total of 32 289 seed sequences with their corresponding alignments and 172 324 phylogenetic trees. PhylomeDB can be publicly accessed at http://phylomedb.bioinfo.cipf.e

Crossref

PubMed Central

A draft genome sequence of the elusive giant squid, Architeuthis dux

Author: Albertin C. B.
Alexander G. C.
Antunes A.
Baril T.
Barrio-Hernandez I.
Blagoev B.
Brejova B.
Campos A.
Castro L. F. C.
Chu C.
Couto A.
Da Fonseca R. R.
Fedrigo O.
Frazao B.
Gardner P.
Gilbert M. T. P.
Hayward A.
Hoving H. -J.
Jarvis E.
Li Q.
Ma B.
Machado A. M.
Musacchia F.
Nielsen R.
Osorio H.
Patricio M.
Penaloza F.
Petersen B.
Pisani D.
Rahman M. Z.
Rasmussen S.
Ribeiro A. M.
Rocha S.
Sanges R.
Sicheritz-Ponten T.
Silva F.
Simakov O.
Strugnell J. M.
Tafur-Jimenez R.
Vinar T.
Vinther J.
Winkelmann I.
Wu Y.
Zhang G.
Publication venue: 'Oxford University Press (OUP)'
Publication date: 21/11/2019
Field of study

Background: The giant squid (Architeuthis dux; Steenstrup, 1857) is an enigmatic giant mollusc with a circumglobal distribution in the deep ocean, except in the high Arctic and Antarctic waters. The elusiveness of the species makes it difficult to study. Thus, having a genome assembled for this deep-sea-dwelling species will allow several pending evolutionary questions to be unlocked. Findings: We present a draft genome assembly that includes 200 Gb of Illumina reads, 4 Gb of Moleculo synthetic long reads, and 108 Gb of Chicago libraries, with a final size matching the estimated genome size of 2.7 Gb, and a scaffold N50 of 4.8 Mb. We also present an alternative assembly including 27 Gb raw reads generated using the Pacific Biosciences platform. In addition, we sequenced the proteome of the same individual and RNA from 3 different tissue types from 3 other species of squid (Onychoteuthis banksii, Dosidicus gigas, and Sthenoteuthis oualaniensis) to assist genome annotation. We annotated 33,406 protein-coding genes supported by evidence, and the genome completeness estimated by BUSCO reached 92%. Repetitive regions cover 49.17% of the genome. Conclusions: This annotated draft genome of A. dux provides a critical resource to investigate the unique traits of this species, including its gigantism and key adaptations to deep-sea environments

OceanRep

Investigo

Woods Hole Open Access Server

ResearchOnline at James Cook University

Copenhagen University Research Information System

eScholarship - University of California

Sissa Digital Library

Open Research Exeter

Repositório Aberto da Universidade do Porto

NORA - Norwegian Open Research Archives

Explore Bristol Research

Dense sampling of bird diversity increases power of comparative genomics

© 2020, The Author(s). Whole-genome sequencing projects are increasingly populating the tree of life and characterizing biodiversity1–4. Sparse taxon sampling has previously been proposed to confound phylogenetic inference5, and captures only a fraction of the genomic diversity. Here we report a substantial step towards the dense representation of avian phylogenetic and molecular diversity, by analysing 363 genomes from 92.4% of bird families—including 267 newly sequenced genomes produced for phase II of the Bird 10,000 Genomes (B10K) Project. We use this comparative genome dataset in combination with a pipeline that leverages a reference-free whole-genome alignment to identify orthologous regions in greater numbers than has previously been possible and to recognize genomic novelties in particular bird lineages. The densely sampled alignment provides a single-base-pair map of selection, has more than doubled the fraction of bases that are confidently predicted to be under conservation and reveals extensive patterns of weak selection in predominantly non-coding DNA. Our results demonstrate that increasing the diversity of genomes used in comparative studies can reveal more shared and lineage-specific variation, and improve the investigation of genomic characteristics. We anticipate that this genomic resource will offer new perspectives on evolutionary processes in cross-species comparative analyses and assist in efforts to conserve species

Louisiana State University

The population genomic legacy of the second plague pandemic

Author: Alfredsson L.
Balloux F.
Campos P.
Cavalleri G.
Cheung C.
Christophersen A.
de-Dios T.
Denham S.
Ebenesersdóttir S.
Ellegaard M.
Fotakis A.
Gelabert P.
Gilbert E.
Gilbert M.
Gopalakrishnan S.
Guðmundsdóttir V.
Günther T.
Halgunset J.
Hansen T.
Helgason A.
Hovig E.
Iraeta-Orbegozo M.
Juan D.
Kivisild T.
Kockum I.
Laffoon J.
Lalueza-Fox C.
Liu S.
Liu X.
Luisi P.
Lundstrøm I.
Magnúsdóttir D.
Magnússon Ó.
Margaryan A.
Marques-Bonet T.
Martin M.
Moltke I.
Moore K.
Moseng O.
Nielsen R.
Olsson T.
Petersen B.
Rasmussen S.
Sandoval-Velasco M.
Schraiber J.
Schroeder H.
Sicheritz-Ponten T.
Sigurðsson Á.
Skar B.
Snorradóttir S.
Stefánsson K.
Stenøien H.
Turner-Walker G.
van Dorp L.
Vieira F.
Vågene Å.
Wales N.
Werge T.
Willerslev E.
Ávila-Arcos M.
Publication venue: 'Elsevier BV'
Publication date: 01/10/2022
Field of study

SummaryHuman populations have been shaped by catastrophes that may have left long-lasting signatures in their genomes. One notable example is the second plague pandemic that entered Europe in ca. 1,347 CE and repeatedly returned for over 300 years, with typical village and town mortality estimated at 10%–40%.1 It is assumed that this high mortality affected the gene pools of these populations. First, local population crashes reduced genetic diversity. Second, a change in frequency is expected for sequence variants that may have affected survival or susceptibility to the etiologic agent (Yersinia pestis).2 Third, mass mortality might alter the local gene pools through its impact on subsequent migration patterns. We explored these factors using the Norwegian city of Trondheim as a model, by sequencing 54 genomes spanning three time periods: (1) prior to the plague striking Trondheim in 1,349 CE, (2) the 17th–19th century, and (3) the present. We find that the pandemic period shaped the gene pool by reducing long distance immigration, in particular from the British Isles, and inducing a bottleneck that reduced genetic diversity. Although we also observe an excess of large FST values at multiple loci in the genome, these are shaped by reference biases introduced by mapping our relatively low genome coverage degraded DNA to the reference genome. This implies that attempts to detect selection using ancient DNA (aDNA) datasets that vary by read length and depth of sequencing coverage may be particularly challenging until methods have been developed to account for the impact of differential reference bias on test statistics.Results and discussion STAR★Method

Leiden University Scholary Publications

MPG.PuRe

The human phylome

Author: A Meyer
A Rokas
A Rokas
AC Berglund-Sonnhammer
AM Aguinaldo
C Roth
C Seoighe
C Vogel
CG Kurland
CG Kurland
CM Zmasek
CM Zmasek
D Penny
DT Jones
ES Lander
EV Koonin
F Delsuc
F Ronquist
FD Ciccarelli
G Panopoulou
G Ricard
H Akaike
H Dopazo
H Philippe
Hernán Dopazo
I Humphery-Smith
J Adachi
J Nielsen
J Zhang
JA Bailey
JA Eisen
Jaime Huerta-Cepas
JC Chiu
JC Venter
JD McPherson
JE Blair
JO Andersson
Joaquín Dopazo
JW Thomas
K Misawa
L Arvestad
L Bromham
L Duret
L Li
M Hallet
M Kullberg
M Pruess
MA Huynen
MA Huynen
MR Goldsmith
N Alvarez
NW Blackstone
O Gascuel
O Jeffroy
PJ Keeling
PS Dehal
RC Edgar
RL Tatusov
S Guindon
S Henikoff
S Ohno
S Whelan
SA Benner
SE Fisher
SL Salzberg
T Blomme
T Cavalier-Smith
T Dagan
T Gabaldón
T Gabaldón
T Gabaldón
T Gabaldón
T Gabaldón
T Hulsen
T Müller
T Ohta
T Sicheritz-Ponten
TF Smith
TK Gandhi
TM Keane
Toni Gabaldón
TR Buckley
U Bergthorsson
V van Noort
WJ Bruno
WJ Murphy
WM Fitch
Y Suzuki
YI Wolf
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

The human phylome, which includes evolutionary relationships of all human proteins and their homologs among thirty-nine fully sequenced eukaryotes, is reconstructed

CiteSeerX

Crossref

PubMed Central

The chemical interactome space between the human host and the genetically defined gut metabotypes

Author: A Gaulton
A Waldram
AL Kau
C Ainsworth
C Jernberg
C Palmer
CV Knox
F Backhed
F Backhed
Falk Hildebrand
FPJ Martin
G Le Gall
Gianni Panagiotou
H Yabuuchi
HE Jakobsson
Henrik Bjørn Nielsen
Huijun Wang
I Laux
IKS Yap.
Irene Kouskoumvekaki
J Qin
J Schellenberger
Jeroen Raes
JH Bae
JH Ward
JK Nicholson
JL Sonnenburg
JP Overington
JY Yang
K Lage
K Senthilkumar
KJ Maloy
LV Hooper
M Arumugam
M Iborra
M Li
NC Duarte
NM O'Boyle
PJ Turnbaugh
PJ Turnbaugh
R Caspi
R Zhang
RE Ley
RL Chang
S Egert
S Kent
S Matzno
SO Jonsdottir
T Yamada
Thomas Sicheritz-Ponten
Ulrik Plesner Jacobsen
X Chen
X Zheng
Z Ji
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 22/11/2012
Field of study

The bacteria that colonize the gastrointestinal tracts of mammals represent a highly selected microbiome that has a profound influence on human physiology by shaping the host's metabolic and immune system activity. Despite the recent advances on the biological principles that underlie microbial symbiosis in the gut of mammals, mechanistic understanding of the contributions of the gut microbiome and how variations in the metabotypes are linked to the host health are obscure. Here, we mapped the entire metabolic potential of the gut microbiome based solely on metagenomics sequencing data derived from fecal samples of 124 Europeans (healthy, obese and with inflammatory bowel disease). Interestingly, three distinct clusters of individuals with high, medium and low metabolic potential were observed. By illustrating these results in the context of bacterial population, we concluded that the abundance of the Prevotella genera is a key factor indicating a low metabolic potential. These metagenome-based metabolic signatures were used to study the interaction networks between bacteria-specific metabolites and human proteins. We found that thirty-three such metabolites interact with disease-relevant protein complexes several of which are highly expressed in cells and tissues involved in the signaling and shaping of the adaptive immune system and associated with squamous cell carcinoma and bladder cancer. From this set of metabolites, eighteen are present in DrugBank providing evidence that we carry a natural pharmacy in our guts. Furthermore, we established connections between the systemic effects of non-antibiotic drugs and the gut microbiome of relevance to drug side effects and health-care solutions.link_to_subscribed_fulltex

Crossref

PubMed Central

Online Research Database In Technology

HKU Scholars Hub

Bootstrap, Bayesian probability and maximum likelihood mapping: exploring new tools for comparative genome analyses

Author: A Drummond
AH Schinkel
B Rannala
B Snel
C Brochier
C Brochier
CR Woese
CR Woese
CR Woese
DE Graham
DH Huson
DR Walker
DT Jones
E Denamur
EV Koonin
EV Koonin
F Tekaia
J Felsenstein
J Felsenstein
J Lin
J Xiong
JD Thompson
JG Lawrence
JP Gogarten
JP Gogarten
JP Huelsenbeck
K Strimmer
K Strimmer
KG Karol
KS Makarova
L Olendzenski
MG Montague
MR Goddard
N Cermakian
RF Doolittle
RL Tatusov
RL Tatusov
RL Tatusov
RS Gupta
RS Gupta
RS Gupta
S Ribeiro
S Rousvoal
SF Altschul
SF Altschul
ST Fitz-Gibbon
T Sicheritz-Ponten
W Hennig
W Ludwig
WF Doolittle
WJ Murphy
WR Pearson
Y Hasegawa
YI Wolf
YI Wolf
Z Yang
Publication venue: BioMed Central
Publication date: 01/01/2002
Field of study

BACKGROUND: Horizontal gene transfer (HGT) played an important role in shaping microbial genomes. In addition to genes under sporadic selection, HGT also affects housekeeping genes and those involved in information processing, even ribosomal RNA encoding genes. Here we describe tools that provide an assessment and graphic illustration of the mosaic nature of microbial genomes. RESULTS: We adapted the Maximum Likelihood (ML) mapping to the analyses of all detected quartets of orthologous genes found in four genomes. We have automated the assembly and analyses of these quartets of orthologs given the selection of four genomes. We compared the ML-mapping approach to more rigorous Bayesian probability and Bootstrap mapping techniques. The latter two approaches appear to be more conservative than the ML-mapping approach, but qualitatively all three approaches give equivalent results. All three tools were tested on mitochondrial genomes, which presumably were inherited as a single linkage group. CONCLUSIONS: In some instances of interphylum relationships we find nearly equal numbers of quartets strongly supporting the three possible topologies. In contrast, our analyses of genome quartets containing the cyanobacterium Synechocystis sp. indicate that a large part of the cyanobacterial genome is related to that of low GC Gram positives. Other groups that had been suggested as sister groups to the cyanobacteria contain many fewer genes that group with the Synechocystis orthologs. Interdomain comparisons of genome quartets containing the archaeon Halobacterium sp. revealed that Halobacterium sp. shares more genes with Bacteria that live in the same environment than with Bacteria that are more closely related based on rRNA phylogeny . Many of these genes encode proteins involved in substrate transport and metabolism and in information storage and processing. The performed analyses demonstrate that relationships among prokaryotes cannot be accurately depicted by or inferred from the tree-like evolution of a core of rarely transferred genes; rather prokaryotic genomes are mosaics in which different parts have different evolutionary histories. Probability mapping is a valuable tool to explore the mosaic nature of genomes

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Isolation of Hox Cluster Genes from Insects Reveals an Accelerated Sequence Evolution Rate

Among gene families it is the Hox genes and among metazoan animals it is the insects (Hexapoda) that have attracted particular attention for studying the evolution of development. Surprisingly though, no Hox genes have been isolated from 26 out of 35 insect orders yet, and the existing sequences derive mainly from only two orders (61% from Hymenoptera and 22% from Diptera). We have designed insect specific primers and isolated 37 new partial homeobox sequences of Hox cluster genes (lab, pb, Hox3, ftz, Antp, Scr, abd-a, Abd-B, Dfd, and Ubx) from six insect orders, which are crucial to insect phylogenetics. These new gene sequences provide a first step towards comparative Hox gene studies in insects. Furthermore, comparative distance analyses of homeobox sequences reveal a correlation between gene divergence rate and species radiation success with insects showing the highest rate of homeobox sequence evolution

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

FigShare

The population genomic legacy of the second plague pandemic

Author: Alfredsson Lars
Avila-Arcos María C.
Balloux Francois
Campos Paula F.
Cavalleri Gianpiero L.
Cheung Christina
Christophersen Axel
De-Dios Toni
Denham Sean Dexter
Ebenesersdottir S.Sunna
Ellegaard Martin Rene
Fotakis Anna K.
Galabert Pere
Gilbert Edmund
Gilbert Thomas P.
Gopalakrishnan Shyam
Gudmundsdottir Valdis B
Günther Torsten
Halgunset Jostein
Hansen Thomas F.
Helgason Agnar
Hovig Eivind
Iraeta-Orbegozo Miren
Juan David
Kivisild Toomas
Kockum Ingrid
Laffoon Jason E.
Lalueza-Fox Carles
Liu Shanlin
Liu Xiaodong
Luisi Pierre
Lundstrøm Inge K.C.
Magaryan Ashot
Magnusson Olafur T.
Magnúsdóttir Droplaug N.
Marquès-Bonet Tomás
Martin Michael D.
Moltke Ida
Moore Kristjan H.S.
Moseng Ole Georg
Nielsen Rasmus
Olsson Tomas
Petersen Bent
Rasmussen Simon
Sandoval-Velasco Marcela
Schraiber Joshua G.
Schroeder Hannes
Sicheritz - Ponten Thomas
Sigurdsson Asgeir
Skar Birgitte
Snorradóttir Steinunn
Stefánsson Kári
Stenøien Hans K.
Turner-Walker Gordon
van Dorp Lucy
Vieira Filipe G.
Vågene Åshild J.
Wales Nathan
Werge Thomas
Willerslev Eske
Publication venue: 'Elsevier BV'
Publication date: 01/01/2022
Field of study

Human populations have been shaped by catastrophes that may have left long-lasting signatures in their genomes. One notable example is the second plague pandemic that entered Europe in ca. 1,347 CE and repeatedly returned for over 300 years, with typical village and town mortality estimated at 10%–40%.1 It is assumed that this high mortality affected the gene pools of these populations. First, local population crashes reduced genetic diversity. Second, a change in frequency is expected for sequence variants that may have affected survival or susceptibility to the etiologic agent (Yersinia pestis).2 Third, mass mortality might alter the local gene pools through its impact on subsequent migration patterns. We explored these factors using the Norwegian city of Trondheim as a model, by sequencing 54 genomes spanning three time periods: (1) prior to the plague striking Trondheim in 1,349 CE, (2) the 17th–19th century, and (3) the present. We find that the pandemic period shaped the gene pool by reducing long distance immigration, in particular from the British Isles, and inducing a bottleneck that reduced genetic diversity. Although we also observe an excess of large FST values at multiple loci in the genome, these are shaped by reference biases introduced by mapping our relatively low genome coverage degraded DNA to the reference genome. This implies that attempts to detect selection using ancient DNA (aDNA) datasets that vary by read length and depth of sequencing coverage may be particularly challenging until methods have been developed to account for the impact of differential reference bias on test statistics.publishedVersio

UiS Brage