Search CORE

3 research outputs found

Advances and Applications in the Quest for Orthologs

Author: Dessimoz C
Ebersberger I
Forslund SK
Gabaldón T
Glover N
Huerta-Cepas J
Martin M-J
Muffato M
Patricio M
Pereira C
Quest for Orthologs Consortium
Sonnhammer E
Sousa da Silva A
Thomas PD
Wang Y
Publication venue
Publication date: 01/10/2019
Field of study

Gene families evolve by the processes of speciation (creating orthologs), gene duplication (paralogs) and horizontal gene transfer (xenologs), in addition to sequence divergence and gene loss. Orthologs in particular play an essential role in comparative genomics and phylogenomic analyses. With the continued sequencing of organisms across the tree of life, the data are available to reconstruct the unique evolutionary histories of tens of thousands of gene families. Accurate reconstruction of these histories, however, is a challenging computational problem, and the focus of the Quest for Orthologs Consortium. We review the recent advances and outstanding challenges in this field, as revealed at a symposium and meeting held at the University of Southern California in 2017. Key advances have been made both at the level of orthology algorithm development and with respect to coordination across the community of algorithm developers and orthology end-users. Applications spanned a broad range, including gene function prediction, phylostratigraphy, genome evolution, and phylogenomics. The meetings highlighted the increasing use of meta-analyses integrating results from multiple different algorithms, and discussed ongoing challenges in orthology inference as well as the next steps toward improvement and integration of orthology resources

UCL Discovery

A hands-on introduction to querying evolutionary relationships across multiple data sources using SPARQL [version 1; peer review: 1 approved, 2 approved with reservations]

Author: Dessimoz C
Mendes de Farias T
Sima AC
Stockinger K
Zahn-Zabal M
Publication venue: 'F1000 Research Ltd'
Publication date: 29/10/2019
Field of study

The increasing use of Semantic Web technologies in the life sciences, in particular the use of the Resource Description Framework (RDF) and the RDF query language SPARQL, opens the path for novel integrative analyses, combining information from multiple sources. However, analyzing evolutionary data in RDF is not trivial, due to the steep learning curve required to understand both the data models adopted by different RDF data sources, as well as the SPARQL query language. In this article, we provide a hands-on introduction to querying evolutionary data across multiple sources that publish orthology information in RDF, namely: The Orthologous MAtrix (OMA), the European Bioinformatics Institute (EBI) RDF platform, the Database of Orthologous Groups (OrthoDB) and the Microbial Genome Database (MBGD). We present four protocols in increasing order of complexity. In these protocols, we demonstrate through SPARQL queries how to retrieve pairwise orthologs, homologous groups, and hierarchical orthologous groups. Finally, we show how orthology information in different sources can be compared, through the use of federated SPARQL queries

UCL Discovery

MBGD update 2018: microbial genome database based on hierarchical orthology relations covering closely related and distantly related comparisons

Author: Altenhoff
Altschul
Chiba
Chiba
Eddy
Edgar
Fernandez-Breis
Haft
Hirokazu Chiba
Hiroyo Nishide
Huerta-Cepas
Ikuo Uchiyama
Jothi
Kriventseva
Masaki Kato
Minarro-Gimenez
Motohiro Mihara
Nakaya
Price
Schreiber
Sievers
Smith
Steinegger
Tettelin
Uchiyama
Uchiyama
Uchiyama
Uchiyama
Uchiyama
Uchiyama
van der Heijden
Publication venue: 'Oxford University Press (OUP)'
Publication date
Field of study

Crossref