Search CORE

119 research outputs found

Functional clues for hypothetical proteins based on genomic context analysis in prokaryotes

Author: Bork P.
Doerks T.
von Mering C.
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2004
Field of study

Three integrated genomic context methods were used to annotate uncharacterized proteins in 102 bacterial genomes. Of 7853 orthologous groups with unknown function containing 45,110 proteins, 1738 groups could be linked to functionally associated partners. In many cases, those partners are uncharacterized themselves (hinting at newly identified modules) or have been described in general terms only. However, we were able to assign pathways, cellular processes or physical complexes for 273 groups (encompassing 3624 previously functionally uncharacterized proteins)

CiteSeerX

PubMed Central

MDC Repository

Orthology prediction methods: a quality assessment using curated protein families

Author: Bork P.
Chen W.H.
Doerks T.
Larsson T.A.
Muller J.
Powell S.
Trachana K.
Publication venue: 'Wiley'
Publication date: 01/10/2011
Field of study

The increasing number of sequenced genomes has prompted the development of several automated orthology prediction methods. Tests to evaluate the accuracy of predictions and to explore biases caused by biological and technical factors are therefore required. We used 70 manually curated families to analyze the performance of five public methods in Metazoa. We analyzed the strengths and weaknesses of the methods and quantified the impact of biological and technical challenges. From the latter part of the analysis, genome annotation emerged as the largest single influencer, affecting up to 30% of the performance. Generally, most methods did well in assigning orthologous group but they failed to assign the exact number of genes for half of the groups. The publicly available benchmark set (http://eggnog.embl.de/orthobench/) should facilitate the improvement of current orthology assignment protocols, which is of utmost importance for many fields of biology and should be tackled by a broad scientific community

PubMed Central

MDC Repository

eggNOG: automated construction and annotation of orthologous groups of genes

Author: Bork P.
Doerks T.
Jensen L J.
Julien P.
Kuhn M.
Muller J.
von Mering C.
Publication venue
Publication date: 02/08/2017
Field of study

The identification of orthologous genes forms the basis for most comparative genomics studies. Existing approaches either lack functional annotation of the identified orthologous groups, hampering the interpretation of subsequent results, or are manually annotated and thus lag behind the rapid sequencing of new genomes. Here we present the eggNOG database ('evolutionary genealogy of genes: Non-supervised Orthologous Groups'), which contains orthologous groups constructed from Smith-Waterman alignments through identification of reciprocal best matches and triangular linkage clustering. Applying this procedure to 312 bacterial, 26 archaeal and 35 eukaryotic genomes yielded 43 582 course-grained orthologous groups of which 9724 are extended versions of those from the original COG/KOG database. We also constructed more fine-grained groups for selected subsets of organisms, such as the 19 914 mammalian orthologous groups. We automatically annotated our non-supervised orthologous groups with functional descriptions, which were derived by identifying common denominators for the genes based on their individual textual descriptions, annotated functional categories, and predicted protein domains. The orthologous groups in eggNOG contain 1 241 751 genes and provide at least a broad functional description for 77% of them. Users can query the resource for individual genes via a web interface or download the complete set of orthologous groups at http://eggnog.embl.d

RERO DOC Digital Library

eggNOG: automated construction and annotation of orthologous groups of genes

Author: Ashburner
C. von Mering
Finn
J. Muller
Kanehisa
L. J. Jensen
Lee
Letunic
Li
Li
M. Kuhn
O'Brien
P. Bork
P. Julien
Sonnhammer
T. Doerks
Tatusov
Tatusov
van der Heijden
Wapinski
Publication venue: Oxford University Press
Publication date: 01/01/2008
Field of study

The identification of orthologous genes forms the basis for most comparative genomics studies. Existing approaches either lack functional annotation of the identified orthologous groups, hampering the interpretation of subsequent results, or are manually annotated and thus lag behind the rapid sequencing of new genomes. Here we present the eggNOG database (‘evolutionary genealogy of genes: Non-supervised Orthologous Groups’), which contains orthologous groups constructed from Smith–Waterman alignments through identification of reciprocal best matches and triangular linkage clustering. Applying this procedure to 312 bacterial, 26 archaeal and 35 eukaryotic genomes yielded 43 582 course-grained orthologous groups of which 9724 are extended versions of those from the original COG/KOG database. We also constructed more fine-grained groups for selected subsets of organisms, such as the 19 914 mammalian orthologous groups. We automatically annotated our non-supervised orthologous groups with functional descriptions, which were derived by identifying common denominators for the genes based on their individual textual descriptions, annotated functional categories, and predicted protein domains. The orthologous groups in eggNOG contain 1 241 751 genes and provide at least a broad functional description for 77% of them. Users can query the resource for individual genes via a web interface or download the complete set of orthologous groups at http://eggnog.embl.de

PLoS Comput Biol

Author: Arendt D. (D)
Bork P. (P)
Creevey C. (C) J. (J)
Doerks T. (T)
Muller J. (Jean)
Thompson J. (J) D. (D)
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/12/2011
Field of study

The identification of single copy (1-to-1) orthologs in any group of organisms is important for functional classification and phylogenetic studies. The Metazoa are no exception, but only recently has there been a wide-enough distribution of taxa with sufficiently high quality sequenced genomes to gain confidence in the wide-spread single copy status of a gene.Here, we present a phylogenetic approach for identifying overlooked single copy orthologs from multigene families and apply it to the Metazoa. Using 18 sequenced metazoan genomes of high quality we identified a robust set of 1,126 orthologous groups that have been retained in single copy since the last common ancestor of Metazoa. We found that the use of the phylogenetic procedure increased the number of single copy orthologs found by over a third more than standard taxon-count approaches. The orthologs represented a wide range of functional categories, expression profiles and levels of divergence.To demonstrate the value of our set of single copy orthologs, we used them to assess the completeness of 24 currently published metazoan genomes and 62 EST datasets. We found that the annotated genes in published genomes vary in coverage from 79% (Ciona intestinalis) to 99.8% (human) with an average of 92%, suggesting a value for the underlying error rate in genome annotation, and a strategy for identifying single copy orthologs in larger datasets. In contrast, the vast majority of EST datasets with no corresponding genome sequence available are largely under-sampled and probably do not accurately represent the actual genomic complement of the organisms from which they are derived

univOAK

DAS Writeback: A Collaborative Annotation System

Author: T Doerks
UniProt Consortium
U Bhatia
B Mons
IJW Huss
R Dowell
A Grzibovska
H Kilov
C Pautasso
S Vinoski
J Gregorio
Google
P Jones
C Bauer
A Jenkinson
RC Jimenez
N Miyake
G Salazar
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Centralised resources such as GenBank and UniProt are perfect examples of the major international efforts that have been made to integrate and share biological information. However, additional data that adds value to these resources needs a simple and rapid route to public access. The Distributed Annotation System (DAS) provides an adequate environment to integrate genomic and proteomic information from multiple sources, making this information accessible to the community. DAS offers a way to distribute and access information but it does not provide domain experts with the mechanisms to participate in the curation process of the available biological entities and their annotations. Results We designed and developed a Collaborative Annotation System for proteins called DAS Writeback. DAS writeback is a protocol extension of DAS to provide the functionalities of adding, editing and deleting annotations. We implemented this new specification as extensions of both a DAS server and a DAS client. The architecture was designed with the involvement of the DAS community and it was improved after performing usability experiments emulating a real annotation task. Conclusions We demonstrate that DAS Writeback is effective, usable and will provide the appropriate environment for the creation and evolution of community protein annotation.</p

Cape Town University OpenUCT

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Online Research Database In Technology

eggNOG v3.0: orthologous groups covering 1133 organisms at 41 different taxonomic ranges

Author: A. Roth
Altenhoff
C. von Mering
Chen
Chen
Ciccarelli
Creevey
D. Szklarczyk
Eisen
Gabaldon
Hulsen
I. Letunic
J. Muller
K. Trachana
Koonin
Kuzniar
L. J. Jensen
Linard
M. Kuhn
Makarova
Milinkovitch
P. Bork
Pearson
R. Arnold
S. Powell
T. Doerks
T. Rattei
Tatusov
Tatusov
Trachana
van der Heijden
von Mering
Wapinski
Publication venue: Oxford University Press
Publication date: 01/01/2012
Field of study

Orthologous relationships form the basis of most comparative genomic and metagenomic studies and are essential for proper phylogenetic and functional analyses. The third version of the eggNOG database (http://eggnog.embl.de) contains non-supervised orthologous groups constructed from 1133 organisms, doubling the number of genes with orthology assignment compared to eggNOG v2. The new release is the result of a number of improvements and expansions: (i) the underlying homology searches are now based on the SIMAP database; (ii) the orthologous groups have been extended to 41 levels of selected taxonomic ranges enabling much more fine-grained orthology assignments; and (iii) the newly designed web page is considerably faster with more functionality. In total, eggNOG v3 contains 721 801 orthologous groups, encompassing a total of 4 396 591 genes. Additionally, we updated 4873 and 4850 original COGs and KOGs, respectively, to include all 1133 organisms. At the universal level, covering all three domains of life, 101 208 orthologous groups are available, while the others are applicable at 40 more limited taxonomic ranges. Each group is amended by multiple sequence alignments and maximum-likelihood trees and broad functional descriptions are provided for 450 904 orthologous groups (62.5%)

Crossref

University of Birmingham Research Portal

PubMed Central

Copenhagen University Research Information System

ZORA

MDC Repository

DAS Writeback: A Collaborative Annotation System

Author: A Grzibovska
A Jenkinson
Alexander Garcia
B Mons
C Bauer
C Pautasso
Edwin Blake
G Salazar
Google
Gustavo A Salazar
H Kilov
Henning Hermjakob
IJW Huss
J Gregorio
N Miyake
Nicola Mulder
P Jones
R Dowell
Rafael C Jimenez
RC Jimenez
S Vinoski
T Doerks
U Bhatia
UniProt Consortium
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Cape Town University OpenUCT

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

The genomes of two key bumblebee species with primitive eusocial organization

Author: A Buttstedt
A Conesa
A Fauser-Misslin
A Garcia-Bellido
A Marchler-Bauer
A Sewer
A Stamatakis
AA Lazareva
Aarti Venkat
AD Chipman
AF Bourke
AJ Vanbergen
Ajay Nair
AK Hassani El
AK Jones
Alvaro G Hernandez
Amy Osborne
Andrew FG Bourke
Andrew G Cridge
Andrew K Jones
Anna K Bennett
AP Lourenco
Arian Köhler
Ariel D Chipman
AV Lobanov
B Boerjan
B Dauwalder
B Langmead
Bart Devreese
Ben M Sadd
Bertrand Fouks
Björn D Schmitt
BM Sadd
BR Herb
BY Kim
C Claudianos
C Elsik
C Elsik
C Fontaine
C Grüter
Carolina G Santos
CD Smith
CD Smith
CE Chapple
Christie Kovar
Christina Schulte
Christine G Elsik
Christopher Pham
Claire Asher
Claire E Johnson
Cornelis JP Grimmelikhuijzen
CR Smith
CW Whitfield
D Goulson
Daniel G Pinheiro
Daniela Puiu
David F Clarke
David H Collins
David S Marco Antonio
DB Weaver
DeNard Simmons
DF Simola
Didac Santesmasses
Dirk C de Graaf
Donna M Muzny
DR Kelley
DR Nassel
E Clare
E Duncan
E Geuverink
E Keibler
E Privman
E Stolle
E Stolle
E Stolle
Eamonn B Mallon
EB Rubin
Eckart Stolle
EJ Duncan
EJ Duncan
Elizabeth Duncan
EM Zdobnov
Erich Bornberg-Bauer
Eva C Winnebeck
Evgeny M Zdobnov
F Bernhard Kraus
F Graeve De
F Hauser
F Hauser
F Liu
F Lyko
Fernanda C Humann
Florian Wolschin
Flávia CP Freitas
FM Nunes
Francis MF Nunes
Francisco Câmara
Frank Hauser
Frano Irvine
G Bloch
G Bloch
G Parra
G Parra
G Suen
GA Lockett
Gabrielle A Lockett
GE Robinson
Gene E Robinson
Geoffrey Okwuonu
Griet Debyser
Gro V Amdam
GS Slater
Guy Bloch
Guy Smagghe
H Li
H Matsuura
H Michael G Lattorff
H Ono
H Shpigler
HE Amarasinghe
HHW Velthuis
HM Hines
HM Robertson
HM Robertson
HM Robertson
HM Robertson
Hugh M Robertson
Inga Nissen
Irene F Newsham
Ivan Meeus
J Debski
J Jurka
J Lu
J Maynard Smith
J Schultz
J-H Xiao
JA Campos-Ortega
JA Lynch
JA Vizcaino
James C Carolan
Jay Evans
JD Evans
JD Lozier
JD Lozier
JD Thompson
Jeffrey D Lozier
JG Oakeshott
JH Werren
JH Willis
Jiaxin Qu
Jinzhi Niu
Jireh Santibanez
Jisheng Liu
JK Colbourne
JK Greenberg
JL Kelley
JM Jandt
John G Oakeshott
Jonathan H Kidner
Joy Jayaseelan
JR Martins
Julie Blommaert
JY Kwon
Jürgen Gadau
K Chen
K Sorefan
K Touhara
K Venkatachalam
Kaat Cappelle
Kate L Ciborowski
Katharina Hoff
Kathrin Näpflin
KB Flores
Kerstin P Blankenburg
Kevin Flores
Kim C Worley
Kimberly KO Walden
KJ Emerson
KK Ingram
Klaus Hartfelder
KS Delaplane
KW Wanner
L Wilfert
L Wilfert
LA Garibaldi
LA Weiss
LaRonda Jackson
Lars Chittka
Lars S Jermiin
LB Kent
LB Vosshall
Liezl Francisco
Ling-Ling Pu
Louis du Plessis
LRS Zanette
Luc Swevers
M Beye
M Hasselmann
M Jinek
M Maibeche-Coisne
M Mariotti
M Mariotti
M Punta
M Stanke
M Stauber
MA Furst
MA Larkin
Marco Mariotti
Marcus Coyle
Marianne Otte
Mark JF Brown
Mark L Blaxter
Martin Beye
Martin Hasselmann
Matthew Beckers
Matthew E Hudson
Matthias Biewer
Matthias Van Vaerenbergh
MC Munoz-Torres
MC Otterstatter
Meaghan P O’Neill
Megan Leask
Michael Holder
Michelle PM Soares
Monica Munoz-Torres
Monika Marxer
Márcia MG Bitondi
N Kapan
NA Baird
Na Yu
Nehad Saada
Nina Rossié
NT Dittmer
O Kohany
Olav Rueppell
Olivier Christiaens
P Danecek
P Schmid-Hempel
P Schmid-Hempel
P Skorupski
PA Hohenlohe
Paul Schmid-Hempel
PD Etter
Peshtewani K Aqrawi
Peter Dearden
PH Williams
PH Williams
PK Dearden
QWT Chan
R Bommarco
R Bonasio
R Crozier
R Feyereisen
R Kucharski
R Nielsen
R Schlatter
R Schmid-Hempel
R Winfree
RC Edgar
RD Finn
Rebecca Thornton
Reed M Johnson
Regula Schmid-Hempel
RG Côté
RG Hatfield
Richard A Gibbs
RJ Gegear
RJ Gill
RM Waterhouse
Robert M Waterhouse
Robert Mata
Robin FA Moritz
Robin Ngo
Roderic Guigó
Rossanah Cameron
S Brown
S Cameron
S Capella-Gutierrez
S Cardinal
S Foret
S Gotz
S Griffiths-Jones
S Hunter
S Kocher
S Moxon
S Nygaard
S Richards
S Schmieder
S Yerushalmi
S Zou
SA Cameron
SA Cameron
SA West
Sandra L Lee
Seirian Sumner
Seth M Barribeau
Severine D Buechel
SH Woodard
SI Ashraf
Silvio Erler
SJ Marygold
SK Behura
Sophie Helbing
Steffen Klasberg
Stephanie Dreier
Stephen Richards
Steven E Scherer
Steven L Salzberg
T Conrad
T Doerks
T Flutre
T Hagai
T Ings
T Louis
T Miyamoto
T Wicker
Tamas Dalmay
Tanja Gempe
Taro Fuchikawa
Tatsuhiko Kadowaki
TD Tayler
Terence Murphy
The Honeybee Genome Sequencing Consortium
Thomas J Colgan
Tiago Falcon
Tittu Mathew
TJ Colgan
TS Korneliussen
UP Consortium
V Croset
V Koch
V Mommaerts
V Raymond-Delpech
V Solovyev
Vandita Joshi
Vasco Koch
VV Kapitonov
VV Kapitonov
W Mao
W Mao
WD Hamilton
WJ Kent
WO Hughes
X Zhou
Y Qiu
Y Wurm
Y Xiong
YL Dupont
Yuan-Qing Wu
YW Yuan
Zilá LP Simões
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

Background: The shift from solitary to social behavior is one of the major evolutionary transitions. Primitively eusocial bumblebees are uniquely placed to illuminate the evolution of highly eusocial insect societies. Bumblebees are also invaluable natural and agricultural pollinators, and there is widespread concern over recent population declines in some species. High-quality genomic data will inform key aspects of bumblebee biology, including susceptibility to implicated population viability threats. Results: We report the high quality draft genome sequences of Bombus terrestris and Bombus impatiens, two ecologically dominant bumblebees and widely utilized study species. Comparing these new genomes to those of the highly eusocial honeybee Apis mellifera and other Hymenoptera, we identify deeply conserved similarities, as well as novelties key to the biology of these organisms. Some honeybee genome features thought to underpin advanced eusociality are also present in bumblebees, indicating an earlier evolution in the bee lineage. Xenobiotic detoxification and immune genes are similarly depauperate in bumblebees and honeybees, and multiple categories of genes linked to social organization, including development and behavior, show high conservation. Key differences identified include a bias in bumblebee chemoreception towards gustation from olfaction, and striking differences in microRNAs, potentially responsible for gene regulation underlying social and other traits. Conclusions: These two bumblebee genomes provide a foundation for post-genomic research on these key pollinators and insect societies. Overall, gene repertoires suggest that the route to advanced eusociality in bees was mediated by many small changes in many genes and processes, and not by notable expansion or depauperation

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Open Access LMU

Edinburgh Research Explorer

RCAAP - Repositório Científico de Acesso Aberto de Portugal

UPF Digital Repository

White Rose Research Online

Brage NMBU

Royal Holloway - Pure

Ghent University Academic Bibliography

PubMed Central

Copenhagen University Research Information System

Universidade de São Paulo