Search CORE

64 research outputs found

I-TASSER server for protein 3D structure prediction

Author: A Zemla
AA Canutescu
AG Murzin
B Wallner
CS Pettitt
D Baker
D Cozzetto
D Fischer
HM Berman
J Skolnick
JN Battey
K Ginalski
K Karplus
LE Reichl
M Feig
MR Betancourt
SB Needleman
SC Tosatto
SF Altschul
ST Wu
ST Wu
TF Smith
W Kabsch
Y Zhang
Y Zhang
Y Zhang
Y Zhang
Y Zhang
Y Zhang
Y Zhang
Yang Zhang
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Abstract Background Prediction of 3-dimensional protein structures from amino acid sequences represents one of the most important problems in computational structural biology. The community-wide Critical Assessment of Structure Prediction (CASP) experiments have been designed to obtain an objective assessment of the state-of-the-art of the field, where I-TASSER was ranked as the best method in the server section of the recent 7th CASP experiment. Our laboratory has since then received numerous requests about the public availability of the I-TASSER algorithm and the usage of the I-TASSER predictions. Results An on-line version of I-TASSER is developed at the KU Center for Bioinformatics which has generated protein structure predictions for thousands of modeling requests from more than 35 countries. A scoring function (C-score) based on the relative clustering structural density and the consensus significance score of multiple threading templates is introduced to estimate the accuracy of the I-TASSER predictions. A large-scale benchmark test demonstrates a strong correlation between the C-score and the TM-score (a structural similarity measurement with values in [0, 1]) of the first models with a correlation coefficient of 0.91. Using a C-score cutoff > -1.5 for the models of correct topology, both false positive and false negative rates are below 0.1. Combining C-score and protein length, the accuracy of the I-TASSER models can be predicted with an average error of 0.08 for TM-score and 2 Å for RMSD. Conclusion The I-TASSER server has been developed to generate automated full-length 3D protein structural predictions where the benchmarked scoring system helps users to obtain quantitative assessments of the I-TASSER models. The output of the I-TASSER server for each query includes up to five full-length models, the confidence score, the estimated TM-score and RMSD, and the standard deviation of the estimations. The I-TASSER server is freely available to the academic community at <url>http://zhang.bioinformatics.ku.edu/I-TASSER</url>.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

KU ScholarWorks

PubMed Central

Cytokine responsive networks in human colonic epithelial organoids unveil a molecular classification of inflammatory bowel disease

Author: Bewick G
Cozzetto D
Friedman J
Hayee B
Korcsmaros TD
Li K
Niazi U
Pavlidis P
Powell N
Saqi M
Treveil A
Tsakmaki A
Yang F
Publication venue: 'Elsevier BV'
Publication date: 09/09/2022
Field of study

Interactions between the epithelium and the immune system are critical in the pathogenesis of inflammatory bowel disease (IBD). In this study, we mapped the transcriptional landscape of human colonic epithelial organoids in response to different cytokines responsible for mediating canonical mucosal immune responses. By profiling the transcriptome of human colonic organoids treated with the canonical cytokines interferon gamma, interleukin-13, -17A, and tumor necrosis factor alpha with next-generation sequencing, we unveil shared and distinct regulation patterns of epithelial function by different cytokines. An integrative analysis of cytokine responses in diseased tissue from patients with IBD (n = 1,009) reveals a molecular classification of mucosal inflammation defined by gradients of cytokine-responsive transcriptional signatures. Our systems biology approach detected signaling bottlenecks in cytokine-responsive networks and highlighted their translational potential as theragnostic targets in intestinal inflammation

Spiral - Imperial College Digital Repository

Structural Annotation of Mycobacterium tuberculosis Proteome

Of the ∼4000 ORFs identified through the genome sequence of Mycobacterium tuberculosis (TB) H37Rv, experimentally determined structures are available for 312. Since knowledge of protein structures is essential to obtain a high-resolution understanding of the underlying biology, we seek to obtain a structural annotation for the genome, using computational methods. Structural models were obtained and validated for ∼2877 ORFs, covering ∼70% of the genome. Functional annotation of each protein was based on fold-based functional assignments and a novel binding site based ligand association. New algorithms for binding site detection and genome scale binding site comparison at the structural level, recently reported from the laboratory, were utilized. Besides these, the annotation covers detection of various sequence and sub-structural motifs and quaternary structure predictions based on the corresponding templates. The study provides an opportunity to obtain a global perspective of the fold distribution in the genome. The annotation indicates that cellular metabolism can be achieved with only 219 folds. New insights about the folds that predominate in the genome, as well as the fold-combinations that make up multi-domain proteins are also obtained. 1728 binding pockets have been associated with ligands through binding site identification and sub-structure similarity analyses. The resource (http://proline.physics.iisc.ernet.in/Tbstructuralannotation), being one of the first to be based on structure-derived functional annotations at a genome scale, is expected to be useful for better understanding of TB and for application in drug discovery. The reported annotation pipeline is fairly generic and can be applied to other genomes as well

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Open Access Repository of IISc Research Publications

A quality metric for homology modeling: the H-factor

Author: A Berglund
A Ilari
A Kolinski
A Sali
A Tramontano
A Wlodawer
AC Paiva
AE Keating
AE Torda
AG Murzin
AR Subramanian
AT Brunger
AT Brunger
B Wallner
BW Matthews
C Chothia
C Venclovas
C Venclovas
CG Roessler
CM Summa
CM Summa
D Baker
D Cozzetto
D Frishman
D Petrey
DH Ohlendorf
DT Jones
E di Luccio
E Di Luccio
E Saccenti
EL Sonnhammer
EN Brown
Eric di Luccio
G Chopra
G Vriend
GJ Kleywegt
GJ Kleywegt
H Yang
HW van Vlijmen
I Friedberg
IY Koh
J Kopp
J Moult
J Moult
J Moult
J Warringer
J Zhu
JC Kendrew
JD Thompson
JW Ponder
K Fidelis
K Wuthrich
K Wuthrich
K Wuthrich
KM Misura
KR Acharya
KR Acharya
LJ McGuffin
M Levitt
M Levitt
M Levitt
M Levitt
M Tress
M Tress
M Vasquez
M Wiederstein
MA Hanson
MA Olson
MJ Sippl
MY Shen
N Eswar
N Guex
N Siew
NV Buchete
ON Jensen
P Benkert
P Koehl
P Koehl
P Koehl
P Koehl
PA Alexander
Patrice Koehl
Q Fang
RA Laskowski
RC Edgar
RL Dunbrack Jr
RL Dunbrack Jr
RL Dunbrack Jr
S Grzesiek
SC Lovell
SC Lovell
SR Eddy
T Schwede
WJ Browne
X Yu
X Zhang
Publication venue: BioMed Central
Publication date: 01/02/2011
Field of study

Abstract Background The analysis of protein structures provides fundamental insight into most biochemical functions and consequently into the cause and possible treatment of diseases. As the structures of most known proteins cannot be solved experimentally for technical or sometimes simply for time constraints, <it>in silico </it>protein structure prediction is expected to step in and generate a more complete picture of the protein structure universe. Molecular modeling of protein structures is a fast growing field and tremendous works have been done since the publication of the very first model. The growth of modeling techniques and more specifically of those that rely on the existing experimental knowledge of protein structures is intimately linked to the developments of high resolution, experimental techniques such as NMR, X-ray crystallography and electron microscopy. This strong connection between experimental and <it>in silico </it>methods is however not devoid of criticisms and concerns among modelers as well as among experimentalists. Results In this paper, we focus on homology-modeling and more specifically, we review how it is perceived by the structural biology community and what can be done to impress on the experimentalists that it can be a valuable resource to them. We review the common practices and provide a set of guidelines for building better models. For that purpose, we introduce the H-factor, a new indicator for assessing the quality of homology models, mimicking the R-factor in X-ray crystallography. The methods for computing the H-factor is fully described and validated on a series of test cases. Conclusions We have developed a web service for computing the H-factor for models of a protein structure. This service is freely accessible at <url>http://koehllab.genomecenter.ucdavis.edu/toolkit/h-factor</url>.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

eScholarship - University of California

MACSIMS : multiple alignment of complete sequences information management system

BACKGROUND: In the post-genomic era, systems-level studies are being performed that seek to explain complex biological systems by integrating diverse resources from fields such as genomics, proteomics or transcriptomics. New information management systems are now needed for the collection, validation and analysis of the vast amount of heterogeneous data available. Multiple alignments of complete sequences provide an ideal environment for the integration of this information in the context of the protein family. RESULTS: MACSIMS is a multiple alignment-based information management program that combines the advantages of both knowledge-based and ab initio sequence analysis methods. Structural and functional information is retrieved automatically from the public databases. In the multiple alignment, homologous regions are identified and the retrieved data is evaluated and propagated from known to unknown sequences with these reliable regions. In a large-scale evaluation, the specificity of the propagated sequence features is estimated to be >99%, i.e. very few false positive predictions are made. MACSIMS is then used to characterise mutations in a test set of 100 proteins that are known to be involved in human genetic diseases. The number of sequence features associated with these proteins was increased by 60%, compared to the features available in the public databases. An XML format output file allows automatic parsing of the MACSIM results, while a graphical display using the JalView program allows manual analysis. CONCLUSION: MACSIMS is a new information management system that incorporates detailed analyses of protein families at the structural, functional and evolutionary levels. MACSIMS thus provides a unique environment that facilitates knowledge extraction and the presentation of the most pertinent information to the biologist. A web server and the source code are available at

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

University of Dundee Online Publications

3D Profile-Based Approach to Proteome-Wide Discovery of Novel Human Chemokines

Author: A Bateman
A Gerber
A Tomczak
A Zlotnik
A Zlotnik
A Zlotnik
AA Maghazachi
Andrej Shevchenko
Aurelie Tomczak
B Rost
C Boshoff
C Gille
C Pasquier
CH Wu
CJ Sigrist
D Cozzetto
D Van Der Spoel
D Wan
David Drechsel
DT Jones
E Lindahl
EL Sonnhammer
F Cocchi
Frank Buchholz
G Magistrelli
G Wang
G Wang
HH de Jongh
I Letunic
I Poser
I Prudovsky
IW Chong
J Cheng
J Gough
J Schultz
J Wang
Jana Sontheimer
JD Bendtsen
JE Pease
JE Tabaska
JG Luz
JT Stine
K Hiller
K Ottersbach
KA Roebuck
Karim Fahmy
KY Blain
LN Kinch
M. Teresa Pisabarro
MA Marti-Renom
Marc Gentzel
MJ Betts
MJ Sippl
MJ Sippl
MJ Sippl
MT Pisabarro
O Shmueli
P Flicek
P Genin
P Horton
P Puntervoll
P Ruggiero
Paul Wrede
R Colobran
Rainer Hausdorf
RJ Nibbs
S Hunter
S Kumar
S Lata
SF Altschul
Stefanie Eichler
T Fujita
TT Murooka
U Widmer
W Humphrey
WF Van Gunsteren
Y Ueda
Z Johnson
Z Zhang
Publication venue: Public Library of Science
Publication date: 07/05/2012
Field of study

Chemokines are small secreted proteins with important roles in immune responses. They consist of a conserved three-dimensional (3D) structure, so-called IL8-like chemokine fold, which is supported by disulfide bridges characteristic of this protein family. Sequence- and profile-based computational methods have been proficient in discovering novel chemokines by making use of their sequence-conserved cysteine patterns. However, it has been recently shown that some chemokines escaped annotation by these methods due to low sequence similarity to known chemokines and to different arrangement of cysteines in sequence and in 3D. Innovative methods overcoming the limitations of current techniques may allow the discovery of new remote homologs in the still functionally uncharacterized fraction of the human genome. We report a novel computational approach for proteome-wide identification of remote homologs of the chemokine family that uses fold recognition techniques in combination with a scaffold-based automatic mapping of disulfide bonds to define a 3D profile of the chemokine protein family. By applying our methodology to all currently uncharacterized human protein sequences, we have discovered two novel proteins that, without having significant sequence similarity to known chemokines or characteristic cysteine patterns, show strong structural resemblance to known anti-HIV chemokines. Detailed computational analysis and experimental structural investigations based on mass spectrometry and circular dichroism support our structural predictions and highlight several other chemokine-like features. The results obtained support their functional annotation as putative novel chemokines and encourage further experimental characterization. The identification of remote homologs of human chemokines may provide new insights into the molecular mechanisms causing pathologies such as cancer or AIDS, and may contribute to the development of novel treatments. Besides, the genome-wide applicability of our methodology based on 3D protein family profiles may open up new possibilities for improving and accelerating protein function annotation processes

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

FigShare

The CAFA challenge reports improved protein function prediction and new functional annotations for hundreds of genes through experimental screens

Author: Alborzi S. Z.
Altenhoff A.
Amezola M.
Antczak M.
Aridhi S.
Asgari E.
Atalay V.
Babbitt P. C.
Barot M.
Ben-Hur A.
Benso A.
Bergquist T. R.
Berselli M.
Bhat P.
Bjorne J.
Black G. S.
Boecker F.
Bonneau R.
Borukhov I.
Bosco G.
Boudellioua I.
Brackenridge D. A.
Brenner S. E.
Cao R.
Carraro M.
Casadio R.
Cetin Atalay R.
Chandler C.
Chang J. -M.
Cheng J.
Chi P. -H.
Cozzetto D.
Crocker A. W.
Dai S.
Dalklran A.
Das S.
Davidovic R. S.
Davis L.
Dayton J. B.
Dessimoz C.
Devignes M. -D.
Di Carlo S.
Dogan T.
Dzeroski S.
Fa R.
Fabris F.
Falda M.
Fang H.
Fernandez J. M.
Fontana P.
Frank Y.
Frasca M.
Freddolino P. L.
Freitas A. A.
Friedberg I.
Gemovic B.
Georghiou G.
Ginter F.
Gligorijevic V.
Goldberg T.
Gough J.
Greene C. S.
Grossi G.
Hakala K.
Hamid M. N.
Hoehndorf R.
Hogan D. A.
Holm L.
Hou J.
Hurto R. L.
Jain A.
Jeffery C. J.
Jiang Y.
Jo D.
Johnson D.
Jones D. T.
Kacsoh B. Z.
Kaewphan S.
Kahanda I.
Kihara D.
Koo D. C. E.
Kulmanov M.
Larsen D. J.
Lavezzo E.
Lee A. J.
Lees J. G.
Lewis K. A.
Liao W. -H.
Lichtarge O.
Linial M.
Liu Y. -W.
Mao Q.
Martelli P. L.
Martin M. J.
McGuffin L. J.
McHardy A. C.
Medlar A. J.
Mehryary F.
Mesiti M.
Moen H.
Mofrad M. R. K.
Mooney S. D.
Nguyen H. N.
Notaro M.
Novikov I.
O'Donovan C.
Omdahl A. R.
Orengo C. A.
Paccanaro A.
Pascarelli S.
Perovic V. R.
Petrini A.
Piovesan D.
Politano G.
Profiti G.
Radivojac P.
Re M.
Reeb J.
Renaux A.
Rifaioglu A. S.
Ritchie D. W.
Roche D. B.
Rodriguez J. M.
Romero A. E.
Rose P. W.
Rost B.
Saidi R.
Salakoski T.
Savojardo C.
Schoof H.
Sillitoe I.
Smuc T.
Suh E.
Sumonja N.
Supek F.
Thurlby N.
Tian W.
Tolvanen M. E. E.
Toppo S.
Toronen P.
Torres M.
Tosatto S. C. E.
Tress M. L.
Tseng W. -C.
Ur Rehman H.
Valentini G.
Veljkovic N.
Vidulin V.
Vucetic S.
Wan C.
Wang Z.
Warwick Vesztrocy A.
Wass M. N.
Wilkins A.
Yang H.
Yao S.
You R.
Yunes J. M.
Zhang C.
Zhang F.
Zhang S.
Zhang Y.
Zhang Z.
Zhao C.
Zhou N.
Zhu S.
Zosa E.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

Background: The Critical Assessment of Functional Annotation (CAFA) is an ongoing, global, community-driven effort to evaluate and improve the computational annotation of protein function. Results: Here, we report on the results of the third CAFA challenge, CAFA3, that featured an expanded analysis over the previous CAFA rounds, both in terms of volume of data analyzed and the types of analysis performed. In a novel and major new development, computational predictions and assessment goals drove some of the experimental assays, resulting in new functional annotations for more than 1000 genes. Specifically, we performed experimental whole genome mutation screening in Candida albicans and aeruginosa genomes, which provided us with genome-wide experimental data for genes associated with biofilm formation and motility. We further performed targeted assays on selected genes in Drosophila melanogaster, which we suspected of being involved in long-term memory. Conclusion: We conclude that while predictions of the molecular function and biological process annotations have slightly improved over time, those of the cellular component have not. Term-centric prediction of experimental annotations remains equally challenging; although the performance of the top methods is significantly better than the expectations set by baseline methods in C. albicans and D. melanogaster, it leaves considerable room and need for improvement. Finally, we report that the CAFA community now involves a broad range of participants with expertise in bioinformatics, biological experimentation, biocuration, and bio-ontologies, working together to improve functional annotation, computational function prediction, and our ability to manage big data in the era of large experimental screens

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)

Sigma-RF: prediction of the variability of spatial restraints in template-based modeling by random forest

Author: A Fiser
A Fiser
A Hildebrand
A Kryshtafovych
A Kryshtafovych
A Liwo
A Pastore
A Sali
A Ziegler
B Manavalan
Bernard R Brooks
D Cozzetto
D Kihara
E Krieger
F Armougom
G Wang
InSuk Joung
J Kopp
J Lee
J Lee
J Moult
J Pei
J Peng
J Peng
J Söding
J Thompson
J Xu
Jooyoung Lee
JR Quinlan
Juyong Lee
K Joo
K Joo
K Joo
K Joo
K Joo
Keehyoung Joo
Kiho Lee
L Breiman
L Breiman
R Caruana
S Wu
TN Petersen
V Mariani
V Mariani
Y Yang
Y Zhang
Y Zhang
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Quantifying protein interface footprinting by hydroxyl radical oxidation and molecular dynamics simulation: Application to galectin-1

Author: B. Fitzner
B. N. Stillman
B. S. Berlett
C. Berens
D. A. Case
D. Cozzetto
D. Hambly
D. King
D. King
D. L. Baker
D. L. Tabb
D. M. Hambly
D. N. Perkins
D. P. Greiner
E. Heyduk
E. S. Boja
F. T. Liu
G. Xu
G. Xu
G. Xu
H. B. Bull
H. Chen
H. M. Berman
H. Schagger
H. Walzel
I. E. Platis
J. D. Dyekjaer
J. Hirabayashi
J. J. Englander
J. K. Eng
J. K. Kamal
J. L. Wang
J. S. Sharp
J. S. Sharp
J. S. Sharp
J. S. Sharp
J. S. Sharp
J. W. Wong
J. Wang
J.-Q. Guan
J.-Q. Guan
K. Ginalski
K. J. Hampel
K. Kasai
K. Kubota
K. Takamoto
L. N. Mueller
M. Bern
M. C. Sullards
M. Cho
M. F. Lopez-Lucendo
M. Fouillit
M. R. Brenowitz
N. Fernandez-Fuentes
N. Shkriabai
N. T. Seyfried
P. G. A. Pedrioli
S. D. Maleknia
S. D. Maleknia
S. D. Maleknia
S. H. Barondes
S. J. Hubbard
S. L. Zhou
S. M. Shell
T. D. Tullius
T. T. Aye
V. L. Woods
W. L. Jorgensen
W. M. Garrison
X. Zheng
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref

An Expanded Evaluation of Protein Function Prediction Methods Shows an Improvement In Accuracy

Author: Almeida-e-Silva Danillo C.
Altenhoff Adrian
Babbitt Patricia C.
Bankapur Asma R.
Bargsten Joachim W.
Ben-Hur Asa
Benso Alfredo
Bhat Prajwal
BKC Dukka
Bonneau Richard
Brenner Steven E.
Bryson Kevin
Cao Renzhi
Casadio Rita
Cejuela Juan M.
Chapan Samuel
Chen Ching-Tai
Cheng Jianlin
Cibrian-Uhalte Elenia
Clark Wyatt T.
Cozzetto Domenico
D\u27Andrea Daniel
Das Sayoni
Dawson Natalie L.
del Pozo Angela
Denny Paul
Dessimoz Christophe
Di Carlo Stefano
Dogan Tunca
ElShal Sarah
Falda Marco
Fang Hai
Feng Shou
Fernández José M.
Ferrari Carlo
Fontana Paolo
Foulger Rebecca E.
Friedberg Iddo
Funk Christopher S.
Gabaldon Toni
Gemovic Branislava
Gillis Jesse
Ginter Filip
Giollo Manuel
Glisic Sanja
Goldberg Tatyana
Gong Qingtian
Gough Julian
Greene Casey S.
Hakala Kai
Hamp Tobias
Hieta Reija
Holm Liisa
Hsu Wen-Lian
Huntley Rachael P.
Jiang Yuxiang
Jones David T.
Kaewphan Suwisa
Kahanda Indika
Kansakar Lakesh
Khan Ishita K.
Kihara Daisuke
Koo Da Chen Emily
Koskinen Patrik
Lavezzo Enrico
Lee David
Lees Jonathan G.
Legge Duncan
Lepore Rosalba
Li Biao
Lin Alexandra
Linial Michal
Lovering Ruth C.
Magrane Michele
Maietta Paolo
Marcet-Houben Marina
Martelli Pier Luigi
Martin Maria J.
Mehryar Farrokh
Melidoni Anna N.
Mesiti Marco
Minneci Federico
Mooney Sean D.
Moreau Yves
Mutowo-Meullenet Prudence
Nepusz Tamás
Ning Wei
O\u27Donovan Claire
Oates Matt
Ofer Dan
Orengo Christine A.
Oron Tal Ronnen
Paccanaro Alberto
Pavlidis Paul
Penfold-Brown Duncan
Perovic Vladmir
Pichler Klemens
Piovesan Damiano
Politano Gianfranco
Profiti Giuseppe
Radivojac Predrag
Rappoport Nadav
Re Matteo
Rehman Hafeez Ur
Richter Lothar
Robinson Peter N.
Romero Alfonso E.
Rost Burkhard
Sahraeian Sayed M.E.
Salakoski Tapio
Salamov Asaf
Sasidharan Rajkumar
Savino Alessandro
Sedeño-Cortés Adriana E.
Sharan Malvika
Shasha Dennis
Shypitsyna Aleksandra
Skunca Nives
Smithers Ben
Stern Amos
Sternberg Michael J.E.
Stilltoe Ian
Supek Fran
Tian Weidong
Toppo Stefano
Tosatto Silvio C.E.
Tramontano Anna
Tranchevent Léon-Charles
Tress Michael L.
Törönen Petri
Valencia Alfonso
Valentini Giorgio
van Dijk Aalt D.J.
Veljkovic Nevena
Veljkovic Veljko
Vencio Ricardo Z.N.
Verspoor Karin M.
Vogel Jörg
Vucetic Slobodan
Wang Zheng
Wass Mark N.
Yang Haixuan
Youngs Noah
Zakeri Pooya
Zhang Shanshan
Zhong Zhaolong
Zhou Yuanpeng
Publication venue: The Aquila Digital Community
Publication date: 07/09/2016
Field of study

Background: A major bottleneck in our understanding of the molecular underpinnings of life is the assignment of function to proteins. While molecular experiments provide the most reliable annotation of proteins, their relatively low throughput and restricted purview have led to an increasing role for computational function prediction. However, assessing methods for protein function prediction and tracking progress in the field remain challenging. Results: We conducted the second critical assessment of functional annotation (CAFA), a timed challenge to assess computational methods that automatically assign protein function. We evaluated 126 methods from 56 research groups for their ability to predict biological functions using Gene Ontology and gene-disease associations using Human Phenotype Ontology on a set of 3681 proteins from 18 species. CAFA2 featured expanded analysis compared with CAFA1, with regards to data set size, variety, and assessment metrics. To review progress in the field, the analysis compared the best methods from CAFA1 to those of CAFA2. Conclusions: The top-performing methods in CAFA2 outperformed those from CAFA1. This increased accuracy can be attributed to a combination of the growing number of experimental annotations and improved methods for function prediction. The assessment also revealed that the definition of top-performing algorithms is ontology specific, that different performance metrics can be used to probe the nature of accurate predictions, and the relative diversity of predictions in the biological process and human phenotype ontologies. While there was methodological improvement between CAFA1 and CAFA2, the interpretation of results and usefulness of individual methods remain context-dependent

Aquila Digital Community