The CAFA challenge reports improved protein function prediction and new functional annotations for hundreds of genes through experimental screens.

Alborzi, Seyed Ziaeddin; Antczak, Magdalena; Aridhi, Sabeur; Asgari, Ehsaneddin; Atalay, Volkan; Barot, Meet; Bergquist, Timothy R; Bhat, Prajwal; Boecker, Florian; Bonneau, Richard; Borukhov, Itamar; Casadio, Rita; Cetin Atalay, Rengul; Cheng, Jianlin; Chi, Po-Han; Cozzetto, Domenico; Crocker, Alex W; Dalkıran, Alperen; Das, Sayoni; Davidović, Radoslav S; Davis, Larry; Dessimoz, Christophe; Devignes, Marie-Dominique; Dogan, Tunca; Dzeroski, Saso; Fa, Rui; Fabris, Fabio; Fang, Hai; Fernández, José M; Frasca, Marco; Freddolino, Peter L; Freitas, Alex A; Gemovic, Branislava; Georghiou, George; Gligorijević, Vladimir; Goldberg, Tatyana; Gough, Julian; Grossi, Giuliano; Hamid, Md Nafiz; Holm, Liisa; Hou, Jie; Hurto, Rebecca L; Jiang, Yuxiang; Jones, David T; Kacsoh, Balint Z; Kahanda, Indika; Koo, Da Chen Emily; Lavezzo, Enrico; Lee, Alexandra J; Lees, Jonathan Gill; Lewis, Kimberley A; Lichtarge, Olivier; Linial, Michal; Martelli, Pier Luigi; McHardy, Alice C; Medlar, Alan J; Mesiti, Marco; Mofrad, Mohammad RK; Nguyen, Huy N; Notaro, Marco; Novikov, Ilya; Paccanaro, Alberto; Perovic, Vladimir R; Petrini, Alessandro; Profiti, Giuseppe; Re, Matteo; Reeb, Jonas; Renaux, Alexandre; Rifaioglu, Ahmet S; Ritchie, David W; Roche, Daniel B; Rodriguez, Jose Manuel; Romero, Alfonso E; Rose, Peter W; Saidi, Rabie; Savojardo, Castrense; Schoof, Heiko; Sillitoe, Ian; Sumonja, Neven; Supek, Fran; Thurlby, Natalie; Toppo, Stefano; Torres, Mateo; Tress, Michael L; Tseng, Wei-Cheng; Törönen, Petri; Valentini, Giorgio; Veljkovic, Nevena; Vidulin, Vedrana; Wan, Cen; Wang, Zheng; Warwick Vesztrocy, Alex; Wass, Mark N; Wilkins, Angela; Yang, Haixuan; Zhang, Chengxin; Zhang, Yang; Zhao, Chenguang; Zhou, Naihui; Zosa, Elaine

The CAFA challenge reports improved protein function prediction and new functional annotations for hundreds of genes through experimental screens.

Authors: Seyed Ziaeddin Alborzi
Magdalena Antczak
Sabeur Aridhi
Ehsaneddin Asgari
Volkan Atalay
Meet Barot
Timothy R Bergquist
Prajwal Bhat
Florian Boecker
Richard Bonneau
Itamar Borukhov
Rita Casadio
Rengul Cetin Atalay
Jianlin Cheng
Po-Han Chi
Domenico Cozzetto
Alex W Crocker
Alperen Dalkıran
Sayoni Das
Radoslav S Davidović
Larry Davis
Christophe Dessimoz
Marie-Dominique Devignes
Tunca Dogan
Saso Dzeroski
Rui Fa
Fabio Fabris
Hai Fang
José M Fernández
Marco Frasca
Peter L Freddolino
Alex A Freitas
Branislava Gemovic
George Georghiou
Vladimir Gligorijević
Tatyana Goldberg
Julian Gough
Giuliano Grossi
Md Nafiz Hamid
Liisa Holm
Jie Hou
Rebecca L Hurto
Yuxiang Jiang
David T Jones
Balint Z Kacsoh
Indika Kahanda
Da Chen Emily Koo
Enrico Lavezzo
Alexandra J Lee
Jonathan Gill Lees
Kimberley A Lewis
Olivier Lichtarge
Michal Linial
Pier Luigi Martelli
Alice C McHardy
Alan J Medlar
Marco Mesiti
Mohammad RK Mofrad
Huy N Nguyen
Marco Notaro
Ilya Novikov
Alberto Paccanaro
Vladimir R Perovic
Alessandro Petrini
Giuseppe Profiti
Matteo Re
Jonas Reeb
Alexandre Renaux
Ahmet S Rifaioglu
David W Ritchie
Daniel B Roche
Jose Manuel Rodriguez
Alfonso E Romero
Peter W Rose
Rabie Saidi
Castrense Savojardo
Heiko Schoof
Ian Sillitoe
Neven Sumonja
Fran Supek
Natalie Thurlby
Stefano Toppo
Mateo Torres
Michael L Tress
Wei-Cheng Tseng
Petri Törönen
Giorgio Valentini
Nevena Veljkovic
Vedrana Vidulin
Cen Wan
Zheng Wang
Alex Warwick Vesztrocy
Mark N Wass
Angela Wilkins
Haixuan Yang
Chengxin Zhang
Yang Zhang
Chenguang Zhao
Naihui Zhou
Elaine Zosa
Publication date: 1 November 2019
Publisher: eScholarship, University of California

Abstract

BackgroundThe Critical Assessment of Functional Annotation (CAFA) is an ongoing, global, community-driven effort to evaluate and improve the computational annotation of protein function.ResultsHere, we report on the results of the third CAFA challenge, CAFA3, that featured an expanded analysis over the previous CAFA rounds, both in terms of volume of data analyzed and the types of analysis performed. In a novel and major new development, computational predictions and assessment goals drove some of the experimental assays, resulting in new functional annotations for more than 1000 genes. Specifically, we performed experimental whole-genome mutation screening in Candida albicans and Pseudomonas aureginosa genomes, which provided us with genome-wide experimental data for genes associated with biofilm formation and motility. We further performed targeted assays on selected genes in Drosophila melanogaster, which we suspected of being involved in long-term memory.ConclusionWe conclude that while predictions of the molecular function and biological process annotations have slightly improved over time, those of the cellular component have not. Term-centric prediction of experimental annotations remains equally challenging; although the performance of the top methods is significantly better than the expectations set by baseline methods in C. albicans and D. melanogaster, it leaves considerable room and need for improvement. Finally, we report that the CAFA community now involves a broad range of participants with expertise in bioinformatics, biological experimentation, biocuration, and bio-ontologies, working together to improve functional annotation, computational function prediction, and our ability to manage big data in the era of large experimental screens

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Sustaining member

eScholarship - University of California

oai:escholarship.org:ark:/1303...

Last time updated on 25/12/2021