The CAFA challenge reports improved protein function prediction and new functional annotations for hundreds of genes through experimental screens

Zhou, N.; Jiang, Y.; Bergquist, T. R.; Lee, A. J.; Kacsoh, B. Z.; Crocker, A. W.; Lewis, K. A.; Georghiou, G.; Nguyen, H. N.; Hamid, M. N.; Davis, L.; Dogan, T.; Atalay, V.; Rifaioglu, A. S.; Dalklran, A.; Cetin Atalay, R.; Zhang, C.; Hurto, R. L.; Freddolino, P. L.; Zhang, Y.; Bhat, P.; Supek, F.; Fernandez, J. M.; Gemovic, B.; Perovic, V. R.; Davidovic, R. S.; Sumonja, N.; Veljkovic, N.; Asgari, E.; Mofrad, M. R. K.; Profiti, G.; Savojardo, C.; Martelli, P. L.; Casadio, R.; Boecker, F.; Schoof, H.; Kahanda, I.; Thurlby, N.; Mchardy, A. C.; Renaux, A.; Saidi, R.; Gough, J.; Freitas, A. A.; Antczak, M.; Fabris, F.; Wass, M. N.; Hou, J.; Cheng, J.; Wang, Z.; Romero, A. E.; Paccanaro, A.; Yang, H.; Goldberg, T.; Zhao, C.; Holm, L.; Toronen, P.; Medlar, A. J.; Zosa, E.; Borukhov, I.; Novikov, I.; Wilkins, A.; Lichtarge, O.; Chi, P. -H.; Tseng, W. -C.; Linial, M.; Rose, P. W.; Dessimoz, C.; Vidulin, V.; Dzeroski, S.; Sillitoe, I.; Das, S.; Lees, J. G.; Jones, D. T.; Wan, C.; Cozzetto, D.; Fa, R.; Torres, M.; Warwick Vesztrocy, A.; Rodriguez, J. M.; Tress, M. L.; Frasca, M.; Notaro, M.; Grossi, G.; Petrini, A.; Re, M.; Valentini, G.; Mesiti, M.; Roche, D. B.; Reeb, J.; Ritchie, D. W.; Aridhi, S.; Alborzi, S. Z.; Devignes, M. -D.; Koo, D. C. E.; Bonneau, R.; Gligorijevic, V.; Barot, M.; Fang, H.; Toppo, S.; Lavezzo, E.; Falda, M.; Berselli, M.; Tosatto, S. C. E.; Carraro, M.; Piovesan, D.; Ur Rehman, H.; Mao, Q.; Zhang, S.; Vucetic, S.; Black, G. S.; Jo, D.; Suh, E.; Dayton, J. B.; Larsen, D. J.; Omdahl, A. R.; Mcguffin, L. J.; Brackenridge, D. A.; Babbitt, P. C.; Yunes, J. M.; Fontana, P.; Zhang, F.; Zhu, S.; You, R.; Zhang, Z.; Dai, S.; Yao, S.; Tian, W.; Cao, R.; Chandler, C.; Amezola, M.; Johnson, D.; Chang, J. -M.; Liao, W. -H.; Liu, Y. -W.; Pascarelli, S.; Frank, Y.; Hoehndorf, R.; Kulmanov, M.; Boudellioua, I.; Politano, G.; Di Carlo, S.; Benso, A.; Hakala, K.; Ginter, F.; Mehryary, F.; Kaewphan, S.; Bjorne, J.; Moen, H.; Tolvanen, M. E. E.; Salakoski, T.; Kihara, D.; Jain, A.; Smuc, T.; Altenhoff, A.; Ben-Hur, A.; Rost, B.; Brenner, S. E.; Orengo, C. A.; Jeffery, C. J.; Bosco, G.; Hogan, D. A.; Martin, M. J.; O'Donovan, C.; Mooney, S. D.; Greene, C. S.; Radivojac, P.; Friedberg, I.

doi:10.1186/s13059-019-1835-8

Background: The Critical Assessment of Functional Annotation (CAFA) is an ongoing, global, community-driven effort to evaluate and improve the computational annotation of protein function. Results: Here, we report on the results of the third CAFA challenge, CAFA3, that featured an expanded analysis over the previous CAFA rounds, both in terms of volume of data analyzed and the types of analysis performed. In a novel and major new development, computational predictions and assessment goals drove some of the experimental assays, resulting in new functional annotations for more than 1000 genes. Specifically, we performed experimental whole genome mutation screening in Candida albicans and aeruginosa genomes, which provided us with genome-wide experimental data for genes associated with biofilm formation and motility. We further performed targeted assays on selected genes in Drosophila melanogaster, which we suspected of being involved in long-term memory. Conclusion: We conclude that while predictions of the molecular function and biological process annotations have slightly improved over time, those of the cellular component have not. Term-centric prediction of experimental annotations remains equally challenging; although the performance of the top methods is significantly better than the expectations set by baseline methods in C. albicans and D. melanogaster, it leaves considerable room and need for improvement. Finally, we report that the CAFA community now involves a broad range of participants with expertise in bioinformatics, biological experimentation, biocuration, and bio-ontologies, working together to improve functional annotation, computational function prediction, and our ability to manage big data in the era of large experimental screens.

The CAFA challenge reports improved protein function prediction and new functional annotations for hundreds of genes through experimental screens / Zhou, N.; Jiang, Y.; Bergquist, T. R.; Lee, A. J.; Kacsoh, B. Z.; Crocker, A. W.; Lewis, K. A.; Georghiou, G.; Nguyen, H. N.; Hamid, M. N.; Davis, L.; Dogan, T.; Atalay, V.; Rifaioglu, A. S.; Dalklran, A.; Cetin Atalay, R.; Zhang, C.; Hurto, R. L.; Freddolino, P. L.; Zhang, Y.; Bhat, P.; Supek, F.; Fernandez, J. M.; Gemovic, B.; Perovic, V. R.; Davidovic, R. S.; Sumonja, N.; Veljkovic, N.; Asgari, E.; Mofrad, M. R. K.; Profiti, G.; Savojardo, C.; Martelli, P. L.; Casadio, R.; Boecker, F.; Schoof, H.; Kahanda, I.; Thurlby, N.; Mchardy, A. C.; Renaux, A.; Saidi, R.; Gough, J.; Freitas, A. A.; Antczak, M.; Fabris, F.; Wass, M. N.; Hou, J.; Cheng, J.; Wang, Z.; Romero, A. E.; Paccanaro, A.; Yang, H.; Goldberg, T.; Zhao, C.; Holm, L.; Toronen, P.; Medlar, A. J.; Zosa, E.; Borukhov, I.; Novikov, I.; Wilkins, A.; Lichtarge, O.; Chi, P. -H.; Tseng, W. -C.; Linial, M.; Rose, P. W.; Dessimoz, C.; Vidulin, V.; Dzeroski, S.; Sillitoe, I.; Das, S.; Lees, J. G.; Jones, D. T.; Wan, C.; Cozzetto, D.; Fa, R.; Torres, M.; Warwick Vesztrocy, A.; Rodriguez, J. M.; Tress, M. L.; Frasca, M.; Notaro, M.; Grossi, G.; Petrini, A.; Re, M.; Valentini, G.; Mesiti, M.; Roche, D. B.; Reeb, J.; Ritchie, D. W.; Aridhi, S.; Alborzi, S. Z.; Devignes, M. -D.; Koo, D. C. E.; Bonneau, R.; Gligorijevic, V.; Barot, M.; Fang, H.; Toppo, S.; Lavezzo, E.; Falda, M.; Berselli, M.; Tosatto, S. C. E.; Carraro, M.; Piovesan, D.; Ur Rehman, H.; Mao, Q.; Zhang, S.; Vucetic, S.; Black, G. S.; Jo, D.; Suh, E.; Dayton, J. B.; Larsen, D. J.; Omdahl, A. R.; Mcguffin, L. J.; Brackenridge, D. A.; Babbitt, P. C.; Yunes, J. M.; Fontana, P.; Zhang, F.; Zhu, S.; You, R.; Zhang, Z.; Dai, S.; Yao, S.; Tian, W.; Cao, R.; Chandler, C.; Amezola, M.; Johnson, D.; Chang, J. -M.; Liao, W. -H.; Liu, Y. -W.; Pascarelli, S.; Frank, Y.; Hoehndorf, R.; Kulmanov, M.; Boudellioua, I.; Politano, G.; Di Carlo, S.; Benso, A.; Hakala, K.; Ginter, F.; Mehryary, F.; Kaewphan, S.; Bjorne, J.; Moen, H.; Tolvanen, M. E. E.; Salakoski, T.; Kihara, D.; Jain, A.; Smuc, T.; Altenhoff, A.; Ben-Hur, A.; Rost, B.; Brenner, S. E.; Orengo, C. A.; Jeffery, C. J.; Bosco, G.; Hogan, D. A.; Martin, M. J.; O'Donovan, C.; Mooney, S. D.; Greene, C. S.; Radivojac, P.; Friedberg, I.. - In: GENOME BIOLOGY. - ISSN 1474-760X. - ELETTRONICO. - 20:Article number: 244 (2019)(2019), pp. 1-23. [10.1186/s13059-019-1835-8]

The CAFA challenge reports improved protein function prediction and new functional annotations for hundreds of genes through experimental screens

Zhou N.;Jiang Y.;Bergquist T. R.;Lee A. J.;Kacsoh B. Z.;Crocker A. W.;Lewis K. A.;Georghiou G.;Nguyen H. N.;Hamid M. N.;Davis L.;Dogan T.;Atalay V.;Rifaioglu A. S.;Dalklran A.;Cetin Atalay R.;Zhang C.;Hurto R. L.;Freddolino P. L.;Zhang Y.;Bhat P.;Supek F.;Fernandez J. M.;Gemovic B.;Perovic V. R.;Davidovic R. S.;Sumonja N.;Veljkovic N.;Asgari E.;Mofrad M. R. K.;Profiti G.;Savojardo C.;Martelli P. L.;Casadio R.;Boecker F.;Schoof H.;Kahanda I.;Thurlby N.;McHardy A. C.;Renaux A.;Saidi R.;Gough J.;Freitas A. A.;Antczak M.;Fabris F.;Wass M. N.;Hou J.;Cheng J.;Wang Z.;Romero A. E.;Paccanaro A.;Yang H.;Goldberg T.;Zhao C.;Holm L.;Toronen P.;Medlar A. J.;Zosa E.;Borukhov I.;Novikov I.;Wilkins A.;Lichtarge O.;Chi P. -H.;Tseng W. -C.;Linial M.;Rose P. W.;Dessimoz C.;Vidulin V.;Dzeroski S.;Sillitoe I.;Das S.;Lees J. G.;Jones D. T.;Wan C.;Cozzetto D.;Fa R.;Torres M.;Warwick Vesztrocy A.;Rodriguez J. M.;Tress M. L.;Frasca M.;Notaro M.;Grossi G.;Petrini A.;Re M.;Valentini G.;Mesiti M.;Roche D. B.;Reeb J.;Ritchie D. W.;Aridhi S.;Alborzi S. Z.;Devignes M. -D.;Koo D. C. E.;Bonneau R.;Gligorijevic V.;Barot M.;Fang H.;Toppo S.;Lavezzo E.;Falda M.;Berselli M.;Tosatto S. C. E.;Carraro M.;Piovesan D.;Ur Rehman H.;Mao Q.;Zhang S.;Vucetic S.;Black G. S.;Jo D.;Suh E.;Dayton J. B.;Larsen D. J.;Omdahl A. R.;McGuffin L. J.;Brackenridge D. A.;Babbitt P. C.;Yunes J. M.;Fontana P.;Zhang F.;Zhu S.;You R.;Zhang Z.;Dai S.;Yao S.;Tian W.;Cao R.;Chandler C.;Amezola M.;Johnson D.;Chang J. -M.;Liao W. -H.;Liu Y. -W.;Pascarelli S.;Frank Y.;Hoehndorf R.;Kulmanov M.;Boudellioua I.;Politano G.;Di Carlo S.;Benso A.;Hakala K.;Ginter F.;Mehryary F.;Kaewphan S.;Bjorne J.;Moen H.;Tolvanen M. E. E.;Salakoski T.;Kihara D.;Jain A.;Smuc T.;Altenhoff A.;Ben-Hur A.;Rost B.;Brenner S. E.;Orengo C. A.;Jeffery C. J.;Bosco G.;Hogan D. A.;Martin M. J.;O'Donovan C.;Mooney S. D.;Greene C. S.;Radivojac P.;Friedberg I.

2019

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno del prodotto
	
				2019
			
	Codice DOI
	
				https://dx.doi.org/10.1186/s13059-019-1835-8
			
	Titolo della Rivista
	
				GENOME BIOLOGY
			
	Appare nelle tipologie
	
				1.1 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
s13059-019-1835-8.pdf accesso aperto Tipologia: 2a Post-print versione editoriale / Version of Record Licenza: Creative commons Dimensione 8.39 MB Formato Adobe PDF Visualizza/Apri	8.39 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/2776153

PORTO @ Archivio Istituzionale della Ricerca

The CAFA challenge reports improved protein function prediction and new functional annotations for hundreds of genes through experimental screens

2019

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Pubblicazioni consigliate

Informazioni

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)