Inference of annealed protein fitness landscapes with AnnealDCA

Sesta, Luca; Pagnani, Andrea; Fernandez-de-Cossio-Diaz, Jorge; Uguzzoni, Guido

doi:10.1371/journal.pcbi.1011812

The design of proteins with specific tasks is a major challenge in molecular biology with important diagnostic and therapeutic applications. High-throughput screening methods have been developed to systematically evaluate protein activity, but only a small fraction of possible protein variants can be tested using these techniques. Computational models that explore the sequence space in-silico to identify the fittest molecules for a given function are needed to overcome this limitation. In this article, we propose AnnealDCA, a machine-learning framework to learn the protein fitness landscape from sequencing data derived from a broad range of experiments that use selection and sequencing to quantify protein activity. We demonstrate the effectiveness of our method by applying it to antibody Rep-Seq data of immunized mice and screening experiments, assessing the quality of the fitness landscape reconstructions. Our method can be applied to several experimental cases where a population of protein variants undergoes various rounds of selection and sequencing, without relying on the computation of variants enrichment ratios, and thus can be used even in cases of disjoint sequence samples.

Inference of annealed protein fitness landscapes with AnnealDCA / Sesta, L., Pagnani, A., Fernandez-de-Cossio-Diaz, J., Uguzzoni, G.. - In: PLOS COMPUTATIONAL BIOLOGY. - ISSN 1553-7358. - ELETTRONICO. - 20:2(2024). [10.1371/journal.pcbi.1011812]

Inference of annealed protein fitness landscapes with AnnealDCA

Sesta, Luca;Pagnani, Andrea;Fernandez-de-Cossio-Diaz, Jorge;Uguzzoni, Guido

2024

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno del prodotto
	
				2024
			
	Codice DOI
	
				https://dx.doi.org/10.1371/journal.pcbi.1011812
			
	Titolo della Rivista
	
				PLOS COMPUTATIONAL BIOLOGY
			
	Appare nelle tipologie
	
				1.1 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
journal.pcbi.1011812.pdf accesso aperto Descrizione: paper Tipologia: 2. Post-print / Author's Accepted Manuscript Licenza: Creative commons Dimensione 1.15 MB Formato Adobe PDF Visualizza/Apri	1.15 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/2995448

PORTO @ Archivio Istituzionale della Ricerca

Inference of annealed protein fitness landscapes with AnnealDCA

Sesta, Luca;Pagnani, Andrea;Fernandez-de-Cossio-Diaz, Jorge;Uguzzoni, Guido

2024

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Pubblicazioni consigliate

Informazioni

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)