Interpretable pairwise distillations for generative protein sequence models

Feinauer, C.; Meynard-Piganeau, B.; Lucibello, C.

doi:10.1371/journal.pcbi.1010219

Many different types of generative models for protein sequences have been proposed in literature. Their uses include the prediction of mutational effects, protein design and the prediction of structural properties. Neural network (NN) architectures have shown great performances, commonly attributed to the capacity to extract non-trivial higher-order interactions from the data. In this work, we analyze two different NN models and assess how close they are to simple pairwise distributions, which have been used in the past for similar problems. We present an approach for extracting pairwise models from more complex ones using an energy-based modeling framework. We show that for the tested models the extracted pairwise models can replicate the energies of the original models and are also close in performance in tasks like mutational effect prediction. In addition, we show that even simpler, factorized models often come close in performance to the original models.

Interpretable pairwise distillations for generative protein sequence models / Feinauer, C.; Meynard-Piganeau, B.; Lucibello, C.. - In: PLOS COMPUTATIONAL BIOLOGY. - ISSN 1553-734X. - 18:6(2022). [10.1371/journal.pcbi.1010219]

Interpretable pairwise distillations for generative protein sequence models

Feinauer C.;Meynard-Piganeau B.;Lucibello C.

2022

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno del prodotto
	
				2022
			
	Codice DOI
	
				https://dx.doi.org/10.1371/journal.pcbi.1010219
			
	Titolo della Rivista
	
				PLOS COMPUTATIONAL BIOLOGY
			
	Appare nelle tipologie
	
				1.1 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
journal.pcbi.1010219 (3).pdf accesso aperto Descrizione: Paper Tipologia: 2a Post-print versione editoriale / Version of Record Licenza: Creative commons Dimensione 3.32 MB Formato Adobe PDF Visualizza/Apri	3.32 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/2988023

PORTO @ Archivio Istituzionale della Ricerca

Interpretable pairwise distillations for generative protein sequence models

Feinauer C.;Meynard-Piganeau B.;Lucibello C.

2022

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Pubblicazioni consigliate

Informazioni

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)