Gradient-Based Competitive Learning: Theory

Cirrincione, Giansalvo; Randazzo, Vincenzo; Barbiero, Pietro; Ciravegna, Gabriele; Pasero, Eros

doi:10.1007/s12559-023-10225-5

Deep learning has been recently used to extract the relevant features for representing input data also in the unsupervised setting. However, state-of-the-art techniques focus mostly on algorithmic efficiency and accuracy rather than mimicking the input manifold. On the contrary, competitive learning is a powerful tool for replicating the input distribution topology. It is cognitive/biologically inspired as it is founded on Hebbian learning, a neuropsychological theory claiming that neurons can increase their specialization by competing for the right to respond to/represent a subset of the input data. This paper introduces a novel perspective by combining these two techniques: unsupervised gradient-based and competitive learning. The theory is based on the intuition that neural networks can learn topological structures by working directly on the transpose of the input matrix. At this purpose, the vanilla competitive layer and its dual are presented. The former is representative of a standard competitive layer for deep clustering, while the latter is trained on the transposed matrix. The equivalence of the layers is extensively proven both theoretically and experimentally. The dual competitive layer has better properties. Unlike the vanilla layer, it directly outputs the prototypes of the data inputs, while still allowing learning by backpropagation. More importantly, this paper proves theoretically that the dual layer is better suited for handling high-dimensional data (e.g., for biological applications), because the estimation of the weights is driven by a constraining subspace which does not depend on the input dimensionality, but only on the dataset cardinality. This paper has introduced a novel approach for unsupervised gradient-based competitive learning. This approach is very promising both in the case of small datasets of high-dimensional data and for better exploiting the advantages of a deep architecture: the dual layer perfectly integrates with the deep layers. A theoretical justification is also given by using the analysis of the gradient flow for both vanilla and dual layers.

Gradient-Based Competitive Learning: Theory / Cirrincione, Giansalvo; Randazzo, Vincenzo; Barbiero, Pietro; Ciravegna, Gabriele; Pasero, Eros. - In: COGNITIVE COMPUTATION. - ISSN 1866-9956. - ELETTRONICO. - 16:(2024), pp. 608-623. [10.1007/s12559-023-10225-5]

Gradient-Based Competitive Learning: Theory

Cirrincione, Giansalvo;Randazzo, Vincenzo;Barbiero, Pietro;Ciravegna, Gabriele;Pasero, Eros

2024

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno del prodotto
	
				2024
			
	Codice DOI
	
				https://dx.doi.org/10.1007/s12559-023-10225-5
			
	Titolo della Rivista
	
				COGNITIVE COMPUTATION
			
	Appare nelle tipologie
	
				1.1 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
Randazzo-Gradient.pdf accesso aperto Tipologia: 2a Post-print versione editoriale / Version of Record Licenza: Creative commons Dimensione 3.51 MB Formato Adobe PDF Visualizza/Apri	3.51 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/2984047

PORTO @ Archivio Istituzionale della Ricerca

Gradient-Based Competitive Learning: Theory

Cirrincione, Giansalvo;Randazzo, Vincenzo;Barbiero, Pietro;Ciravegna, Gabriele;Pasero, Eros

2024

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Pubblicazioni consigliate

Informazioni

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)