Relative Norm Alignment for Tackling Domain Shift in Deep Multi-modal Classification

Planamente, Mirco; Plizzari, Chiara; Peirone, Simone Alberto; Caputo, Barbara; Bottino, Andrea

doi:10.1007/s11263-024-01998-9

Multi-modal learning has gained significant attention due to its ability to enhance machine learning algorithms. However, it brings challenges related to modality heterogeneity and domain shift. In this work, we address these challenges by proposing a new approach called Relative Norm Alignment (RNA) loss. RNA loss exploits the observation that variations in marginal distributions between modalities manifest as discrepancies in their mean feature norms, and rebalances feature norms across domains, modalities, and classes. This rebalancing improves the accuracy of models on test data from unseen ("target") distributions. In the context of Unsupervised Domain Adaptation (UDA), we use unlabeled target data to enhance feature transferability. We achieve this by combining RNA loss with an adversarial domain loss and an Information Maximization term that regularizes predictions on target data. We present a comprehensive analysis and ablation of our method for both Domain Generalization and UDA settings, testing our approach on different modalities for tasks such as first and third person action recognition, object recognition, and fatigue detection. Experimental results show that our approach achieves competitive or state-of-the-art performance on the proposed benchmarks, showing the versatility and effectiveness of our method in a wide range of applications.

Relative Norm Alignment for Tackling Domain Shift in Deep Multi-modal Classification / Planamente, Mirco; Plizzari, Chiara; Peirone, Simone Alberto; Caputo, Barbara; Bottino, Andrea. - In: INTERNATIONAL JOURNAL OF COMPUTER VISION. - ISSN 0920-5691. - STAMPA. - 132:(2024), pp. 2618-2638. [10.1007/s11263-024-01998-9]

Relative Norm Alignment for Tackling Domain Shift in Deep Multi-modal Classification

Planamente, Mirco;Plizzari, Chiara;Peirone, Simone Alberto;Caputo, Barbara;Bottino, Andrea

2024

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno del prodotto
	
				2024
			
	Codice DOI
	
				https://dx.doi.org/10.1007/s11263-024-01998-9
			
	Titolo della Rivista
	
				INTERNATIONAL JOURNAL OF COMPUTER VISION
			
	Appare nelle tipologie
	
				1.1 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
RNA_for_IJCV__FLAT_.pdf accesso aperto Descrizione: post print Tipologia: 2. Post-print / Author's Accepted Manuscript Licenza: Creative commons Dimensione 5.71 MB Formato Adobe PDF Visualizza/Apri	5.71 MB	Adobe PDF	Visualizza/Apri
s11263-024-01998-9.pdf accesso aperto Tipologia: 2a Post-print versione editoriale / Version of Record Licenza: Creative commons Dimensione 2.46 MB Formato Adobe PDF Visualizza/Apri	2.46 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/2984895

Nome	Dominio	Durata	Descrizione
s_.*	plu.mx	sessione	recupero grafico citazioni sociali da plumx
A_.*	core.ac.uk	7 giorni	recupero pubblicazioni consigliate per il pannello core-recommander
GS_.*	gstatic.com	richiesta http	visualizza grafico citazioni
CC_.*	creativecommons.org	richiesta http	visualizza licenza bitstream

PORTO @ Archivio Istituzionale della Ricerca

Relative Norm Alignment for Tackling Domain Shift in Deep Multi-modal Classification

Planamente, Mirco;Plizzari, Chiara;Peirone, Simone Alberto;Caputo, Barbara;Bottino, Andrea

2024

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Pubblicazioni consigliate

Informazioni

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)