Classifier-dependent feature selection via greedy methods

Camattari, Fabiana; Guastavino, Sabrina; Marchetti, Francesco; Piana, Michele; Perracchione, Emma

doi:10.1007/s11222-024-10460-2

The purpose of this study is to introduce a new approach to feature ranking for classification tasks, called in what follows greedy feature selection. In statistical learning, feature selection is usually realized by means of methods that are independent of the classifier applied to perform the prediction using that reduced number of features. Instead, the greedy feature selection identifies the most important feature at each step and according to the selected classifier. The benefits of such scheme are investigated in terms of model capacity indicators, such as the Vapnik-Chervonenkis dimension or the kernel alignment. This theoretical study proves that the iterative greedy algorithm is able to construct classifiers whose complexity capacity grows at each step. The proposed method is then tested numerically on various datasets and compared to the state-of-the-art techniques. The results show that our iterative scheme is able to truly capture only a few relevant features, and may improve, especially for real and noisy data, the accuracy scores of other techniques. The greedy scheme is also applied to the challenging application of predicting geo-effective manifestations of the active Sun.

Classifier-dependent feature selection via greedy methods / Camattari, Fabiana; Guastavino, Sabrina; Marchetti, Francesco; Piana, Michele; Perracchione, Emma. - In: STATISTICS AND COMPUTING. - ISSN 0960-3174. - 34:5(2024), pp. 1-12. [10.1007/s11222-024-10460-2]

Classifier-dependent feature selection via greedy methods

Camattari, Fabiana;Guastavino, Sabrina;Marchetti, Francesco;Piana, Michele;Perracchione, Emma

2024

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno del prodotto
	
				2024
			
	Codice DOI
	
				https://dx.doi.org/10.1007/s11222-024-10460-2
			
	Titolo della Rivista
	
				STATISTICS AND COMPUTING
			
	Appare nelle tipologie
	
				1.1 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
STATCOMP_2024.pdf accesso aperto Tipologia: 2a Post-print versione editoriale / Version of Record Licenza: Creative commons Dimensione 330.21 kB Formato Adobe PDF Visualizza/Apri	330.21 kB	Adobe PDF	Visualizza/Apri
2403.05138v1.pdf accesso aperto Tipologia: 1. Preprint / submitted version [pre- review] Licenza: Pubblico - Tutti i diritti riservati Dimensione 238.49 kB Formato Adobe PDF Visualizza/Apri	238.49 kB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/2991265

PORTO @ Archivio Istituzionale della Ricerca

Classifier-dependent feature selection via greedy methods

Camattari, Fabiana;Guastavino, Sabrina;Marchetti, Francesco;Piana, Michele;Perracchione, Emma

2024

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Pubblicazioni consigliate

Informazioni

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)