Dataset drift is a common challenge in machine learning, especially for models trained on unstructured data, such as images. In this paper, we propose a new approach for the detection of data drift in black box models, which is based on Hellinger distance and feature extraction methods. The proposed approach is aimed at detecting data drift without knowing the architecture of the model to monitor, the dataset on which it was trained, or both. The paper analyzes three different use cases to evaluate the effectiveness of the proposed approach, encompassing a variety of tasks including document segmentation, classification, and handwriting recognition. The use cases considered for the drift are adversarial assaults, domain shifts, and dataset biases. The experimental results show the efficacy of our drift detection approach in identifying changes in distribution under various training settings.

Drift Detection for Black Box Deep Learning Models / Piano, Luca; Garcea, Fabio; Cavallone, Andrea; Aparicio Vazquez, Ignacio; Morra, Lia; Lamberti, Fabrizio. - In: IT PROFESSIONAL. - ISSN 1520-9202. - (2023).

Drift Detection for Black Box Deep Learning Models

Luca Piano;Fabio Garcea;Lia Morra;Fabrizio Lamberti
2023

Abstract

Dataset drift is a common challenge in machine learning, especially for models trained on unstructured data, such as images. In this paper, we propose a new approach for the detection of data drift in black box models, which is based on Hellinger distance and feature extraction methods. The proposed approach is aimed at detecting data drift without knowing the architecture of the model to monitor, the dataset on which it was trained, or both. The paper analyzes three different use cases to evaluate the effectiveness of the proposed approach, encompassing a variety of tasks including document segmentation, classification, and handwriting recognition. The use cases considered for the drift are adversarial assaults, domain shifts, and dataset biases. The experimental results show the efficacy of our drift detection approach in identifying changes in distribution under various training settings.
2023
File in questo prodotto:
File Dimensione Formato  
ITPro_Drift_Detection_Extension_Last (1).pdf

non disponibili

Tipologia: 2. Post-print / Author's Accepted Manuscript
Licenza: Non Pubblico - Accesso privato/ristretto
Dimensione 837.46 kB
Formato Adobe PDF
837.46 kB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/2987595