Dataset drift is a common challenge in machine learning, especially for models trained on unstructured data, such as images. In this paper, we propose a new approach for the detection of data drift in black box models, which is based on Hellinger distance and feature extraction methods. The proposed approach is aimed at detecting data drift without knowing the architecture of the model to monitor, the dataset on which it was trained, or both. The paper analyzes three different use cases to evaluate the effectiveness of the proposed approach, encompassing a variety of tasks including document segmentation, classification, and handwriting recognition. The use cases considered for the drift are adversarial assaults, domain shifts, and dataset biases. The experimental results show the efficacy of our drift detection approach in identifying changes in distribution under various training settings.

Drift Detection for Black Box Deep Learning Models / Piano, Luca; Garcea, Fabio; Cavallone, Andrea; Aparicio Vazquez, Ignacio; Morra, Lia; Lamberti, Fabrizio. - In: IT PROFESSIONAL. - ISSN 1520-9202. - 26:2(2024), pp. 24-31. [10.1109/MITP.2023.3338007]

Drift Detection for Black Box Deep Learning Models

Luca Piano;Fabio Garcea;Lia Morra;Fabrizio Lamberti
2024

Abstract

Dataset drift is a common challenge in machine learning, especially for models trained on unstructured data, such as images. In this paper, we propose a new approach for the detection of data drift in black box models, which is based on Hellinger distance and feature extraction methods. The proposed approach is aimed at detecting data drift without knowing the architecture of the model to monitor, the dataset on which it was trained, or both. The paper analyzes three different use cases to evaluate the effectiveness of the proposed approach, encompassing a variety of tasks including document segmentation, classification, and handwriting recognition. The use cases considered for the drift are adversarial assaults, domain shifts, and dataset biases. The experimental results show the efficacy of our drift detection approach in identifying changes in distribution under various training settings.
File in questo prodotto:
File Dimensione Formato  
ITPro_Drift_Detection_Extension_Last (1).pdf

accesso aperto

Tipologia: 2. Post-print / Author's Accepted Manuscript
Licenza: PUBBLICO - Tutti i diritti riservati
Dimensione 837.46 kB
Formato Adobe PDF
837.46 kB Adobe PDF Visualizza/Apri
Drift_Detection_for_Black-Box_Deep_Learning_Models.pdf

non disponibili

Tipologia: 2a Post-print versione editoriale / Version of Record
Licenza: Non Pubblico - Accesso privato/ristretto
Dimensione 639.3 kB
Formato Adobe PDF
639.3 kB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/2987595