Dataset drift is a common challenge in machine learning, especially for models trained on unstructured data, such as images. In this paper, we propose a new approach for the detection of data drift in black box models, which is based on Hellinger distance and feature extraction methods. The proposed approach is aimed at detecting data drift without knowing the architecture of the model to monitor, the dataset on which it was trained, or both. The paper analyzes three different use cases to evaluate the effectiveness of the proposed approach, encompassing a variety of tasks including document segmentation, classification, and handwriting recognition. The use cases considered for the drift are adversarial assaults, domain shifts, and dataset biases. The experimental results show the efficacy of our drift detection approach in identifying changes in distribution under various training settings.
Drift Detection for Black Box Deep Learning Models / Piano, Luca; Garcea, Fabio; Cavallone, Andrea; Aparicio Vazquez, Ignacio; Morra, Lia; Lamberti, Fabrizio. - In: IT PROFESSIONAL. - ISSN 1520-9202. - 26:2(2024), pp. 24-31. [10.1109/MITP.2023.3338007]
Drift Detection for Black Box Deep Learning Models
Luca Piano;Fabio Garcea;Lia Morra;Fabrizio Lamberti
2024
Abstract
Dataset drift is a common challenge in machine learning, especially for models trained on unstructured data, such as images. In this paper, we propose a new approach for the detection of data drift in black box models, which is based on Hellinger distance and feature extraction methods. The proposed approach is aimed at detecting data drift without knowing the architecture of the model to monitor, the dataset on which it was trained, or both. The paper analyzes three different use cases to evaluate the effectiveness of the proposed approach, encompassing a variety of tasks including document segmentation, classification, and handwriting recognition. The use cases considered for the drift are adversarial assaults, domain shifts, and dataset biases. The experimental results show the efficacy of our drift detection approach in identifying changes in distribution under various training settings.File | Dimensione | Formato | |
---|---|---|---|
ITPro_Drift_Detection_Extension_Last (1).pdf
accesso aperto
Tipologia:
2. Post-print / Author's Accepted Manuscript
Licenza:
PUBBLICO - Tutti i diritti riservati
Dimensione
837.46 kB
Formato
Adobe PDF
|
837.46 kB | Adobe PDF | Visualizza/Apri |
Drift_Detection_for_Black-Box_Deep_Learning_Models.pdf
non disponibili
Tipologia:
2a Post-print versione editoriale / Version of Record
Licenza:
Non Pubblico - Accesso privato/ristretto
Dimensione
639.3 kB
Formato
Adobe PDF
|
639.3 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.
https://hdl.handle.net/11583/2987595