Robust Ensemble of Computational Techniques for the Detection of Outliers in the Inverse Uncertainty Quantification with Limited Data

Pedroni, Nicola

In the analysis of safety-critical systems, the outputs of the simulation models can diverge substantially from the actual system dynamics. This discrepancy is due to the aleatory (randomness) and epistemic (lack of knowledge) uncertainties affecting the system behavior and modelling, respectively. In this view, Model Calibration (MC) or Inverse Uncertainty Quantification (IUQ) is of paramount importance. However, in real engineering applications the data collection can be prohibitively expensive. Thus, the obtained data can be insufficient to accurately represent the true system response. In such situations, IUQ methods may lead to biased results. In addition, due to malfunctioning of sensors and measurement technologies, human errors, external contamination or unexpected deviations in the system behavior, outliers can be found in the collected samples. In these cases, the outcomes of MC or IUQ may be misleading or even erroneous, especially in the presence of limited data [1]. To address this problem, the paper proposes a framework for the robust detection of outliers in the IUQ of the dynamic models of safety-critical systems, in the presence of scarce experimental data. The approach is based on an ensemble of three statistical and artificial intelligence techniques, relying on diverse principles: (i) density-based Isolation Forest (IF) [2], which detects outliers by isolating anomalies from normal points through an ensemble of trees; (ii) Finite Mixture Models (FMMs) [3], that provide an unsupervised probabilistic clustering of the data and identify the outliers as those observations characterized by the smallest likelihood; and (iii) Sparse Stacked AutoEncoders (SSAE) [4], which are trained to reconstruct multivariate data and spot out abnormal samples as those characterized by the largest reconstruction error. Each algorithm is originally tailored to compute anomaly scores for all the available observations, in order to quantify their “degree of outlyingness”. These three (possibly different) rankings are finally combined using the Borda Count to get a unique, robust and reliable ranking. The proposed ensemble-based approach is embedded within an inverse quantification of mixed probabilistic (aleatory) and set-based (epistemic) uncertainties, in the presence of scarce functional (time series) data. It is tested on two case studies: (i) a toy problem, where the true outliers and uncertainty models are known; (ii) the NASA Langley Uncertainty Quantification Challenge on Optimization Under Uncertainty [5]. The method is applied to data sets with different degrees of outlier contamination (i.e., 1%, 5% and 10%) producing calibration results that are strongly robust to outliers. [1] L.G. Crespo, B. K. Colbert, S. P. Kenny, D. P. Giesy. On the quantification of aleatory and epistemic uncertainty using Sliced-Normal distributions, Systems & Control Letters 134 (2019) 104560. [2] F.T. Liu, K.M. Ting, Z.-H. Zhou, Isolation-based anomaly detection, ACM Transactions on Knowledge Discovery from Data 6 (2012) 3. [3] M.A.T. Figueiredo, A.K. Jain, Unsupervised learning of finite mixture models. In IEEE Trans. Pattern Anal. Mach. Intell. 24 (3) (2002) 381–396. [4] R. Zhao, R. Yan, Z. Chen, K. Mao, P. Wang, R.X. Gao, Deep learning and its applications to machine health monitoring. Mech Syst Signal Process 115 (2019) 213–37. [5] L.G. Crespo, S.P. Kenny, The NASA Langley challenge on optimization under uncertainty. Mechanical Systems and Signal Processing 152 (2021) 107405.

Nome	Dominio	Durata	Descrizione
s_.*	plu.mx	sessione	recupero grafico citazioni sociali da plumx
A_.*	core.ac.uk	7 giorni	recupero pubblicazioni consigliate per il pannello core-recommander
GS_.*	gstatic.com	richiesta http	visualizza grafico citazioni
CC_.*	creativecommons.org	richiesta http	visualizza licenza bitstream

Robust Ensemble of Computational Techniques for the Detection of Outliers in the Inverse Uncertainty Quantification with Limited Data

Pedroni, Nicola

2023

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

Pubblicazioni consigliate

Attenzione

PORTO @ Archivio Istituzionale della Ricerca

Robust Ensemble of Computational Techniques for the Detection of Outliers in the Inverse Uncertainty Quantification with Limited Data

Pedroni, Nicola

2023

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Pubblicazioni consigliate

Informazioni

Attenzione

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)