The Principal Component Analysis (PCA) is the simplest eigenvector-based multivariate data analysis tool and dates back to 1901 when Karl Pearson proposed it as a way for finding the best fitting d-1 hyperplane of a system of points in a d-dimensional (Euclidean) space. Over time, the PCA evolved in different fields with several different names and with different scopes, but, in its essence, it is always an orthogonal transformation to convert a set of observations of possibly correlated variables into a set of values of linearly uncorrelated variables called principal components. Generalizing Pearson’s purpose, the knowledge derived by such an analysis is mostly used to find a subspace which effectively and efficiently summarizes the original system of points by losing a minimum amount of information. In the field of Diagnostics, the fundamental task of detecting damage is basically a binary classification problem which is in many cases tackled via Novelty Detection: an observation is classified as novel if it differs significantly from other observations. Novelty can, in principle, be assessed directly in the original space, but the effectiveness of the estimated novelty can be improved by taking advantage of the PCA. In this work, the traditional PCA will be compared to a robust modification that is commonly used in the field of diagnostics to face the issue of confounding influences which could affect the novelty-damage correspondence. Comparisons will be made to shed light on the main misleading aspects of PCA, and finally, define a unique, theoretically justified procedure for Diagnostics via Novelty Detection.
ON THE USE OF PCA FOR DIAGNOSTICS VIA NOVELTY DETECTION: INTERPRETATION, PRACTICAL APPLICATION NOTES AND RECOMMENDATION FOR USE / Daga, ALESSANDRO PAOLO; Fasana, Alessandro; Garibaldi, Luigi; Marchesiello, Stefano. - 5:(2020). [10.36001/phme.2020.v5i1.1241]
ON THE USE OF PCA FOR DIAGNOSTICS VIA NOVELTY DETECTION: INTERPRETATION, PRACTICAL APPLICATION NOTES AND RECOMMENDATION FOR USE
Alessandro Paolo Daga;Alessandro Fasana;Luigi Garibaldi;Stefano Marchesiello
2020
Abstract
The Principal Component Analysis (PCA) is the simplest eigenvector-based multivariate data analysis tool and dates back to 1901 when Karl Pearson proposed it as a way for finding the best fitting d-1 hyperplane of a system of points in a d-dimensional (Euclidean) space. Over time, the PCA evolved in different fields with several different names and with different scopes, but, in its essence, it is always an orthogonal transformation to convert a set of observations of possibly correlated variables into a set of values of linearly uncorrelated variables called principal components. Generalizing Pearson’s purpose, the knowledge derived by such an analysis is mostly used to find a subspace which effectively and efficiently summarizes the original system of points by losing a minimum amount of information. In the field of Diagnostics, the fundamental task of detecting damage is basically a binary classification problem which is in many cases tackled via Novelty Detection: an observation is classified as novel if it differs significantly from other observations. Novelty can, in principle, be assessed directly in the original space, but the effectiveness of the estimated novelty can be improved by taking advantage of the PCA. In this work, the traditional PCA will be compared to a robust modification that is commonly used in the field of diagnostics to face the issue of confounding influences which could affect the novelty-damage correspondence. Comparisons will be made to shed light on the main misleading aspects of PCA, and finally, define a unique, theoretically justified procedure for Diagnostics via Novelty Detection.File | Dimensione | Formato | |
---|---|---|---|
PHME_20_PCA_r1.pdf
accesso aperto
Tipologia:
2. Post-print / Author's Accepted Manuscript
Licenza:
PUBBLICO - Tutti i diritti riservati
Dimensione
1.6 MB
Formato
Adobe PDF
|
1.6 MB | Adobe PDF | Visualizza/Apri |
1241-Document Upload-4644-1-10-20200719.pdf
accesso aperto
Descrizione: Articolo principale
Tipologia:
2a Post-print versione editoriale / Version of Record
Licenza:
Creative commons
Dimensione
1.6 MB
Formato
Adobe PDF
|
1.6 MB | Adobe PDF | Visualizza/Apri |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.
https://hdl.handle.net/11583/2916174