The Principal Component Analysis (PCA) is the simplest eigenvector-based multivariate data analysis tool and dates back to 1901 when Karl Pearson proposed it as a way for finding the best fitting d-1 hyperplane of a system of points in a d-dimensional (Euclidean) space. Over time, the PCA evolved in different fields with several different names and with different scopes, but, in its essence, it is always an orthogonal transformation to convert a set of observations of possibly correlated variables into a set of values of linearly uncorrelated variables called principal components. Generalizing Pearson’s purpose, the knowledge derived by such an analysis is mostly used to find a subspace which effectively and efficiently summarizes the original system of points by losing a minimum amount of information. In the field of Diagnostics, the fundamental task of detecting damage is basically a binary classification problem which is in many cases tackled via Novelty Detection: an observation is classified as novel if it differs significantly from other observations. Novelty can, in principle, be assessed directly in the original space, but the effectiveness of the estimated novelty can be improved by taking advantage of the PCA. In this work, the traditional PCA will be compared to a robust modification that is commonly used in the field of diagnostics to face the issue of confounding influences which could affect the novelty-damage correspondence. Comparisons will be made to shed light on the main misleading aspects of PCA, and finally, define a unique, theoretically justified procedure for Diagnostics via Novelty Detection.

ON THE USE OF PCA FOR DIAGNOSTICS VIA NOVELTY DETECTION: INTERPRETATION, PRACTICAL APPLICATION NOTES AND RECOMMENDATION FOR USE / Daga, ALESSANDRO PAOLO; Fasana, Alessandro; Garibaldi, Luigi; Marchesiello, Stefano. - 5:(2020). [10.36001/phme.2020.v5i1.1241]

ON THE USE OF PCA FOR DIAGNOSTICS VIA NOVELTY DETECTION: INTERPRETATION, PRACTICAL APPLICATION NOTES AND RECOMMENDATION FOR USE

Alessandro Paolo Daga;Alessandro Fasana;Luigi Garibaldi;Stefano Marchesiello
2020

Abstract

The Principal Component Analysis (PCA) is the simplest eigenvector-based multivariate data analysis tool and dates back to 1901 when Karl Pearson proposed it as a way for finding the best fitting d-1 hyperplane of a system of points in a d-dimensional (Euclidean) space. Over time, the PCA evolved in different fields with several different names and with different scopes, but, in its essence, it is always an orthogonal transformation to convert a set of observations of possibly correlated variables into a set of values of linearly uncorrelated variables called principal components. Generalizing Pearson’s purpose, the knowledge derived by such an analysis is mostly used to find a subspace which effectively and efficiently summarizes the original system of points by losing a minimum amount of information. In the field of Diagnostics, the fundamental task of detecting damage is basically a binary classification problem which is in many cases tackled via Novelty Detection: an observation is classified as novel if it differs significantly from other observations. Novelty can, in principle, be assessed directly in the original space, but the effectiveness of the estimated novelty can be improved by taking advantage of the PCA. In this work, the traditional PCA will be compared to a robust modification that is commonly used in the field of diagnostics to face the issue of confounding influences which could affect the novelty-damage correspondence. Comparisons will be made to shed light on the main misleading aspects of PCA, and finally, define a unique, theoretically justified procedure for Diagnostics via Novelty Detection.
2020
978-1-936263-32-5
File in questo prodotto:
File Dimensione Formato  
PHME_20_PCA_r1.pdf

accesso aperto

Tipologia: 2. Post-print / Author's Accepted Manuscript
Licenza: PUBBLICO - Tutti i diritti riservati
Dimensione 1.6 MB
Formato Adobe PDF
1.6 MB Adobe PDF Visualizza/Apri
1241-Document Upload-4644-1-10-20200719.pdf

accesso aperto

Descrizione: Articolo principale
Tipologia: 2a Post-print versione editoriale / Version of Record
Licenza: Creative commons
Dimensione 1.6 MB
Formato Adobe PDF
1.6 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/2916174