Interval based chemometric algorithms have proven to be very powerful for spectral alignments, spectral regressions and spectral classifications. The interval-based methods may not only improve the performance, but also reduce model complexity and enhance the spectral interpretation. Extended Canonical Variate Analysis (ECVA) is a powerful method for multiple group classifications of multivariate data and can easily be extended to an interval approach, iECVA. This study outlines the iECVA method and compares its performance to interval Partial Least Squares Discriminant Analysis (iPLS-DA) on three spectroscopic datasets from Nuclear Magnetic Resonance (NMR), Near Infrared (NIR) and Infrared (IR) spectroscopy, respectively. The results invariantly show that the interval-based classification methods greatly enhance the interpretability of the models by identifying important spectral regions, which facilitate interpretation and biomarker discovery. Although the results for the two methods are similar regarding the number of misclassifications and identified important regions, the model complexity of the PLS-DA proved to consistently lower than the ECVA. The Matlab source codes for both iECVA and iPLS-DA are made freely available at www.models.life.ku.dk.

Simultaneous classification of multiple classes in NMR metabolomics and vibrational spectroscopy using interval-based classification methods: iECVA vs iPLS-DA / Rinnan, A.; Savorani, F.; Engelsen, S. B.. - In: ANALYTICA CHIMICA ACTA. - ISSN 0003-2670. - ELETTRONICO. - 1021:(2018), pp. 20-27. [10.1016/j.aca.2018.03.020]

Simultaneous classification of multiple classes in NMR metabolomics and vibrational spectroscopy using interval-based classification methods: iECVA vs iPLS-DA

Savorani F.;
2018

Abstract

Interval based chemometric algorithms have proven to be very powerful for spectral alignments, spectral regressions and spectral classifications. The interval-based methods may not only improve the performance, but also reduce model complexity and enhance the spectral interpretation. Extended Canonical Variate Analysis (ECVA) is a powerful method for multiple group classifications of multivariate data and can easily be extended to an interval approach, iECVA. This study outlines the iECVA method and compares its performance to interval Partial Least Squares Discriminant Analysis (iPLS-DA) on three spectroscopic datasets from Nuclear Magnetic Resonance (NMR), Near Infrared (NIR) and Infrared (IR) spectroscopy, respectively. The results invariantly show that the interval-based classification methods greatly enhance the interpretability of the models by identifying important spectral regions, which facilitate interpretation and biomarker discovery. Although the results for the two methods are similar regarding the number of misclassifications and identified important regions, the model complexity of the PLS-DA proved to consistently lower than the ECVA. The Matlab source codes for both iECVA and iPLS-DA are made freely available at www.models.life.ku.dk.
File in questo prodotto:
File Dimensione Formato  
S0003267018303994.pdf

non disponibili

Tipologia: 2a Post-print versione editoriale / Version of Record
Licenza: Non Pubblico - Accesso privato/ristretto
Dimensione 1.56 MB
Formato Adobe PDF
1.56 MB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/2815373