Interval based chemometric algorithms have proven to be very powerful for spectral alignments, spectral regressions and spectral classifications. The interval-based methods may not only improve the performance, but also reduce model complexity and enhance the spectral interpretation. Extended Canonical Variate Analysis (ECVA) is a powerful method for multiple group classifications of multivariate data and can easily be extended to an interval approach, iECVA. This study outlines the iECVA method and compares its performance to interval Partial Least Squares Discriminant Analysis (iPLS-DA) on three spectroscopic datasets from Nuclear Magnetic Resonance (NMR), Near Infrared (NIR) and Infrared (IR) spectroscopy, respectively. The results invariantly show that the interval-based classification methods greatly enhance the interpretability of the models by identifying important spectral regions, which facilitate interpretation and biomarker discovery. Although the results for the two methods are similar regarding the number of misclassifications and identified important regions, the model complexity of the PLS-DA proved to consistently lower than the ECVA. The Matlab source codes for both iECVA and iPLS-DA are made freely available at www.models.life.ku.dk.
Simultaneous classification of multiple classes in NMR metabolomics and vibrational spectroscopy using interval-based classification methods: iECVA vs iPLS-DA / Rinnan, A.; Savorani, F.; Engelsen, S. B.. - In: ANALYTICA CHIMICA ACTA. - ISSN 0003-2670. - ELETTRONICO. - 1021:(2018), pp. 20-27. [10.1016/j.aca.2018.03.020]
Simultaneous classification of multiple classes in NMR metabolomics and vibrational spectroscopy using interval-based classification methods: iECVA vs iPLS-DA
Savorani F.;
2018
Abstract
Interval based chemometric algorithms have proven to be very powerful for spectral alignments, spectral regressions and spectral classifications. The interval-based methods may not only improve the performance, but also reduce model complexity and enhance the spectral interpretation. Extended Canonical Variate Analysis (ECVA) is a powerful method for multiple group classifications of multivariate data and can easily be extended to an interval approach, iECVA. This study outlines the iECVA method and compares its performance to interval Partial Least Squares Discriminant Analysis (iPLS-DA) on three spectroscopic datasets from Nuclear Magnetic Resonance (NMR), Near Infrared (NIR) and Infrared (IR) spectroscopy, respectively. The results invariantly show that the interval-based classification methods greatly enhance the interpretability of the models by identifying important spectral regions, which facilitate interpretation and biomarker discovery. Although the results for the two methods are similar regarding the number of misclassifications and identified important regions, the model complexity of the PLS-DA proved to consistently lower than the ECVA. The Matlab source codes for both iECVA and iPLS-DA are made freely available at www.models.life.ku.dk.File | Dimensione | Formato | |
---|---|---|---|
S0003267018303994.pdf
non disponibili
Tipologia:
2a Post-print versione editoriale / Version of Record
Licenza:
Non Pubblico - Accesso privato/ristretto
Dimensione
1.56 MB
Formato
Adobe PDF
|
1.56 MB | Adobe PDF | Visualizza/Apri Richiedi una copia |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.
https://hdl.handle.net/11583/2815373