In classical empirical research a model requires that the number of variables must be less than the number of observations, but developments in chemometrics and modern analytical platforms have pushed people beyond the classical model. Typical "omics" data sets will include 100-1000 samples and often more than 10,000 variables and the advantage of using chemometrics to large data structures is the ability to efficiently deal with collinear data sets with many more variables than samples. However, the trend with ever more variables also pushes the chemometric tools to the limit as they will also increase the extent of spurious correlations and interferences. This chapter advocates for a systematic breakdown of the variable space in intervals in order to improve the interpretability and performance of chemometric methods. The term ". i-chemometrics" is here introduced to encompass the whole class of interval-based chemometric methods. This chapter will describe the advantages of using the generic i-chemometric methods for data preprocessing, data exploration, regression, and sample classification/discrimination using examples from NMR foodomics. The main advantages are more parsimonious models, improved interpretability and, in many cases, improved performance.

Interval-Based Chemometric Methods in NMR Foodomics / Savorani, Francesco; Rasmussen, Morten Arendt; Rinnan, Åsmund; Engelsen, Søren Balling - In: Data Handling in Science and Technology: Chemometrics in Food Chemistry / Marini F.. - STAMPA. - [s.l] : Elsevier, 2013. - ISBN 9780444595287. - pp. 449-486 [10.1016/B978-0-444-59528-7.00012-0]

Interval-Based Chemometric Methods in NMR Foodomics

SAVORANI, FRANCESCO;
2013

Abstract

In classical empirical research a model requires that the number of variables must be less than the number of observations, but developments in chemometrics and modern analytical platforms have pushed people beyond the classical model. Typical "omics" data sets will include 100-1000 samples and often more than 10,000 variables and the advantage of using chemometrics to large data structures is the ability to efficiently deal with collinear data sets with many more variables than samples. However, the trend with ever more variables also pushes the chemometric tools to the limit as they will also increase the extent of spurious correlations and interferences. This chapter advocates for a systematic breakdown of the variable space in intervals in order to improve the interpretability and performance of chemometric methods. The term ". i-chemometrics" is here introduced to encompass the whole class of interval-based chemometric methods. This chapter will describe the advantages of using the generic i-chemometric methods for data preprocessing, data exploration, regression, and sample classification/discrimination using examples from NMR foodomics. The main advantages are more parsimonious models, improved interpretability and, in many cases, improved performance.
2013
9780444595287
Data Handling in Science and Technology: Chemometrics in Food Chemistry
File in questo prodotto:
Non ci sono file associati a questo prodotto.
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/2628255
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo