Variable selection in visible/near infrared spectra for linear and nonlinear calibrations: a case study to determine soluble solids content of beer

Anal Chim Acta. 2009 Mar 2;635(1):45-52. doi: 10.1016/j.aca.2009.01.017. Epub 2009 Jan 17.

Abstract

Three effective wavelength (EW) selection methods combined with visible/near infrared (Vis/NIR) spectroscopy were investigated to determine the soluble solids content (SSC) of beer, including successive projections algorithm (SPA), regression coefficient analysis (RCA) and independent component analysis (ICA). A total of 360 samples were prepared for the calibration (n=180), validation (n=90) and prediction (n=90) sets. The performance of different preprocessing was compared. Three calibrations using EWs selected by SPA, RCA and ICA were developed, including linear regression of partial least squares analysis (PLS) and multiple linear regression (MLR), and nonlinear regression of least squares-support vector machine (LS-SVM). Ten EWs selected by SPA achieved the optimal linear SPA-MLR model compared with SPA-PLS, RCA-MLR, RCA-PLS, ICA-MLR and ICA-PLS. The correlation coefficient (r) and root mean square error of prediction (RMSEP) by SPA-MLR were 0.9762 and 0.1808, respectively. Moreover, the newly proposed SPA-LS-SVM model obtained almost the same excellent performance with RCA-LS-SVM and ICA-LS-SVM models, and the r value and RMSEP were 0.9818 and 0.1628, respectively. The nonlinear model SPA-LS-SVM outperformed SPA-MLR model. The overall results indicated that SPA was a powerful way for the selection of EWs, and Vis/NIR spectroscopy incorporated to SPA-LS-SVM was successful for the accurate determination of SSC of beer.