Issue 5, 2015

Impact of data reduction on multivariate classification models built on spectral data from bio-samples

Abstract

Multivariate data analysis methods have been used to evaluate single shot spectral data, obtained by laser induced breakdown spectroscopy (LIBS), from ten different biological samples (simulants and possible interferents in Biological Warfare Agent (BWA) detection applications). Spectral data as echellograms (2D CCD images) and extracted 1D spectra were used and the classification performance was studied as the number of input variables was altered. Principal component analysis (PCA) indicated a possibility to separate the samples due to spectral differences, and partial least squares discriminant analysis (PLS-DA) was applied to study the predictability in more detail. For full resolution 1D spectra, a normalization of the data mainly resulted in visual effects in the PCA score-plots without significant effect in predictability by the PLS-DA models, however, normalization improved the predictability if the amount of variables were heavily reduced. A quite strong data (variable) reduction could be performed on both the 1D and 2D data without losing significant predictability. Using similar amounts of variables, the prediction models performed better using the echellograms directly compared to the extracted 1D spectra. The problem of spectral data shift (relative ‘database’ spectra) was also investigated, where already small shifts cause the models to fail. However, after a selection of important variables and allowing certain regions for these variables, the impact of shift on predictability could be reduced.

Graphical abstract: Impact of data reduction on multivariate classification models built on spectral data from bio-samples

Article information

Article type
Paper
Submitted
03 Dec 2014
Accepted
11 Feb 2015
First published
19 Feb 2015

J. Anal. At. Spectrom., 2015,30, 1117-1127

Author version available

Impact of data reduction on multivariate classification models built on spectral data from bio-samples

A. Larsson, H. Andersson and L. Landström, J. Anal. At. Spectrom., 2015, 30, 1117 DOI: 10.1039/C4JA00467A

To request permission to reproduce material from this article, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Read more about how to correctly acknowledge RSC content.

Social activity

Spotlight

Advertisements