David
Perez-Guaita
a,
Angel
Sanchez-Illana
a,
Josep
Ventura-Gayete
b,
Salvador
Garrigues
*a and
Miguel
de la Guardia
a
aDepartment of Analytical Chemistry, University of Valencia, 50 Dr Moliner Street, Research Building, 46100 Burjassot, Valencia, Spain. E-mail: salvador.garrigues@uv.es
bUniversity Hospital Doctor Peset Aleixandre, Av. Gaspar Aguilar, 90, 46017 Valencia, Spain
First published on 25th October 2013
Attenuated total reflectance (ATR) infrared spectroscopy of dried organic extracts of serum samples has been evaluated as a fast method for the determination of triglycerides, cholesterol, high density lipoprotein (HDL) and low density lipoprotein (LDL). After careful selection of solvents based on green parameters, serum samples were extracted using hexane–isopropanol and ethyl acetate–ethanol mixtures. Microscopy studies and comparison with standard spectra were performed in order to investigate whether the proposed methodology is suitable for the quantification of lipids in serum samples. The results of these preliminary studies confirmed that the variations in the IR spectra of sample extracts could be related quantitatively to variations in the concentrations of the target analytes. Then, ATR spectra of the dried sample extracts were obtained and direct measurement of the spectra were carried out and modelled using partial least squares (PLS) and reference concentrations. PLS models obtained from the extracts of the two mixtures were compared with those obtained from direct measurement of sera samples. The prediction errors obtained using the proposed approach were considerably (between 27 and 72%) better than those obtained by the direct measurements of sera. For triglycerides and cholesterol relative errors below 9% and 12% respectively were obtained with this method, which are comparable to the tolerance for the errors of the control analysis established at the hospital. For HDL and LDL, the errors found were between 18 and 20%. The incorporation of a preprocessing extraction step, involves time and solvent consumption. However, the results obtained provide evidence that the proposed method provides, in a few minutes and using simple instrumentation and with minimum cost, important information about the lipidic profile of patients sera at a good screening confidence level.
Since the aforementioned analyses are often carried out using enzymatic methods through high-performance auto-analyzers, the associated cost can be too high for performing screening studies or monitoring analyses. In addition, samples must be transported to a fully-equipped laboratory, making the information available after a time delay, which may hamper timely diagnosis.5 Thus, in the last decades different analytical strategies have been applied for moving testing closer to the patient, so-called point-of-care testing (PoCT), which has several economic and clinical benefits.6
Fourier transformed infrared spectroscopy (FTIR) has been evidenced as one of the most promising techniques for obtaining concentration values of the main components, including the aforementioned lipids,7,8 in the analysis of several bio-fluids; such as serum,9 plasma,10 whole blood or urine.11 Since almost every bioorganic compound presents absorbance in the infrared region, infrared spectra provide a ‘snapshot’ of the main composition of the biological samples. However, the overlapping of the bands of the target analytes with those of several compounds contained in the samples makes mandatory the use of chemometrics, commonly partial least squares (PLS). For spectra acquisition, samples can be dried and measured by transmission12 or can be placed directly into an attenuated total reflectance (ATR) unit.11,13,14 ATR-FTIR permits a straightforward measurement of several parameters for the analysis of sera, which can be obtained in a few minutes and involves no or minimal pretreatment of samples or use of reagents. Besides, the equipment required is not complex and nowadays there are available compact and portable instruments suitable to be used for PoCT. Nevertheless, this technique also presents important drawbacks. First of all, the lack of sensitivity of the ATR technique implies that only analytes present at percentage levels in samples can be properly quantified.15 In addition, the protein bands overshadow most of the bands of the less concentrated compounds. There are also errors of precision and accuracy associated with reference data and finally, due to the high complexity and heterogeneity of the samples, several reference standards must be included in the calibration set in order to obtain a representative and robust calibration.9
Recent advances in ATR-FTIR have been focused on improving the prediction errors. E.g., quantum cascade lasers have been used as an infrared source in the determination of several clinical parameters in the plasma of critical patients, thus improving the signal-to-noise ratio of the measurements16 and compacting considerably the instrumentation needed. In addition, locally weighted regression-PLS has been used instead of PLS in order to improve the representativeness of the calibration set.17 Other approaches have been applied to eliminate the interference caused by the important contribution of proteins to the spectra. Proteins make up approximately 90% of the non-aqueous fraction of plasma and, because of that, proteins are easily determined using FTIR spectroscopy.18 However, the strong amide bands can hinder the determination of less concentrated molecules. Hence, serum creatinine was successfully quantified by using a laminar fluid diffusion interface which isolates small molecules as creatinine from proteins.19,20
To our knowledge, the analysis of serum components by FTIR after deproteinization with organic solvents has still not been investigated. From serum samples, organic solvents should extract the major lipids and related compounds, i.e. those involved in important clinical parameters (triglycerides, cholesterol and lipoproteins). Since lipids include a wide range of nonpolar and highly nonpolar molecules, normally a mixture of solvents is used for their extraction.21 Folch reagent is the most commonly used for this propose. Nevertheless, due to environmental issues,22 other mixtures of solvents such as hexane–isopropanol23 and ethyl acetate–ethanol24 have been proposed as alternatives.
Besides, drying samples or their organic components in an ATR crystal is an easy and rapid way of obtaining an infrared spectrum free from the contribution of solvent bands and thus with an enhanced signal-to-noise ratio. It has already been used for discriminating red wine cultivars25 and for detecting the presence of polyphenol cocoa metabolites in urine.26 However, due to the possible irreproducibility of the drying step and the low penetration depth of the beam in ATR measurements, the size and shape of the sample deposits obtained after a drying step should be investigated if this strategy is selected for performing a quantitative measurement.
The aim of this work has been to evaluate the use of ATR infrared spectra of serum organic extracts as a fast and PoC approach to determine the concentration of triglycerides, cholesterol, HDL and LDL. First, since the method is designed for the screening and control of several samples on a large scale, a careful selection of the solvents to be used for sample pretreatment was realized considering ‘green’ factors.22 The ATR spectra of dried sample extracts were obtained and modelled using PLS based on reference concentrations obtained by enzymatic methods. PLS models built from the extracts of the two solvent mixtures were compared with those obtained from direct measurements of sera samples. Results evidenced that the proposed approach is suitable as a fast tool for the determination of important lipidic parameters in sera, thus being interesting for PoCT.
Serum samples with reference data of some of the studied parameters were obtained from the Hospital Dr Peset Alexandre (Valencia, Spain), being that the reason because we have not used a fixed number of samples to build all the models assayed. Reference concentrations of HDL, LDL, triglycerides and cholesterol were determined using an Abbot Architect c16000 auto-analyzer (Libertyville, IL, USA) as described elsewhere.9 The error of the reference data was considered as the tolerance in the validation analysis of controls established in the analysis protocol and performed daily at the hospital. The tolerance was two times the standard deviation (SD). However, since in most of the cases the values were inside the SD, a declared accuracy value of ±10% of the reported results, corresponding to the SD was considered. Table 1 summarizes the main descriptive statistics of the sample reference data used throughout this study.
Analyte | Set | Hexane–isopropanol 3:1 | Ethyl acetate–ethanol 3:1 | Direct measurement | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Mean | SD | Min | Max | N | Mean | SD | Min | Max | N | Mean | SD | Min | Max | N | ||
a All values are in mg dL−1. Cal and val indicate the sets used for calibration and validation respectively. SD, min, max and N indicate respectively standard deviation, minimum and maximum concentration value and the number of samples included on each set. | ||||||||||||||||
HDL | Cal | 36 | 8 | 47 | 24 | 28 | 35 | 10 | 58 | 18 | 32 | 35 | 10 | 58 | 13 | 29 |
Val | 34 | 11 | 58 | 15 | 21 | 34 | 9 | 50 | 15 | 24 | 36 | 8 | 50 | 21 | 16 | |
LDL | Cal | 91 | 34 | 160 | 39 | 24 | 86 | 34 | 178 | 35 | 32 | 78 | 24 | 143 | 29 | 29 |
Val | 82 | 28 | 159 | 46 | 21 | 86 | 30 | 160 | 39 | 24 | 77 | 28 | 124 | 35 | 14 | |
Cholesterol | Cal | 158 | 41 | 247 | 83 | 28 | 148 | 44 | 259 | 53 | 38 | 139 | 28 | 200 | 87 | 29 |
Val | 141 | 31 | 214 | 87 | 21 | 153 | 42 | 247 | 83 | 27 | 145 | 42 | 235 | 83 | 16 | |
Triglycerides | Cal | 138 | 79 | 397 | 62 | 28 | 116 | 67 | 306 | 41 | 37 | 125 | 72 | 299 | 46 | 28 |
Val | 137 | 84 | 345 | 46 | 21 | 153 | 90 | 397 | 65 | 26 | 124 | 61 | 227 | 41 | 13 |
Direct measurements were performed by depositing 150 μL of serum samples directly on the ATR cell and a spectrum was acquired averaging 300 scans, using a spectrum of the clean empty ATR cell as background. Spectra were acquired in the range between 600 and 4000 cm−1 with a resolution of 4 cm−1. A blank of Milli-Q water was obtained after each three samples measured and was subtracted from the samples in order to eliminate water contribution. After each measurement cleaning of the ATR cell was performed using soft cellulose and a 0.9% (w/v) sodium chloride aqueous solution in order to avoid protein precipitation and, finally the cell was rinsed with Milli-Q water.
PLS models were built using Matlab 7.7.0 (Mathworks Inc. Natik, MA, USA). Interval wavenumber ranges used for modeling were 810–1757 cm−1 for spectra obtained from direct measurements and 3161–2440 cm−1 and 1815–844 cm−1 for spectra obtained from the dried organic extracts. The root means square error of cross validation (RMSCV) calculated using the leave-one-out method was used for selecting the number of latent variables for performing the calibration for each model (see Tables 2–5). The spectra matrix was preprocessed by using a Savisky–Golatz 1st derivate (polynomial order 2 and filter width 15) and mean centering. Concentration data were mean centered as well.
Parameter | Direct measurement | Hexane–isopropanol (3:1) | Ethyl acetate–ethanol (3:1) |
---|---|---|---|
a RMSCV: root mean square error of cross validation, RMSEP: root mean square error of prediction, RRMSEP: relative root mean square error of prediction, RPD: residual predictive deviation. | |||
RMSCV/mg dL−1 | 68.82 | 25.82 | 19.7 |
RMSEP/mg dL−1 | 31.36 | 23.00 | 18.52 |
RRMSEP (%) | 24.89 | 16.91 | 12.10 |
RPD | 1.945 | 3.761 | 4.877 |
Latent variables | 4 | 4 | 6 |
Repeatability/mg dL−1 | 6.29 | 6.73 | |
Relative repeatability (%) | 8.7 | 9.3 |
For comparing the models obtained in the direct measurement to those obtained after protein precipitation and lipid extraction, the following parameters were considered: for evaluating the prediction capability of the models, root mean square error of prediction (RMSEP) and ratio of performance to deviation (RPD), defined as the ratio between RMSEP and the standard deviation of the prediction set reference values, were used. Besides, for quantifying the relative accuracy of the models, the relative root mean square error of prediction (RRMSEP) defined as the RMSEP divided by the average concentration value of the prediction set was used.
Since the step which involved the deposition and the drying of the organic extract was suspected to be an important source of imprecision, a study of the effect of these operations on the measurement repeatability and its influence on the precision of the predictions of analyte concentrations was performed. For this purpose, samples included in a new set of 5 samples that were not integrated in the calibration, nor in the validation set were extracted in triplicate using both assayed solvent mixtures and extracts were measured and predicted using the models obtained with the calibration set. The mean of the SD of the prediction results obtained for each sample was used for evaluating the precision of each measurement method.
Images were obtained using an OLYMPUS (Barcelona, Spain) FV1000 confocal microscope operated in the reflection mode and equipped with a dry objective UPLSAPO (20×) with a numeric aperture of 0.75. The resolution of the measurements in the z-axis was 1.39 μm. Open source software ImageJ was used for handling the images.27
The type of molecules extracted was strongly related to the size, shape and position of the IR bands. Antisymmetric νas(P–O) and symmetric νs(P–O) stretching vibrations, characteristics of phospholipids, were found in the interval regions 900–1100 and 1100–1300 respectively.28 Regarding the alkyl groups, found in every lipid, the δas(CH3), δas(CH2), δs(CH3), δs(CH2) bending (scissoring) were found in the 1400–1450 cm−1 (ref. 10 and 29) range and the ν(CH3), δas(CH2), δs(CH3), δs(CH2) stretching vibrations between 2800 and 3100 cm−1. 10,29 Finally, the ν(CO) stretching vibration of the phospholipids and the esters of fatty acids were found in the region between 1700 and 1760 cm−1.10,29
A figure (SM1) available in the ESI† shows the ATR-FTIR spectra acquired from the different extracts of the same sample obtained using the solvents considered in the preliminary study. In SM1 there are also indicated the regions assigned to the major chemical groups found in the lipids present in sera.
A visual inspection of the spectra reveals that each solvent mixture extracted different amounts and types of lipids. Due to the low absorbance of the hexane extract, this solvent was discarded. Concerning solvent mixtures, ethyl acetate–ethanol 3:1 extracted the largest amount of lipids as indicated by the big absorption bands in the 2800–3200 cm−1 interval. Hence, this mixture was selected for further studies. Besides, the extracts obtained from the Folch reagent and the mixture composed of hexane–isopropanol 3:1 presented similar spectra, with an unidentified wide band at 1500–1700 cm−1. Since both mixtures presented similar behavior, only one was selected for performing the study. As the proposed method is intended for its large-scale use as a screening tool, mixtures were compared in terms of their green parameters (e.g. toxicity of the waste, safety for the operator, etc.30), using hazard statements available in the MERK index.31 Thus, the use of Folch reagent, which includes chloroform, a potentially carcinogen solvent, was excluded.23 An overview of the comparison of solvents employed for sample extraction is available in the ESI (SM2).†
Spectra obtained from the direct measurements were dominated by the protein amide bands, which hampered the specific bands of lipids. In the case of the C–H region (2800–3050 cm−1), water absorption was too high and the subtraction of the water blank was not effective enough to show clearly the C–H bands. In contrast, in the spectra of serum extracts strong differences between the samples can be seen in the absorption regions of the standards, providing evidence that changes in the spectra are caused by the different amount of cholesterol, triglycerides, HDL and LDL in the samples. For example, the maximum position and intensity of the CH2 and CH3 bands located in the 2800–3050 cm−1 range varied widely among the samples, specifically the maximum of the band assigned to the νas(CH2). Standard spectra provide evidence that the mentioned variation can be explained taking into account the different size and shape of the bands of the different components under study. Something similar occurs in the fingerprint region, where strong variations in the size and shape of the ν(CO) and the ν(P–O) bands were observed. In this region bands of triolein and L-α-lysophosphatidylcholine standard were also located. In short, the visual comparison of spectra shown in Fig. 2 reveals that the differences in the bands found in sample spectra are presumably due to differences in the concentration of the compounds under study.
The main conclusion that can be obtained from this study is that irregular films were obtained by drying aqueous diluted sera. However, in the case of sample extracts obtained with organic solvents, the measurement area was covered uniformly by drops of regular size and their thickness in the main part of the crystal was smaller than the penetration depth of the beam, thus allowing the acquisition of quantitative information of the components present in the samples in a reproducible way.
Parameter | Direct measurement | Hexane–isopropanol (3:1) | Ethyl acetate–ethanol (3:1) |
---|---|---|---|
a RMSCV: root mean square error of cross validation, RMSEP: root mean square error of prediction, RRMSEP: relative root mean square error of prediction, RPD: residual predictive deviation. | |||
RMSCV/mg dL−1 | 22.19 | 16.93 | 17.25 |
RMSEP/mg dL−1 | 32.65 | 9.27 | 14.55 |
RRMSEP (%) | 22.53 | 6.57 | 9.48 |
RPD | 1.28 | 3.35 | 2.85 |
Latent variables | 4 | 5 | 5 |
Repeatability/mg dL−1 | 14.5 | 7.54 | |
Relative repeatability (%) | 8.9 | 4.6 |
Fig. 4 Radial plot of ratio of performance to deviation values obtained for all the models built for PLS-ATR-FTIR determination of lipidic compounds in sera. |
Nevertheless, the goodness of the validation was strongly dependent on the analyte considered. Fig. 4 represents the RPD calculated for the validation of the analytes under study. RPD values for cholesterol and triglycerides were found to be two times higher than those obtained for the lipoproteins. A similar profile of RPD can also be seen for models created using the spectra of extracts, in all the cases the RPD values obtained were better than those obtained by direct measurement. Comparing the two extraction procedures, the best prediction capability for triglycerides was found for the extraction of sera with ethyl acetate–ethanol mixture, with a RPD value 30% higher than the RPD obtained for the models obtained using the hexane–isopropanol mixture. In contrast, for cholesterol, the RPD obtained was 18% higher for the extraction with hexane–isopropanol than in the case of using ethyl acetate–ethanol mixture, thus providing evidence of the relative capability of each solvent mixture to extract well the different lipidic compounds present in sera.
Parameter | Direct measurement | Hexane–isopropanol (3:1) | Ethyl acetate–ethanol (3:1) |
---|---|---|---|
a RMSCV: root mean square error of cross validation, RMSEP: root mean square error of prediction, RRMSEP: relative root mean square error of prediction, RPD: residual predictive deviation. | |||
RMSCV/mg dL−1 | 8.15 | 7.05 | 9.95 |
RMSEP/mg dL−1 | 10.25 | 6.55 | 6.56 |
RRMSEP (%) | 28.02 | 19.19 | 20.21 |
RPD | 0.80 | 1.64 | 1.56 |
Latent variables | 5 | 4 | 4 |
Repeatability/mg dL−1 | 2.59 | 1.26 | |
Relative repeatability (%) | 5.8 | 2.8 |
The repeatability of the drying step was found to be an important source of irreproducibility. Relative standard deviations of the measurements of three replicates obtained by drying the same sample extracts (see the repeatability data in the last line of Tables 2–5) were higher than half of the relative accuracy in the case of triglycerides, cholesterol and HDL. Hence, an automation of the deposition of the samples could improve the prediction results. However, this automation also increases the complexity and cost of the instrumentation, thus limiting its application as a PoC methodology.
Parameter | Direct measurement | Hexane–isopropanol (3:1) | Ethyl acetate–ethanol (3:1) |
---|---|---|---|
a RMSCV: root mean square error of cross validation, RMSEP: root mean square error of prediction, RRMSEP: relative root mean square error of prediction, RPD: residual predictive deviation. | |||
RMSCV/mg dL−1 | 20.11 | 15.58 | 16.00 |
RMSEP/mg dL−1 | 24.64 | 17.13 | 15.82 |
RRMSEP (%) | 32.00 | 20.90 | 18.38 |
RPD | 1.139 | 1.607 | 1.912 |
Latent variables | 5 | 4 | 4 |
Repeatability/mg dL−1 | 14.51 | 7.6 | |
Relative repeatability (%) | 14.7 | 7.7 |
In the ESI†is available a table (SM6) which compares the errors obtained, number of samples used for calibration and number of latent variables selected in this study compared with those performed by Hosafçi et al.11 and Liu et al.7 It must be indicated that different approaches for obtaining the spectra were employed in each one of the three compared studies that were namely, direct measurement of whole blood in the case of Hoisafi et al., transmission measurement of dry films of sera in the case of Liu et al. and ATR measurement of organic extracts of sera in the case of the methodology proposed in this paper. Although the best prediction capability was obtained for the models built following the proposed methodology, it must be noticed that the validation sets used were different. However, it seems clear that the proposed methodology, using smaller calibration sets than previous ones and simpler PLS models (latent variables) is able to obtain better or comparable results than those obtained in previous works. This fact can be explained by the extraction step which eliminates the strong bands of proteins from the spectra. Nevertheless, this preprocessing step also implies the use of organic solvents and a preprocessing time.
Footnote |
† Electronic supplementary information (ESI) available. See DOI: 10.1039/c3an01057k |
This journal is © The Royal Society of Chemistry 2014 |