Kateřina
Aubrechtová Dragounová‡
ab,
Oleg
Ryabchykov
bc,
Daniel
Steinbach
d,
Vincent
Recla
e,
Nora
Lindig
e,
María José
González Vázquez
ab,
Susan
Foller
d,
Michael
Bauer
a,
Thomas W.
Bocklitz
bfg,
Jürgen
Popp
bf,
Jürgen
Rödel
e and
Ute
Neugebauer
*abf
aDepartment of Anaesthesiology and Intensive Care Medicine and Center for Sepsis Control and Care (CSCC), Jena University Hospital, Am Klinikum 1, 07747 Jena, Germany. E-mail: ute.neugebauer@leibniz-ipht.de
bLeibniz Institute of Photonic Technology (Leibniz-IPHT), a member of the Leibniz Centre for Photonics in Infection Research (LPI), Albert-Einstein-Straße 9, 07745 Jena, Germany
cBiophotonics Diagnostics GmbH, Am Wiesenbach 30, 07751 Jena, Germany
dDepartment of Urology, Jena University Hospital, Am Klinikum 1, 07747 Jena, Germany
eInstitute of Medical Microbiology, Jena University Hospital, Am Klinikum 1, 07747 Jena, Germany
fInstitute of Physical Chemistry and Abbe School of Photonics, Friedrich Schiller University Jena, Helmholtzweg 4, 07743 Jena, Germany
gInstitute of Computer Science, Faculty of Mathematics, Physics & Computer Science, University Bayreuth, Universitätsstraße 30, 95447 Bayreuth, Germany
First published on 17th July 2023
Urinary tract infections (UTI) are among the most frequent nosocomial infections. A fast identification of the pathogen and assignment of Gram type could help to prescribe most suitable treatments. Raman spectroscopy holds high potential for fast and reliable bacterial pathogens identification. While most studies so far have focused on individual pathogens or artificial mixtures, this contribution aims to translate the analysis to primary urine samples from patients with suspected UTIs. For this, we have included 59 primary urine samples out of which 29 were diagnosed as mixed infections. For Raman analysis, we first trained two classification models based on principal component analysis – linear discriminant analysis (PCA-LDA) with more than 3500 Raman spectra of 85 clinical isolates from 23 species in order to (1) identify the Gram type of the bacteria and (2) assign family membership to one of the six most abundant bacterial families in urinary tract infections (Enterobacteriaceae, Morganellaceae, Pseudomonadaceae, Enterococcaceae, Staphylococcaceae and Streptococcaceae). The classification models were applied to artificial mixtures of Gram positive and Gram negative bacteria to correctly predict mixed infections with an accuracy of 75%. Raman scans of dried droplets did not yet yield optimal classification results on family level. When translating the method to primary urine samples, we observed a strong bias towards Gram negative bacteria, on family level towards Morganellaceae, which reduced prediction accuracy. Spectral differences were observed between isolates grown on standard growth medium and bacteria of the same strain when characterized directly from the patient. Thus, improvement of the classification accuracy is expected with a larger data base containing also bacteria measured directly from the urine sample.
Despite the increasing interest on the clinical relevance of mixed UTIs in the last years,8,10–12 question how to deal with them was not yet addressed. In routine diagnostics, the presence of further pathogen in the culture is connected with complications. In one hand, if there is insufficient second pathogen concentration, such a culture is considered as contamination or colonization, and usually it may not be reported by laboratories. So, the piece of information relevant for long-term catheterization, elderly and high-risk patients is lost.12,15 On the other hand, when second pathogen counts are enough, there is no established routine how to identify responsible pathogen for particular UTI episode11 resulting in time-consuming, expensive, and labour-intensive identification and susceptibility testing of most present organism. Finally, both mentioned situations lead to broad-spectrum antibiotic treatment and the risk of bacterial resistance increase11 and may lead to potential life-threatening complications described above. Current conventional diagnostic methods, which are used as gold standard, require at least one cultivation step, so that the identity of the bacteria is available after 24 hours or more. In addition, the routine, based on monobacterial growth with significant count,8 is inherently designed for single pathogen UTIs. To obtain particular information about sample until then, some urine screen tests are applied before culturing, like Gram staining followed by microscopy, or tests on leukocyte esterase or nitrite via dipsticks indirectly indicating infection.5,7,16 Nevertheless, this approach suffers relatively low sensitivity and high false positive rate.5 On the other hand, other precise techniques able to shorten time of diagnosis, like PCR-based techniques, electrochemical DNA biosensors, mass spectrometry (MALDI-TOF) are expensive or have special requirements on preparation or personnel. On research level, Raman spectroscopy in combination with multivariate data analysis proved to be a powerful alternative to identify bacteria at the species level1,15,17 and their resistance pattern18–21 in real time with high accuracy, in a cultivation-independent manner and no special requirements on sample preparation.
However, to the best of our knowledge, most work on this topic focused on laboratory bacterial strains under optimal growth conditions (medium broth, agar plates) or selected clinical isolates. When translating the technique to primary urine samples, additional complexity is introduced as the exact chemical composition of the patient's urine is not known and might even change with disease state, but also with nutrition. Bacteria are likely to adapt their metabolic state and thus might be in different states than after cultivation under defined laboratory conditions.17,22 Furthermore, in case of real patients' samples studies, UTIs caused by single pathogens were included23 or if the mixture, the attention was focused only on one pathogen on purpose.17,20,24 Only few works15,25 were devoted to the Raman spectroscopic analysis of mixed infections in the form of artificial mixtures, i.e. again under optimal growth conditions.
The aim of this study is to present the potential of Raman spectroscopy for cultivation-independent bacteria identification directly from patient's urine samples based on a large Raman database covering the clinical spectrum of pathogens and also to shed more light onto spectroscopy-based diagnostic of polymicrobial infections. For this, bacteria are first classified according to Gram type as this information is valuable for selecting preliminary patient treatment. A second classification model evaluates the potential to also differentiate bacterial families to provide a more detailed picture on the infection.
One monovette was subjected to routine microbiological analysis (gold standard), while the other monovette was subjected to Raman measurements at the same time. The maximal delay between sample collection and Raman sample preparation was 2 days, meanwhile urine was stored in the fridge at 4 °C.
Isolated bacterial cultures on blood or Drigalski agar plates were included in this study as clinical isolates, see ESI Table S2.†
It total, 85 bacterial strains belonging to 23 species were included in this study: 11 Escherichia coli strains, 14 Klebsiella spp. strains, 13 Pseudomonas spp. strains, 13 Enterococcus spp. strains, 6 Enterobacter cloacae strains, 7 Proteus mirabilis strains, 5 Streptococcus spp. strains, 9 Staphylococcus spp. strains, 2 Providencia rettgeri strains, 2 Citrobacter koseri strains, 1 Morganella morganii strain, 1 Acinetobacter ursingii strain, and 1 Corynebacterium amycolatum strain (ESI Table S2†).
Mixture name | Strain 1 | Strain 2 | Volume ratio strain 1:strain 2 |
---|---|---|---|
Mix_1 | E. coli urRP41 | E. faecalis urRP41 | 1:3 |
Mix_2 | E. coli urRP59 | E. faecalis urRP56 | 1:3 |
Mix_3 | E. coli urRP65 | S. warneri urRP20 | 1:1 |
Mix_4 | E. coli urRP18 | S. aureus urRP022 | 1:3 |
Bacteria were cultivated overnight in 20 ml of AT2 medium in separate flasks. The optical density (OD) at 600 nm of the overnight culture was adjusted to yield 20 ml suspension with OD between 0.08–0.1 (cell Density Meter, Fisher Scientific, Fisherbrand), corresponding to McFarland standard 0.5. After that, each suspension was centrifugated at 4190 rcf for 10 min, supernatants were removed, and both pellets were resuspended into 1000 μl of diluted Raman medium (AT2 + 0.5 PBS). A total volume of 1000 μl of different artificial mixtures were created as outlined in Table 1. Different volume ratios were necessary to ensure that both bacterial strains were present in sufficient quantities. To verify the presence of strains in the mixture, 100 μl of the suspension was plated on MH2 agar plate. Afterwards, cells were washed twice in sterile deionized water (for 1.5 minutes at rcf 13500g Eppendorf centrifuge 5418), resuspend in 1000 μl of sterile deionized water.
Raman measurement were performed using an upright CRM 300 WiTec micro-Raman system, equipped with UHTS spectrometer with 600 lines per mm grating, and air-cooled back-illuminated CCD camera (DV401 BV, ANDOR, 1024 × 127 pixels, cooled to −60 °C). Raman scattering was excited with 532 nm line of Nd:YAG laser with the power of 15 mW before passing the objective. The laser light was focused onto the sample using a 63× objective (Zeiss LD Plan-Neofluar Korr M27, NA 0.75), allowing a maximum spatial resolution of 355 nm under optimal conditions when using Abbe's formula (d = λ/2NA). Back-scattered Raman signal was collected and forwarded to the spectrograph by a multimode optical fibre with 50 μm core diameter. Performance and alignment check of the device was performed using silicon and 4-acetamidophenol on each measurement day.
Dried droplets of clinical isolates were measured in at least 3 independent batches per strain resulting in more than 3400 Raman spectra from the 23 different species. For each batch, at least 10 single spectra from different locations on the sample were recorded with acquisition time of 10 s.
Dried droplets of artificial mixtures and patient samples were measured as image scans in automated scanning mode covering an area of at least 20 μm × 20 μm, with a step size of 0.333 μm in XY directions with 5 s acquisition time per spectrum. Thus, at least 3600 spectra per sample were collected.
After preprocessing and quality check using all spectral data, classification models were built using only spectra of clinical isolates. Principal component analysis (PCA) was carried out to reduce the dimensionality of the dataset, then, linear discriminant analysis (LDA) was utilized as a binary (Gram positive vs. Gram negative) classification model and a 6-class (bacterial families) model. In both cases, balanced model weights were used to get the optimal trade-off between the sensitivity and specificity. The number of principal components used in the LDA was optimized in the leave-one-replicate-out cross-validation.30 In such validation scheme for the total of N replicates, the model trained on N − 1 replicates is applied to the replicate excluded from the training. The procedure is repeated N times to obtain predictions for all replicates but avoids the situation when the spectra from the predicted replicate are included in training. Nine PCs were used in the Gram-model and 39 PCs were used in the Family-model. These two models were then used to predict bacteria identity in the artificial mixtures as well as the clinical urine samples.
Spectra of artificial mixtures and the patients’ samples were not utilized in training and were only used for result evaluation. In the case that after quality check, less than 10 spectra were kept from a patient, this patient was excluded from final evaluation (true for 1 patient in our set). Within each spectral scan, the number of spectra assigned by the model to each class were investigated. The predictions within each scan are normalized to the maximal value, thus limiting the normalized predictions between 0 and 1. All classes with more than 0.1 normalized predictions were considered present in the sample. Thus, each scan could be predicted as a member of a single class or multiple classes. Prediction accuracy was calculated by comparing with microbiological findings.
a Two more families (Moraxellaceae and Corynebacteriaceae) were included in training the Gram type model, however, they were left out for training the family model. |
---|
Raman mean spectra together with their standard deviation of Gram negative and positive bacteria included in the study are depicted in Fig. 1a. Typical spectral features of bacteria can be identified, e.g., 783 cm−1 (cytosine, thymine ring breathing31,32), 1005 cm−1 (phenylalanine ring breathing33), 1080 cm−1 (C–N stretch of proteins31), 1097 cm−1 (PO2− stretching in DNA33), 1250 cm−1 (amide III32,33), 1340 cm−1 (adenine, guanine, CN-stretching in purine nucleobases33), 1450 cm−1 (deformations of CH2 scissoring31,34), 1578 cm−1 (ring stretching of guanine, adenine24), 1670 cm−1 (amide I, lipids31,34), and overlapping bands at 2850 cm−1 and 2935 cm−1 (CH3 and CH2 stretching35). A detailed assignment of the Raman bands is given in ESI Table S4 and Fig. S2.† Clear spectral differences are visible between the Raman spectra of Gram positive and Gram negative bacteria. Fig. 2 shows the computed difference spectrum of the Raman mean spectra. The most visible difference is found around 748 cm−1 (position 7 in Fig. S2, Table S4†), range 900–1000 cm−1 (bands 14–18, Fig. S2, Table S4†), 1312 cm−1 (position 34) and 1578 cm−1 (position 45, Fig. S2, Table S4†). These differences can be mainly explained with the different cell wall compositions of Gram positive and negative bacteria. Similar results have been reported in previous studies.23
Fig. 1 (a) Normalized mean Raman spectra of Gram negative (black) and Gram positive bacteria (red) together with standard deviation (shown as shadow). Raman band assignment is given in ESI Table S2.† (b) Mean preprocessed Raman spectra with standard deviations for families used for training the model (spectra are shifted on y axis for clarity). Genus and strains included per family are found in Table 2 and ESI Table S2,† respectively. |
Fig. 2 PCA-LDA loading plot of LD1 (pink) along with computed difference spectrum (Gram positive minus Gram negative) of the Raman mean spectra (blue). |
It has to be noted, that the standard deviation of Raman spectra from Gram negative species is higher than the standard deviation of Gram positive bacteria. This can be explained with the large variety of Gram negative bacteria included in this study (9 different bacterial genera compared to 4 Gram positive genera, see Table 2 and ESI Table S2†).
Quantitative results leave-one-out prediction are summarized in the confusion matrix in Table 3. High sensitivity and specificity for both classification levels are reached and exceed 90%, giving a balanced accuracy of 93.6%.
Cross-validation | Prediction | Sensitivity (%) | ||
---|---|---|---|---|
neg. | pos. | |||
True | neg. | 2256 | 103 | 95.6 |
pos. | 99 | 1074 | 91.6 |
Overall less spectra are included here as spectra of Moraxellaceae and Corynebacteriaceae were not used.
A six-class PCA-LDA model was trained to predict membership to respective bacterial family. Results of leave-one-replicate cross-validation are summarized in the confusion matrix (Table 4). An overall balanced accuracy of around 87% was reached. Best sensitivities are reached for Morganellaceae (>93%) and Staphylococcaceae (>92%). Lowest sensitivities were observed for Pseudomonadaceae (>79%) and Enterobacteriaceae (>82%) families. However, it has to be noted, that most mispredictions occurred within the same Gram type, namely between Pseudomonadaceae and Enterobacteriaceae as well as between Enterobacteriaceae and Morganellaceae. High similarity between Enterobacteriaceae and Morganellaceae is also seen in Fig. 1b. Both families belong to the order of Enterobacterales. High similarity of Raman spectra of bacteria from Enterobacteriaceae and Morganellaceae have been also reported previously,15 where it was not possible to separate spectra of E. coli and P. mirabilis using simple PCA clustering. The low sensitivity for predicting Pseudomonadaceae family might be due to the fact that within this family, a high standard deviation among individual strains is observed.
Prediction results for the artificial mixtures using the family model trained with the bacterial isolates are listed in Table 6. Except for artificial mixture 2 (Mix_2) and two batches of artificial mixture 1 (Mix_1), the truly present bacteria were always correctly predicted to be present in the mixture. However, a fully correct prediction was only achieved for batch 5 in artificial mixture 1. In all other cases (except batch 2 of Mix_1), also other families were predicted to be present in the mixture. In most cases, these other families contributed only to a minor proportion. However, in a very few cases they made up the majority (e.g., batch 2 in Mix_3 and batch 4 in Mix_4). Thus, it can be concluded that the current model did not proof yet powerful enough to predict correct family membership in mixed samples. One reason could be that image scans were recorded of dried droplets. We have chosen the step size (i.e. the pixel size of one point) with 0.333 μm rather small and also smaller than the average size of a bacterium (0.5–1 μm in diameter). However, it cannot be excluded that in a dried sample more than one bacterium was contributing to the spectrum and therefore making precise family assignment difficult. Nevertheless, as in most cases truly present bacteria were correctly predicted to be present, we aggregated the family predictions (Table 6) according to the Gram type and used this prediction to identify mixtures. The results are shown in the last column of Table 5. The same overall accuracy of 75% is reached as for the Gram type model. However, different batches were wrongly predicted to contain no mixtures.
The achieved accuracies for mixture predictions with our PCA-LDA models are comparable to previously reported results of mixture analysis where prediction accuracies of up to 73 and 89% were achieved with PLS-DA and SVM, respectively.15
Results of Raman and microbiological analysis are provided for each of the 59 samples in ESI Table S5.† Microbiological analysis revealed that ∼50% of samples (29 out of 59 samples) showed mixed (polymicrobial) infections. In 18 samples with mixed infections, two pathogens were identified, in 11 samples three or more pathogens were identified (ESI Table S5†). Mixed infections were slightly more likely from catheter samples (11 out of 17 samples (∼65%) were mixed infections) than from midstream urine (18 out of 42 samples (∼43%) were mixed infections).
These findings are in line with results from other studies, where 30–86% of UTIs were reported to be true mixed infections.7,8 With an average age of 69 ± 15.2 years our cohort includes also a significant portion of elderly patients, which are more susceptible for mixed infections.12 Higher prevalence of mixed infections in samples originating from catheterized patients has also been reported.8,10–12
In very few cases, bacterial species were identified (always as part of mixed infections) which were not included in the training data set, e.g., Serratia marcescens in patient 21. Corynebacterium amycolatum (from patient 27) and Acinetobacter ursingii (from patient 29) were included in the Gram model, but not in the family model. In one patient, also fungi were identified as pathogen in mixed infections. However, with the currently applied sample preparation protocol, fungi are not expected in the Raman sample due to their size (>5 μm) and the filtration step at the beginning.
Table 7 summarized the results of the PCA-LDA classification analysis of the Raman data. At first, we applied the 2-class Gram type model to identify patients’ samples with mixed infections of Gram positive and Gram negative bacteria from infections with only Gram positive or Gram negative bacteria (Table 7, top). The model achieved an overall balanced accuracy of 49%, when considering correct predicting to one of the three options. The achieved accuracy is significantly lower than for the artificial mixtures where the three options could be predicted with an accuracy of 75%. When analysing the wrong predictions, a strong bias towards Gram negative bacteria is observed which is reflected in the high sensitivity and low specificity for this class in Table 7. This means, almost all Gram negative bacteria were correctly predicted to be Gram negative, while also many mixtures and Gram positive bacteria were wrongly predicted to be Gram negative bacteria. High specificity and low sensitivity were reached for Gram positive bacteria. This means, that no Gram negative bacteria or mixtures were predicted to be Gram positive bacteria.
Bal. acc. 49% | Pred. Gram model | Sensitivity (%) | Specificity (%) | |||
---|---|---|---|---|---|---|
Neg. | mix. | Pos. | ||||
True | Neg. | 26 | 5 | 0 | 83.9 | 28.6 |
Mixture | 12 | 3 | 0 | 20 | 77.3 | |
Pos. | 8 | 5 | 0 | 0.0 | 100.0 |
Bal. acc. 44% | Pred. family model | Sensitivity (%) | Specificity (%) | |||
---|---|---|---|---|---|---|
Neg. | mix. | Pos. | ||||
True | Neg. | 21 | 10 | 0 | 67.7 | 39.3 |
Mixture | 11 | 4 | 0 | 26.7 | 63.6 | |
Pos. | 6 | 6 | 1 | 7.7 | 100.0 |
A similar trend is observed when predicting the family membership with the 6-class family model and aggregating the family predictions according to the Gram type (Table 7, top). Here, an overall balanced accuracy of 44% is achieved. A similar bias towards predicting the presence of Gram negative bacteria in the sample as for the Gram model is observed. Upon closer investigation of the prediction on family level (ESI Table S5†), it can be seen, that in all, but two patients’ samples (patient 13 and patient 24), Gram negative Morganellaceae are predicted to be present in the sample. However, this is correct only for 8 patients’ samples and wrong for 47 patients’ samples.
For seven patients’ samples, two independent dried droplets were prepared and measured. In the analysis above, the samples with the most remaining spectra after automated Raman quality filtration were included. ESI Table S6† shows the prediction results of both batches for each of the seven urine samples. In most cases, a good overall agreement is found between the replicate samples.
It has to be noted that the classification models were trained with clinical isolates that were cultivated under ideal microbiological conditions, i.e. on Müller-Hinton 2 agar plates, while patients’ urine samples were directly analysed without any cultivation step. An influence of growth medium on Raman spectra has been reported in earlier studies22 and explained with a slightly changed overall chemical composition due to different nutrients in the medium. Storage of the samples prepared for Raman measurement might also play a role, as was shown in ref. 37.
In order to explore, if the growth medium effect could be relevant for our samples, we performed unsupervised principal component analysis with the Raman spectra of the pathogens when measured directly from the urine sample and after isolation and cultivation on Müller-Hinton 2 agar plates. For this, all 14 pairs of pathogens which were measured directly from urine and after isolation and cultivated were included in the analysis. Selected individual scores plots and the combined scores plot of all samples are presented in ESI Fig. S4.† Exemplarily, the PCA scores plot is shown for patient's sample number 16 in Fig. 4a. Raman spectra of the very same strain of Klebsiella pneumoniae measured directly from the urine sample (black dots) and after isolation and cultivation on Müller-Hinton 2 agar plates (red dots) show prominent differences already in the first principal component which describes 27% of the variation in the data set. Different measurement parameters of the isolates, such as single spectra (green dots) vs. image scan (red dots) do not have an effect on spectral variation as is clearly visible from coinciding green and red clusters in Fig. 4a. Differences in growth condition (in urine vs. Mueller-Hinton 2 agar plates) seem to be larger than differences between species and families as significant separation of growth conditions is visible already in PC1 in an unsupervised PCA model containing strains from different families (Fig. 4b and ESI Fig. S4†). The loading plot of PC1 (Fig. 4b) shows large contributions from the spectral region of CH-stretching. In addition, changes in bands corresponding to phenylalanine (1005 cm−1, 1590 cm−1), amide III (1250 cm−1), guanine (1310 cm−1), CH modes of glucosamines and proteins (1440 cm−1), and peptidoglycan (1590 cm−1) are visible in the fingerprint region. For all three selected bacterial strains, PC1 captures most differences.
In order to avoid medium-induced effects on the classification model, it could be recommended to train the classification model with Raman spectra of bacteria directly from urine samples. In our small study we had only 30 urine samples with single pathogen infections (20 Gram negative and 10 Gram positive strains). For reliable classification models it is suggested to expand the data set. This will be within the scope of future work.
There are further factors that could affect the accuracy of the classification model to predict mixed infections. One would be the presence of bacterial species and families that were not included in the training model. In our case, we found Lactobacillus jensenii (Gram positive bacilli of family Lactobacillaceae) or Serratia marcescens (Gram negative rods of family Yersiniaceae). We assume that if the spectral data base is sufficiently large, also unknown species are assigned to the right Gram type or a closely related family.23
Furthermore, in literature, the application of different classification models has been discussed. In our study, we have chosen PCA-LDA, which is one of the most common approaches for classification tasks of isolated bacterial strains.24,33 However, other studies could demonstrate higher performance of other models, such as partial least squares-discriminant analysis (PLS-DA)15,38, k-Nearest Neighbours methods,23,31,36 or support vector machines (SVM),23 or deep learning methods39 to just name a few. A detailed comparison of different methods is beyond the scope of the manuscript and will be subject of further studies.
It has to be noted, that routine microbiological diagnostics relies on viable bacteria in the urine sample as it is a cultivation-based method that yields number of bacteria as colony forming units (CFU) per milliliter. Thus, bacteria that are already killed by a successful antibiotic treatment cannot be cultivated anymore, but might be still present (if not fully lysed yet) in the urine sample.
Furthermore, the current sample preparation workflow excludes fungal pathogens. If they should be included in future work, the filtration step needs to be modified.
Footnotes |
† Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d3an00679d |
‡ Present address of KAD: Faculty of Nuclear Sciences and Physical Engineering, Czech Technical University in Prague, Brehova 7, 11519 Prague, Czech Republic. |
This journal is © The Royal Society of Chemistry 2023 |