Combining MALDI-MS with machine learning for metabolomic characterization of lung cancer patient sera†
Abstract
An increasing amount of evidence has proven that serum metabolites can instantly reflect disease states. Therefore, sensitive and reproducible detection of serum metabolites in a high-throughput manner is urgently needed for clinical diagnosis. Matrix-assisted laser desorption/ionization mass spectrometry (MALDI-MS) is a high-throughput platform for metabolite detection, but it is hindered by significant signal fluctuations because of the “sweet spot” effect of organic matrices. Here, by screening two transformation methods and four normalization techniques to reduce the significant signal fluctuations of the DHB matrix, an integrated MALDI-MS data processing approach combined with machine learning methods was established to reveal metabolic biomarkers of lung cancer. In our study, 13 distinctive features with statistically significant differences (p < 0.001) between 34 lung cancer patients and 26 healthy controls were selected as significant potential biomarkers of lung cancer. 6 out of the 13 distinctive features were identified as intact metabolites. Our results demonstrate the potential for clinical application of MALDI-MS in serum metabolomics for biomarker screening in lung cancer.