Ran Wang‡
a,
Yufei Liu‡a,
Chang Wang‡a,
Henghui Lia,
Xin Liu*a,
Liming Cheng*b and
Yanhong Zhou*a
aBritton Chance Center for Biomedical Photonics at Wuhan National Laboratory for Optoelectronics – Hubei Bioinformatics & Molecular Imaging Key Laboratory, Systems Biology Theme, Department of Biomedical Engineering, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, China. E-mail: yhzhou@hust.edu.cn; xliu@mail.hust.edu.cn
bDepartment of Laboratory Medicine, Tongji Hospital, Wuhan 430074, China. E-mail: chengliming2002@163.com
First published on 20th July 2018
Monitoring serum glycomics is one of the most important emerging approaches for diagnosis of various cancers, and the majority of previous studies were based on MALDI-MS or HPLC analysis. Considering the difference of these analytical methods employed for serum glycomics, it is necessary to compare the effectiveness of different analytical methods for monitoring the aberrant changes in serum glycomics. In this study, a strategy based on machine learning was firstly applied for comparing the analysis results of MALDI-MS and HPLC on the same serum glycomics of hepatocellular carcinoma (HCC) samples. The capability of these two analytical methods for identifying HCC is demonstrated by the classification results obtained from MALDI-MS and HPLC data. In addition, by comparing glycomics which were significantly correlated with HCC based on MALDI-MS and HPLC, some N-glycans which may be the potential biomarkers for HCC were identified, validating the capability of these two analytical methods for the differentiated identification in the analysis of glycomics. Meanwhile, it is noteworthy that various physiological and environmental factors may cause the aberrant changes in glycosylation, and all these interference factors may be minimized by analyzing the same sample sets of HCC. Overall, these results showed that MALDI-MS and HPLC are complementary in qualitative and quantitative analysis of serum glycomics.
Various approaches for qualitative and quantitative analysis of subtle changes in glycomics mainly rely on several different analytical techniques, including high performance liquid chromatography (HPLC), capillary electrophoresis (CE), and mass spectrometry (MS) etc.10 Among those techniques, HPLC-fluorescent detection (HPLC-FLD) has become an effective means to analyze glycans, coupling with different fluorescent labelling reagents, such as 2-aminobenzoic acid (2-AA), 2-aminobenzamide (2-AB), and 2-amino pyridine (PA) etc..11 Additionally, matrix-assisted laser desorption/ionization mass spectrometry (MALDI-MS) has also widely been applied to identify N-glycan biomarkers for cancer due to its low sample consumption, high throughput capacity and ease of operation.12–14
It is noteworthy that the N-glycomics of some cancers have been studied by MALDI-MS and HPLC, respectively.1 In these studies, both MALDI-MS and HPLC have identified several N-glycans as potential biomarkers for these cancers. Kyselova et al. reported that 8 N-glycans have been identified with highly accurate diagnostic potential in breast cancer by MALDI-MS,15 while in the study of Saldova et al., another 4 different types of N-glycans have been identified as potential biomarkers for breast cancer by HPLC, and there was no overlap between these two studies in terms of the potential biomarkers.16 Additionally, Wu et al. have reported that 6 N-glycans were highly in correlation with lung cancer by MALDI-MS analysis,17 while in the study of Rudd et al., 20 N-glycans have been identified as potential biomarkers for lung cancer by HPLC,18 and it should be noted that 3 common N-glycans were identified as significantly changed in lung cancer by both MALDI-MS and HPLC. However, those differences of N-glycans as potential biomarkers for these cancers by both two analytical methods hasn't been investigated in the same sample set. In addition, some common lifestyle parameters such as age, diet, smoking, body fat and plasma lipid status may cause the changes in glycosylation.19–21 And it also should be noted that some other factors including mutations of genes, different levels of cholesterol and insulin may also have effects on the aberrant glycosylation even for normal plasma glycomic profiles.22 Therefore, it is necessary to eliminate these interference factors between different sample sets which may lead to the variances in glycosylation, in prior to the appropriate evaluation of the identified N-glycans by these two analytical methods.
In our study, HCC which is a common malignant disease with five-year relative survival rates less than 15%10,11 was selected to further investigate the internal factors. In order to eliminate the interference caused by different samples, two sets of same samples including HCC cases and healthy controls were derivatized and then analyzed by MALDI-MS and HPLC respectively. The workflow of the analytical process was shown in Fig. 1 and some N-glycans which were highly in correlation with HCC were identified. By statistical analysis, the difference in identification of glycoforms by MALDI-MS and HPLC were evaluated, further revealing the difference of these two analytical methods in biomarker discovery for HCC. Meanwhile, the relevance in biomarker discovery for HCC by MALDI-MS and HPLC was also explored, suggesting the complementary of these two analytical methods in qualitative and quantitative analysis of serum glycomics.
Dried sample was dissolved in 800 μL of solution including 400 μL chloroform and 400 μL NaCl solution, then the extraction solution was mixed and incubated at room temperature for 20 min. Centrifuged the extraction solution for 1 min at 10000 rpm and removed the supernatant. Extra 400 μL of pure water was added. After mixing for 1 min, remove the supernatant again and dried the sample in a vacuum concentrator.
The reaction mixture was purified by MCC cartridges as follows: MCC cartridges were equilibrated by 3.0 mL of 1-butanol/ethanol/H2O (4:1:1, v/v/v). After equilibrium, the derivatives were loaded on MCCs and washed with 3.0 mL of equilibrium solution. Finally, glycans were eluted by 1.0 mL of ethanol/H2O (1:1, v/v), and then dried by concentrator under vacuum.
In order to confirm the chemical compositions of 2-AA derivatized N-glycans, the collections of each peak from HPLC were further analyzed by nanoLC-ESI-MS (AB SCIEX, USA) with C18 as solid phase (75 μm i.d. × 100 mm long, 5 μm; Proteomics Front, China). Solvent A was consisted of 5% ACN solution containing 0.1% FA (v/v), and solvent B was consisted of 95% ACN solution containing 0.1% FA (v/v). The injection volume for each collection was 2 μL. The solutes were eluted at a flow rate of 300 nL min−1 with gradient profile as follows: 95% to 95% A, 0 min; 95% to 90% A, 0 to 2 min; 90% to 70% A, 2 to 10 min; 70% to 40% A, 10 to 15 min; 40% to 5% A, 15 to 18 min; 5% to 5% A, 18 to 23 min; 5% to 95% A, 23 to 25 min; 95% to 95% A, 25 to 40 min. Data acquisition was conducted using an ion source gas of 3 PSI, a curtain gas of 35 PSI, an ion spray voltage of 2.3 kV, an interface heater temperature of 150 °C, and a collision energy of 10 eV for collision-induced dissociation (CID). MS was operated in the positive-ion mode with a mass range of 100–3000 m/z, and MS/MS was acquired in the information dependent acquisition (IDA) mode with a mass range of 20–2000 m/z. All the N-glycan species detected were summarized in Table S2 (ESI†).
It should be noted that the limits of detection (LOD) and quantitation (LOQ) of glycans were measured as the dosages of a standard N-glycan of [2-3-0-1-0] giving a signal-to-noise ratio of 3 and 10 respectively, and the detailed parameters were listed in Table S5 (ESI†), and 10 μL of human serum has been used in our study, which met the requirement of detection and quantitation.
Fig. 3 HPLC analysis of 2-AA derivatized N-glycans derived from human serum of healthy controls (a) and HCC cases (b). |
In order to determine which components made more contribution to the differences between sample sets, K–S tests for the normalized peak of N-glycans were performed. N-glycans with significant difference (p-values < 0.05) between HCC cases and healthy controls were listed in Table 1. By MALDI-MS, 19 N-glycans have been identified between HCC cases and healthy controls, with p-values < 0.05. Meanwhile, 20 specific N-glycans were identified with p-values < 0.05 by HPLC. It is noteworthy that 10 common N-glycans were detected with significant difference whether by MALDI-MS or HPLC. Although the other 9 N-glycans detected by MALDI-MS also showed difference, 3 of which showed no difference in HPLC analysis with p-values > 0.05, and 6 of which cannot be detected by HPLC. In addition, for the other 10 N-glycans detected by HPLC with p-values < 0.05, 8 of which showed no difference between HCC cases and healthy controls by MALDI-MS with p-values > 0.05, and the signal response of the other 2 N-glycans were too weak, which is not suitable for quantitative study.
m/z | Compositiona | Analytical approach | p-Value of K–S test | AUC | Average change in HCCb |
---|---|---|---|---|---|
a The compositions of the N-glycans were abbreviated by [a-b-c-d-e]: a indicates the number of HexNAc, b indicates the number of mannose, c indicates the number of galactose, d indicates the number of fucose and e indicates the number of N-acetylneuraminic acid.b The average changes in HCC were calculated by the difference of average intensity of glycans between HCC cases and healthy controls, in which the positive and negative denote the up-regulation and down-regulation respectively. | |||||
1579.8 | [2-5-0-0-0] | HPLC (with [4-3-0-1-0]) | 3.554 × 10−7 | 0.931 | 0.816 |
1661.7 | [4-3-0-0-0] | HPLC (with [5-3-0-0-0]) | 1.006 × 10−2 | 0.793 | 0.954 |
1783.9 | [2-6-0-0-0] | MALDI-MS | 7.291 × 10−3 | 0.756 | 0.948 |
1835.8 | [4-3-0-1-0] | MALDI-MS | 6.308 × 10−6 | 0.897 | 3.425 |
HPLC (with [2-5-0-0-0]) | 3.554 × 10−7 | 0.931 | 0.816 | ||
1865.9 | [4-3-1-0-0] | MALDI-MS | 1.741 × 10−3 | 0.739 | −0.235 |
HPLC (with [5-3-0-1-0]) | 1.707 × 10−2 | 0.738 | 0.335 | ||
1906.9 | [5-3-0-0-0] | HPLC (with [4-3-0-0-0]) | 1.006 × 10−2 | 0.793 | 0.954 |
2040 | [4-3-1-1-0] | MALDI-MS | 2.496 × 10−3 | 0.847 | 2.858 |
2070 | [4-3-2-0-0] | MALDI-MS | 5.762 × 10−3 | 0.774 | −0.495 |
2081.1 | [5-3-0-1-0] | MALDI-MS | 5.269 × 10−4 | 0.811 | 0.641 |
HPLC (with [4-3-1-0-0]) | 1.707 × 10−2 | 0.738 | 0.335 | ||
2186.1 | [3-4-1-0-1] | MALDI-MS | 1.740 × 10−3 | 0.777 | 0.291 |
HPLC (with [5-3-2-1-0]) | 6.236 × 10−4 | 0.848 | 0.447 | ||
2192.1 | [2-8-0-0-0] | MALDI-MS | 2.301 × 10−5 | 0.857 | 0.732 |
2244.1 | [4-3-2-1-0] | MALDI-MS | 5.268 × 10−4 | 0.798 | 1.156 |
2285.2 | [5-3-1-1-0] | MALDI-MS | 3.103 × 10−6 | 0.922 | 1.179 |
HPLC | 1.025 × 10−5 | 0.884 | 0.288 | ||
2390.2 | [3-5-1-0-1] | MALDI-MS | 4.431 × 10−5 | 0.870 | 0.361 |
HPLC (with [4-3-2-1-1]) | 6.308 × 10−6 | 0.919 | 1.422 | ||
2396.2 | [2-9-0-0-0] | MALDI-MS | 2.596 × 10−2 | 0.737 | 0.638 |
2401.2 | [4-3-1-1-1] | MALDI-MS | 4.532 × 10−3 | 0.723 | −0.385 |
2417.2 | [4-3-2-2-0] | MALDI-MS | 3.103 × 10−6 | 0.868 | −3.042 |
2472.2 | [5-3-1-0-1] | HPLC (with [5-3-2-2-0]) | 1.286 × 10−4 | 0.842 | 0.707 |
2489.3 | [5-3-2-1-0] | MALDI-MS | 2.278 × 10−8 | 0.956 | 0.850 |
HPLC (with [3-4-1-0-1]) | 6.236 × 10−4 | 0.848 | 0.447 | ||
2605.3 | [4-3-2-1-1] | MALDI-MS | 1.338 × 10−3 | 0.811 | 1.281 |
HPLC (with [3-5-1-0-1]) | 6.308 × 10−6 | 0.919 | 1.422 | ||
2663.2 | [5-3-2-2-0] | HPLC (with [5-3-1-0-1]) | 1.286 × 10−4 | 0.842 | 0.707 |
2676.3 | [5-3-2-0-1] | HPLC | 1.566 × 10−2 | 0.619 | 0.019 |
2792.4 | [4-3-2-0-2] | MALDI-MS | 1.781 × 10−2 | 0.777 | −5.734 |
HPLC | 6.308 × 10−6 | 0.750 | −6.368 | ||
2850 | [5-3-2-1-1] | HPLC | 1.741 × 10−4 | 0.807 | −0.950 |
3054 | [5-3-3-1-1] | HPLC | 2.933 × 10−2 | 0.677 | −0.181 |
3241.6 | [5-3-3-0-2] | MALDI-MS | 2.303 × 10−5 | 0.828 | −0.883 |
3211.6 | [5-3-2-1-2] | HPLC | 4.697 × 10−2 | 0.692 | −0.394 |
3602.8 | [5-3-3-0-3] | MALDI-MS | 1.566 × 10−2 | 0.770 | −0.191 |
HPLC | 2.863 × 10−9 | 0.976 | −3.186 | ||
3864.9 | [6-3-4-1-2] | HPLC | 1.006 × 10−2 | 0.793 | −3.784 |
Fig. 4 Principal component analysis (PCA) scores plot for HCC cases and healthy controls analyzed by HPLC. |
Fig. 5 Principal component analysis (PCA) scores plot for HCC cases and healthy controls analyzed by MALDI-MS. |
In our study, 16 N-glycans were identified with AUC over 0.80 by MALDI-MS and HPLC. Among which, 5 common N-glycans were both identified by these two analytical methods.
As listed in Table 1, N-glycans analyzed by MALDI-MS with AUC over 0.80 were as following: [4-3-0-1-0], [4-3-1-1-0], [5-3-0-1-0], [2-8-0-0-0], [5-3-1-1-0], [3-5-1-0-1], [4-3-2-2-0], [5-3-2-1-0], [4-3-2-1-1] and [5-3-3-0-2]. Fig. 6 presented these significantly changed N-glycans, of which 8 were up-regulated, especially for glycans with fucosylated moieties, and that 2 glycans were down-regulated in patients with HCC.
In addition, N-glycans with AUC over 0.80 have been identified by HPLC, including [5-3-3-0-3], [2-5-0-0-0]/[4-3-0-1-0], [4-3-2-1-1]/[3-5-1-0-1], [5-3-1-1-0], [5-3-2-1-1], [5-3-2-1-0]/[3-4-1-0-1] and [5-3-1-0-1]/[5-3-2-2-0]. The changes of these peaks for HCC compared to healthy controls were shown in Fig. 7, of which 5 were up-regulated and 2 were down-regulated. It also should be noted that four of these peaks contained co-elution with two glycan structures. And the comparison of the results analyzed by these two analytical methods will be performed in following process.
Furthermore, each of these N-glycans presented highly significant level (AUC over 0.9) in one analytical method, however showed significant level (0.8 < AUC < 0.9) in another method. For example, [5-3-1-1-0] reflected the optimal diagnostic capability for distinguishing HCC cases from healthy controls with AUC of 0.922 by MALDI-MS, while with AUC of 0.884 by HPLC. The results described above showed that these five N-glycans were all highly correlated with HCC. More interestingly, [5-3-2-1-0] and [5-3-1-1-0] have been reported to be potential biomarkers for HCC by some previous studies,12,13,28 while [4-3-0-1-0], [3-5-1-0-1] and [4-3-2-1-1] haven't been reported, suggesting that these N-glycans may be the potential biomarkers for HCC.
Although these 5 N-glycans presented significant difference between HCC cases and healthy controls by MALDI-MS, they showed low significant levels in HPLC analysis. For example, [4-3-1-1-0] and [5-3-3-0-2] showed no significance with p-value > 0.05. Additionally, [2-8-0-0-0] and [4-3-2-2-0] have not been detected by HPLC analysis. It is noteworthy that [5-3-0-1-0] and [4-3-1-0-0] are involved in the same chromatographic peak with moderately significant difference with AUC of 0.738 between HCC cases and healthy controls. However, [5-3-0-1-0] showed significant difference with AUC of 0.811 in the analysis of MALDI-MS, which is higher than that in HPLC analysis. The lower significant difference of the chromatographic peak containing [5-3-0-1-0] and [4-3-1-0-0] may be caused by different regulation trends of these two glycans, of which [4-3-1-0-0] has been identified with down-regulation in MALDI-MS, while [5-3-0-1-0] presented up-regulation in MALDI-MS, the difference of the chromatographic peak has been reduced by these two glycans, suggesting the complementarity of these two analytical methods in analysis of serum glycomics.
Although these 6 N-glycans showed significant difference by HPLC, they showed lower significant levels in MALDI-MS analysis. For example, there is no significant difference between HCC cases and healthy controls for [5-3-1-0-1]/[5-3-2-2-0], [2-5-0-0-0] and [5-3-2-1-1] by MALDI-MS. Meanwhile, [3-4-1-0-1] and [5-3-3-0-3] presented moderate significance (0.7 < AUC<0.8) between HCC cases and healthy controls by MALDI-MS, which were lower than HPLC analysis.
In addition, it should be noted that [3-4-1-0-1] and [5-3-2-1-0] were co-eluted in the same chromatographic peak with significant difference (AUC of 0.848) by HPLC. However, [3-4-1-0-1] showed moderate significance with AUC of 0.777 and [5-3-2-1-0] presented high significance with AUC of 0.956 by MALDI-MS. The significant difference of the co-eluted chromatographic peak in HPLC may be caused by the mixing of [3-4-1-0-1] and [5-3-2-1-0], suggesting these two analytical methods were complementary in potential biomarker discovery for HCC.
It should be noted that among these 11 N-glycans, almost half (5/11) were co-eluted with other N-glycans. In addition, a previous study has reported that the co-elution of chromatographic peaks in HPLC analysis might limit the quantitation of N-glycans.29 In our study, the separation capability of HPLC is limited for analysing N-glycans with bisecting structures, which may interfere the identification of biomarker, and may be one of the major causes that lead to the differences of glycomics between HPLC and MALDI-MS analysis, and the co-eluted glycans associated with the detection limitation need to be further validated. Interestingly, MALDI-MS analysis in our study present capability of validation the effectiveness of co-eluted N-glycans in HPLC. For example, [5-3-0-1-0] performs significant statistical difference between HCC cases and healthy controls by MALDI-MS, while it showed lower significance by HPLC analysis, which might be interfered by [4-3-1-0-0] in the co-eluting chromatographic peak. In addition, the chromatographic peak containing [2-5-0-0-0] and [4-3-0-1-0] is identified with high significance by HPLC analysis, meanwhile [4-3-0-1-0] showed significant difference by MALDI-MS, while [2-5-0-0-0] presented no significance between HCC cases and healthy controls by MALDI-MS, suggesting the significant difference of the chromatographic peak containing [2-5-0-0-0] and [4-3-0-1-0] may be caused by [4-3-0-1-0], further indicated the complementarity of these two analytical methods in analysis of serum glycomics.
Meanwhile, 2 of these 11 are tri-antennary N-glycans. [5-3-3-0-2] has been identified as significant by MALDI-MS, while [5-3-3-0-3] has been identified by HPLC. Wada etc. had expressed that the levels of some tri-antennary glycans determined by MALDI-MS were also different from chromatographic analysis, but it is not possible to decide which approach is more precise.30 As described above, both these 2 tri-antennary N-glycans have been reported in previous studies,12,13 indicating that the tri-antennary as potential biomarker could be promising whether it identified by MALDI-MS or HPLC, and the complementarity of these two analytical methods in analysis of serum glycomics.
Certainly, besides the causes described above, further studies for other factors which may lead to the differences of glycomics between HPLC and MALDI-MS analysis is indeed required. Interestingly, the deep study for the differences may help expanding the biomarker library for HCC to a certain extent. For example, [4-3-1-1-0] has been only identified by MALDI-MS with AUC over 0.8, which was related to HCC in a previous study.12
Additionally, differences also existed between MALDI-MS and HPLC in terms of these analytical platforms. MALDI-MS presented excellent reproducibility, high throughput and low consumption of samples.30 While, HPLC showed precise intensities and reproducibility in detection of N-glycans.31 Overall, all these results described above indicated that these two analytical methods are complementary for identifying biomarkers of HCC.
Footnotes |
† Electronic supplementary information (ESI) available. See DOI: 10.1039/c8ra02542h |
‡ These authors contributed equally to this work. |
This journal is © The Royal Society of Chemistry 2018 |