Yoshinari
Suzuki
*a,
Midori
Kondo
a,
Masae
Harimoto
a,
Yusuke
Okamoto
a,
Yu-ki
Tanaka
b,
Yasumitsu
Ogra
b and
Hiroshi
Akiyama
ac
aDivision of Foods, National Institute of Health Sciences, 3-25-26 Tonomachi, Kawasaki-ku, Kawasaki-shi, Kanagawa 210-9501, Japan. E-mail: szk-yoshi@nihs.go.jp
bGraduate School of Pharmaceutical Sciences, Chiba University, 1-8-1 Inohana, Chuo-ku, Chiba-shi, Chiba 260-8675, Japan
cDepartment of Analytical Chemistry, Hoshi University, 2-4-41 Ebara, Shinagawa-ku, Tokyo 142-8501, Japan
First published on 30th November 2023
Single-particle ICP-MS (spICP-MS) is an established method for the determination of inorganic nanoparticle (NP) mass distributions and particle number concentrations. However, spICP-MS is not applicable to some cases, especially cases that require distinguishing signals from dissolved ions and signals from relatively small NPs. To deconvolute spICP-MS data, which is obtained by setting the dwell time similar to the particle event duration time, a Bayesian estimation method was developed for spICP-MS analysis using silver (Ag) and silica (SiO2) NPs. The signal distributions of the spICP-MS data were parameterised using a Bayesian estimation method on the assumption that they could be described by mixed Poisson distributions. Analytical results were then compared to results obtained with conventional criteria. When the instrument parameters were set so that the particle-event duration was within 2 readings and hence did not deviate from the assumptions of the current Bayesian model, better estimation results could be obtained with the Bayesian estimation method than with methods based on conventional criteria, especially for a sample with high particle number concentration. Furthermore, applying the specific informative prior distribution enabled us to obtain reasonable estimation results, even when the signal counts were 6 or less and the background counts were high. Because appropriate NP information was obtained for Ag-NP and SiO2-NP, the Bayesian estimation method can be universally adopted with inorganic NPs detectable by ICP-MS.
An improved data acquisition system could help to mitigate this problem. If particle events are recorded as one-reading events, the use of a shorter DT makes it possible to increase the signal-to-background (S/B) ratio. In 2002, Nomizu et al.6 developed a high-speed, digital, signal-processing system that enabled analyses with a DT of 20 μs, and they subsequently used the system to analyse individual airborne particles. The performance of the ultrafast data acquisition system for analysis of NPs was first reported by Hineman and Stephan.7 They analysed Au-NPs with 60 nm diameters using DTs ranging from 10 μs to 10 ms, and they discussed the effects of varying the DT on particle integration, particle counting, particle sizing, and the background signal. Miyashita et al.8 assembled their own home-built data acquisition unit, which was capable of communicating with a manufactured ICP-MS, and they applied the unit to single-cell analysis at a DT of 50 μs or 100 μs. Several attempts have subsequently been made to improve the time resolution in spICP-MS.4,9,10 A new generation of ICP-MS spectrometers was released around 2011, allowing the minimum DT to be decreased to as low as 10 μs.11 Use of shorter DTs, as well as diluting the particle number concentration, also reduces overlapping of particle events and thus leads to valid estimation of particle number concentrations. However, using shorter DTs increases the possibility that a particle event does not completely coincide with a DT and thus broadens the particle signal distribution and lowers signal intensities.11,12
Applying advanced statistical methods could be another solution. Researchers typically distinguish NP signals from background signals by using three times the standard deviation (σ) of the background signals as a criterion,2 although different threshold values, such as the mean plus 5σ, have been applied in some cases.13,14 Bi et al.15 applied a K-means clustering algorithm to signal processing of spICP-MS raw data. Tuoriniemi et al.16 introduced an algorithm based on outlier detection for data with time resolution of 100 ms. An approach reported by Donard et al.17,18 was based on the detection of peak maxima. Bolea-Fernandez et al.5 used a Gaussian model followed by a deconvolution approach to characterize SiO2-NPs in the presence of a background signal. Gundlach-Graham et al.19 used a Monte-Carlo simulation to obtain a new detection criterion after fitting to a compound Poisson distribution. Hadioui et al.20 applied repeated signal smoothing (3–4 iterations) and then detected NPs if the associated signals were more than 3σ above the mean. Mozhayeva and Engelhard21 applied a Poisson distribution and set the criterion for detection based on a false positive probability of 5%.
Even if an optimal criterion has been determined and a distinction between particle event and background has been made, there are still other difficulties. It is necessary to confirm the particle-event shapes for all NPs. If a bimodal particle event is detected, it must be divided, and some particle event may be excluded based on their shapes. spICP-MS measurements often yield more than 1000 particle events, and confirming the identification of that many particle events can be a time-consuming task that requires perseverance. Although automated data processing software for spICP-MS has been developed by major ICP-MS suppliers,22–25 there is still room for improvement and the development of algorithms for routine analysis that enable processing of multiple samples.
We hypothesized that describing a data generation process as a statistical model would be a useful approach that would not use criteria and would not require taking the sum of intensities for particle-event. A statistical model is a probability distribution constructed to enable inferences to be drawn or decisions to be made based on data.26 This concept is quite similar to the approach reported by Cornelis and Hassellöv.27 They used a method of maximum likelihood estimation (MLE: actually, they used an algorithm that minimized negative likelihood) to deconvolute the background signals from the particle signals under the assumption of a Pólya-Gaussian distribution.27 (n.b., a Pólya distribution is a special case of a negative binomial distribution.) Their approach was to analyse only the probability distribution of the obtained signal, but it is better to assume a state-space model for the signal obtained by spICP-MS. In other words, when the particle event is observed at the ith reading, it is more natural to consider the relationship of that signal with the signals at the (i − 1)th and/or (i + 1)th readings. Furthermore, there is a drawback to using the MLE method. Because the recorded signals (counts) are positive integer numbers, the data should be described with a discrete distribution such as a Poisson distribution.21 The spICP-MS data are considered to be a mixture of a Poisson distribution associated with the background counts and another Poisson distribution due to the counts associated with the NPs. That is, the signal is considered to follow a mixed Poisson distribution. The generalization error is known to be lower for Bayesian inference versus MLE in singular models such as a mixture of distributions.28
In this study, we investigated a method to extract information about NPs from spICP-MS data by describing the data generation process with Bayesian estimation (BE) based on a mixed Poisson distribution. First, a numerical simulation was performed to evaluate the validity of a Bayesian model. Then, the proposed BE method was used to interpret the actual spICP-MS results, and the performance of the BE method was compared with that of a method based on conventional criteria (CC).
Pomix(Y|α, λbkg, λbkg + λNP) = (1 − α)Po(Y|λbkg) + αPo(Y|λbkg + λNP), | (1) |
Eqn (1) assumes that all signals associated with NPs are detected within one reading, but normally NP signals are observed over multiple readings. Thus, extension to a mixture of q Poisson distributions leads to a generalized mixed Poisson distribution given by
It was reported that a typical NP pulse lasts for 0.3–0.5 ms.29 A previous study has reported that the duration of the ion signal for one 60 nm Ag-NP is approximately 0.5 ms.30 Thus, when the DT is set to 0.5 ms and a 60 nm Ag-NP particle is measured, if the signal Yi associated with the ith reading is the peak of the particle event, then the signal is split and is expected to be observed at the (i − 1)th or (i + 1)th reading. However, when Yi is the background signal and Yi−1 and/or Yi+1 are the peaks of the particle event(s), the split signal(s) may overlap the background Yi. The probabilities of whether the split signal from Yi overlaps at Yi−1 or Yi+1 are considered to be equal. If δ (0.5 ≤ δ ≤ 1) is the dimensionless split coefficient of the particle-event intensity, then the split patterns and their PMF can be summarized as shown in Fig. 1. In the pattern 7, although the expected signal is λbkg + δλNP, we estimate λNP, which is equivalent to particle-event intensity. In the pattern 6, the tails of the particle-event overlap. In the pattern 8 and 9, the particle events are merged, and it may be difficult to divide them into individual particle-event. Although these split patterns do not satisfy the condition unless the probability of NP's being observed three times in a row is realistically taken into consideration, the probability of this event's occurring (α3; if α = 0.01, the probability of this event's occurring is 10−6) is very low. That case was therefore intentionally excluded to simplify the model. Briefly, this model is a method to calculate the expected particle-event intensity without calculating the intensity of each particle event.
Fig. 1 Signal split patterns and probability mass function based on mixed Poisson distribution for spICP-MS. Symbols are explained in the list of symbols and throughout the text. |
(3) |
fpost(θ|Y) ∝ L(Y|θ)fpri(θ). | (5) |
Dissolved ion standards for Ag (1000 mg L−1, for atomic absorption spectrometry analysis, Kanto Chemical, Tokyo, Japan) and Si (1000 mg L−1, for ICP analysis, Fujifilm-Wako Chemical, Tokyo, Japan) were used for external calibration.
Dwell time was set to 0.5 ms, which is similar to the particle-event duration, and time resolved analysis was performed for 60 s. In this study, the particle-event duration is defined as the product of the dwell time and the number of consecutive readings of the signal above the baseline. Ag (m/z = 107) was measured under a no-gas mode, and Si (m/z = 28) was measured under an H2 reaction mode (flow rate was 5.5 mL min−1). To reduce background signal of Si, a PLUS torch (Thermo Fisher Scientific) and a sapphire injector (Thermo Fisher Scientific) were used for SiO2-NP analysis. Other main instrumental parameters of the ICP-MS are presented in Table S2.† The probe was rinsed with 2% HNO3 between sample measurements.
Bayesian estimation method Fig. 2 shows a schematic overview of the proposed BE method and data conversion to NP information. Because the signals were saved as counts per second, the data were converted to integers (counts) using the DT.
To obtain posterior distribution(s), the probabilistic programming language Stan, developed by Gelman et al.,32 was used in this study. A key feature of Stan is that it uses NUTS (No-U-Turn Sampler), which is an implementation of Hamiltonian Monte Carlo, one of the algorithms of the Markov chain Monte Carlo (MCMC) method. Various Bayesian modelling sources using Stan have been published, and documentation is abundantly available online (see S2†).33
Four parallel Markov chains were calculated using MCMC to check convergence. The number of iterations, warm-up, and thinning were set to 1 × 103, 0.5 × 103, and 1, respectively. After we confirmed convergence of the parameters (see S3 and Fig. S3† for more details), we used an expected a posteriori (EAP), which is the mean of the posterior distribution, as a point estimator (see S4† for more details).
Statistical analysis was performed with R (ver. 3.4.0) via RStudio (ver. 1.3.1093). The R package rstan (ver. 2.16.2) or cmdstanr (ver. 0.4.0) was used for the BE method. To eliminate agglomerated NP signals, the Bayesian analysis was performed after selecting a duration that produced stable transient signals in some cases.
Conversion to NP information As shown in Fig. 2, NP information was calculated from a posterior distribution. As described by Pace et al.,2 a mass flux curve could be obtained from measurements of dissolved ion standards and the NP standard. Thus, the mass of each NP, mNP (fg per particle), was calculated with the following equation:
(6) |
The dimensionless transport efficiency η was calculated from the measurement results for 25 pg mL−1 Ag-NPs with 60 nm diameters. That efficiency was also applied to other Ag-NPs with different diameters. The number-based (ηN) and mass-based (ηM) transport efficiencies were calculated, and the mean of ηN and ηM was used as η. Let *CPN (particles per mL) be the prepared particle number concentration of Ag-NPs with 60 nm diameters. The dimensionless parameter ηN was then calculated from the ratio of the number of detected NPs to the number concentration of standard NPs as follows:
(7) |
Let *mNP (fg per particle) be the theoretical mass per particle of Ag-NPs with diameters of 60 nm. The dimensionless parameter ηM was then calculated with following equation:
(8) |
The transportation efficiency was calculated on a daily basis and was about 10% through the analysis.
The NP was assumed to be spherical, and its diameter ϕNP (nm) was calculated from mNP as follows:
(9) |
(10) |
(11) |
Fig. 3C shows a histogram of the particle-event intensity, and Fig. 3D shows a histogram of all the signals in Fig. 3A. Unlike the approach of subtracting background,5,27 we used the probability approach. Using the probabilistic model, it is possible to calculate the PMF that gives the signal k as the background [PMFbkg(k)] and the PMF that gives the signal k of the NPs [PMFNP(k)]. That is, the probability that Y is a NP when the value of Y is k [PNP(k)] can be calculated with the following equation:
(12) |
The red curve in Fig. 3D shows the estimated PNP(k). Let HF(k) be the histogram frequency of the signal k. The histogram frequency of the NP-derived signal can then be calculated as follows:
HFNP(k) = HF(k) × PNP(k). | (13) |
Fig. 3E shows the histogram of the particle event intensity restored by using the BE method. The histogram was calculated by substituting the EAP of the posterior distribution into eqn (13). The similarity of the histograms in Fig. 3C and E indicates that estimation with the BE method is appropriate (see S2† for more detailed information about the simulation study).
When this simulation data was estimated using the CC method, the average value of the particle-event intensity was 104 ± 11 counts and the particle-event frequency was 0.91%. For the data generated according to the assumed model, the CC method and the BE method gave similar estimation results. Another example for small signal (λNP =10 and λbkg = 1) was shown in Fig. S4.† We could obtain reliable estimates for small signals (S6†).
Application to Ag-NPs and comparison with conventional criteria method
Fig. 4A and B shows the results of a time-resolved analysis of Ag-NPs with 60 nm diameters. Fig. 4C and D shows the results of estimation with the CC method, and Fig. 4E–G shows the results of estimation with the BE method. 789 particle events were obtained for 1-minute measurement within particle-event intensity of 90–500 counts, and average particle-event intensity was 199 counts. The median particle-event duration was 1.0 ms (Fig. S5A†). Using F of 0.31 mL min−1 and Rion of 1.07 × 105 cps ng−1 mL, the CC was calculated to be 12.2% (CCN = 12.1%, CCM = 12.3%). When this data was calculated using the BE method, was estimated to (7.04 ± 0.24) × 10−3 and NP was estimated to 209 ± 1, consequently the BE was estimated to be 12.2% (BEN = 12.7%, BEM = 11.8%). The both mean CCNP of 59.9 nm and mean BENP of 60.9 nm showed good agreement with the nominal particle diameter of 60 nm. The 95%CrI of BENP (60.7–61.0 nm)‡ was within the uncertainty of the calibrated diameter (60 ± 8 nm).
The both ĈCCPN of 2.09 × 104 particles per mL and the ĈBEPN of 2.19 × 104 particles per mL showed good agreements with the prepared concentration of 2.11 × 104 particles per mL. Moreover, the prepared concentration was within the 95% CrI of the ĈBEPN (2.04 × 104–2.34 × 104 particles per mL.
For the analytical results for Ag-NPs with diameters of 20–60 nm, both methods showed good agreement with the nominal particle size (Fig. S6†). When ideal spICP-MS data are obtained, the BE method and the CC method gave similar estimation results.
Fig. 5A and B show the results of a time-resolved analysis of SiO2-NPs with 500 nm diameters. Fig. 5C and D show the results of estimation with the CC method, and Fig. 5E–G show the results of estimation using the BE method. The median particle-event duration was 1.0 ms (Fig. S5B†). The both mean CCNP (496 nm) and BENP (494 nm) show good agreements with the nominal diameter of 492 nm. Moreover, the both ĈCCPN (1.96 × 104 particles per mL) and ĈBEPN (1.95 × 104 particles per mL) showed similar values to the prepared concentration of 1.96 × 104 particles per mL. Similar results were obtained for 1000 nm SiO2-NP with median particle-event duration of 1.0 ms (see S7, S5C, and Fig. S7† for more detail).
Condition | Estimation method | Concentration (×104 particle per mL) | Ion concentration (ng L−1) | Estimated diameter (nm) |
---|---|---|---|---|
a Values indicate mean ± combined uncertainly. CC: conventional criteria method, BE: Bayesian estimation method, WIP: weak informative prior distribution, SIP: specific informative prior distribution. | ||||
Ag-NP | CC | 2.3 ± 0.2 | — | 21.3 ± 0.5 |
BE (WIP) | 2.3 ± 0.2 | — | 21.4 ± 0.5 | |
Ag-NP + Ag ion (100 ng L−1) | CC | 0.92 ± 0.13 | 107 ± 3 | 23.8 ± 0.2 |
BE (WIP) | 0.82 ± 0.05 | 104 ± 1 | 33.3 ± 2.4 | |
BE (SIP) | 2.6 ± 0.2 | 105 ± 1 | 22.5 ± 1.2 |
To improve the estimation results, we applied the specific informative prior (SIP) distribution of α ∼ N(0.005, 0.001) and λNP ∼ N(22, 0.4) based on the BE result for Ag-NPs with 20 nm diameters without dissolved Ag ions. The designation X ∼ N(μ, σ) indicates that the variable X follows a normal distribution with a mean of μ and standard deviation of σ. We could consequently obtain similar results [ĈBEPN: (2.6 ± 0.2) × 104 particle per mL,BENP: 22.5 ± 1.2 nm] from data with high backgrounds and data obtained without dissolved Ag ions (Table 1).
To evaluate the estimation performance under the various S/B ratios, we executed a numerical simulation with the following parameters; α = 0.01, λblk = (0.3, 1, 3, 10, 30), and λNP = 30. Prior distributions used in this simulation were listed in Table S3 (see S8† for more detail). For the CC method, the 95% range of Po(λ = 30) (20–41) was set as the criterion for histogram.
Regarding the particle-event frequency, remarkable differences were observed between the BE and the CC method (Fig. S9A†). Although the true value was distributed within the uncertainty range using the BE method, when the S/B ratio was 3 or less, the BE method provided slightly large particle-event frequencies. On the other hand, the CC method tends to underestimate the particle-event frequencies. In the situation when the S/B ratio is 1, the error in the particle-event frequency estimated by the BE method was +3.5%, while the error in the particle-event frequency estimated by the CC method was −37%.
Regarding the particle-event intensity, both the BE and the CC methods provided estimates relatively close to the true value (Fig. S9B†). Compared to the BE method, the CC method tended to estimate a larger particle-event intensity, but even when the S/B ratio was 1, the error was only +5.2%.
In this simulation, we used the SIP for δR. δR is a parameter that does not directly relate to NP information (see S8† for more detail). By setting an almost constant-like prior distribution close to the expected value of δR (0.5), reasonable estimation results were obtained in situations where the S/B ratio is more than 3. However, when the S/B ratio is 1, it is necessary to set the SIP distribution for parameters other than δR.
By using the specific informative prior distribution, we estimated the NP to be 3.7 ± 0.2 counts (Fig. 6). Although the nominal diameter (10 nm) was not distributed within the 95%CrI of BENP (11.5–12.3 nm), the analytical results of BENP (12.2 ± 0.4 nm) for triplicate measurements were closer to the nominal diameter compared to the CCNP (13.7 ± 0.3 nm) (Table 2). Whatever the case, both methods provided estimates within the uncertainty range (10 ± 4 nm). The prepared CPN (1.8 × 104) was distributed within the 95% CrI of ĈBEPN [(1.8–2.1) × 104 particle per mL]. The ĈCCPN (0.81 × 104 particle per mL) was less than the half of the prepared CPN. The analytical results of ĈBEPN [(1.9 ± 0.1) × 104 particle per mL] for triplicate measurements were closer to the prepared CPN compared to the ĈCCPN [(0.84 ± 0.05) × 104 particle per mL] (Table 2). These results indicate that the BE method can provide a more valid estimate for low-count NP-delivered signals compared to the CC method.
Estimation method | Concentration (×104 particle per mL) | Estimated diameter (nm) |
---|---|---|
a Values indicate mean ± combined uncertainly. CC: conventional criteria method, BE: Bayesian estimation method. | ||
CC | 0.84 ± 0.05 | 13.7 ± 0.3 |
BE | 1.9 ± 0.1 | 12.2 ± 0.4 |
Using the CC method, although the estimated particle sizes were within the uncertainty of the nominal diameter, the CCNP tend to decrease as the prepared CPN increase. Although the ĈCCPN distributed within the uncertainly of the prepared CPN derived from the uncertainly of diameter by the manufacture, except for the prepared CPN of 2.5 × 104 particle per mL, the ĈCCPN tend to decrease as the prepared CPN increase. Moreover, the influence of the prepared CPN on the ĈCCPN was more pronounced than that on CCPN. Contrary to the CC method, using the BE method, the BENP and the ĈBEPN are estimated appropriately regardless of the high concentrations of the prepared CPN.
Since TE is critical factor to calculate NP parameters, we evaluated the estimation results of TE with increasing the prepared CPN. The difference between BEN and CCN tended to increase with increasing the prepared CPN. In contrast, BEM and CCM showed almost constant values with increasing number concentration. In o ther words, the difference between CCN and CCM increased with increasing the prepared CPN in the CC method, but no such difference occurred in the BE method.
Although the use of CRC gas was reported to broaden the particle-event duration,35 which may cause the measured data to deviate from the assumed model, the particle-event duration of SiO2-NP with 500 nm diameter (median: 1.0 ms) showed equivalent value to that of Ag-NP with 60 nm diameter (median: 1.0 ms). However, there is a possibility that the broadening of particle-event duration due to the use of H2 gas was not observed under the current conditions. Since the obtained particle-event durations were within 2 readings (1.0 ms), our data did not deviate significantly from the model assumed in this study.
From these results, it was shown that particles up to about 1000 nm can be analysed by the BE method without deviating greatly from the assumptions of the model. The influence of the use of H2 gas could be ignored under the conditions of this study. Although a potential problem in analysis of the large particles with high-boiling points is incomplete vaporization, the influence of incomplete vaporization was not observed in this study. If the ionization efficiency of the particles is different from that of non-particulate chemicals due to the high boiling point and large particle size, it is expected that the BE and the CC methods will not be able to estimate the particle size equally well.
As a low-count signal, using the BE method with setting SIP distributions provided better estimation results for 10 nm Ag-NP with particle number concentration of 1.8 × 104 particles per mL compared to the CC method. The particle-event intensity estimated using the BE method at this time was 3.7 ± 0.4 counts. This result indicates that by setting SIP, the BE method could provide a smaller estimate of particle-event intensities than the CC method.
Furthermore, application of the BE method to 500 nm SiO2-NPs with particle number concentration of 0.65–6.5 × 104 particles per mL resulted in higher linearity than those obtained using the CC method even when NIP distribution was set. This result indicates that the BE method may be more robust than the CC method against increases in particle number concentration.
From the above results, it was considered that the BE method is a versatile calculation method that can be used for various spICP-MS data. Data with high background or various PNC can be dealt with by using short-DT and/or diluting the sample. However, if the sample is no longer available, the BE method can be used to re-analyse past measurement data. Although the BE method is useful in limited situations, it is a valuable alternative tool for analysing spICP-MS data.
Sources of error in the determination of the number-based transport efficiency
In this section, we discuss the sources of error in the determination of transport efficiency. It has been reported that the error in transport efficiency estimation theoretically affects the accuracy of CPN greater than particle size.20,36 While the CCN tends to decrease as the prepared CPN increased, the CCM showed almost constant values. Using the BE method, both the BEN and the BEM showed almost same values and did not change with increase in the prepared CPN. It is considered that such a difference occurred for the following reasons. The CC method underestimates the number of particle events when the particle events are completely coalescent and cannot be divided into individual particle events. On the other hand, the BE method uses a model in which two particle events are stochastically merged, thus the , which is the particle-event frequency, tends to be larger than the actual number of particle events.
Contrary, the M was not affected by the number concentration even with the CC method. Theoretically, if the number of particle events decreases by x%, the mean particle-event intensity increases by a factor of 1/(1 − x/100). However, the obtained results were not the case. The reason for this may be to eliminate outliers from the distribution in the process of selecting particle events to be used for the converting process to NP information. From the histogram of the signal intensities derived from SiO2-NP (Fig. 5), the 95% range was 76–141 counts. Since the expected particle-event intensity of the coalescent particle-event was 219 counts, the coalescent particle-event was removed as outlier. In the CC method, in addition to the particle-event detection criteria, the influence of the criteria for histogram cannot be ignored for the case of high CPN as well as high backgrounds.
The smaller difference between ηN and ηM is probably the reason why the accuracy of NP information is improved by use of the BE method. An approach that minimizes the difference between ηM and ηN may be more useful than an approach that calculates log-likelihood. In addition, in terms of reducing the discrepancy between N and M, using an introduction system that achieves 100% transport efficiency, such as a total-consumption nebulizer37 or droplet nebulizer,38 is useful method to solve the problem of TE calculation. Relative estimation errors in transport efficiency may be suppressed because conventional nebulizers are more sensitive to a 1% difference in transport efficiency compared to high-efficiency nebulizers.
Because there is no information about α and λNP for unknown samples, it is difficult to identify a logical, specific informative prior distribution. In the data with many NP signals associated with smaller particle sizes, an appropriate estimate might be obtained with use of a prior distribution with a large α and small λNP that reflected the analytical result with the CC method. In any case, it will be necessary to go through a process of trial and error using models with different prior distributions. It is then appropriate to select an optimum model based on objective indicators such as the widely applicable information criterion.28
On the other hand, in numerical simulations of various S/B ratios, there were cases in which reasonable estimation results were obtained simply by setting the SIP to δR. The random number generation model assumes that δR is a variable that follows a uniform distribution between 0 and 1. In this case, the SIP is intended to be the expected value of 0.5. Since δR is not directly involved in the information of NP, there is a possibility that it can be applied to unknown samples.
Because the selection of a prior distribution in Bayesian estimation depends on the analyst, Bayesian estimation has been criticized on the grounds that it is not objective. However, because the criteria that are used depend on the researcher (for example, whether the requirement for NPs is mean plus 3σ or 5σ), the CC method is also subjective. Prior distributions can be used as tools to allow for data-informed shrinkage, to enact regularization, or to nudge algorithms towards a likely high-density region and thereby improve estimation efficiency.39,40 These considerations suggest that it is possible to select the appropriate operation using the BE method complemented with the CC method or other published methods.
The case of NPs with multiple diameters is very different from the current model. It would be ideal if we could obtain a bimodal posterior distribution as an estimate of the result for NPs with multiple diameters. However, a numerical simulation showed that the posterior distribution was close to the average value of the counts of the two NPs (data not shown). It is therefore necessary to extend the model. Another scenario that is not assumed in the current study is a particle event that spans three or more readings. As a compromise between signal-to-noise ratio and possibilities for transient time measurement, a value of 50–100 μs is often preferred.29 The conventional goal of spICP-MS analysis is to measure with a shorter DT (∼100 μs) and detect the particle event at multiple readings. Bayesian algorithms that can detect particle events that span more readings will be needed to achieve that goal.
In this study, a Poisson distribution with one parameter was used to facilitate making estimates. Because the variance of the Poisson distribution is equivalent to the expectation value, it is likely to be unreliable for under- and over-dispersed data. For such data, consideration should be given to models that use other probability density functions, such as a hypergeometric distribution, negative binomial distribution, or double Poisson distribution. In such distributions, more than 2 parameters take on non-negative integer values.
Many studies that have used spICP-MS to obtain information about NPs have treated the distribution of the NP signal as if it represented the distribution of the NP mass. Laborda et al.41 have reported that the particle size distributions analysed using spICP-MS are broadened relative to distributions analysed with other techniques. In general, spICP-MS analyses evaluate NP information without classifying the variance of the ICP-MS signals or the variance of the NP masses. We consider that this characteristic of spICP-MS analyses is the main cause of the broadened size distribution. In the Bayesian framework, it may be possible to separately analyse the variation of the ICP-MS signal itself and the variation of the signal caused by NPs by extension to a hierarchical Bayesian model that considers random effects. In general, hierarchical Bayesian models take longer to calculate than non-hierarchical Bayesian models. It is therefore presently difficult to analyse large datasets such as spICP-MS data in a realistic amount of time. We expect that it will be possible to implement more realistic algorithms by improving the analysis environment as well as improving the system of detection by ICP-MS.
BE | Bayesian estimation |
bkg | Background |
CC | Conventional criteria (method) |
C PN | Particle number concentration |
CrI | Credible interval |
DT | Dwell time |
IBE | Indirect Bayesian estimation |
L | Likelihood function |
MCMC | Markov chain Monte Carlo |
MLE | Maximum likelihood estimation |
N a | Number of readings per measurement |
N r | Number of readings per minute |
NP | Nanoparticle |
N(μ,σ) | Normal distribution with μ and σ |
PMF | probability mass function |
Po(Y|λ) | Variable Y follows Poisson distribution with parameter λ |
R ion | Sensitivity of the ionic standard solution (cps ng−1 mL) |
α | Frequency parameter of particle event |
δ | Signal split coefficient for particle volume |
δ R | Signal split coefficient for particle diameter |
η | Transport efficiency |
η M | Mass-based transport efficiency |
η N | Number-based transport efficiency |
λ bkg | Parameter for background signal (count) |
λ NP | Parameter for particle-event intensity (count) |
μ | Mean |
ρ NP | Density of the NP (g cm−3) |
ϕ NP | NP diameter |
σ | Standard deviation |
ψ NP | Mass fraction of analytical elements in the NP Hat notation indicates an estimate. Upper-right letters in variable indicate estimation methods. Lower-right letters in variable indicate simple description for variables |
Footnotes |
† Electronic supplementary information (ESI) available: The detail description of Stan codes and other examples were shown. See DOI: https://doi.org/10.1039/d3ja00220a |
‡ Although Fig. 4G, 5G, and 6G show histograms of diameters, it does not show the particle size distribution, which has frequently been shown in other papers discussing spICP-MS. The mean value of the sample particle size and its uncertainty are expressed as a posterior distribution. |
This journal is © The Royal Society of Chemistry 2024 |