Chetan R. Chilkunda,
John R. Kitchin and
Robert D. Tilton
*
Department of Chemical Engineering, Carnegie Mellon University, 5000 Forbes Ave., Pittsburgh, PA 15213, USA. E-mail: tilton@andrew.cmu.edu
First published on 11th April 2025
The commercial formulation development for multicomponent complex fluids is time-intensive and data-intensive. There is a need for tools to expedite this process. This work develops an experimental and analytical high-throughput methodology to quantify binary surfactant mixture micellization in a 96-well plate. The novelty of this work lies in (1) employing model-driven design of experiments for efficient experimentation and (2) using physics-informed classification to quantify the mixture critical micelle concentration. This work employs a novel classification-based approach to map the binary critical micelle concentration as a function of the surfactant mixture ratio. Regular solution theory is used as the physics basis for modeling the binary interactions between surfactants (quantifying the β interaction parameter). Other, more complex surfactant interaction models exist; however, this simple model is used to demonstrate the efficacy of this methodology as a high-throughput screening tool for binary surfactant mixtures. Using regular solution theory as a guide to map the critical micelle concentration against the surfactant ratio, the SDS-C8E4 surfactant system was determined to have a β = −3.6 ± 0.5, a 14.9% difference from the literature reference of β = −3.1. We demonstrate the utility of the method on the SDS-C8E4 system in 0.5 M NaCl which was determined to have a β = −3.1 ± 0.4, which is a 17.5% difference from a similar literature system of SDS-C12E8 in 0.5 M NaCl with β = −2.6. These two systems support the efficacy and generalizability of this high-throughput methodology to any binary surfactant mixture and future work involves extending this methodology to ternary surfactant mixtures.
Current formulation development includes testing a large number of candidate surfactant mixtures, a process that is time-intensive and data-intensive. Surfactant mixtures tend to be non-ideal, and so quantifying surfactant mixture CMCs can require many experiments to quantify the non-ideal behavior. Efficiently quantifying this behavior requires systematic and deliberate planning and experimentation. This study details the development of a high-throughput methodology based on 96-well plates that efficiently quantifies the CMC for a binary mixture of surfactants by incorporating a simple model for the CMC of a binary surfactant solution into a classification framework to determine the best decision boundary that separates points labeled below the CMC from points labeled above the CMC.
The CMC is a boundary that separates mixtures that form micelles from those that do not. In a classification model, one seeks to find a boundary that separates two classes of observations. In a conventional approach, one finds values of a CMC at various composition ratios and then fits a model using nonlinear regression through those values to obtain the CMC function. In our work we will use a machine learning classifier to fit that model directly using the boundary to most accurately separate mixtures that form micelles from those that do not. In that context, classification is a natural approach to finding the composition dependent CMC of a surfactant mixture.
There are several binary surfactant micellization models to choose from, each with its own degree of complexity. Beyond the one-constant Margules activity coefficient model as in regular solution theory, there exist more complex activity coefficient models as well as models that quantify surfactant packing in micelle formation.2,3 In this work, regular solution theory is used for its simplicity and common usage.4–7 Regular solution theory is a thermodynamic model that characterizes the interactions and the potential for synergism in a binary surfactant mixture as a function of the pure surfactant CMCs, an empirical interaction parameter, and the molar ratio of each surfactant in the binary mixture:8
![]() | (1) |
![]() | (2) |
In eqn (1) and (2), for a given binary mixture of surfactants (marked as 1 and 2), x1 and x2 where x1 + x2 = 1, represent total surfactant molar fractions, xM1 and xM2 represent the molar fractions of the two surfactants in the mixed micelle, CMC1 and CMC2 represent the pure critical micelle concentrations of the two surfactants, CMCmixed represents the mixed critical micelle concentration, and β represents an empirical interaction parameter that quantifies the deviation of the mixture from ideal mixing. β values generally fall within the range of 0 (corresponding to ideal mixing) to approximately −20 (corresponding to strong attractive interactions between the surfactant species as occurring in mixtures of anionic and cationic surfactants).9 It is also possible to have positive values of β with fluorocarbon based surfactants.10,11 At these large, negative β values, the predicted mixture CMC curve resembles the x−1 function with endpoints at the pure surfactant CMCs. This model predicts that the surfactant interactions tend towards equal molar fractions inside the micelle (xM1 ≈ xM2 = 0.5) over a wide range of overall surfactant ratios, and that less total surfactant may be required to achieve micellization than would be the case for either pure surfactant, the condition referred to as micellization synergism.8 In this study, regular solution theory serves as the basis for the model-driven DoE (Design of Experiments) and as the governing physics-imposed guide in the analysis methods.
There are several ways to experimentally determine the CMC including capillary electrophoresis,12,13 Taylor dispersion analysis,14 bipolar pulse conductance, spectroscopic methods,15 surface tension16 and even analysis of “coffee rings” formed from surfactant mixtures.17 More commonly, measuring dye solubilization via fluorescence or absorbance methods as a function of total surfactant concentration for a fixed ratio of the two surfactants in solution15,16,18,19 is used. Previous research on the high-throughput determination of the CMC leveraged a 96-well plate20 to assess the fluorescence of detergents. An alternative approach used absolute aggregation-caused quenching probes in a 96-well platform.21 In this study, the fluorescence-based dye solubilization approach is used based on its suitability for high-throughput implementation in a 96-well plate format.
This work uses Nile Red for the dye. This dye produces low fluorescence intensity when it is in a polar environment (i.e. in aqueous solution) and a relatively high fluorescence intensity when it is in a local non-polar environment (i.e. inside the non-polar micelle core).22,23 Preferential dye partitioning into the non-polar cores of micelles allows for dye solubilization experiments to measure fluorescence intensity as a function of total surfactant concentration. Fluorescence intensity serves as a proxy measurement for micelle formation. Typically there is a constant or weakly varying intensity for surfactant concentrations below the CMC, with an abrupt transition at the CMC.24 At concentrations above the CMC, the dye fluorescence intensity typically increases approximately linearly with increasing surfactant concentration, with a slope that depends on the solubilizing power of the mixed micelles.25–27
In measuring fluorescence intensity over the 96-well plate, a SYNERGY H1 Biotek microplate spectrofluorometer was used to (1) measure the excitation and emission spectra of the highest concentration samples and internal standard and record the wavelengths at peak absorption and (2) perform a fluorescence intensity endpoint scan with a manual gain setting of 100 and bottom-plate optical readings using the mean excitation maximum wavelength and mean emission maximum wavelength as the lower and upper bounds. The output endpoint fluorescence intensity scan was measured 5–7 times to account for instrument variability and averaged to get the expected fluorescence intensity for each well.
We aim to determine the CMC boundary across all mixture compositions for the SDS-C8E4 blend using regular solution theory as the reference physics-based model. For the most general scenario of binary surfactant mixtures, the experimental space is quite large, with β values in the range of [−20, 0]. To expedite the procedure one would take advantage of well-established trends. Here, the trends indicate that mixtures of anionic surfactants and nonionic surfactants with oligo(ethylene oxide) headgroups exhibit small, negative β values. Thus, we restrict the experimental sampling space to β ∈ [−6, 0].29 This restriction gives targeted information about where the boundary lies.
Although the literature indicates that one should expect β = −3.1 for the SDS-C8E4 system, the value of β would not generally be known a priori. Only a rough idea of the β value is needed to expedite the experiment design. To sample across the entire space, the top-most ideal mixing line (β = 0.0) is broken into six equivalent line segments as shown in Fig. 1. For surfactants with unknown β values, one might use the head group character (e.g. anionic, zwitterionic, ionic, etc.) to narrow down the search space, or rely on an iterative approach to search broadly and narrow down in subsequent experiments.
The five black lines in Fig. 1 represent five mixed surfactant blends, where each solution has a constant ratio of C8E4 concentration to SDS concentration. Experimental trial samples are prepared along each of the five lines at their intersections with the predicted regular solution theory curves with additional trials both below and above the predicted curves.
In Fig. 1, each constant composition line has a trial at each of the seven different β interaction values and an additional two trials below the lowest curve and one trial above the ideal mixing line. The two trials below the lowest curve map to 10% and 20% of the trial at the ideal mixing line and are necessary to characterize fluorescence in the system at very low surfactant concentrations that are highly likely to fall below the CMC. This information aids in determining the empirical fluorescence intensity threshold for classification. The trial above the ideal mixing line signifies a point with expected maximum fluorescence, marking an upper bound on fluorescence intensity for each respective mixture blend.
Altogether, each line corresponds to ten unique experimental trials-ten different aliquots from a premade mixed stock solution blend. Given the total 50 unique trials needed to map out the experimental space and the applicability of using a plate reading fluorescence spectrofluorometer, a 96-well plate facilitates a simple, high-throughput setup for the experiment. The 96-well plate experimental design is as follows: except for the trials above the zero interaction line, every trial is replicated twice. This amounts to 19 trials per line over five lines for a total of 95 trials. The trials above the zero interaction line are chosen to not have a second replicate because (1) the experiment needs to fit inside a single 96-well plate and (2) these trials are far from the expected CMC boundary so they do not directly influence the boundary determination. With 95 of the 96 wells occupied, the last remaining well was used to hold an internal fluorescence standard (of high fluorescence intensity) to normalize all spectrofluorometer readings between 0 and 1.
The pure CMCs of both SDS and C8E4 must first be accurately determined. These pure CMCs, marked by the starred endpoints in Fig. 1, mark the bounds of the experimental sampling space as the mixture CMC will occur at a lower total surfactant concentration. This two-stage experimental procedure of (1) determining the pure CMCs and (2) the physics-based sampling and experimentation across mixture compositions allows for the efficient quantification of the mixture's CMC across all compositions. After pure CMC determination of both SDS and C8E4, the pipeline for the experimentation is as follows: (1) use model-driven design of experiments to determine the specified 96-well plate mixed surfactant trials, (2) prepare 96 surfactant samples for the high-throughput dye solubilization experiment and measure their respective fluorescence intensity, and (3) analyze the experimental data using physics-based classification to quantify the empirical binary interaction parameter and the mixture CMC across all compositions.
Description | CMC (mM) | Literature ref. |
---|---|---|
SDS | 8.0 ± 0.2 | 8.2 |
SDS (in 0.5 M NaCl) | 0.6 ± 0.1 | 0.5–0.9 |
C8E4 | 9.3 ± 0.3 | 9.1 |
C8E4 (in 0.5 M NaCl) | 7.0 ± 0.2 | N/A |
The CMC values for pure SDS and pure C8E4 in Table 1 have less than a 2.5% difference from their literature references detailed above. Additionally, the SDS in 0.5 M NaCl CMC value of 0.6 mM falls within the expected range of 0.5–0.9 mM based on surface tension CMC experiments.31
To determine the CMC of a binary mixture of SDS and C8E4 we prepared a 96-well plate in the design shown in Fig. 2 which shows the normalized fluorescence intensity (NFU) as a function of each of the two surfactant concentrations.
In Fig. 2, we observed a boundary between the bottom blackish points and the first colored points around NFU = 0.2. This is an indication that the sampling approach successfully encompassed the binary interaction micellization space. That boundary demarcates the onset of micellization and the shielding of the non-polar Nile Red dye from the polar aqueous solution. Qualitatively, we can assert that the binary CMC boundary between SDS and C8E4 lies at the first jump in fluorescence intensity (between the black and dark blue points).
The remainder of this section details the CMC boundary quantification via (1) Leaky ReLU analysis and (2) physics-based classification analysis. These two analysis methods are introduced in order of increasing generalizability and complexity. The ReLU approach is most subject to the local noise in the data and does not account for the underlying physics constraints. In contrast, the physics-based classification approach is the least subject to local noise and directly imposes the physics constraints to robustly identify the CMC boundary from the data.
From Fig. 3, we observe that the data obtained are fit well by Leaky ReLU regressions. After determining the mixed CMC for all five blends, regular solution theory is regressed upon these five CMC points and the two pure endpoint CMCs and the final result of the ReLU regression analysis is depicted in Fig. 4.
Fig. 4 illustrates the final result from the ReLU analysis approach, where β = −2.8 ± 0.2 closely agrees with the literature value of β = −3.1.8 This analysis approach is easy to understand and implement; however, it has a limitation: local noise in each constant composition line affects the fits independently. Each Leaky ReLU fit is regressed separately so the constraint of a smooth CMC boundary is not present which can introduce some uncertainty when choosing the initial guesses for the Leaky ReLU regression. Another limitation to this analysis method is that the second regression is performed on only seven points (five CMC values and two endpoints). This limits uncertainty quantification. A more robust approach than this two-step regression analysis is to enforce the mixed CMC boundary directly into the first regression, and perform the analysis over all five constant composition lines simultaneously.
To label the points, we fit a plane to the low-curvature region (the bottom most two trials across all blends including replicates) and extrapolate this plane to determine an empirical fluorescence threshold. In Fig. 5, the blue center plane represents the fitted composition-based fluorescence threshold and the two red planes illustrate the 95% confidence interval in the fit. From Fig. 5, it is clear that there is a significant difference in the low-curvature region's fluorescence intensity when compared to the fluorescence intensity above the CMC. This difference in intensity from below to above the CMC indicates the robustness of the methodology to the choice of the confidence interval. One note is that this empirical fluorescence threshold stems directly from the two bottommost trials outlined in the DoE sampling setup, and that this threshold assumes linearity within the low-curvature region. This is used to label the points (Fig. 6).
The final results of this physics-based binary classification approach are summarized in Fig. 6 which depicts the original seven β curves, the classification truth values determined from the plane fluorescence threshold, and the predicted optimal β range. Given the literature value for the SDS-C8E4 system β = −3.1, the predicted β from this approach is β = −3.6 (midpoint of the confidence interval [−4.1, −3.1]).8 A 14.9% difference between the two values indicates that this overarching experimental and analytical methodology efficiently and accurately quantified the binary interaction parameter β for the SDS-C8E4 system.
From Fig. 7, for the electrolyte system, β ∈ [−3.45, −2.71] based on the confidence (shaded) region and resolution of the data via the physics-based classification analysis. Equivalently, β = −3.1 ± 0.4 and this result highlights the applicability of the high-throughput methodology to an asymmetric binary surfactant formulation with drastically different pure surfactant CMCs. The experimentally determined β = −3.1 for the SDS-C8E4-electrolyte system sees a 17.5% difference from the value reported for a similar literature system of the SDS-C12E8-electrolyte system. As the value of β is responsive to the chemical composition of the headgroups, it is reasonable that the SDS-C8E4-electrolyte β value is close to that of the SDS-C12E8-electrolyte system.
The addition of 0.5 M NaCl to the SDS-C8E4 system decreased the predicted β from β = −3.6 ± 0.5 to β = −3.1 ± 0.4. At 95% confidence, there is an overlap between the two predicted values due to the sampling resolution of the data; however, the smaller negative β associated with the electrolyte system implies less cooperative interactions between C8E4 and SDS. The less cooperative interactions can be attributed to the already screened electrostatic repulsions among the sulfate headgroups in the presence of NaCl so there is less ethylene oxide incorporation (and interactions) among the headgroups in the micelles.32
In summary, this experiment illustrates the efficacy and accuracy of the high-throughput methodology by reproducing the literature binary interaction parameter from regular solution theory for the SDS-C8E4-electrolyte system. We note that this methodology could have been used with any physics model, and regular solution theory serves as a simple model to demonstrate the viability of this experimental and analytical framework. Additionally, in cases where the governing models turn out to be inconsistent with the data, any mathematical expression can be fit to the data to determine the nonlinear CMC boundary. Some areas of future work could include extending the physical model to include three or more surfactants, as well as other electrolyte and solvent effects.
This journal is © The Royal Society of Chemistry 2025 |