Tyler C.
Bradley
*ab,
Sheldon V.
Masters
c,
Timothy A.
Bartrand
d and
Christopher M.
Sales
b
aPhiladelphia Water Department, USA. E-mail: tyler.bradley@phila.gov
bCollege of Civil, Architectural, and Environmental Engineering, Drexel University, USA
cUniversity of Colorado Boulder, USA
dEnvironmental, Science, and Policy Research Institute, USA
First published on 14th November 2023
The United States Environmental Protection Agency's (USEPA's) Lead and Copper Rule Revisions (LCRR) introduced many changes to the existing regulation. Two major changes are the change in sample methodology to fifth-liter (L5) sampling for homes with lead service lines and the find-and-fix (FaF) provision following any single home lead action level exceedance. This research proposes a method which estimates L5 lead levels from first-draw (L1) LCR data. Using L1 data along with paired L5–L1 difference data from other systems with similar L1 results, L5 data can be estimated accurately by bootstrapping. Using L1 data from two utilities (DC Water and Utility B) with known L5 data, this method was validated to accurately estimate L5 data. This method was then applied to a third utility (Philadelphia Water Department, PWD) with LCR data without paired L5 results to estimate what it can expect from this sample methodology. This same method was then applied to PWD to estimate the impact that FaF would have on the system by identifying how quickly new, permanent Water Quality Parameter (WQP) sites would have to be added. Under all simulations, PWD eventually would reach the maximum number of required WQP sites.
Water impactThere are many unknowns for utilities switching to fifth-liter sampling under the Lead and Copper Rule Revisions. This study demonstrates a method for utilities to estimate their fifth-liter lead levels from first-liter lead levels to assess the impacts of this change on their system. |
In December 2020, the U.S. Environmental Protection Agency (EPA) published the final Lead and Copper Rule Revisions (LCRR).3 On January 16, 2021, the effective date of LCRR was delayed until December 16, 2021, to allow the Agency sufficient time to review the LCRR requirements and determine whether additional regulatory changes were needed.4 The LCRR contains numerous new aspects such as changes to sampling methods (e.g., 5th liter samples) and sample location (e.g., all sample sites in systems with lead service lines (LSLs) must have a LSL) requirements, the inclusion of required “find-and-fix” (FaF) follow up procedures for single-home exceedances of the AL, the addition of mandatory school and child-care facility lead monitoring and education programs, development of service line inventories, and new requirements for sampling and education following lead service line replacements.3 The LCRR also established a trigger level (TL) of 10 ppb as a secondary threshold that preemptively initiates action by the utility before they exceed the AL.
The change in sampling methodology adds significant complexity to the LCRR compliance programs for utilities. This method requires samplers to accurately capture five consecutive samples while losing as little water as possible between samples. In the authors' experience, this change greatly increases the level of difficulty in performing the sampling. Additional complexity is added due to the fact that L5 sampling does not apply to all tiers of homes sampled during compliance. This change only applies to tier 1 and tier 2 sites (homes with a LSL). For the remainder of homes sampled, the documented water lead level is based on the L1 sample. All results collected using these different sampling methodology are included in the final calculation for the 90th percentile. Systems with LSLs will be required to collect all LCRR compliance samples from homes with LSLs. This is an increase from previous requirements that only required 50% of homes sampled to have a LSL.
The FaF provision also requires utilities to collect a WQP sample be collected after every single-home exceedance of the AL. This WQP sample must be collected from a tap (that is not the LCR tap itself) from a nearby site that is within half a mile, within the same pressure district, and connected to the same size main as the LCR site.3 If an existing site does not exist, then the utility must identify a new site and perform sampling within the 5-day window. This new site would then become a permanent WQP site. While utilities certainly have routine grab sampling locations spread throughout their systems, most systems are likely to have varying main diameters that will result in mismatches between LCR sites and grab sampling sites.
This study investigates the potential impacts of L5 sampling and the FaF provision of the LCRR on a large public water system (i.e., serving over 50000 people): Philadelphia Water Department (PWD). In order to assess the potential impact of L5 sampling and the FaF requirements, a simulation method is proposed to estimate L5 results directly from L1 results. This method is validated against paired L1 and L5 data from two other large public water systems: DC Water and Utility B. The method is then applied to PWD to evaluate how L5 sampling will impact PWD's compliance with the AL and TL and how the FaF requirements will impact the system, specifically looking at the requirement to add WQP locations to match characteristics of sites with single-home exceedances. This method will allow systems to preemptively begin to assess their corrosion control programs to improve water quality for their consumers and plan for any upcoming capital costs that may be required to comply with the LCRR.
Michigan LCR compliance data used as the L5–L1 difference data in this study was obtained from Masters et al.8 This data covered the 2019 compliance sampling from Michigan systems following Michigan's state LCR revisions, which required a paired L1 and L5 sample for all homes that had a LSL (n = 2909). No difference data points over 100 ppb were included in this analysis (see ESI† for more details).12,13 From this overall dataset, three data subsets were defined to ensure that difference data was only used from Michigan utilities that matched the target utilities (i.e., DC Water, PWD, or Utility B) L1 data characteristics. The first subset included only results from Michigan water systems with a 90th percentile value less than 5 ppb (n = 1817) and was used for simulations for DC Water and PWD. The second subset included only results from Michigan water systems with a 90th percentile value less than 5 ppb and a standard deviation greater than 3 ppb (n = 689) and was used for simulations for PWD. The third subset included only results from Michigan water systems with a 90th percentile value between 5 ppb and 15 ppb (n = 907) and was used for comparison with Utility B. There was no information available regarding the corrosion control methods employed by the systems included in the Michigan LCR compliance dataset.
All data used for this study were from homes with LSLs.
This bootstrap method makes two major assumptions: 1) the L5 sample result is dependent on the L1 sample results (i.e., if a home has a high L1 result, they are more likely to have a high L5 result) and 2) The difference between paired L5 and L1 results is independent of the L1 level within the home. To confirm these assumptions, Spearman rank correlations were performed between paired L1 and L5 results and L1 and L5–L1 differences for the DC Water, Utility B, and 2019 Michigan LCR data sets.
This method was applied to LCR compliance data for three utilities: DC Water, Utility B, and PWD. Using DC Water and Utility B paired L1 and L5 sample results from the sampling rounds included in this study (six sampling rounds for DC Water and one for Utility B), “true” summary statistics were calculated for each of the 1000 simulated sampling rounds. These true summary statistics were calculated from the paired L5 samples that were collected from the 100 randomly selected L1 results for that iteration. It is important to note that the “true” 90th percentile associated with each of the iterations does not represent the system's actual compliance 90th percentile for each LCR sampling round. Using the calculated summary statistics for the “estimated” L5 data and the “true” L5 data, the efficacy of the simulation method was assessed. 95% confidence intervals (CI) were calculated for each of the summary statistics by taking the 2.5% and 97.5% percentile values of the 1000 values. If the mean value of the summary statistic for the “estimated” L5 values fell within the 95% CI of the “true” L5 values than the distributions were not considered significantly different. In addition to comparing the summary statistics of “true” vs. “estimated” L5 results, the differences between true and estimated L5 results were compared on a (simulated) round by round basis to determine if there was a significant difference between the distribution of results. Differences between the individual sampling round data sets were assessed using the non-parametric Mann–Whitney U test. This analysis was applied to four different subsets of the DC Water LCR data set: 1) all six sampling rounds 2018–2020, 2) the two 2018 sampling rounds, 3) the two 2019 sampling rounds, and 4) the two 2020 sampling rounds. Comparisons were made between the “true” and “estimated” L5 results for each of these four subsets and the Utility B paired data to further assess the efficacy of this method on smaller data sets.
PWD did not have paired L1 and L5 data for the sampling rounds included in this study (2016, 2017, and 2019) to directly compare “estimated” vs. “true” L5 results. However, this method can be applied to give the utility an estimate of where the L5 90th percentile may likely fall when L5 sampling is implemented. The bootstrap simulation was applied to PWD using two different subsets of the Michigan L5–L1 difference data. PWD's 90th percentile was less than 5 ppb for all three sampling rounds included in this study (2016, 2017, and 2019), but its standard deviation was greater than 3 ppb for all sampling rounds. To get an idea of how L5 could vary, two subsets of the overall Michigan data set were defined to include 1) all systems with a 90th percentile less than 5 ppb (53 systems and 1817 paired lead results) and 2) all systems with a 90th percentile less than 5 ppb and a standard deviation greater than 3 ppb (15 systems and 689 paired lead results). These two subsets will be defined within as “best case” and “conservative case” for PWD's simulation results, respectively. For both “best” and “conservative” cases, estimated L5 results were assessed to determine what PWD can expect from L5 sampling and how likely they are to exceed both the TL and the AL. 95% confidence intervals were calculated for the 90th percentile statistic for each of these cases. Lead distribution tables were developed for five randomly selected iterations to assess how L5 results compare to lead distribution tables previously published for PWD's L1 data.14,15
1. An existing WQP monitoring site that meets the FaF requirements,
2. An existing non-WQP, RTCR monitoring site that meets FaF requirements, or
3. The closest hydrant that meets FaF requirements.
If either option 2 or 3 had to be used, then that RTCR site or hydrant was added to the permanent list of WQP sites for the rest of that iteration. This means that for all the remaining consecutive sample rounds within that iteration, that RTCR site or hydrant would now be included in the first category of existing WQP monitoring sites. By simulating these 100 consecutive sampling rounds (which would cover at a minimum 50 years of biannual sampling periods), water systems can see how the change to L5 sampling and the implementation of FaF may impact their WQP over the coming years and decades. This method resulted in 1000 simulations of the increase of WQP monitoring requirements over time for a utility and 100000 individual LCRR sampling round simulations.
Only 12 grab locations are currently monitored by PWD for OCCT on a quarterly basis, with 10 being required during reduced monitoring under the current LCR. The new LCRR will require this number to be increased to 25 as PWD will be moved back to standard monitoring. In order to perform this simulation, the initial list of 25 starting WQP sites was made by combining the 12 current WQP sites, the 11 disinfection by-product (DBP) monitoring sites that are not already WQP monitoring sites, and two easily accessible coliform sites. All of the DBP monitoring locations are also used for RTCR monitoring. These 25 sample locations are spread geographically throughout PWD's system covering all service areas.
Sampling round | First liter | Fifth liter | ||||||
---|---|---|---|---|---|---|---|---|
N | Mean (ppb) | 90% (ppb) | Std. dev. (ppb) | N | Mean (ppb) | 90% (ppb) | Std. dev. (ppb) | |
Philadelphia Water Department | ||||||||
2016 | 68 | 1.8 | 3.0 | 3.5 | 0 | — | — | — |
2017 | 89 | 13.8 | 2.2 | 80.2 | 0 | — | — | — |
2019 | 99 | 2.4 | 3.0 | 7.8 | 0 | — | — | — |
DC Water | ||||||||
2018a | 118 | 7.0 | 2.8 | 56.2 | 116 | 2.8 | 4.8 | 8.9 |
2018b | 104 | 1.0 | 2.3 | 1.0 | 101 | 2.3 | 6.0 | 2.4 |
2019a | 109 | 1.5 | 2.2 | 4.0 | 105 | 1.7 | 3.8 | 3.0 |
2019b | 108 | 3.0 | 2.3 | 20.0 | 107 | 2.5 | 6.0 | 2.5 |
2020a | 107 | 1.2 | 1.8 | 2.8 | 104 | 2.3 | 3.2 | 10.5 |
2020b | 105 | 1.4 | 2.8 | 3.7 | 104 | 2.1 | 5.1 | 2.4 |
Utility B | ||||||||
2021a | 88 | 4.3 | 9.1 | 10.6 | 88 | 6.6 | 19.3 | 15.7 |
Since the first 2018 sampling round, DC Water has collected paired L1 and L5 samples during LCR compliance monitoring. In total, DC Water performed six LCR compliance sampling rounds between 2018 and 2020 with paired L1 and L5 samples collected during all of these (Table 1). The L1 90th percentile for all six sampling rounds has been below 3 ppb, with the highest 90th percentile value being 2.8 ppb. The L5 90th percentile value has been slightly higher for the six sampling rounds, although still well below the TL and AL, with a maximum value of 6 ppb. Similar to PWD, one of the sample rounds from DC Water had an increased standard deviation that was driven by a single high lead result. It is unexpected that this will impact the effectiveness of the model.
Utility B collected paired L1 and L5 samples during the January–June 2021 LCR sampling round. 88 paired samples were collected during this sample round. Unlike, DC Water and PWD, Utility B experiences slightly higher L1 levels with an L1 90th percentile value of 9.1 ppb. While this value is less than both the AL and the TL, the L5 90th percentile value from this sample round would have exceeded both limits with a value of 19.3 ppb (Table 1).
A major assumption used in this method is that the L5 result is dependent on the L1 result, however, the difference between L5 and L1 result is independent of L1 result. To verify this assumption, Spearman rank correlations were performed between L1 and L5 paired results from the Michigan 2019 LCR sampling round, DC Water, and Utility B paired L1/L5 data sets. Similarly, Spearman rank correlations were performed between L1 and paired L5–L1 difference results for the same three data sets. All three of these data sets demonstrated statistically significant, strong correlations (rho = 0.73–0.89) between L1 and L5 paired results. From these results, it is clear that L5 results increase as L1 results increase, indicating that the L5 result is dependent on the L1 result. Conversely, the correlations between L1 results and L1/L5 difference results indicate no or very weak correlations (rho = 0–0.27). This confirms the assumption that while the L5 result is dependent on the L1 result, the difference between the two is independent of the L1 lead level.
Similarly, the mean of the estimated 90th percentile distribution fell within the 95% CI for the true 90th percentile distribution when simulated sample rounds were generated only using 2018 and 2019 LCR data, respectively. In 2018, the estimated 90th percentile distribution had a mean of 5.54 ppb, which falls between the true 90th percentile distributions 95% CI of 3.61 and 7.52 ppb. In 2019, the estimated 90th percentile distribution had a mean of 5.3 ppb, which falls between the true 90th percentile distributions 95% CI of 4.11 and 6.2 ppb. Unlike the other three simulated data sets, the simulated LCR rounds using only 2020 LCR data did have a significant difference between the estimated 90th percentile distribution and the true 90th percentile distribution. In 2020, the estimated 90th percentile distribution had a mean of 5.27 ppb, which falls just outside of the 95% CI of the true 90th percentile of 3.31 and 5.23.
For all four of the bootstrap simulations, the estimated 90th percentile values tended to be slightly higher than the true 90th percentile levels. From the cumulative density plot of the true vs. estimated distributions, the estimated 90th percentile distribution is shifted to right of the true 90th percentile distribution (Fig. S3†). This indicates that while there are not significant differences between the estimated and true distributions, the estimated distribution may slightly overestimate L5 lead results. However, given that this method is designed to give utilities an understanding of how the LCRR change in sampling method may impact their compliance and the level of effort required to comply with the new rule, the authors believe that a slight overestimation is not an issue.
Non-parametric Wilcoxon-rank sum tests were performed to assess whether the distribution of true and estimated L5 results were significantly different for each simulated sampling round. Of the 1000 simulated sampling rounds, 265 had a statistically significant difference between the two distributions. Randomly sampling 10 sampling rounds from the 1000 simulations shows us that the overall spread of the data is relatively the same for both the true and estimated data sets (Fig. S4†). In general, estimated L5 results tended to have more single sample exceedances of the 15 ppb AL than true L5 results, with averages of 2.3 and 1.1 samples exceeding 15 ppb, respectively. The estimated L5 results closely matched the distribution of true L5 results from the same subset of homes. By accurately estimating the L5 distribution, this method gives utilities an understanding of where their L5 results may fall.
One of the factors that is likely to have an impact on the effectiveness of this simulation method is how many L1 data points are used to estimate the L5 90th percentiles. To assess this factors impact on the method, the method was repeated on all DC water data five different times (1000 simulations each time) using 30, 50, 70, 90, and 110 lead results in each simulation round, respectively. These simulations show that even at smaller sample sizes (n = 30) the centroid of the distribution was consistent with simulations using larger sample sizes and with the actual 90th percentiles from DC Water's sampling rounds (Fig. S9†). However, the distribution of 90th percentile results from simulations that used smaller sample sizes showed longer tails both in the estimated and true 90th percentile values calculated from the model. This indicates that the while the method can still provide utilities with an estimate of the L5 90th percentile with limited L1 data, these estimates will be less accurate then simulations performed where more L1 data is available. Based on this sensitivity analysis, utilities would want to include at least 70 L1 data points in the simulation to get a more accurate range of possible L5 90th percentiles.
The distributions of the “true” and “estimated” 90th percentile result overall shared a similar profile (Fig. S5†). The distribution of “true” 90th percentile values displayed a bimodal distribution with more results skewing higher than what is seen in the “estimated” 90th percentile distribution. However, the largest peak in the “true” 90th percentile distribution matches that of the “estimated” 90th percentile distribution with results ranging primarily from 10 ppb to 30 ppb. From the estimated distribution, Utility B would exceed the AL 62% of the time and would exceed the TL 97% of the time. The true distribution shows that Utility B would have exceeded the AL and TL 56% and 86% of the time, respectively. The results from the estimated 90th percentile simulations perform well when estimating whether a system is likely to exceed either the TL or AL.
When comparing the distribution of estimated L5 results to true L5 results, not 90th percentiles, 30% of the 1000 simulations saw significant differences (Mann–Whitney p-value < 0.05) between the two difference L5 distributions. Overall, the majority of simulated sampling rounds did not experience a significant difference between the true and estimated L5 results. This illustrates that this simulation method is effective in estimating what a utility is likely to experience when collecting L5 lead samples in their system by using only the L1 data that they have available.
While the results presented here show that the method was effective in estimating the L5 results for Utility B, it is worth noting that the agreement between true and estimated L5 results was lower for Utility B compared to DC Water. The two suspected causes for this decreased accuracy are either the increased magnitude and variability in lead levels observed at Utility B or the use of pH adjustment as a CCT. From the true L1 and L5 data for Utility B, lead levels are less controlled in this system compared to DC Water and as such, that could result in less consistent differences between L1 and L5 results. This may indicate that it is potentially harder to estimate L5 results in systems that do not have stable corrosion control and consistent lead levels. Unfortunately, it is hard to assess the impacts of the choice of corrosion control on the efficacy of the simulation method with only one utility with known L5 data investigated using each type of corrosion control. It would be of interest in future research to test this method on more systems with each type of corrosion control to evaluate its accuracy between corrosion control treatment techniques.
Fig. 2 Distribution of estimated L5 90th percentile values for PWD calculated from 1000 simulations. |
During the 2022 LCR monitoring period, PWD modified its sampling methodology to have customers collect both L1 and L5 samples. All EPA recommended sampling methods (i.e. no pre-flush, no aerator removal, etc.) were followed as they have been since 2016. A total of 104 homes were sampled during this sampling round. The L1 90th percentile remained consistent with previous years, matching the systems lowest L1 90th percentile, at 2 ppb. The L5 90th percentile value for this non-regulatory sampling was 5 ppb. While this is only a single monitoring period, this is indicative to that fact that PWD may be more closely grouped in the “best case” scenario when looking ahead at future monitoring rounds for the L5 sample results.
While investigating the impacts of L5 samples on the 90th percentile is critically important for compliance, it is also still important to investigate the overall distribution of simulated L5 sampling rounds. Fig. S6 and Table S1† illustrates the overall distribution of lead levels in ten and five randomly selected iterations, respectively. We can see from Fig. S6† that for all ten iterations that the majority of samples fall between 1 ppb and 10 ppb. However, investigating the lead distribution table (Table S1†) we can see that the percentage of samples less than 5 ppb has shifted from previously reported values for PWD in the mid-90s for L1 samples to the mid- to low-80s for L5 samples.14,15
While the EPA suggests that large utilities will likely be able to rely on coliform sites to collect WQP FaF samples, this may not be true for all systems and systems should assess the level of effort that this requirement will have on them. In the three sampling rounds included, there were only 32 LCR sampling sites that had an RTCR grab sampling site meeting the requirements of the LCRR FaF mandate, compared with 114 LCR sites without RTCR sites meeting the LCRR requirements. If these FaF requirements had been in place during the 2016, 2017, and 2019 sampling round, 0, 2, and 1 new WQP grab sampling sites would have had to have been established in each round, respectively. While one could argue that establishing only three new locations over three monitoring periods is not a large burden, it is likely still understating the level of effort that this FaF requirement may have under the new LCRR. Firstly, PWD has a stable CCT in place and experiences very few single home exceedances of the AL. However, systems that are well within the 90th percentile AL, but experience more single home exceedances could see this number increase from 1 to 2 new WQP sites per sampling round to 3 to 5. This could result in these systems hitting the maximum number of WQP sites (50) within only a few monitoring periods. Secondly, under the new LCRR, with all systems being reverted to standard monitoring, systems will have single home exceedances more frequently than if they were on reduced monitoring. Finally, with the change in sampling methods, if the L5 sample does indeed result in more single home exceedances, then the number of new WQP sites required could increase at an even faster rate.
Systems must prepare for what impact this will then subsequently have on FaF and on WQP monitoring within their systems. From the 1000 FaF simulations performed, on average PWD reached 50 permanent WQP sites by round 26 in the “best case” scenario and by round 13 in the “conservative case” scenario. The fastest that 50 permanent WQP sites were reached was after 8 rounds in the “best case” scenario compared to 5 in the “conservative case”. The highest number of sampling rounds to reach 50 WQP sites could be as high as 48 and 23 rounds in the “best case” and “conservative case” scenarios, respectively. Based on this simulation, it would take between 13 and 72 (“best case”) or 6 and 34 (“conservative case”) for PWD to reach 50 WQP sites depending on if they are on standard or reduced monitoring. However, since every simulation reached 50 WQP monitoring sites at some point, unless the water system removes all LSL, they will eventually have 50 unique WQP sites (Fig. 3).
On average, PWD would be adding new WQP sites in 41 of the first 100 sampling rounds for “best case” and 50 of the first 100 sampling rounds for “conservative case” under the LCRR. In the first 10 rounds, PWD would be adding new sites in 7 and 9 of these rounds under “best case” and “conservative case”, respectively. This is likely to be a slight underestimation because the same 146 sites are used as the sampling pool for all 100 consecutive rounds. It is likely that the sampling pool will change over time, introducing sites that do not have an existing WQP site meeting the FaF requirements. However, this analysis demonstrates that eventually they all reach 50 WQP sites that must permanently be monitored.
The LCRR will have a major impact on water systems across the country. The change in sample methodology in the LCRR may leave water systems unsure of whether they will remain in compliance, even if they have been in compliance with the current LCR for years. It is imperative that water systems start to assess their L5 lead levels to begin effectively planning for compliance with the LCRR. This method can provide systems with a starting point for this assessment of their corrosion control programs to improve water quality for its customers and plan for any potentially required capital costs associated with this regulation.
Footnote |
† Electronic supplementary information (ESI) available: Additional water system details, methodological details, and figures and tables supporting analysis results. See DOI: https://doi.org/10.1039/d3ew00631j |
This journal is © The Royal Society of Chemistry 2024 |