Issue 8, 2023

Emerging investigator series: a machine learning approach to quantify the impact of meteorology on tropospheric ozone in the inland southern California

Abstract

The role of meteorology in facilitating the formation and accumulation of ground-level ozone is of great theoretical and practical interest, especially due to changing global climate. In this study, with appropriate machine learning algorithms, we analyzed large meteorology and air quality datasets to train machine learning models to (1) enhance the prediction of ozone levels in the South Coast Air Basin of California, (2) investigate the impact of recent meteorological shifts on ozone formation, and (3) determine the most critical factors influencing ozone exceedance hours. Random forest regression was used to predict historical and future trends of ozone levels, and k-nearest neighbor was used as a binary classifier for ozone exceedance prediction. The models were trained on meteorology data from Ontario and Los Angeles International Airport stations and air quality data from the Fontana, California air monitoring station, and data were collected for the 1994 to 2018 time period. Upon model evaluation, the correlation of the RFR model was 0.92, and the probability of detection for ozone exceedances using k-nearest neighbors was 0.81 for the most recent years of the analysis (2014–2018). We also ran a 4 km Community Multiscale Air Quality model simulation to generate air pollution estimates over Southern California. As expected, ozone in Fontana was positively correlated with temperature. The ozone exceedance hours usually occurred when the temperature was above 25 °C, and the wind direction was from 270° (westerly). Ozone sensitivity as a function of temperature and NOx was also examined. Observed troughs in hourly NOx concentrations during midday under high temperatures suggests that most of the ambient NOx reacted, also as expected. The results indicate that machine learning can support state implementation planning by complementing traditional air quality modeling, reducing simulation time, and exploiting large datasets for historical simulations and future air quality predictions.

Graphical abstract: Emerging investigator series: a machine learning approach to quantify the impact of meteorology on tropospheric ozone in the inland southern California

Supplementary files

Article information

Article type
Paper
Submitted
01 juil. 2022
Accepted
22 mai 2023
First published
15 juin 2023
This article is Open Access
Creative Commons BY-NC license

Environ. Sci.: Atmos., 2023,3, 1159-1173

Emerging investigator series: a machine learning approach to quantify the impact of meteorology on tropospheric ozone in the inland southern California

K. Do, M. Mahish, A. K. Yeganeh, Z. Gao, C. L. Blanchard and C. E. Ivey, Environ. Sci.: Atmos., 2023, 3, 1159 DOI: 10.1039/D2EA00077F

This article is licensed under a Creative Commons Attribution-NonCommercial 3.0 Unported Licence. You can use material from this article in other publications, without requesting further permission from the RSC, provided that the correct acknowledgement is given and it is not used for commercial purposes.

To request permission to reproduce material from this article in a commercial publication, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party commercial publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Read more about how to correctly acknowledge RSC content.

Social activity

Spotlight

Advertisements