6 research outputs found
Spatiotemporal Prediction of Fine Particulate Matter During the 2008 Northern California Wildfires Using Machine Learning
Estimating population exposure to particulate matter during wildfires
can be difficult because of insufficient monitoring data to capture
the spatiotemporal variability of smoke plumes. Chemical transport
models (CTMs) and satellite retrievals provide spatiotemporal data
that may be useful in predicting PM<sub>2.5</sub> during wildfires.
We estimated PM<sub>2.5</sub> concentrations during the 2008 northern
California wildfires using 10-fold cross-validation (CV) to select
an optimal prediction model from a set of 11 statistical algorithms
and 29 predictor variables. The variables included CTM output, three
measures of satellite aerosol optical depth, distance to the nearest
fires, meteorological data, and land use, traffic, spatial location,
and temporal characteristics. The generalized boosting model (GBM)
with 29 predictor variables had the lowest CV root mean squared error
and a CV-R<sup>2</sup> of 0.803. The most important predictor variable
was the Geostationary Operational Environmental Satellite Aerosol/Smoke
Product (GASP) Aerosol Optical Depth (AOD), followed by the CTM output
and distance to the nearest fire cluster. Parsimonious models with
various combinations of fewer variables also predicted PM<sub>2.5</sub> well. Using machine learning algorithms to combine spatiotemporal
data from satellites and CTMs can reliably predict PM<sub>2.5</sub> concentrations during a major wildfire event
A Hybrid Approach to Estimating National Scale Spatiotemporal Variability of PM<sub>2.5</sub> in the Contiguous United States
Airborne
fine particulate matter exhibits spatiotemporal variability
at multiple scales, which presents challenges to estimating exposures
for health effects assessment. Here we created a model to predict
ambient particulate matter less than 2.5 Ī¼m in aerodynamic diameter
(PM<sub>2.5</sub>) across the contiguous United States to be applied
to health effects modeling. We developed a hybrid approach combining
a land use regression model (LUR) selected with a machine learning
method, and Bayesian Maximum Entropy (BME) interpolation of the LUR
space-time residuals. The PM<sub>2.5</sub> data set included 104,172
monthly observations at 1464 monitoring locations with approximately
10% of locations reserved for cross-validation. LUR models were based
on remote sensing estimates of PM<sub>2.5</sub>, land use and traffic
indicators. Normalized cross-validated <i>R</i><sup>2</sup> values for LUR were 0.63 and 0.11 with and without remote sensing,
respectively, suggesting remote sensing is a strong predictor of ground-level
concentrations. In the models including the BME interpolation of the
residuals, cross-validated <i>R</i><sup>2</sup> were 0.79
for both configurations; the model without remotely sensed data described
more fine-scale variation than the model including remote sensing.
Our results suggest that our modeling framework can predict ground-level
concentrations of PM<sub>2.5</sub> at multiple scales over the contiguous
U.S
A Hybrid Approach to Estimating National Scale Spatiotemporal Variability of PM<sub>2.5</sub> in the Contiguous United States
Airborne
fine particulate matter exhibits spatiotemporal variability
at multiple scales, which presents challenges to estimating exposures
for health effects assessment. Here we created a model to predict
ambient particulate matter less than 2.5 Ī¼m in aerodynamic diameter
(PM<sub>2.5</sub>) across the contiguous United States to be applied
to health effects modeling. We developed a hybrid approach combining
a land use regression model (LUR) selected with a machine learning
method, and Bayesian Maximum Entropy (BME) interpolation of the LUR
space-time residuals. The PM<sub>2.5</sub> data set included 104,172
monthly observations at 1464 monitoring locations with approximately
10% of locations reserved for cross-validation. LUR models were based
on remote sensing estimates of PM<sub>2.5</sub>, land use and traffic
indicators. Normalized cross-validated <i>R</i><sup>2</sup> values for LUR were 0.63 and 0.11 with and without remote sensing,
respectively, suggesting remote sensing is a strong predictor of ground-level
concentrations. In the models including the BME interpolation of the
residuals, cross-validated <i>R</i><sup>2</sup> were 0.79
for both configurations; the model without remotely sensed data described
more fine-scale variation than the model including remote sensing.
Our results suggest that our modeling framework can predict ground-level
concentrations of PM<sub>2.5</sub> at multiple scales over the contiguous
U.S
Development and field validation of a community-engaged particulate matter air quality monitoring network in Imperial, California, USA
<p>The Imperial County Community Air Monitoring Network was developed as part of a community-engaged research study to provide real-time particulate matter (PM) air quality information at a high spatial resolution in Imperial County, California. The network augmented the few existing regulatory monitors and increased monitoring near susceptible populations. Monitors were both calibrated and field validated, a key component of evaluating the quality of the data produced by the community monitoring network. This paper examines the performance of a customized version of the low-cost Dylos optical particle counter used in the community air monitors compared with both PM<sub>2.5</sub> and PM<sub>10</sub> (particulate matter with aerodynamic diameters <2.5 and <10 Ī¼m, respectively) federal equivalent method (FEM) beta-attenuation monitors (BAMs) and federal reference method (FRM) gravimetric filters at a collocation site in the study area. A conversion equation was developed that estimates particle mass concentrations from the native Dylos particle counts, taking into account relative humidity. The <i>R</i><sup>2</sup> for converted hourly averaged Dylos mass measurements versus a PM<sub>2.5</sub> BAM was 0.79 and that versus a PM<sub>10</sub> BAM was 0.78. The performance of the conversion equation was evaluated at six other sites with collocated PM<sub>2.5</sub> environmental beta-attenuation monitors (EBAMs) located throughout Imperial County. The agreement of the Dylos with the EBAMs was moderate to high (<i>R</i><sup>2</sup> = 0.35ā0.81).</p> <p><i>Implications</i>: The performance of low-cost air quality sensors in community networks is currently not well documented. This paper provides a methodology for quantifying the performance of a next-generation Dylos PM sensor used in the Imperial County Community Air Monitoring Network. This air quality network provides data at a much finer spatial and temporal resolution than has previously been possible with government monitoring efforts. Once calibrated and validated, these high-resolution data may provide more information on susceptible populations, assist in the identification of air pollution hotspots, and increase community awareness of air pollution.</p
Development of Land Use Regression Models for PM<sub>2.5</sub>, PM<sub>2.5</sub> Absorbance, PM<sub>10</sub> and PM<sub>coarse</sub> in 20 European Study Areas; Results of the ESCAPE Project
Land Use Regression (LUR) models have been used increasingly
for
modeling small-scale spatial variation in air pollution concentrations
and estimating individual exposure for participants of cohort studies.
Within the ESCAPE project, concentrations of PM<sub>2.5</sub>, PM<sub>2.5</sub> absorbance, PM<sub>10</sub>, and PM<sub>coarse</sub> were
measured in 20 European study areas at 20 sites per area. GIS-derived
predictor variables (e.g., traffic intensity, population, and land-use)
were evaluated to model spatial variation of annual average concentrations
for each study area. The median model explained variance (<i>R</i><sup>2</sup>) was 71% for PM<sub>2.5</sub> (range across
study areas 35ā94%). Model <i>R</i><sup>2</sup> was
higher for PM<sub>2.5</sub> absorbance (median 89%, range 56ā97%)
and lower for PM<sub>coarse</sub> (median 68%, range 32ā 81%).
Models included between two and five predictor variables, with various
traffic indicators as the most common predictors. Lower <i>R</i><sup>2</sup> was related to small concentration variability or limited
availability of predictor variables, especially traffic intensity.
Cross validation <i>R</i><sup>2</sup> results were on average
8ā11% lower than model <i>R</i><sup>2</sup>. Careful
selection of monitoring sites, examination of influential observations
and skewed variable distributions were essential for developing stable
LUR models. The final LUR models are used to estimate air pollution
concentrations at the home addresses of participants in the health
studies involved in ESCAPE
Development of Land Use Regression Models for Particle Composition in Twenty Study Areas in Europe
Land Use Regression (LUR) models
have been used to describe and
model spatial variability of annual mean concentrations of traffic
related pollutants such as nitrogen dioxide (NO<sub>2</sub>), nitrogen
oxides (NO<sub><i>x</i></sub>) and particulate matter (PM).
No models have yet been published of elemental composition. As part
of the ESCAPE project, we measured the elemental composition in both
the PM<sub>10</sub> and PM<sub>2.5</sub> fraction sizes at 20 sites
in each of 20 study areas across Europe. LUR models for eight a priori
selected elements (copper (Cu), iron (Fe), potassium (K), nickel (Ni),
sulfur (S), silicon (Si), vanadium (V), and zinc (Zn)) were developed.
Good models were developed for Cu, Fe, and Zn in both fractions (PM<sub>10</sub> and PM<sub>2.5</sub>) explaining on average between 67 and
79% of the concentration variance (<i>R</i><sup>2</sup>)
with a large variability between areas. Traffic variables were the
dominant predictors, reflecting nontailpipe emissions. Models for
V and S in the PM<sub>10</sub> and PM<sub>2.5</sub> fractions and
Si, Ni, and K in the PM<sub>10</sub> fraction performed moderately
with <i>R</i><sup>2</sup> ranging from 50 to 61%. Si, NI,
and K models for PM<sub>2.5</sub> performed poorest with <i>R</i><sup>2</sup> under 50%. The LUR models are used to estimate exposures
to elemental composition in the health studies involved in ESCAPE