6 research outputs found

    Spatiotemporal Prediction of Fine Particulate Matter During the 2008 Northern California Wildfires Using Machine Learning

    No full text
    Estimating population exposure to particulate matter during wildfires can be difficult because of insufficient monitoring data to capture the spatiotemporal variability of smoke plumes. Chemical transport models (CTMs) and satellite retrievals provide spatiotemporal data that may be useful in predicting PM<sub>2.5</sub> during wildfires. We estimated PM<sub>2.5</sub> concentrations during the 2008 northern California wildfires using 10-fold cross-validation (CV) to select an optimal prediction model from a set of 11 statistical algorithms and 29 predictor variables. The variables included CTM output, three measures of satellite aerosol optical depth, distance to the nearest fires, meteorological data, and land use, traffic, spatial location, and temporal characteristics. The generalized boosting model (GBM) with 29 predictor variables had the lowest CV root mean squared error and a CV-R<sup>2</sup> of 0.803. The most important predictor variable was the Geostationary Operational Environmental Satellite Aerosol/Smoke Product (GASP) Aerosol Optical Depth (AOD), followed by the CTM output and distance to the nearest fire cluster. Parsimonious models with various combinations of fewer variables also predicted PM<sub>2.5</sub> well. Using machine learning algorithms to combine spatiotemporal data from satellites and CTMs can reliably predict PM<sub>2.5</sub> concentrations during a major wildfire event

    A Hybrid Approach to Estimating National Scale Spatiotemporal Variability of PM<sub>2.5</sub> in the Contiguous United States

    No full text
    Airborne fine particulate matter exhibits spatiotemporal variability at multiple scales, which presents challenges to estimating exposures for health effects assessment. Here we created a model to predict ambient particulate matter less than 2.5 Ī¼m in aerodynamic diameter (PM<sub>2.5</sub>) across the contiguous United States to be applied to health effects modeling. We developed a hybrid approach combining a land use regression model (LUR) selected with a machine learning method, and Bayesian Maximum Entropy (BME) interpolation of the LUR space-time residuals. The PM<sub>2.5</sub> data set included 104,172 monthly observations at 1464 monitoring locations with approximately 10% of locations reserved for cross-validation. LUR models were based on remote sensing estimates of PM<sub>2.5</sub>, land use and traffic indicators. Normalized cross-validated <i>R</i><sup>2</sup> values for LUR were 0.63 and 0.11 with and without remote sensing, respectively, suggesting remote sensing is a strong predictor of ground-level concentrations. In the models including the BME interpolation of the residuals, cross-validated <i>R</i><sup>2</sup> were 0.79 for both configurations; the model without remotely sensed data described more fine-scale variation than the model including remote sensing. Our results suggest that our modeling framework can predict ground-level concentrations of PM<sub>2.5</sub> at multiple scales over the contiguous U.S

    A Hybrid Approach to Estimating National Scale Spatiotemporal Variability of PM<sub>2.5</sub> in the Contiguous United States

    No full text
    Airborne fine particulate matter exhibits spatiotemporal variability at multiple scales, which presents challenges to estimating exposures for health effects assessment. Here we created a model to predict ambient particulate matter less than 2.5 Ī¼m in aerodynamic diameter (PM<sub>2.5</sub>) across the contiguous United States to be applied to health effects modeling. We developed a hybrid approach combining a land use regression model (LUR) selected with a machine learning method, and Bayesian Maximum Entropy (BME) interpolation of the LUR space-time residuals. The PM<sub>2.5</sub> data set included 104,172 monthly observations at 1464 monitoring locations with approximately 10% of locations reserved for cross-validation. LUR models were based on remote sensing estimates of PM<sub>2.5</sub>, land use and traffic indicators. Normalized cross-validated <i>R</i><sup>2</sup> values for LUR were 0.63 and 0.11 with and without remote sensing, respectively, suggesting remote sensing is a strong predictor of ground-level concentrations. In the models including the BME interpolation of the residuals, cross-validated <i>R</i><sup>2</sup> were 0.79 for both configurations; the model without remotely sensed data described more fine-scale variation than the model including remote sensing. Our results suggest that our modeling framework can predict ground-level concentrations of PM<sub>2.5</sub> at multiple scales over the contiguous U.S

    Development and field validation of a community-engaged particulate matter air quality monitoring network in Imperial, California, USA

    No full text
    <p>The Imperial County Community Air Monitoring Network was developed as part of a community-engaged research study to provide real-time particulate matter (PM) air quality information at a high spatial resolution in Imperial County, California. The network augmented the few existing regulatory monitors and increased monitoring near susceptible populations. Monitors were both calibrated and field validated, a key component of evaluating the quality of the data produced by the community monitoring network. This paper examines the performance of a customized version of the low-cost Dylos optical particle counter used in the community air monitors compared with both PM<sub>2.5</sub> and PM<sub>10</sub> (particulate matter with aerodynamic diameters <2.5 and <10 Ī¼m, respectively) federal equivalent method (FEM) beta-attenuation monitors (BAMs) and federal reference method (FRM) gravimetric filters at a collocation site in the study area. A conversion equation was developed that estimates particle mass concentrations from the native Dylos particle counts, taking into account relative humidity. The <i>R</i><sup>2</sup> for converted hourly averaged Dylos mass measurements versus a PM<sub>2.5</sub> BAM was 0.79 and that versus a PM<sub>10</sub> BAM was 0.78. The performance of the conversion equation was evaluated at six other sites with collocated PM<sub>2.5</sub> environmental beta-attenuation monitors (EBAMs) located throughout Imperial County. The agreement of the Dylos with the EBAMs was moderate to high (<i>R</i><sup>2</sup> = 0.35ā€“0.81).</p> <p><i>Implications</i>: The performance of low-cost air quality sensors in community networks is currently not well documented. This paper provides a methodology for quantifying the performance of a next-generation Dylos PM sensor used in the Imperial County Community Air Monitoring Network. This air quality network provides data at a much finer spatial and temporal resolution than has previously been possible with government monitoring efforts. Once calibrated and validated, these high-resolution data may provide more information on susceptible populations, assist in the identification of air pollution hotspots, and increase community awareness of air pollution.</p

    Development of Land Use Regression Models for PM<sub>2.5</sub>, PM<sub>2.5</sub> Absorbance, PM<sub>10</sub> and PM<sub>coarse</sub> in 20 European Study Areas; Results of the ESCAPE Project

    No full text
    Land Use Regression (LUR) models have been used increasingly for modeling small-scale spatial variation in air pollution concentrations and estimating individual exposure for participants of cohort studies. Within the ESCAPE project, concentrations of PM<sub>2.5</sub>, PM<sub>2.5</sub> absorbance, PM<sub>10</sub>, and PM<sub>coarse</sub> were measured in 20 European study areas at 20 sites per area. GIS-derived predictor variables (e.g., traffic intensity, population, and land-use) were evaluated to model spatial variation of annual average concentrations for each study area. The median model explained variance (<i>R</i><sup>2</sup>) was 71% for PM<sub>2.5</sub> (range across study areas 35ā€“94%). Model <i>R</i><sup>2</sup> was higher for PM<sub>2.5</sub> absorbance (median 89%, range 56ā€“97%) and lower for PM<sub>coarse</sub> (median 68%, range 32ā€“ 81%). Models included between two and five predictor variables, with various traffic indicators as the most common predictors. Lower <i>R</i><sup>2</sup> was related to small concentration variability or limited availability of predictor variables, especially traffic intensity. Cross validation <i>R</i><sup>2</sup> results were on average 8ā€“11% lower than model <i>R</i><sup>2</sup>. Careful selection of monitoring sites, examination of influential observations and skewed variable distributions were essential for developing stable LUR models. The final LUR models are used to estimate air pollution concentrations at the home addresses of participants in the health studies involved in ESCAPE

    Development of Land Use Regression Models for Particle Composition in Twenty Study Areas in Europe

    No full text
    Land Use Regression (LUR) models have been used to describe and model spatial variability of annual mean concentrations of traffic related pollutants such as nitrogen dioxide (NO<sub>2</sub>), nitrogen oxides (NO<sub><i>x</i></sub>) and particulate matter (PM). No models have yet been published of elemental composition. As part of the ESCAPE project, we measured the elemental composition in both the PM<sub>10</sub> and PM<sub>2.5</sub> fraction sizes at 20 sites in each of 20 study areas across Europe. LUR models for eight a priori selected elements (copper (Cu), iron (Fe), potassium (K), nickel (Ni), sulfur (S), silicon (Si), vanadium (V), and zinc (Zn)) were developed. Good models were developed for Cu, Fe, and Zn in both fractions (PM<sub>10</sub> and PM<sub>2.5</sub>) explaining on average between 67 and 79% of the concentration variance (<i>R</i><sup>2</sup>) with a large variability between areas. Traffic variables were the dominant predictors, reflecting nontailpipe emissions. Models for V and S in the PM<sub>10</sub> and PM<sub>2.5</sub> fractions and Si, Ni, and K in the PM<sub>10</sub> fraction performed moderately with <i>R</i><sup>2</sup> ranging from 50 to 61%. Si, NI, and K models for PM<sub>2.5</sub> performed poorest with <i>R</i><sup>2</sup> under 50%. The LUR models are used to estimate exposures to elemental composition in the health studies involved in ESCAPE
    corecore