4 research outputs found

    Estimation of hourly near surface air temperature across Israel using an ensemble model

    Get PDF
    Mapping of near-surface air temperature (Ta) at high spatio-temporal resolution is essential for unbiased assessment of human health exposure to temperature extremes, not least given the observed trend of urbanization and global climate change. Data constraints have led previous studies to focus merely on daily Ta metrics, rather than hourly ones, making them insufficient for intra-day assessment of health exposure. In this study, we present a three-stage machine learning-based ensemble model to estimate hourly Ta at a high spatial resolution of 1 × 1 km2, incorporating remotely sensed surface skin temperature (Ts) from geostationary satellites, reanalysis synoptic variables, and observations from weather stations, as well as auxiliary geospatial variables, which account for spatio-temporal variability of Ta. The Stage 1 model gap-fills hourly Ts at 4 × 4 km2 from the Spinning Enhanced Visible and InfraRed Imager (SEVIRI), which are subsequently fed into the Stage 2 model to estimate hourly Ta at the same spatio-temporal resolution. The Stage 3 model downscales the residuals between estimated and measured Ta to a grid of 1 × 1 km2, taking into account additionally the monthly diurnal pattern of Ts derived from the Moderate Resolution Imaging Spectroradiometer (MODIS) data. In each stage, the ensemble model synergizes estimates from the constituent base learners—random forest (RF) and extreme gradient boosting (XGBoost)—by applying a geographically weighted generalized additive model (GAM), which allows the weights of results from individual models to vary over space and time. Demonstrated for Israel for the period 2004–2017, the proposed ensemble model outperformed each of the two base learners. It also attained excellent five-fold cross-validated performance, with overall root mean square error (RMSE) of 0.8 and 0.9 °C, mean absolute error (MAE) of 0.6 and 0.7 °C, and R2 of 0.95 and 0.98 in Stage 1 and Stage 2, respectively. The Stage 3 model for downscaling Ta residuals to 1 km MODIS grids achieved overall RMSE of 0.3 °C, MAE of 0.5 °C, and R2 of 0.63. The generated hourly 1 × 1 km2 Ta thus serves as a foundation for monitoring and assessing human health exposure to temperature extremes at a larger geographical scale, helping to further minimize exposure misclassification in epidemiological studies

    Towards an operational model for estimating day and night instantaneous near-surface air temperature for urban heat island studies: outline and assessment

    Get PDF
    Near-surface air temperature (NSAT) is key for assessing urban heat islands, human health, and well-being. However, a widely recognized and cost- and time-effective replicable approach for estimating hourly NSAT is still urgent. In this study, we outline and validate an easy-to-replicate, yet effective, operational model, for automating the estimation of high-resolution day and night instantaneous NSAT. The model is tested on a heat wave event and for a large geographical area. The model combines remotely sensed land surface temperature and digital elevation model, with air temperature from local fixed weather station networks. Achieved NSAT has daily and hourly frequency consistent with MODIS revisiting time. A geographically weighted regression method is employed, with exponential weighting found to be highly accurate for our purpose. A robust assessment of different methods, at different time slots, both day- and night-time, and during a heatwave event, is provided based on a cross-validation protocol. Four-time periods are modelled and tested, for two consecutive days, i.e. 31st of July 2020 at 10:40 and 21:50, and 1st of August 2020 at 02:00 and 13:10 local time. High R2 was found for all time slots, ranging from 0.82 to 0.88, with a bias close to 0, RMSE ranging from 1.45 °C to 1.77 °C, and MAE from 1.15 °C to 1.36 °C. Normalized RMSE and MAE are roughly 0.05 to 0.08. Overall, if compared to other recognized regression models, higher effectiveness is allowed also in terms of spatial autocorrelation of residuals, as well as in terms of model sensitivity

    The Technological Emergence of AutoML: A Survey of Performant Software and Applications in the Context of Industry

    Full text link
    With most technical fields, there exists a delay between fundamental academic research and practical industrial uptake. Whilst some sciences have robust and well-established processes for commercialisation, such as the pharmaceutical practice of regimented drug trials, other fields face transitory periods in which fundamental academic advancements diffuse gradually into the space of commerce and industry. For the still relatively young field of Automated/Autonomous Machine Learning (AutoML/AutonoML), that transitory period is under way, spurred on by a burgeoning interest from broader society. Yet, to date, little research has been undertaken to assess the current state of this dissemination and its uptake. Thus, this review makes two primary contributions to knowledge around this topic. Firstly, it provides the most up-to-date and comprehensive survey of existing AutoML tools, both open-source and commercial. Secondly, it motivates and outlines a framework for assessing whether an AutoML solution designed for real-world application is 'performant'; this framework extends beyond the limitations of typical academic criteria, considering a variety of stakeholder needs and the human-computer interactions required to service them. Thus, additionally supported by an extensive assessment and comparison of academic and commercial case-studies, this review evaluates mainstream engagement with AutoML in the early 2020s, identifying obstacles and opportunities for accelerating future uptake
    corecore