2 research outputs found

    Sensitivity analysis of the PC hyperprior for range and standard deviation components in Bayesian Spatiotemporal high-resolution prediction: An application to PM2.5 prediction in Jakarta, Indonesia

    Get PDF
    The Gaussian Markov Random Field (GMRF) is widely acknowledged for its remarkable flexibility, especially in the realm of high-resolution prediction, when compared to conventional Kriging methods. Rooted in the fundamental principles of Bayesian estimation, this methodology underscores the importance of a meticulous examination of prior and hyperprior distributions, along with their corresponding parameter values. Sensitivity analyses are crucial for evaluating the potential impact of these distributions and parameter values on prediction results. To determine the most effective values for hyperprior parameters, an iterative trial-and-error approach is commonly employed. In our research, we systematically assessed a variety of parameter values through exhaustive cross-validation. Our study is focused on optimizing hyperprior parameter values, with a particular emphasis on Penalized Complexity (PC). We applied our method to conduct spatiotemporal high-resolution predictions of PM2.5 concentrations in Jakarta province, Indonesia. Achieving accurate predictive modeling of PM2.5 concentrations in Jakarta is contingent upon this optimization. We identified that the optimal values for PC hyperprior parameters, with a range of less than 2,000 and a hyperprior standard deviation greater than 1 with a 0.1 probability, yield the most accurate predictions. These parameter values result in the minimum mean absolute percentage error (MAPE) of 19.35393, along with a deviation information criterion (DIC) of -154.23. Our findings highlight that the standard deviation parameter significantly influences model fit compared to the relatively insignificant impact of the range parameter. When coupled with high-resolution mapping, these optimized parameters facilitate a comprehensive understanding of distribution patterns. This process aids in detecting areas particularly susceptible to risks, thereby enhancing decision-making efficacy regarding air quality management

    Boosting Algorithm to Handle Unbalanced Classification of PM<sub>2.5</sub> Concentration Levels by Observing Meteorological Parameters in Jakarta-Indonesia Using AdaBoost, XGBoost, CatBoost, and LightGBM

    No full text
    Air quality conditions are now more severe in the Jakarta area that is among the world&#x2019;s top eight worst cities according to the 2022 Air Quality Index (AQI) report. In particular, the data from the Meteorological, Climatological, and Geophysical Agency (BMKG) of the Republic of Indonesia, the latest outcomes in air quality conditions in Jakarta and surrounding areas, says that PM2.5 concentrations have increased and peaked at 148 μg/m3148~\mu \text{g}/\text{m}^{3} in 2022. While a classification system for this pollution is necessary and critical, the observation of PM2.5 concentrations measured through the BMKG Kemayoran station, Jakarta, turns out to be identified as an unbalanced data class. Thus, in this work, we perform boosting algorithm supervised learning to handle such an unbalanced classification toward PM2.5 concentration levels by observing meteorological patterns in Jakarta during 1 January 2015 to 7 July 2022. The boosting algorithms considered in this research include Adaptive Boosting (AdaBoost), Extreme Gradient Boosting (XGBoost), Categorical Boosting (CatBoost), and Light Gradient Boosting Machine (LightGBM). Our simulations have proven that boosting classification can significantly reduce bias in combination with variance reduction with unbalanced within-class coefficients, with the classification of PM2.5 class values: good 62&#x0025;, moderate 34&#x0025;, and unhealthy 59&#x0025;, respectively
    corecore