181 research outputs found
Air Quality Prediction in Smart Cities Using Machine Learning Technologies Based on Sensor Data: A Review
The influence of machine learning technologies is rapidly increasing and penetrating almost in every field, and air pollution prediction is not being excluded from those fields. This paper covers the revision of the studies related to air pollution prediction using machine learning algorithms based on sensor data in the context of smart cities. Using the most popular databases and executing the corresponding filtration, the most relevant papers were selected. After thorough reviewing those papers, the main features were extracted, which served as a base to link and compare them to each other. As a result, we can conclude that: (1) instead of using simple machine learning techniques, currently, the authors apply advanced and sophisticated techniques, (2) China was the leading country in terms of a case study, (3) Particulate matter with diameter equal to 2.5 micrometers was the main prediction target, (4) in 41% of the publications the authors carried out the prediction for the next day, (5) 66% of the studies used data had an hourly rate, (6) 49% of the papers used open data and since 2016 it had a tendency to increase, and (7) for efficient air quality prediction it is important to consider the external factors such as weather conditions, spatial characteristics, and temporal features
A Review of 21st-Century Studies
PM10 prediction has attracted special legislative and scientific attention due
to its harmful effects on human health. Statistical techniques have the
potential for high-accuracy PM10 prediction and accordingly, previous studies
on statistical methods for temporal, spatial and spatio-temporal prediction of
PM10 are reviewed and discussed in this paper. A review of previous studies
demonstrates that Support Vector Machines, Artificial Neural Networks and
hybrid techniques show promise for suitable temporal PM10 prediction. A review
of the spatial predictions of PM10 shows that the LUR (Land Use Regression)
approach has been successfully utilized for spatial prediction of PM10 in
urban areas. Of the six introduced approaches for spatio-temporal prediction
of PM10, only one approach is suitable for high-resolved prediction (Spatial
resolution < 100 m; Temporal resolution ¤ 24 h). In this approach, based upon
the LUR modeling method, short-term dynamic input variables are employed as
explanatory variables alongside typical non-dynamic input variables in a non-
linear modeling procedure
Features Exploration from Datasets Vision in Air Quality Prediction Domain
Air pollution and its consequences are negatively impacting on the world population
and the environment, which converts the monitoring and forecasting air quality techniques as
essential tools to combat this problem. To predict air quality with maximum accuracy, along with the
implemented models and the quantity of the data, it is crucial also to consider the dataset types. This
study selected a set of research works in the field of air quality prediction and is concentrated on the
exploration of the datasets utilised in them. The most significant findings of this research work are:
(1) meteorological datasets were used in 94.6% of the papers leaving behind the rest of the datasets
with a big difference, which is complemented with others, such as temporal data, spatial data, and
so on; (2) the usage of various datasets combinations has been commenced since 2009; and (3) the
utilisation of open data have been started since 2012, 32.3% of the studies used open data, and 63.4%
of the studies did not provide the data
Air quality and urban sustainable development: the application of machine learning tools
[EN] Air quality has an efect on a populationÂżs quality of life. As a dimension of sustainable urban development, governments have been concerned about this indicator. This is refected in the references consulted that have demonstrated progress in forecasting pollution events to issue early warnings using conventional tools which, as a result of the new era of big data, are becoming obsolete. There are a limited number of studies with applications of machine learning tools to characterize and forecast behavior of the environmental, social and economic dimensions of sustainable development as they pertain to air quality. This article presents an analysis of studies that developed machine learning models to forecast sustainable development and air quality. Additionally, this paper sets out to present research that studied the relationship between air quality and urban sustainable development to identify the reliability and possible applications in diferent urban contexts of these machine learning tools. To that end, a systematic review was carried out, revealing that machine learning tools have been primarily used for clustering and classifying variables and indicators according to the problem analyzed, while tools such as artifcial neural networks and support vector machines are the most widely used to predict diferent types of events. The nonlinear nature and synergy of the dimensions of sustainable development are of great interest for the application of machine learning tools.Molina-GĂłmez, NI.; DĂaz-ArĂ©valo, JL.; LĂłpez JimĂ©nez, PA. (2021). Air quality and urban sustainable development: the application of machine learning tools. International Journal of Environmental Science and Technology. 18(4):1-18. https://doi.org/10.1007/s13762-020-02896-6S118184Al-Dabbous A, Kumar P, Khan A (2017) Prediction of airborne nanoparticles at roadside location using a feed–forward artificial neural network. Atmos Pollut Res 8:446–454. https://doi.org/10.1016/j.apr.2016.11.004Antanasijević D, Pocajt V, Povrenović D, Ristić M, Perić-Grujić A (2013) PM10 emission forecasting using artificial neural networks and genetic algorithm input variable optimization. Sci Total Environ 443:511–519. https://doi.org/10.1016/j.scitotenv.2012.10.110Brink H, Richards JW, Fetherolf M (2016) Real-world machine learning. Richards JW, Fetherolf M (eds) Manning Publications Co. Berkeley, CA. https://www.manning.com/books/real-world-machine-learning. Accessed 26 Apr 2020Cervone G, Franzese P, Ezber Y, Boybeyi Z (2008) Risk assessment of atmospheric emissions using machine learning. Nat Hazard Earth Syst 8:991–1000. https://doi.org/10.5194/nhess-8-991-2008Chen S, Kan G, Li J, Liang K, Hong Y (2018) Investigating China’s urban air quality using big data, information theory, and machine learning. Pol J Environ Stud 27:565–578. https://doi.org/10.15244/pjoes/75159Corani (2005) Air quality prediction in Milan: feed-forward neural networks, pruned neural networks and lazy learning. Ecol Model 185:513–529. https://doi.org/10.1016/j.ecolmodel.2005.01.008Cruz C, GĂłmez A, RamĂrez L, Villalva A, Monge O, Varela J, Quiroz J, Duarte H (2017) Calidad del aire respecto de metales (Pb, Cd, Ni, Cu, Cr) y relaciĂłn con salud respiratoria: caso Sonora, MĂ©xico. Rev Int Contam Ambient 33:23–34. https://doi.org/10.20937/RICA.2017.33.esp02.02de Hoogh K, HĂ©ritier H, Stafoggia M, KĂĽnzli N, Kloog I (2018) Modelling daily PM2.5 concentrations at high spatio-temporal resolution across Switzerland. Environ Pollut 233:1147–1154. https://doi.org/10.1016/j.envpol.2017.10.025Franceschi F, Cobo M, Figueredo M (2018) Discovering relationships and forecasting PM10 and PM2.5 concentrations in Bogotá, Colombia, using Artificial Neural Networks, Principal Component Analysis, and k-means clustering. Atmos Pollut Res 9:912–922. https://doi.org/10.1016/j.apr.2018.02.006GarcĂa N, Combarro E, del Coz J, Montañes E (2013) A SVM-based regression model to study the air quality at local scale in Oviedo urban area (Northern Spain): a case study. Appl Math Comput 219:8923–8937. https://doi.org/10.1016/j.amc.2013.03.018Gibert K, SĂ nchez-MĂ rre M, Sevilla B (2012) Tools for environmental data mining and intelligent decision support. In iEMSs. Leipzig, Germany. http://www.iemss.org/society/index.php/iemss-2012-proceedings. Accessed 26 Nov 2018Gibert K, SĂ nchez-Marrè M, Izquierdo J (2016) A survey on pre-processing techniques: relevant issues in the context of environmental data mining. Ai Commun 29:627–663. https://doi.org/10.3233/AIC-160710Gounaridis D, Chorianopoulos I, Koukoulas S (2018) Exploring prospective urban growth trends under different economic outlooks and land-use planning scenarios: the case of Athens. Appl Geogr 90:134–144. https://doi.org/10.1016/j.apgeog.2017.12.001Holloway J, Mengersen K (2018) Statistical machine learning methods and remote sensing for sustainable development goals: a review. Remote Sens 10:1–21. https://doi.org/10.3390/rs10091365Ifaei P, Karbassi A, Lee S, Yoo Ch (2017) A renewable energies-assisted sustainable development plan for Iran using techno-econo-socio-environmental multivariate analysis and big data. Energy Convers Manag 153:257–277. https://doi.org/10.1016/j.enconman.2017.10.014Kadiyala A, Kumar A (2017a) Applications of R to evaluate environmental data science problems. Environ Prog Sustain 36:1358–1364. https://doi.org/10.1002/ep.12676Kadiyala A, Kumar A (2017b) Vector time series-based radial basis function neural network modeling of air quality inside a public transportation bus using available software. Environ Prog Sustain 36:4–10. https://doi.org/10.1002/ep.12523Karimian H, Li Q, Wu Ch, Qi Y, Mo Y, Chen G, Zhang X, Sachdeva S (2019) Evaluation of different machine learning approaches to forecasting PM2.5 mass concentrations. Aerosol Air Qual Res 19:1400–1410. https://doi.org/10.4209/aaqr.2018.12.0450Krzyzanowski M, Apte J, Bonjour S, Brauer M, Cohen A, PrĂĽss-Ustun A (2014) Air pollution in the mega-cities. Curr Environ Health Rep 1:185–191. https://doi.org/10.1007/s40572-014-0019-7Lässig K, Morik (2016) Computat sustainability. Springer, Berlin. https://doi.org/10.1007/978-3-319-31858-5Li Y, Wu Y-X, Zeng Z-X, Guo L (2006) Research on forecast model for sustainable development of economy-environment system based on PCA and SVM. In: Proceedings of the 2006 international conference on machine learning and cybernetics, vol 2006. IEEE, Dalian, China, pp 3590–3593. https://doi.org/10.1109/ICMLC.2006.258576Liu B-Ch, Binaykia A, Chang P-Ch, Tiwari M, Tsao Ch-Ch (2017) Urban air quality forecasting based on multi- dimensional collaborative support vector regression (SVR): a case study of Beijing-Tianjin-Shijiazhuang. PLoS ONE 12:1–17. https://doi.org/10.1371/journal.pone.0179763Lubell M, Feiock R, Handy S (2009) City adoption of environmentally sustainable policies in California’s Central Valley. J Am Plan Assoc 75:293–308. https://doi.org/10.1080/01944360902952295Ma D, Zhang Z (2016) Contaminant dispersion prediction and source estimation with integrated Gaussian-machine learning network model for point source emission in atmosphere. J Hazard Mater 311:237–245. https://doi.org/10.1016/j.jhazmat.2016.03.022Madu C, Kuei N, Lee P (2017) Urban sustainability management: a deep learning perspective. Sustain Cities Soc 30:1–17. https://doi.org/10.1016/j.scs.2016.12.012Mellos K (1988) Theory of eco-development. In: Perspectives on ecology. Palgrave Macmillan, London. https://doi.org/10.1007/978-1-349-19598-5_4Ni XY, Huang H, Du WP (2017) Relevance analysis and short-term prediction of PM2.5 concentrations in Beijing based on multi-source data. Atmos Environ 150:146–161. https://doi.org/10.1016/j.atmosenv.2016.11.054Oprea M, Dragomir E, Popescu M, Mihalache S (2016) Particulate matter air pollutants forecasting using inductive learning approach. Rev Chim 67:2075–2081Paas B, Stienen J, Vorländer M, Schneider Ch (2017) Modelling of urban near-road atmospheric PM concentrations using an artificial neural network approach with acoustic data input. Environments 4:1–25. https://doi.org/10.3390/environments4020026Pandey G, Zhang B, Jian L (2013) Predicting submicron air pollution indicators: a machine learning approach. Environ Sci Proc Impacts 15:996–1005. https://doi.org/10.1039/c3em30890aPeng H, Lima A, Teakles A, Jin J, Cannon A, Hsieh W (2017) Evaluating hourly air quality forecasting in Canada with nonlinear updatable machine learning methods. Air Qual Atmos Health 10:195–211. https://doi.org/10.1007/s11869-016-0414-3PĂ©rez-OrtĂz M, de La Paz-MarĂn M, GutiĂ©rrez PA, Hervás-MartĂnez C (2014) Classification of EU countries’ progress towards sustainable development based on ordinal regression techniques. Knowl Based Syst 66:178–189. https://doi.org/10.1016/j.knosys.2014.04.041Phillis Y, Kouikoglou V, Verdugo C (2017) Urban sustainability assessment and ranking of cities. Comput Environ Urban 64:254–265. https://doi.org/10.1016/j.compenvurbsys.2017.03.002Saeed S, Hussain L, Awan I, Idris A (2017) Comparative analysis of different statistical methods for prediction of PM2.5 and PM10 concentrations in advance for several hours. Int J Comput Sci Netw Secur 17:45–52Sayegh A, Munir S, Habeebullah T (2014) Comparing the performance of statistical models for predicting PM10 concentrations. Aerosol Air Qual Res 14:653–665. https://doi.org/10.4209/aaqr.2013.07.0259Shaban K, Kadri A, Rezk E (2016) Urban air pollution monitoring system with forecasting models. IEEE Sens J 16:2598–2606. https://doi.org/10.1109/JSEN.2016.2514378Sierra B (2006) Aprendizaje automático conceptos básicos y avanzados Aspectos prácticos utilizando el software Weka. Madrid Pearson Prentice Hall, MadridSingh K, Gupta S, Rai P (2013) Identifying pollution sources and predicting urban air quality using ensemble learning methods. Atmos Environ 80:426–437. https://doi.org/10.1016/j.atmosenv.2013.08.023Song L, Pang S, Longley I, Olivares G, Sarrafzadeh A (2014) Spatio-temporal PM2.5 prediction by spatial data aided incremental support vector regression. In: International joint conference on neural networks. IEEE, Beijing, pp 623–630. https://doi.org/10.1109/IJCNN.2014.6889521Souza R, Coelho G, da Silva A, Pozza S (2015) Using ensembles of artificial neural networks to improve PM10 forecasts. Chem Eng Trans 43:2161–2166. https://doi.org/10.3303/CET1543361Suárez A, GarcĂa PJ, Riesgo P, del Coz JJ, Iglesias-RodrĂguez FJ (2011) Application of an SVM-based regression model to the air quality study at local scale in the AvilĂ©s urban area (Spain). Math Comput Model 54:453–1466. https://doi.org/10.1016/j.mcm.2011.04.017Tamas W, Notton G, Paoli C, Nivet M, Voyant C (2016) Hybridization of air quality forecasting models using machine learning and clustering: an original approach to detect pollutant peaks. Aerosol Air Qual Res 16:405–416. https://doi.org/10.4209/aaqr.2015.03.0193Toumi O, Le Gallo J, Ben Rejeb J (2017) Assessment of Latin American sustainability. Renew Sustain Energy Rev 78:878–885. https://doi.org/10.1016/j.rser.2017.05.013Tzima F, Mitkas P, Voukantsis D, Karatzas K (2011) Sparse episode identification in environmental datasets: the case of air quality assessment. Expert Syst Appl 38:5019–5027. https://doi.org/10.1016/j.eswa.2010.09.148United Nations, Department of Economic and Social Affairs (2019) World urbanization prospects The 2018 Revision. New York. https://doi.org/10.18356/b9e995fe-enWang B (2019) Applying machine-learning methods based on causality analysis to determine air quality in China. Pol J Environ Stud 28:3877–3885. https://doi.org/10.15244/pjoes/99639Wang X, Xiao Z (2017) Regional eco-efficiency prediction with support vector spatial dynamic MIDAS. J Clean Prod 161:165–177. https://doi.org/10.1016/j.jclepro.2017.05.077Wang W, Men C, Lu W (2008) Online prediction model based on support vector machine. Neurocomputing 71:550–558. https://doi.org/10.1016/j.neucom.2007.07.020WCED (1987) Report of the world commission on environment and development: our common future: report of the world commission on environment and development. WCED, Oslo. https://doi.org/10.1080/07488008808408783Weizhen H, Zhengqiang L, Yuhuan Z, Hua X, Ying Z, Kaitao L, Donghui L, Peng W, Yan M (2014) Using support vector regression to predict PM10 and PM2.5. In: IOP conference series: earth and environmental science, vol 17. IOP. https://doi.org/10.1088/1755-1315/17/1/012268WHO (2016) OMS | La OMS publica estimaciones nacionales sobre la exposiciĂłn a la contaminaciĂłn del aire y sus repercusiones para la salud. WHO. http://www.who.int/mediacentre/news/releases/2016/air-pollution-estimates/es/. Accesed 26 Nov 2018Yeganeh N, Shafie MP, Rashidi Y, Kamalan H (2012) Prediction of CO concentrations based on a hybrid partial least square and support vector machine model. Atmos Environ 55:357–365. https://doi.org/10.1016/j.atmosenv.2012.02.092Zalakeviciute R, Bastidas M, Buenaño A, Rybarczyk Y (2020) A traffic-based method to predict and map urban air quality. Appl Sci. https://doi.org/10.3390/app10062035Zeng L, Guo J, Wang B, Lv J, Wang Q (2019) Analyzing sustainability of Chinese coal cities using a decision tree modeling approach. Resour Policy 64:101501. https://doi.org/10.1016/j.resourpol.2019.101501Zhan Y, Luo Y, Deng X, Grieneisen M, Zhang M, Di B (2018) Spatiotemporal prediction of daily ambient ozone levels across China using random forest for human exposure assessment. Environ Pollut 233:464–473. https://doi.org/10.1016/j.envpol.2017.10.029Zhang Y, Huan Q (2006) Research on the evaluation of sustainable development in Cangzhou city based on neural-network-AHP. In: Proceedings of the fifth international conference on machine learning and cybernetics, vol 2006. pp 3144–3147. https://doi.org/10.1109/ICMLC.2006.258407Zhang Y, Shang W, Wu Y (2009) Research on sustainable development based on neural network. In: 2009 Chinese control and decision conference. IEEE, pp 3273–3276. https://doi.org/10.1109/CCDC.2009.5192476Zhou Y, Chang F-J, Chang L-Ch, Kao I-F, Wang YS (2019) Explore a deep learning multi-output neural network for regional multi-step-ahead air quality forecasts. J Clean Prod 209:134–145. https://doi.org/10.1016/j.jclepro.2018.10.24
A comparison of statistical and machine learning methods for creating national daily maps of ambient PM concentration
A typical problem in air pollution epidemiology is exposure assessment for
individuals for which health data are available. Due to the sparsity of
monitoring sites and the limited temporal frequency with which measurements of
air pollutants concentrations are collected (for most pollutants, once every 3
or 6 days), epidemiologists have been moving away from characterizing ambient
air pollution exposure solely using measurements. In the last few years,
substantial research efforts have been placed in developing statistical methods
or machine learning techniques to generate estimates of air pollution at finer
spatial and temporal scales (daily, usually) with complete coverage. Some of
these methods include: geostatistical techniques, such as kriging; spatial
statistical models that use the information contained in air quality model
outputs (statistical downscaling models); linear regression modeling approaches
that leverage the information in GIS covariates (land use regression); or
machine learning methods that mine the information contained in relevant
variables (neural network and deep learning approaches). Although some of these
exposure modeling approaches have been used in several air pollution
epidemiological studies, it is not clear how much the predicted exposures
generated by these methods differ, and which method generates more reliable
estimates. In this paper, we aim to address this gap by evaluating a variety of
exposure modeling approaches, comparing their predictive performance and
computational difficulty. Using PM in year 2011 over the continental
U.S. as case study, we examine the methods' performances across seasons, rural
vs urban settings, and levels of PM concentrations (low, medium, high)
Environmental risk assessment in the mediterranean region using artificial neural networks
Los mapas auto-organizados han demostrado ser una herramienta apropiada para la clasificaciĂłn y visualizaciĂłn de grupos de datos complejos. Redes neuronales, como los mapas auto-organizados (SOM) o las redes difusas ARTMAP (FAM), se utilizan en este estudio para evaluar el impacto medioambiental acumulativo en diferentes medios (aguas subterráneas, aire y salud humana). Los SOMs tambiĂ©n se utilizan para generar mapas de concentraciones de contaminantes en aguas subterráneas simulando las tĂ©cnicas geostadĂsticas de interpolaciĂłn como kriging y cokriging. Para evaluar la confiabilidad de las metodologĂas desarrolladas en esta tesis, se utilizan procedimientos de referencia como puntos de comparaciĂłn: la metodologĂa DRASTIC para el estudio de vulnerabilidad en aguas subterráneas y el mĂ©todo de interpolaciĂłn espacio-temporal conocido como Bayesian Maximum Entropy (BME) para el análisis de calidad del aire.
Esta tesis contribuye a demostrar las capacidades de las redes neuronales en el desarrollo de nuevas metodologĂas y modelos que explĂcitamente permiten evaluar las dimensiones temporales y espaciales de riesgos acumulativos
PM2.5-GNN: A Domain Knowledge Enhanced Graph Neural Network For PM2.5 Forecasting
When predicting PM2.5 concentrations, it is necessary to consider complex
information sources since the concentrations are influenced by various factors
within a long period. In this paper, we identify a set of critical domain
knowledge for PM2.5 forecasting and develop a novel graph based model,
PM2.5-GNN, being capable of capturing long-term dependencies. On a real-world
dataset, we validate the effectiveness of the proposed model and examine its
abilities of capturing both fine-grained and long-term influences in PM2.5
process. The proposed PM2.5-GNN has also been deployed online to provide free
forecasting service.Comment: Pre-print version of a ACM SIGSPATIAL 2020 poster
[paper](https://dl.acm.org/doi/10.1145/3397536.3422208). The code is
available at [Github](https://github.com/shawnwang-tech/PM2.5-GNN), and the
talk is available at [YouTube](https://www.youtube.com/watch?v=VX93vMthkGM
Estimating particulate matter using sattellite based aerosol optical depth and meteorological parameters in Malaysia
The insufficient number of ground-based stations for measuring Particulate Matter less than 10µm (PM10), especially in the developing countries hinders PM10 monitoring at a regional scale. The present study aims to develop empirical models for PM10 estimates from space over Malaysia using Aerosol Optical Depth (AOD550) retrieval from Moderate Resolution Imaging Spectroradiometer (MODIS), Medium Resolution Imaging Spectrometer/Advanced Along-Track Scanner Radiometer (MERIS/AATSR) synergy algorithm and meteorological data that include surface temperature, relative humidity and atmospheric stability from 2007-2011. Accuracy of meteorological parameters that have been used in the estimation of PM10 are examined. The estimated relative humidity and surface temperature using satellite data agree well with ground data where coefficient of determination (R2) = 0.78 and 0.49 and Root Mean Square Error (RMSE) = 5.14% and 2.68?C for relative humidity and surface temperature respectively. Multiple Linear Regressions (MLR) and Artificial Neural Network (ANN) techniques are utilized to develop the empirical models. The models were developed using PM10 data measured at 29 stations over Malaysia. Result of the research reveals that the ANN using MODIS AOD550 provide higher accuracy with R2 = 0.71 and RMSE = 11.61?gm-3 compared to the MLR method where R2 = 0.66 and RMSE = 12.39?gm-3 or models that use MERIS/AATSR AOD data. Stepwise regression analysis performed on the MLR method reveals that the MODIS AOD550 is the most important parameter for PM10 predictions where R2 = 0.59 and RMSE = 13.61?gm-3. However, the inclusion of the meteorological parameters in the MLR increases the accuracy of the PM10 estimations. The significance of the meteorological parameters in prediction of PM10 concentrations is in the order of (i) atmospheric stability, (ii) relative humidity and (iii) surface temperature. The estimated PM10 concentrations are validated against another 16 stations dataset of measured PM10 with the ANN model to result in higher accuracy (R2= 0.58, RMSE = 10.16?gm-3) compared to the MLR technique (R2 = 0.56, RMSE = 10.58?gm-3). The higher accuracy that has been attained in PM10 estimations from space allows (i) to map the PM10 distribution at large spatial and temporal scales and (ii) permits for future estimates of PM2.5 concentrations from space for monitoring of the Environmental Performance Index (EPI)
- …