5,035 research outputs found

    Matrix Completion With Variational Graph Autoencoders: Application in Hyperlocal Air Quality Inference

    Get PDF
    Inferring air quality from a limited number of observations is an essential task for monitoring and controlling air pollution. Existing inference methods typically use low spatial resolution data collected by fixed monitoring stations and infer the concentration of air pollutants using additional types of data, e.g., meteorological and traffic information. In this work, we focus on street-level air quality inference by utilizing data collected by mobile stations. We formulate air quality inference in this setting as a graph-based matrix completion problem and propose a novel variational model based on graph convolutional autoencoders. Our model captures effectively the spatio-temporal correlation of the measurements and does not depend on the availability of additional information apart from the street-network topology. Experiments on a real air quality dataset, collected with mobile stations, shows that the proposed model outperforms state-of-the-art approaches

    A stigmergy-based analysis of city hotspots to discover trends and anomalies in urban transportation usage

    Full text link
    A key aspect of a sustainable urban transportation system is the effectiveness of transportation policies. To be effective, a policy has to consider a broad range of elements, such as pollution emission, traffic flow, and human mobility. Due to the complexity and variability of these elements in the urban area, to produce effective policies remains a very challenging task. With the introduction of the smart city paradigm, a widely available amount of data can be generated in the urban spaces. Such data can be a fundamental source of knowledge to improve policies because they can reflect the sustainability issues underlying the city. In this context, we propose an approach to exploit urban positioning data based on stigmergy, a bio-inspired mechanism providing scalar and temporal aggregation of samples. By employing stigmergy, samples in proximity with each other are aggregated into a functional structure called trail. The trail summarizes relevant dynamics in data and allows matching them, providing a measure of their similarity. Moreover, this mechanism can be specialized to unfold specific dynamics. Specifically, we identify high-density urban areas (i.e hotspots), analyze their activity over time, and unfold anomalies. Moreover, by matching activity patterns, a continuous measure of the dissimilarity with respect to the typical activity pattern is provided. This measure can be used by policy makers to evaluate the effect of policies and change them dynamically. As a case study, we analyze taxi trip data gathered in Manhattan from 2013 to 2015.Comment: Preprin

    Spatiotemporal Deep Learning ๋ชจ๋ธ์„ ๊ธฐ๋ฐ˜์œผ๋กœ ํ•œ ๋„์‹œ ์ „์—ญ์˜ ๋Œ€๊ธฐ ์˜ค์—ผ ๋ณด๊ฐ„๊ณผ ์˜ˆ์ธก

    Get PDF
    ํ•™์œ„๋…ผ๋ฌธ (์„์‚ฌ)-- ์„œ์šธ๋Œ€ํ•™๊ต ๋Œ€ํ•™์› : ๊ณต๊ณผ๋Œ€ํ•™ ์ „๊ธฐยท์ •๋ณด๊ณตํ•™๋ถ€, 2019. 2. ์ฐจ์ƒ๊ท .๋Œ€๊ธฐ ์˜ค์—ผ์€ ๋Œ€๋„์‹œ์—์„œ ๊ฐ€์žฅ ํฐ ๋ฌธ์ œ ์ค‘ ํ•˜๋‚˜์ด๋‹ค. ๋งŽ์€ ๊ตญ๊ฐ€๋“ค์€ ์ฃผ์š” ๋„์‹œ ์ฃผ๋ณ€์— ๋Œ€๊ธฐ ์˜ค์—ผ ๋ชจ๋‹ˆํ„ฐ๋ง ์„ผํ„ฐ๋ฅผ ๊ฑด์„คํ•˜์—ฌ ๋Œ€๊ธฐ ์˜ค์—ผ ๋ฌผ์งˆ์„ ์ˆ˜์ง‘ํ•˜๊ณ  ํ•ด๋‹น ์ง€์—ญ์˜ ์‹œ๋ฏผ๋“ค์—๊ฒŒ ๋Œ€๊ธฐ ์˜ค์—ผ์„ ๊ฒฝ๊ณ ํ•œ๋‹ค. ๊ทธ๋Ÿฌ๋‚˜ ๋„์‹œ์—์„œ์˜ ๋Œ€๊ธฐ ์˜ค์—ผ์€ ๊ท ์ผํ•˜์ง€ ์•Š์œผ๋ฉฐ ์‹œ๊ณต๊ฐ„ (spatiotemporal)์ ์ธ ๋ฌธ์ œ์ด๋‹ค. ๋Œ€๊ธฐ ์˜ค์—ผ์€ ์œ„์น˜ (๊ณต๊ฐ„์  ํŠน์„ฑ)๊ณผ ์‹œ๊ฐ (์‹œ๊ฐ„์  ํŠน์„ฑ)์— ๋”ฐ๋ผ ๋‹ฌ๋ผ์ง„๋‹ค. ๋”ฐ๋ผ์„œ, ๋„์‹œ ์ „์ฒด์˜ ๋Œ€๊ธฐ ์˜ค์—ผ ๋ณด๊ฐ„๊ณผ ์˜ˆ์ธก์€ ์‹œ๋ฏผ๋“ค์ด ์‹œ๊ฐ„๊ณผ ๊ณต๊ฐ„์— ๋Œ€ํ•ด ๋Œ€๊ธฐ์˜ ์งˆ์„ ํŒŒ์•…ํ•˜๊ณ , ๋‚˜์•„๊ฐ€ ๊ฑด๊ฐ•์— ๋Œ€ํ•œ ์œ„ํ˜‘์„ ์ œ๊ฑฐํ•˜๊ธฐ ์œ„ํ•œ ํ•„์š” ์กฐ๊ฑด์ด๋‹ค. ๋Œ€๊ธฐ ์˜ค์—ผ์€ ๋„์‹œ ์ „์—ญ์˜ ์—ฌ๋Ÿฌ ์‹œ๊ณต๊ฐ„์  ์š”์ธ์— ์˜ํ•ด ์˜ํ–ฅ์„ ๋ฐ›๋Š” ๊ฒƒ์œผ๋กœ ์•Œ๋ ค์ ธ ์žˆ๋‹ค. ๊ทธ ์ค‘, ๊ธฐ์ƒ์ด ๋Œ€๊ธฐ ์˜ค์—ผ์— ๊ฐ€์žฅ ํฐ ์˜ํ–ฅ์„ ์ฃผ๋Š” ๊ฒƒ์œผ๋กœ ์ธ์‹๋˜๊ณ  ์žˆ๋‹ค. ๊ทธ ์™ธ์—, ๊ตํ†ต๋Ÿ‰์€ ๋Œ€๊ธฐ ์˜ค์—ผ์˜ ์ฃผ์š” ์›์ธ์ธ ๋„๋กœ์˜ ์ฐจ๋Ÿ‰ ๋ฐ€๋„๋ฅผ ๋ฐ˜์˜ํ•œ๋‹ค. ํ‰๊ท  ์ฃผํ–‰ ์†๋„๋Š” ๋„์‹œ ๋Œ€๊ธฐ ์˜ค์—ผ์— ์˜ํ–ฅ์„ ์ค€๋‹ค๊ณ  ํŒ๋‹จ๋˜๋Š” ๊ตํ†ต ์ฒด์ฆ์„ ๋‚˜ํƒ€๋‚ธ๋‹ค. ๋งˆ์ง€๋ง‰์œผ๋กœ, ์™ธ๋ถ€ ๋Œ€๊ธฐ ์˜ค์—ผ์›์€ ๋„์‹œ ๋Œ€๊ธฐ ์˜ค์—ผ ๋ฌธ์ œ์˜ ๊ทผ์› ์ค‘ ํ•˜๋‚˜๋ผ๊ณ  ์ฃผ์žฅ๋œ๋‹ค. ๋ณธ ๋…ผ๋ฌธ์—์„œ๋Š” ์„œ์šธ์‹œ์˜ ๋Œ€๊ธฐ ์˜ค์—ผ ๋ฐ์ดํ„ฐ, ๊ธฐ์ƒ ๋ฐ์ดํ„ฐ, ๊ตํ†ต๋Ÿ‰, ํ‰๊ท  ์ฃผํ–‰ ์†๋„์™€ ๊ฐ™์€ ๋งŽ์€ ์‹œ๊ณต๊ฐ„์  ๋ฐ์ดํ„ฐ์™€ ์„œ์šธ์˜ ๋Œ€๊ธฐ ์˜ค์—ผ์— ์˜ํ–ฅ์„ ์ค€๋‹ค๊ณ  ์•Œ๋ ค์ง„ ์ค‘๊ตญ์˜ 3๊ฐœ ์ง€๋ฐฉ(๋ฒ ์ด์ง•, ์ƒํ•˜์ด, ์‚ฐ๋™)์˜ ๋Œ€๊ธฐ ์˜ค์—ผ ๋ฐ์ดํ„ฐ๋ฅผ ์ œ์‹œํ•˜์˜€๋‹ค. ๋Œ€๊ธฐ ์˜ค์—ผ์— ๋Œ€ํ•œ ์ตœ๊ทผ์˜ ์—ฐ๊ตฌ์—์„œ๋Š” ํŠน์ • ์œ„์น˜์™€ ์‹œ๊ฐ„์˜ ๋Œ€๊ธฐ ์˜ค์—ผ ์˜ˆ์ธก ๋ชจ๋ธ์„ ๊ตฌ์ถ•ํ•˜๋ ค๊ณ  ์‹œ๋„ํ•ด์™”๋‹ค. ๊ทธ๋Ÿฌ๋‚˜ ๋Œ€๋ถ€๋ถ„ ์—ฐ์†๋˜์ง€ ์•Š์€ ์œ„์น˜์—๋Œ€ํ•œ ๋Œ€๊ธฐ ์˜ค์—ผ์„ ์˜ˆ์ธกํ•˜๊ฑฐ๋‚˜ ์ง์ ‘ ๋งŒ๋“  ๊ณต๊ฐ„ ๋ฐ ์‹œ๊ฐ„์  ํŠน์„ฑ์„ ์‚ฌ์šฉํ•˜๋Š” ๋ฐ ์ค‘์ ์„ ๋‘์—ˆ๋‹ค. ์ตœ๊ทผ CNN (Convolutional Neural Network), RNN (Recurrent Neural Network) ๋ฐ LSTM (Long-Short Term Memory)๊ณผ ๊ฐ™์€ ๋”ฅ๋Ÿฌ๋‹ ๋ชจ๋ธ์ด ๊ณต๊ฐ„ ๋ฐ ์‹œ๊ฐ„ ๊ด€๋ จ ๋ฌธ์ œ์—์„œ ์šฐ์ˆ˜ํ•˜๋‹ค๊ณ  ์•Œ๋ ค์ ธ์žˆ๋‹ค. ๋ณธ ๋…ผ๋ฌธ์—์„œ๋Š” CNN๊ณผ LSTM์„ ๊ฒฐํ•ฉํ•œ ConvLSTM (Convolutional Long-Short Term Memory) ๋ชจ๋ธ์„ ์ œ์•ˆํ•˜์˜€์œผ๋ฉฐ, ์ด๋ฅผ ํ†ตํ•ด ๋ฐ์ดํ„ฐ์˜ ๊ณต๊ฐ„ ๋ฐ ์‹œ๊ฐ„์  ํŠน์„ฑ์„ ํšจ์œจ์ ์œผ๋กœ ์ฒ˜๋ฆฌํ•˜๊ณ  ์ตœ๊ทผ์˜ ๋‹ค๋ฅธ ์—ฐ๊ตฌ ๊ฒฐ๊ณผ๋ณด๋‹ค ๋›ฐ์–ด๋‚œ ์„ฑ๋Šฅ์„ ๋‹ฌ์„ฑํ•˜์˜€๋‹ค.Air pollution is one of the most concerns of big cities. Many countries in the world have constructed air quality monitoring stations around major cities to collect air pollutants and make the warning to urban citizens about the air pollution around them. However, air pollution is not uniform in the city, but it is a spatiotemporal problem. It changes by locations (spatial feature) and by time (temporal feature). Consequently, citywide air pollution interpolation and prediction is a requirement of urban people to know the air quality through time and spaces to eliminate the health risks. Moreover, air pollution is affected by many spatiotemporal factors throughout the whole city. Among them, meteorology is recognized to be one the most significant effects to air pollution. Besides that, traffic volume reflects the density of vehicles on roads which is the primary cause of air pollution. Average driving speed indicates the traffic congestion which also reasonably influences air pollution over the city. Finally, external air pollution sources from outside areas are claimed to be the reason contributing to a city's air pollution problem. In this thesis, we present many spatiotemporal datasets collected over Seoul city, Korea such as air pollution data, meteorological data, traffic volume, average driving speed, and air pollution of 3 China areas like Beijing, Shanghai, Shandong, which are known to have the effect to Seoul's air pollution. Recent research in air pollution has tried to build models to predict air pollution by locations and in the future time. Nonetheless, they mostly focused on predicting air pollution in discrete locations or used hand-crafted spatial and temporal features. Recently, Deep learning models such as Convolutional Neural Network (CNN), Recurrent Neural Network (RNN), and Long-Short Term Memory (LSTM) are known to be superior in spatial and temporal relating problems. In this thesis, we propose the usage of Convolutional Long-Short Term Memory (ConvLSTM) model, a combination of CNN and LSTM, which efficiently manipulates the spatial and temporal features of the data and outperforms other recent research.1 INTRODUCTION 1 1.1 Air pollution description . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 1.2 Citywide Air pollution Interpolation and Prediction . . . . . . . . . . . . . . . . . . . . . . . 2 1.3 Spatiotemporal datasets introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3 1.4 Thesis contributions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .8 2 RELATED WORK 11 2.1 Spatiotemporal Air pollution interpolation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11 2.2 Machine Learning/Neural Networks based Air pollution prediction models . . . .12 2.3 Spatiotemporal Deep Learning models . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15 3 Spatiotemporal Deep Learning Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .16 3.1 CNN and LSTM models . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .16 3.2 ConvLSTM model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21 3.3 Air Pollution Interpolation and Prediction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24 4 EXPERIMENTS AND EVALUATIONS 29 4.1 Baselines description . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29 4.2 Experiments and Evaluations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31 4.2.1 Air pollution Interpolation: experiments and evaluations . . . . . . . . . . . . . . . . . 34 4.2.2 Air pollution Forecasting: experiments and evaluations . . . . . . . . . . . . . . . . . . 41 5 CONCLUSIONS AND FUTURE WORK 45 5.1 Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 45 5.2 Future work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46Maste

    Quantifying the health burden misclassification from the use of different PM2.5 exposure tier models: A case study of London

    Get PDF
    Exposure to PM2.5 has been associated with increased mortality in urban areas. Hence, reducing the uncertainty in human exposure assessments is essential for more accurate health burden estimates. Here we quantify the misclassification that occurs when using different exposure approaches to predict the mortality burden of a population using London as a case study. We develop a framework for quantifying the misclassification of the total mortality burden attributable to exposure to fine particulate matter (PM2.5) in four major microenvironments (MEs) (dwellings, aboveground transportation, London Underground (LU) and outdoors)in the Greater London Area (GLA), in 2017. We demonstrate that differences exist between five different exposure Tier-models with incrementally increasing complexity, moving from static to more dynamic approaches. BenMap-CE, the open source software developed by the U.S. Environmental Protection Agency, is used as a tool to achieve spatial distribution of the ambient concentration by interpolating the monitoring data to the unmonitored areas and ultimately estimate the change in mortality on a fine resolution. Our results showed that using the outdoor concentration as a surrogate for the total population exposure but ignoring the different exposure concentration that occurs indoors and the time spent in transit, would lead to a misclassification of 1,174 predicted mortalities in GLA. Indoor exposure to PM2.5 is the largest contributor to total population exposure, accounting for 80% of total mortality, followed by the London Underground which contributes 15%, albeit the average percentage of time spent there by Londoners is only 0.4%. We generally confirmed that increasing the complexity and incorporating important microenvironments, such as the highly polluted LU, could significantly reduce the misclassification in health burden assessments
    • โ€ฆ
    corecore