1,554 research outputs found

    Exploiting Recurring Patterns to Improve Scalability of Parking Availability Prediction Systems

    Get PDF
    Parking Guidance and Information (PGI) systems aim at supporting drivers in finding suitable parking spaces, also by predicting the availability at driver’s Estimated Time of Arrival (ETA), leveraging information about the general parking availability situation. To do these predictions, most of the proposals in the literature dealing with on-street parking need to train a model for each road segment, with significant scalability issues when deploying a city-wide PGI. By investigating a real dataset we found that on-street parking dynamics show a high temporal auto-correlation. In this paper we present a new processing pipeline that exploits these recurring trends to improve the scalability. The proposal includes two steps to reduce both the number of required models and training examples. The effectiveness of the proposed pipeline has been empirically assessed on a real dataset of on-street parking availability from San Francisco (USA). Results show that the proposal is able to provide parking predictions whose accuracy is comparable to state-of-the-art solutions based on one model per road segment, while requiring only a fraction of training costs, thus being more likely scalable to city-wide scenarios

    Urban traffic flow prediction, a spatial-temporal approach

    Get PDF
    Dissertation submitted in partial fulfilment of the requirements for the Degree of Master of Science in Geospatial TechnologiesCurrent advances in computational technologies such as machine learning combined with traffic data availability are inspiring the development and growth of intelligent transport Systems (ITS). As urban authorities strive for efficient traffic systems, traffic forecasting is a vital element for effective control and management of traffic networks. Traffic forecasting methods have progressed from traditional statistical techniques to optimized data driven methods eulogised with artificial intelligence. Today, most techniques in traffic forecasting are mainly timeseries methods that ignore the spatial impact of traffic networks in traffic flow modelling. The consideration of both spatial and temporal dimensions in traffic forecasting efforts is key to achieving inclusive traffic forecasts. This research paper presents approaches to analyse spatial temporal patterns existing in networks and goes on to use a machine learning model that integrates both spatial and temporal dependency in traffic flow prediction. The application of the model to a traffic dataset for the city of Singapore shows that we can accurately predict traffic flow up to 15 minutes in advance and also accuracy results obtained outperform other classical traffic prediction methods

    Short Term Forecasting of Solar Radiation

    Get PDF
    This paper details how to predict solar radiation at a location for the next few hours using machine learning techniques like Facebook’s Prophet, and Amazon’s DeepAR+. Multiple techniques like AutoRegressive (ARIMA) and Exponential Smoothing (ES) have been used to forecast solar radiation, but they lack accuracy and are not scalable. Whereas Prophet, and Amazon’s DeepAR+ are scalable, accurate, and easily integrated into other machine learning techniques. This will be the first time where the combination of these techniques along with Linear Regression, Random Forest, XGBoost and Decision Tree will be leveraged to forecast solar radiation for the short term. Predicting solar energy accurately depends on multiple factors (including weather conditions) that make forecasting highly resource-intensive, and accuracy remains a challenge. Improving the accuracy of the short-term forecast of solar energy production would provide a massive value to the companies operating IoT Devices and drones to have a more efficient operation and reduced cost. The objective is to improve the accuracy of forecasting short-term solar radiation to power drones and IoT devices, leveraging the ensemble techniques by combing the outcome of Prophet and DeepAR+. Facebook’s Prophet, and Amazon’s DeepAR+ used to carry out shortterm solar forecasting can be scaled by leveraging the supercomputer. Amazon’s DeepAR+ runs on the AWS cloud platform, so they align well with scaling and bring in all the enhancement that comes with cloud technology. Multiple models were used to identify the best way to forecast short-term solar radiation. Random Forest and ensemble models outperformed the Facebook Prophet and Amazon’s DeepAR+, achieving a coefficient of determination R2 of 99 % in Dallas, Texas. Ensemble Model was created to minimize the bias and variance of the outcome

    Truck Activity Pattern Classification Using Anonymous Mobile Sensor Data

    Get PDF
    To construct, operate, and maintain a transportation system that supports the efficient movement of freight, transportation agencies must understand economic drivers of freight flow. This is a challenge since freight movement data available to transportation agencies is typically void of commodity and industry information, factors that tie freight movements to underlying economic conditions. With recent advances in the resolution and availability of big data from Global Positioning Systems (GPS), it may be possible to fill this critical freight data gap. However, there is a need for methodological approaches to enable usage of this data for freight planning and operations. To address this methodological need, we use advanced machine-learning techniques and spatial analyses to classify trucks by industry based on activity patterns derived from large streams of truck GPS data. The major components are: (1) derivation of truck activity patterns from anonymous GPS traces, (2) development of a classification model to distinguish trucks by industry, and (3) estimation of a spatio-temporal regression model to capture rerouting behavior of trucks. First, we developed a K-means unsupervised clustering algorithm to find unique and representative daily activity patterns from GPS data. For a statewide GPS data sample, we are able to reduce over 300,000 daily patterns to a representative six patterns, thus enabling easier calibration and validation of the travel forecasting models that rely on detailed activity patterns. Next, we developed a Random Forest supervised machine learning model to classify truck daily activity patterns by industry served. The model predicts five distinct industry classes, i.e., farm products, manufacturing, chemicals, mining, and miscellaneous mixed, with 90% accuracy, filling a critical gap in our ability to tie truck movements to industry served. This ultimately allows us to build travel demand forecasting models with behavioral sensitivity. Finally, we developed a spatio-temporal model to capture truck rerouting behaviors due to weather events. The ability to model re-routing behaviors allows transportation agencies to identify operational and planning solutions that mitigate the impacts of weather on truck traffic. For freight industries, the prediction of weather impacts on truck driver’s route choices can inform a more accurate estimation of billable miles

    Hierarchical learning, forecasting coherent spatio-temporal individual and aggregated building loads

    Full text link
    Optimal decision-making compels us to anticipate the future at different horizons. However, in many domains connecting together predictions from multiple time horizons and abstractions levels across their organization becomes all the more important, else decision-makers would be planning using separate and possibly conflicting views of the future. This notably applies to smart grid operation. To optimally manage energy flows in such systems, accurate and coherent predictions must be made across varying aggregation levels and horizons. With this work, we propose a novel multi-dimensional hierarchical forecasting method built upon structurally-informed machine-learning regressors and established hierarchical reconciliation taxonomy. A generic formulation of multi-dimensional hierarchies, reconciling spatial and temporal hierarchies under a common frame is initially defined. Next, a coherency-informed hierarchical learner is developed built upon a custom loss function leveraging optimal reconciliation methods. Coherency of the produced hierarchical forecasts is then secured using similar reconciliation technics. The outcome is a unified and coherent forecast across all examined dimensions. The method is evaluated on two different case studies to predict building electrical loads across spatial, temporal, and spatio-temporal hierarchies. Although the regressor natively profits from computationally efficient learning, results displayed disparate performances, demonstrating the value of hierarchical-coherent learning in only one setting. Yet, supported by a comprehensive result analysis, existing obstacles were clearly delineated, presenting distinct pathways for future work. Overall, the paper expands and unites traditionally disjointed hierarchical forecasting methods providing a fertile route toward a novel generation of forecasting regressors

    Joint Air Quality and Weather Prediction Based on Multi-Adversarial Spatiotemporal Networks

    Full text link
    Accurate and timely air quality and weather predictions are of great importance to urban governance and human livelihood. Though many efforts have been made for air quality or weather prediction, most of them simply employ one another as feature input, which ignores the inner-connection between two predictive tasks. On the one hand, the accurate prediction of one task can help improve another task's performance. On the other hand, geospatially distributed air quality and weather monitoring stations provide additional hints for city-wide spatiotemporal dependency modeling. Inspired by the above two insights, in this paper, we propose the Multi-adversarial spatiotemporal recurrent Graph Neural Networks (MasterGNN) for joint air quality and weather predictions. Specifically, we first propose a heterogeneous recurrent graph neural network to model the spatiotemporal autocorrelation among air quality and weather monitoring stations. Then, we develop a multi-adversarial graph learning framework to against observation noise propagation introduced by spatiotemporal modeling. Moreover, we present an adaptive training strategy by formulating multi-adversarial learning as a multi-task learning problem. Finally, extensive experiments on two real-world datasets show that MasterGNN achieves the best performance compared with seven baselines on both air quality and weather prediction tasks.Comment: 9 pages, 6 figure

    Transforming Graph Representations for Statistical Relational Learning

    Full text link
    Relational data representations have become an increasingly important topic due to the recent proliferation of network datasets (e.g., social, biological, information networks) and a corresponding increase in the application of statistical relational learning (SRL) algorithms to these domains. In this article, we examine a range of representation issues for graph-based relational data. Since the choice of relational data representation for the nodes, links, and features can dramatically affect the capabilities of SRL algorithms, we survey approaches and opportunities for relational representation transformation designed to improve the performance of these algorithms. This leads us to introduce an intuitive taxonomy for data representation transformations in relational domains that incorporates link transformation and node transformation as symmetric representation tasks. In particular, the transformation tasks for both nodes and links include (i) predicting their existence, (ii) predicting their label or type, (iii) estimating their weight or importance, and (iv) systematically constructing their relevant features. We motivate our taxonomy through detailed examples and use it to survey and compare competing approaches for each of these tasks. We also discuss general conditions for transforming links, nodes, and features. Finally, we highlight challenges that remain to be addressed

    An Improved Deep Learning Model for Electricity Price Forecasting

    Get PDF
    Accurate electricity price forecasting (EPF) is important for the purpose of bidding strategies and minimizing the risk for market participants in the competitive electricity market. Besides that, EPF becomes critically important for effective planning and efficient operation of a power system due to deregulation of electricity industry. However, accurate EPF is very challenging due to complex nonlinearity in the time series-based electricity prices. Hence, this work proposed two-fold contributions which are (1) effective time series preprocessing module to ensure feasible time-series data is fitted in the deep learning model, and (2) an improved long short-term memory (LSTM) model by incorporating linear scaled hyperbolic tangent (LiSHT) layer in the EPF. In this work, the time series pre-processing module adopted linear trend of the correlated features of electricity price series and the time series are tested by using Augmented Dickey Fuller (ADF) test method. In addition, the time series are transformed using boxcox transformation method in order to satisfy the stationarity property. Then, an improved LSTM prediction module is proposed to forecast electricity prices where LiSHT layer is adopted to optimize the parameters of the heterogeneous LSTM. This study is performed using the Australian electricity market price, load and renewable energy supply data. The experimental results obtained show that the proposed EPF framework performed better compared to previous techniques

    Web Traffic Prediction Using Autoregressive, LSTM, and XGBoost Time Series Models

    Get PDF
    Web traffic is vital to the success of any online company or website in the current era of digital technology. Insightful marketing, web development, and resource allocation choices may be made with the support of reliable online traffic forecasts. In this study, we investigate the effectiveness of the Autoregressive (AR), Long Short-Term Memory (LSTM), and eXtreme Gradient Boosting (XGBoost) time series modeling strategies for forecasting website traffic. We evaluate the accuracy of these models in forecasting future online traffic by comparing their results on a real-world dataset. The performance of four different models for predicting a target variable was evaluated based on the provided information. The AR model had the highest test error, indicating poor performance, while the ARIMA model had a lower test error than the AR model, but its high SMAPE value on the training dataset suggested overfitting. The LSTM model had the lowest test error, but its high SMAPE value on the training dataset indicated that it may not have captured underlying patterns in the data well. The XGBoost model had a relatively low test error, suggesting good performance, and performed slightly better on the testing dataset than the ARIMA model. The study did not consider external factors that may impact website traffic, such as changes in search engine algorithms or other external shocks. These external factors can significantly impact website traffic, and not considering them may limit the generalizability of our study's findings
    • …
    corecore