6,159 research outputs found
Recommended from our members
Explainable and Advisable Learning for Self-driving Vehicles
Deep neural perception and control networks are likely to be a key component of self-driving vehicles. These models need to be explainable - they should provide easy-to-interpret rationales for their behavior - so that passengers, insurance companies, law enforcement, developers, etc., can understand what triggered a particular behavior. Explanations may be triggered by the neural controller, namely introspective explanations, or informed by the neural controller's output, namely rationalizations. Our work has focused on the challenge of generating introspective explanations of deep models for self-driving vehicles. In Chapter 3, we begin by exploring the use of visual explanations. These explanations take the form of real-time highlighted regions of an image that causally influence the network's output (steering control). In the first stage, we use a visual attention model to train a convolution network end-to-end from images to steering angle. The attention model highlights image regions that potentially influence the network's output. Some of these are true influences, but some are spurious. We then apply a causal filtering step to determine which input regions actually influence the output. This produces more succinct visual explanations and more accurately exposes the network's behavior. In Chapter 4, we add an attention-based video-to-text model to produce textual explanations of model actions, e.g. "the car slows down because the road is wet". The attention maps of controller and explanation model are aligned so that explanations are grounded in the parts of the scene that mattered to the controller. We explore two approaches to attention alignment, strong- and weak-alignment. These explainable systems represent an externalization of tacit knowledge. The network's opaque reasoning is simplified to a situation-specific dependence on a visible object in the image. This makes them brittle and potentially unsafe in situations that do not match training data. In Chapter 5, we propose to address this issue by augmenting training data with natural language advice from a human. Advice includes guidance about what to do and where to attend. We present the first step toward advice-giving, where we train an end-to-end vehicle controller that accepts advice. The controller adapts the way it attends to the scene (visual attention) and the control (steering and speed). Further, in Chapter 6, we propose a new approach that learns vehicle control with the help of long-term (global) human advice. Specifically, our system learns to summarize its visual observations in natural language, predict an appropriate action response (e.g. "I see a pedestrian crossing, so I stop"), and predict the controls, accordingly
SPATIO-TEMPORAL DYNAMICS OF SHORT-TERM TRAFFIC
Short-term traffic forecasting and missing data imputation can benefit from the use of neighboring traffic information, in addition to temporal data alone. However, little attention has been given to quantifying the effect of upstream and downstream traffic on the traffic at current location. The knowledge about temporal and spatial propagation of traffic is still limited in the current literature. To fill this gap, this dissertation research focus on revealing the spatio-temporal correlations between neighboring traffic to develop reliable algorithms for short-term traffic forecasting and data imputation based on spatio-temporal dynamics of traffic.
In the first part of this dissertation, spatio-temporal relationships of speed series from consecutive segments were studied for different traffic conditions. The analysis results show that traffic speeds of consecutive segments are highly correlated. While downstream traffic tends to replicate the upstream condition under light traffic conditions, it may also affect upstream condition during congestion and build up situations. These effects were statistically quantified and an algorithm for properly choosing the “best” or most correlated neighbor(s), for potential traffic prediction or imputation purposes was proposed.
In the second part of the dissertation, a spatio-temporal kriging (ST-Kriging) model that determines the most desirable extent of spatial and temporal traffic data from neighboring locations was developed for short-term traffic forecasting. The new ST-Kriging model outperforms all benchmark models under various traffic conditions.
In the final part of the dissertation, a spatio-temporal data imputation approach was proposed and its performance was evaluated under scenarios with different data missing rates. Compared against previous methods, better flexibility and stable imputation accuracy were reported for this new imputation technique
Forecasting monthly airline passenger numbers with small datasets using feature engineering and a modified principal component analysis
In this study, a machine learning approach based on time series models, different feature engineering, feature extraction, and feature derivation is proposed to improve air passenger forecasting. Different types of datasets were created to extract new features from the core data. An experiment was undertaken with artificial neural networks to test the performance of neurons in the hidden layer, to optimise the dimensions of all layers and to obtain an optimal choice of connection weights – thus the nonlinear optimisation problem could be solved directly. A method of tuning deep learning models using H2O (which is a feature-rich, open source machine learning platform known for its R and Spark integration and its ease of use) is also proposed, where the trained network model is built from samples of selected features from the dataset in order to ensure diversity of the samples and to improve training. A successful application of deep learning requires setting numerous parameters in order to achieve greater model accuracy. The number of hidden layers and the number of neurons, are key parameters in each layer of such a network. Hyper-parameter, grid search, and random hyper-parameter approaches aid in setting these important parameters. Moreover, a new ensemble strategy is suggested that shows potential to optimise parameter settings and hence save more computational resources throughout the tuning process of the models. The main objective, besides improving the performance metric, is to obtain a distribution on some hold-out datasets that resemble the original distribution of the training data. Particular attention is focused on creating a modified version of Principal Component Analysis (PCA) using a different correlation matrix – obtained by a different correlation coefficient based on kinetic energy to derive new features. The data were collected from several airline datasets to build a deep prediction model for forecasting airline passenger numbers. Preliminary experiments show that fine-tuning provides an efficient approach for tuning the ultimate number of hidden layers and the number of neurons in each layer when compared with the grid search method. Similarly, the results show that the modified version of PCA is more effective in data dimension reduction, classes reparability, and classification accuracy than using traditional PCA.</div
- …