8 research outputs found

    Understanding error log event sequence for failure analysis

    Get PDF
    Due to the evolvement of large-scale parallel systems, they are mostly employed for mission critical applications. The anticipation and accommodation of failure occurrences is crucial to the design. A commonplace feature of these large-scale systems is failure, and they cannot be treated as exception. The system state is mostly captured through the logs. The need for proper understanding of these error logs for failure analysis is extremely important. This is because the logs contain the “health” information of the system. In this paper we design an approach that seeks to find similarities in patterns of these logs events that leads to failures. Our experiment shows that several root causes of soft lockup failures could be traced through the logs. We capture the behavior of failure inducing patterns and realized that the logs pattern of failure and non-failure patterns are dissimilar.Keywords: Failure Sequences; Cluster; Error Logs; HPC; Similarit

    Prediction of flight delay using deep operator network with gradient-mayfly optimisation algorithm

    Get PDF
    Accurate flight delay prediction is fundamental to establishing an efficient airline business. It is considered one of the most critical intelligent aviation systems components. Recently, flight delay has been a significant cause that deprives airlines of good performance. Hence, airlines must accurately forecast flight delays and comprehend their sources to have excellent passenger experiences, increase income and minimise unwanted revenue loss. In this paper, we developed a novel approach that is an optimisation-driven deep learning model for predicting flight delays by extending a state-of-the-art method, DeepONet. We utilise the Box-Cox transformation for data conversion with a minimal error rate. Also, we employed a deep residual network for the feature fusion before training our model. Furthermore, this research uses flight on-time data for flight delay prediction. To validate our proposed model, we conducted a numerical study using the US Bureau of Transportation of Statistics. Also, we predict the flight delay by selecting the optimum weights using the novel DeepONet with the Gradient Mayfly Optimisation Algorithm (GMOA). Our experiment results show that the proposed GMOA-based DeepONet outperformed the existing methods with a Root Mean Square Error of 0.0765, Mean Square Error of 0.0058, Mean Absolute Error of 0.0049 and Mean Absolute Percent Error of 0.0043, respectively. When we apply 4-fold cross-validation, the proposed GMOA-based DeepONet outperformed the existing methods with minimal standard error. These results also show the importance of optimisation algorithms in deciding the optimal weight to improve the model performance. The efficacy of our proposed approach in predicting flight delays with minimal errors well define from all the evaluation metrics. Also, utilising the prediction outcome of our robust model to release information about the delayed flight in advance from the aviation decision systems can effectively alleviate the passengers’ nervousness.UKRI for the COVID-19 recovery grant under the budget code SA077N. This research was heavily affected by the COVID-19 pandemic during the first authors' PhD studies. This lead to an extension to registration for 3 months, which was funded by the UKRI doctoral extension recovery grant. (PTDF main funder of PhD)

    Illuminating the neural landscape of pilot mental states: a convolutional neural network approach with Shapley Additive explanations interpretability

    Get PDF
    Predicting pilots’ mental states is a critical challenge in aviation safety and performance, with electroencephalogram data offering a promising avenue for detection. However, the interpretability of machine learning and deep learning models, which are often used for such tasks, remains a significant issue. This study aims to address these challenges by developing an interpretable model to detect four mental states—channelised attention, diverted attention, startle/surprise, and normal state—in pilots using EEG data. The methodology involves training a convolutional neural network on power spectral density features of EEG data from 17 pilots. The model’s interpretability is enhanced via the use of SHapley Additive exPlanations values, which identify the top 10 most influential features for each mental state. The results demonstrate high performance in all metrics, with an average accuracy of 96%, a precision of 96%, a recall of 94%, and an F1 score of 95%. An examination of the effects of mental states on EEG frequency bands further elucidates the neural mechanisms underlying these states. The innovative nature of this study lies in its combination of high-performance model development, improved interpretability, and in-depth analysis of the neural correlates of mental states. This approach not only addresses the critical need for effective and interpretable mental state detection in aviation but also contributes to our understanding of the neural underpinnings of these states. This study thus represents a significant advancement in the field of EEG-based mental state detection

    Social ski driver conditional autoregressive-based deep learning classifier for flight delay prediction

    Get PDF
    The importance of robust flight delay prediction has recently increased in the air transportation industry. This industry seeks alternative methods and technologies for more robust flight delay prediction because of its significance for all stakeholders. The most affected are airlines that suffer from monetary and passenger loyalty losses. Several studies have attempted to analysed and solve flight delay prediction problems using machine learning methods. This research proposes a novel alternative method, namely social ski driver conditional autoregressive-based (SSDCA-based) deep learning. Our proposed method combines the Social Ski Driver algorithm with Conditional Autoregressive Value at Risk by Regression Quantiles. We consider the most relevant instances from the training dataset, which are the delayed flights. We applied data transformation to stabilise the data variance using Yeo-Johnson. We then perform the training and testing of our data using deep recurrent neural network (DRNN) and SSDCA-based algorithms. The SSDCA-based optimisation algorithm helped us choose the right network architecture with better accuracy and less error than the existing literature. The results of our proposed SSDCA-based method and existing benchmark methods were compared. The efficiency and computational time of our proposed method are compared against the existing benchmark methods. The SSDCA-based DRNN provides a more accurate flight delay prediction with 0.9361 and 0.9252 accuracy rates on both dataset-1 and dataset-2, respectively. To show the reliability of our method, we compared it with other meta-heuristic approaches. The result is that the SSDCA-based DRNN outperformed all existing benchmark methods tested in our experiment.The Petroleum Trust Development Fund (PTDF) Nigeri

    Identifying recovery patterns from resource usage data of cluster systems

    Get PDF
    Failure of Cluster Systems has proven to be of adverse effect and it can be costly. System administrators have employed divide and conquer approach to diagnosing the root-cause of such failure in order to take corrective or preventive measures. Most times, event logs are the source of the information about the failures. Events that characterized failures are then noted and categorized as causes of failure. However, not all the ’causative’ events lead to eventual failure, as some faults sequence experience recovery. Such sequences or patterns constitute challenge to system administrators and failure prediction tools as they add to false positives. Their presence are always predicted as “failure causing“, while in reality, they will not. In order to detect such recovery patterns of events from failure patterns, we proposed a novel approach that utilizes resource usage data of cluster systems to identify recovery and failure sequences. We further propose an online detection approach to the same problem. We experiment our approach on data from Ranger Supercomputer System and the results are positive.Keywords: Change point detection; resource usage data; recovery sequence; detection; large-scale HPC system

    Analysing the sentiment of air-traveller: a comparative analysis

    Get PDF
    Airport service quality is considered to be an indicator of passenger satisfaction. However, assessing this by conventional methods requires continuous observation and monitoring. Therefore, during the past few years, the use of machine learning techniques for this purpose has attracted considerable attention for analysing the sentiment of the air traveller. A sentiment analysis system for textual data analytics leverages the natural language processing and machine learning techniques in order to determine whether a piece of writing is positive, negative or neutral. Numerous methods exist for estimating sentiments which include lexical-based methodologies and directed artificial intelligence strategies. Despite the wide use and ubiquity of certain strategies, it remains unclear which is the best strategy for recognising the intensity of the sentiments of a message. It is necessary to compare these techniques in order to understand their advantages, disadvantages and limitations. In this paper, we compared the Valence Aware Dictionary and sentiment Reasoner, a sentiment analysis technique specifically attuned and well known for performing good on social media data, with the conventional machine learning techniques of handling the textual data by converting it into numerical form. We used the review data obtained from the SKYTRAX website for each airport. The machine learning algorithms evaluated in this paper are VADER sentiment and logistic regression. The term frequency-inverse document frequency is used in order to convert the textual review data into the resulting numerical columns. This was formulated as a classification problem, whereby the prediction of the algorithm was compared with the actual recommendation of the passenger in the dataset. The results were analysed according to the accuracy, precision, recall and F1-score. From the analysis of the results, we observed that logistic regression outperformed the VADER sentiment analysis.Homaid M, Bisandu D, Moulitsas I, Jenkins K. (2022) Analysing the sentiment of air-traveller: a comparative analysis, International Journal of Computer Theory and Engineering, Volume 14, Issue 2, May 2022, pp. 48-5

    A deep feedforward neural network and shallow architectures effectiveness comparison: Flight delays classification perspective

    Get PDF
    Flight delays have negatively impacted the socio-economics state of passengers, airlines and airports, resulting in huge economic losses. Hence, it has become necessary to correctly predict their occurrences in decision-making because it is important for the effective management of the aviation industry. Developing accurate flight delays classification models depends mostly on the air transportation system complexity and the infrastructure available in airports, which may be a region-specific issue. However, no specific prediction or classification model can handle the individual characteristics of all airlines and airports at the same time. Hence, the need to further develop and compare predictive models for the aviation decision system of the future cannot be over-emphasised. In this research, flight on-time data records from the United State Bureau of Transportation Statistics was employed to evaluate the performances of Deep Feedforward Neural Network, Neural Network, and Support Vector Machine models on a binary classification problem. The research revealed that the models achieved different accuracies of flight delay classifications. The Support Vector Machine had the worst average accuracy than Neural Network and Deep Feedforward Neural Network in the initial experiment. The Deep Feedforward Neural Network outperformed Support Vector Machines and Neural Network with the best average percentage accuracies. Going further to investigate the Deep Feedforward Neural Network architecture on different parameters against itself suggest that training a Deep Feedforward Neural Network algorithm, regardless of data training size, the classification accuracy peaks. We examine which number of epochs works best in our flight delay classification settings for the Deep Feedforward Neural Network. Our experiment results demonstrate that having many epochs affects the convergence rate of the model; unlike when hidden layers are increased, it does not ensure better or higher accuracy in a binary classification of flight delays. Finally, we recommended further studies on the applicability of the Deep Feedforward Neural Network in flight delays prediction with specific case studies of either airlines or airports to check the impact on the model’s performance

    Data supporting: 'Analysing the Sentiment of Air-Traveller: A Comparative Analysis'

    No full text
    Airport service qualityis considered to be an indicator of passenger satisfaction. However, assessingthis by conventional methods requires continuous observation and monitoring.Therefore, during the past few years, the use of machine learning techniquesfor this purpose has attracted considerable attention for analysing thesentiment of the air traveller. A sentiment analysis system for textual dataanalytics leverages the natural language processing and machine learningtechniques in order to determine whether a piece of writing is positive, negativeor neutral. Numerous methods exist for estimating sentiments which includelexical-based methodologies and directed artificial intelligence strategies.Despite the wide use and ubiquity of certain strategies, it remains unclearwhich is the best strategy for recognising the intensity of the sentiments of amessage. It is necessary to compare these techniques in order to understandtheir advantages, disadvantages and limitations. In this paper, we compared theValence Aware Dictionary and sentiment Reasoner, a sentiment analysis techniquespecifically attuned and well known for performing good on social media data,with the conventional machine learning techniques of handling the textual databy converting it into numerical form. We used the review data obtained from theSKYTRAX website for each airport. The machine learning algorithms evaluated inthis paper are VADER sentiment and logistic regression. The termfrequency-inverse document frequency is used in order to convert the textualreview datainto the resulting numerical columns. This was formulated as a classificationproblem, whereby the prediction of the algorithm was compared with the actualrecommendation of the passenger in the dataset. The results were analysedaccording to the accuracy, precision, recall and F1-score. From the analysis ofthe results, we observed that logistic regression outperformed the VADERsentiment analysis
    corecore