38 research outputs found

    A Lightweight Anomaly Detection Approach in Large Logs Using Generalizable Automata

    Get PDF
    In this thesis, we focus on the problem of detecting anomalies in large log data. Logs are generated at runtime and contain a wealth of information, useful for various software engineering tasks, including debugging, performance analysis, and fault diagnosis. Our anomaly detection approach is based on the multiresolution abnormal trace detection algorithm proposed in the literature. The algorithm exploits the causal relationship of events in large execution traces to build a model that represents the normal behaviour of a system using varying length n-grams and a generalizable automaton. The resulting model is later used to detect deviations from normalcy. In this thesis, we investigate the application of this algorithm in detecting anomalies in log data. Logs and execution traces are different. Unlike traces, logs do not exhibit a causal relationship among their events, raising questions as to the effectiveness of automata to model log data for anomaly detection. Logs are unstructured data and hence require the use of parsing and abstraction techniques. We propose a process, called LogAutomata, which uses the multiresolution abnormal trace detection algorithm as its primary mechanism. When applying LogAutomata to a large log file generated from the execution of Hadoop Distributed File System (HDFS), we show that the multiresolution algorithm can be a very effective way to detect anomalies in log data

    Sixth Biennial Report : August 2001 - May 2003

    No full text

    Character Recognition

    Get PDF
    Character recognition is one of the pattern recognition technologies that are most widely used in practical applications. This book presents recent advances that are relevant to character recognition, from technical topics such as image processing, feature extraction or classification, to new applications including human-computer interfaces. The goal of this book is to provide a reference source for academic research and for professionals working in the character recognition field

    Radioactive Waste

    Get PDF
    The safe management of nuclear and radioactive wastes is a subject that has recently received considerable recognition due to the huge volume of accumulative wastes and the increased public awareness of the hazards of these wastes. This book aims to cover the practice and research efforts that are currently conducted to deal with the technical difficulties in different radioactive waste management activities and to introduce to the non-technical factors that can affect the management practice. The collective contribution of esteem international experts has covered the science and technology of different management activities. The authors have introduced to the management system, illustrate how old management practices and radioactive accident can affect the environment and summarize the knowledge gained from current management practice and results of research efforts for using some innovative technologies in both pre-disposal and disposal activities

    Modeling and experiments with low-frequency pressure wave propagation in liquid-filled, flexible tubes

    Get PDF

    Predicting room acoustical behavior with the ODEON computer model

    Get PDF

    Treatment of early and late reflections in a hybrid computer model for room acoustics

    Get PDF

    Advanced Computational Methods for Oncological Image Analysis

    Get PDF
    [Cancer is the second most common cause of death worldwide and encompasses highly variable clinical and biological scenarios. Some of the current clinical challenges are (i) early diagnosis of the disease and (ii) precision medicine, which allows for treatments targeted to specific clinical cases. The ultimate goal is to optimize the clinical workflow by combining accurate diagnosis with the most suitable therapies. Toward this, large-scale machine learning research can define associations among clinical, imaging, and multi-omics studies, making it possible to provide reliable diagnostic and prognostic biomarkers for precision oncology. Such reliable computer-assisted methods (i.e., artificial intelligence) together with clinicians’ unique knowledge can be used to properly handle typical issues in evaluation/quantification procedures (i.e., operator dependence and time-consuming tasks). These technical advances can significantly improve result repeatability in disease diagnosis and guide toward appropriate cancer care. Indeed, the need to apply machine learning and computational intelligence techniques has steadily increased to effectively perform image processing operations—such as segmentation, co-registration, classification, and dimensionality reduction—and multi-omics data integration.

    On the classification of time series and cross wavelet phase variance

    Get PDF
    The continuous wavelet transform (CWT) is arguably one of the best tools to explore underlying characteristic features of time series data. Its application in large time series classification experiments, however, has been severely limited due to the large amount of redundant associated information. By extending the capabilities of the CWT to perform cross wavelet analysis (CWA), common frequency behaviour between two time series is highlighted, and the potential to extract amplitude modulated (AM) and frequency modulation (FM) characteristics in an automated way is possible. Characterisation of AM is relatively straightforward and can be resolved by any number of Euclidean based techniques in both the time and frequency domains. FM on the other hand, is somewhat more difficult as it transcends multiple wavelet scales. In this study, linear combinations of scales are used to extract both AM similarity (derived from global wavelet power spectra) and FM coherency, using a new method developed called cross wavelet phase variance (CWPV). The methodology is applied to large scale classification problems (using benchmark time series), in which the method clearly outperforms other common distance-based measures. Lastly, the approach provides a powerful framework in which AM and FM characteristics common between time series can be explicitly mapped to their corresponding scales, and with some initial optimisation, can be applied to any classification problem
    corecore