Search CORE

5,925 research outputs found

Using patterns position distribution for software failure detection

Author: Agrawal R.
Agrawal R.
Bowring J.F.
Chandola V.
Cheng H.
Deshpande M.
Dickinson W.
Duda R.
Gamma E.
Han J.
Han J.
Hutchins M.
Liu C.
Lo D.
Lo D.
Lo D.
Mannila H.
Wang J.
Xing Z.
Yan X.
Yan X.
Publication venue: 'Informa UK Limited'
Publication date: 01/01/2013
Field of study

Pattern-based software failure detection is an important topic of research in recent years. In this method, a set of patterns from program execution traces are extracted, and represented as features, while their occurrence frequencies are treated as the corresponding feature values. But this conventional method has its limitation due to ignore the pattern’s position information, which is important for the classification of program traces. Patterns occurs in the different positions of the trace are likely to represent different meanings. In this paper, we present a novel approach for using pattern’s position distribution as features to detect software failure. The comparative experiments in both artificial and real datasets show the effectiveness of this method

Crossref

Directory of Open Access Journals

Middlesex University Research Repository

APPLYING THE ATTRIBUTED PREFIX TREE FOR MINING CLOSED SEQUENTIAL PATTERNS

Author: Thiet Pham Thi
Publication venue: 'Publishing House for Science and Technology, Vietnam Academy of Science and Technology'
Publication date: 20/03/2018
Field of study

Mining closed sequential patterns is one of important tasks in data mining. It is proposed to resolve difficult problems in mining sequential pattern such as mining long frequent sequences that contain a combinatorial number of frequent subsequences or using very low support thresholds to mine sequential patterns is usually both time- and memory-consuming. This paper applies the characteristics of closed sequential patterns and sequence extensions into the prefix tree structure to mine closed sequential patterns from the sequence database. The paper uses the parent–child relationship on prefix tree structure and each node on prefix tree is also added fields to determine whether that is a closed sequential pattern or not. Experimental results show that the number of sequential patterns is reduced significantly

Vietnam Academy of Science and Technology: Journals Online

Using patterns position distribution for software failure detection

Author: Augusto J.
Augusto J.
Chen Z.
Chen Z.
Du H.
Du H.
Li C.
Li C.
Liu J.
Liu J.
Wang H.
Wang H.
Wilkie G.
Wilkie G.
Publication venue: Atlantis Press
Publication date: 01/01/2013
Field of study

Middlesex University Research Repository

Mining complex structured data: Enhanced methods and applications

Author: Bui Dang Bach
Publication venue: Curtin University
Publication date: 01/01/2015
Field of study

Conventional approaches to analysing complex business data typically rely on process models, which are difficult to construct and use. This thesis addresses this issue by converting semi-structured event logs to a simpler flat representation without any loss of information, which then enables direct applications of classical data mining methods. The thesis also proposes an effective and scalable classification method which can identify distinct characteristics of a business process for further improvements

espace@Curtin

Mining Positional Data Streams

Author: Jens Haase
Ulf Brefeld
Publication venue
Publication date: 05/03/2020
Field of study

Abstract. We study frequent pattern mining from positional data streams. Existing approaches require discretised data to identify atomic events and are not applicable in our continuous setting. We propose an efficient trajectory-based preprocessing to identify similar movements and a distributed pattern mining algorithm to identify frequent trajectories. We empirically evaluate all parts of the processing pipeline

CiteSeerX

Discovery and Extraction of Protein Sequence Motif Information that Transcends Protein Family Boundaries

Author: Chen Bernard
Publication venue: ScholarWorks @ Georgia State University
Publication date: 17/07/2009
Field of study

Protein sequence motifs are gathering more and more attention in the field of sequence analysis. The recurring patterns have the potential to determine the conformation, function and activities of the proteins. In our work, we obtained protein sequence motifs which are universally conserved across protein family boundaries. Therefore, unlike most popular motif discovering algorithms, our input dataset is extremely large. As a result, an efficient technique is essential. We use two granular computing models, Fuzzy Improved K-means (FIK) and Fuzzy Greedy K-means (FGK), in order to efficiently generate protein motif information. After that, we develop an efficient Super Granular SVM Feature Elimination model to further extract the motif information. During the motifs searching process, setting up a fixed window size in advance may simplify the computational complexity and increase the efficiency. However, due to the fixed size, our model may deliver a number of similar motifs simply shifted by some bases or including mismatches. We develop a new strategy named Positional Association Super-Rule to confront the problem of motifs generated from a fixed window size. It is a combination approach of the super-rule analysis and a novel Positional Association Rule algorithm. We use the super-rule concept to construct a Super-Rule-Tree (SRT) by a modified HHK clustering, which requires no parameter setup to identify the similarities and dissimilarities between the motifs. The positional association rule is created and applied to search similar motifs that are shifted some residues. By analyzing the motifs results generated by our approaches, we realize that these motifs are not only significant in sequence area, but also in secondary structure similarity and biochemical properties

ScholarWorks @ Georgia State University

A Survey on Behavioral Pattern Mining from Sensor Data in Internet of Things

Author: Bhuiyan Md Zakirul
Hassan Mohammad
Kamruzzaman Joarder
Rashid Md Mamunur
Shahriar Shafin Sakib
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2020
Field of study

The deployment of large-scale wireless sensor networks (WSNs) for the Internet of Things (IoT) applications is increasing day-by-day, especially with the emergence of smart city services. The sensor data streams generated from these applications are largely dynamic, heterogeneous, and often geographically distributed over large areas. For high-value use in business, industry and services, these data streams must be mined to extract insightful knowledge, such as about monitoring (e.g., discovering certain behaviors over a deployed area) or network diagnostics (e.g., predicting faulty sensor nodes). However, due to the inherent constraints of sensor networks and application requirements, traditional data mining techniques cannot be directly used to mine IoT data streams efficiently and accurately in real-time. In the last decade, a number of works have been reported in the literature proposing behavioral pattern mining algorithms for sensor networks. This paper presents the technical challenges that need to be considered for mining sensor data. It then provides a thorough review of the mining techniques proposed in the recent literature to mine behavioral patterns from sensor data in IoT, and their characteristics and differences are highlighted and compared. We also propose a behavioral pattern mining framework for IoT and discuss possible future research directions in this area. © 2013 IEEE

Federation ResearchOnline

aCQUIRe

Pattern Discovery in Time-Ordered Data

Author
Publication venue: 'Office of Scientific and Technical Information (OSTI)'
Publication date
Field of study

Crossref