13 research outputs found

    Efficient Web Usage Mining Process for Sequential Patterns

    Full text link
    The tremendous growth in volume of web usage data results in the boost of web mining research with focus on discovering potentially useful knowledge from web usage data. This paper presents a new web usage mining process for finding sequential patterns in web usage data which can be used for predicting the possible next move in browsing sessions for web personalization. This process consists of three main stages: preprocessing web access sequences from the web server log, mining preprocessed web log access sequences by a tree-based algorithm, and predicting web access sequences by using a dynamic clustering-based model. It is designed based on the integration of the dynamic clustering-based Markov model with the Pre-Order Linked WAP-Tree Mining (PLWAP) algorithm to enhance mining performance. The proposed mining process is verified by experiments with promising results

    Semantic-enhanced web-page recommender systems

    Full text link
    University of Technology, Sydney. Faculty of Engineering and Information Technology.This thesis presents a new framework for a semantic-enhanced Web-page recommender (WPR) system, and a suite of enabling techniques which include semantic network models of domain knowledge and Web usage knowledge, querying techniques, and Web-page recommendation strategies. The framework enables the system to automatically discover and construct the domain and Web usage knowledge bases, and to generate effective Webpage recommendations. The main contributions of the framework are fourfold: (1) it effectively changes the fact that knowledge base construction must rely on human experts; (2) it enriches the pool of candidate Web-pages for effective Web-page recommendations by using semantic knowledge of both Web-pages and Web usage; (3) it thoroughly resolves the inconsistency problem facing contemporary WPR systems which heavily employ heterogeneous representations of knowledge bases. Knowledge bases in the system are consistently represented in a formal Web ontology language, namely OWL; and (4) it can generate effective Web-page recommendations based on a set of thoughtfully-designed recommendation strategies. A prototype of the semantic-enhanced WPR system is developed and presented, and the experimental comparisons with existing WPR approaches convincingly prove the significantly improved performance of WPR systems based on the framework and its enabling techniques

    Web-Page Recommendation Based on Web Usage and Domain Knowledge

    Full text link
    © 1989-2012 IEEE. Web-page recommendation plays an important role in intelligent Web systems. Useful knowledge discovery from Web usage data and satisfactory knowledge representation for effective Web-page recommendations are crucial and challenging. This paper proposes a novel method to efficiently provide better Web-page recommendation through semantic-enhancement by integrating the domain and Web usage knowledge of a website. Two new models are proposed to represent the domain knowledge. The first model uses an ontology to represent the domain knowledge. The second model uses one automatically generated semantic network to represent domain terms, Web-pages, and the relations between them. Another new model, the conceptual prediction model, is proposed to automatically generate a semantic network of the semantic Web usage knowledge, which is the integration of domain knowledge and Web usage knowledge. A number of effective queries have been developed to query about these knowledge bases. Based on these queries, a set of recommendation strategies have been proposed to generate Web-page candidates. The recommendation results have been compared with the results obtained from an advanced existing Web Usage Mining (WUM) method. The experimental results demonstrate that the proposed method produces significantly higher performance than the WUM method

    Ontology-style web usage model for semantic Web applications

    Full text link
    Current semantic recommender systems aim to exploit the website ontologies to produce valuable web recommendations. However, Web usage knowledge for recommendation is presented separately and differently from the domain ontology, this leads to the complexity of using inconsistent knowledge resources. This paper aims to solve this problem by proposing a novel ontology-style model of Web usage to represent the non-taxonomic visiting relationship among the visited pages. The output of this model is an ontology-style document which enables the discovered web usage knowledge to be sharable and machine-understandable in semantic Web applications, such as recommender systems. A case study is presented to show how this model is used in conjunction of the web usage mining and web recommendation. Two real-world datasets are used in the case study. © 2010 IEEE

    Experimental Investigation of PSO Based Web User Session Clustering

    No full text
    Web user session clustering is very important in web usage mining for web personalization. This paper proposes a Particle Swarm Optimization (PSO) based sequence clustering approach and presents an experimentally investigation of the PSO based sequence clustering methods, which use three original PSO variants and their corresponding variants of a hybrid PSO with real value mutation. The investigation was conducted in 45 test cases using five web user session datasets extracted from a real world web site. The experimental results of these methods are compared with the results obtained from the traditional k-means clustering method. Some interesting observations have been made. In the most of test cases under consideration, the PSO and PSO-RVM methods have better performance than the k-means method. Furthermore, the PSO-RVM methods show better performance than the corresponding PSO methods in the cases in which the similarity measure function is more complex. © 2009 IEEE

    Investigation of sequential pattern mining techniques for web recommendation

    Full text link
    Increased application of sequence mining in web recommender systems (WRS) requires a better understanding of the performance and a clear identification of the strengths and weaknesses of existing algorithms. Among the commonly used sequence mining methods, the tree-based approach, such as pre-order linked WAP-tree mining algorithm (PLWAP-Mine) and conditional sequence mining algorithm (CS-Mine), has demonstrated high performance in web mining applications. However, its advantages over other mining methods are not well explained and understood in the context of WRS. This paper firstly reviews the existing sequence mining algorithms, and then studies the performance of two outstanding algorithms, i.e., the PLWAP-Mine and CS-Mine algorithms, with respect to their sensitivity to the dataset variability, and their practicality for web recommendation. The results show that CS-Mine performs faster than PLWAP-Mine, but the frequent patterns generated by PLWAP-Mine are more effective than CS-Mine when applied in web recommendations. These results are useful to WRS developers for the selection of appropriate sequence mining algorithms. © 2012 Inderscience Enterprises Ltd

    BSPNN: Boosted subspace probabilistic neural network for email security

    Full text link
    In the modern age of Internet connectivity, advanced information systems have accumulated huge volumes of data. Such fast growing, tremendous amount of data, collected and stored in large databases has far exceeded our human ability to comprehend without proper tools. There has been a great deal of research conducted to explore the potential applications of Machine Learning technologies in Security Informatics. This article studies the Network Security Detection problems in which predictive models are constructed to detect network security breaches such as spamming. Due to overwhelming volume ofdata, complexity and dynamics of computer networks and evolving cyber threats, current security systems suffer limited performance with low detection accuracy and high number of false alarms. To address such performance issues, a novel Machine Learning algorithm, namely Boosted Subspace Probabilistic Neural Network (BSPNN), has been proposed which combines a Radial Basis Function Neural Network with an innovative diversity-based ensemble learning framework. Extensive empirical analyses suggested that BSPNN achieved high detection accuracy with relatively small computational complexity compared with other conventional detection methods. © Springer Science+Business Media B.V. 2010
    corecore