72,783 research outputs found

    Mutual information based clustering of market basket data for profiling users

    Get PDF
    Attraction and commercial success of web sites depend heavily on the additional values visitors may find. Here, individual, automatically obtained and maintained user profiles are the key for user satisfaction. This contribution shows for the example of a cooking information site how user profiles might be obtained using category information provided by cooking recipes. It is shown that metrical distance functions and standard clustering procedures lead to erroneous results. Instead, we propose a new mutual information based clustering approach and outline its implications for the example of user profiling

    Profiling user activities with minimal traffic traces

    Full text link
    Understanding user behavior is essential to personalize and enrich a user's online experience. While there are significant benefits to be accrued from the pursuit of personalized services based on a fine-grained behavioral analysis, care must be taken to address user privacy concerns. In this paper, we consider the use of web traces with truncated URLs - each URL is trimmed to only contain the web domain - for this purpose. While such truncation removes the fine-grained sensitive information, it also strips the data of many features that are crucial to the profiling of user activity. We show how to overcome the severe handicap of lack of crucial features for the purpose of filtering out the URLs representing a user activity from the noisy network traffic trace (including advertisement, spam, analytics, webscripts) with high accuracy. This activity profiling with truncated URLs enables the network operators to provide personalized services while mitigating privacy concerns by storing and sharing only truncated traffic traces. In order to offset the accuracy loss due to truncation, our statistical methodology leverages specialized features extracted from a group of consecutive URLs that represent a micro user action like web click, chat reply, etc., which we call bursts. These bursts, in turn, are detected by a novel algorithm which is based on our observed characteristics of the inter-arrival time of HTTP records. We present an extensive experimental evaluation on a real dataset of mobile web traces, consisting of more than 130 million records, representing the browsing activities of 10,000 users over a period of 30 days. Our results show that the proposed methodology achieves around 90% accuracy in segregating URLs representing user activities from non-representative URLs

    Hybrid Profiling in Information Retrieval

    Get PDF
    Abstract-One of the main challenges in search engine quality of service is how to satisfy the needs and the interests of individual users. This raises the fundamental issue of how to identify and select the information that is relevant to a specific user. This concern over generic provision and the lack of search precision have provided the impetus for the research into Web Search personalisation. In this paper a hybrid user profiling system is proposed -a combination of explicit and implicit user profiles for improving the web search effectiveness in terms of precision and recall. The proposed system is content-based and implements the Vector Space Model. Experimental results, supported by significance tests, indicate that the system offers better precision and recall in comparison to traditional search engines

    The state-of-the-art in personalized recommender systems for social networking

    Get PDF
    With the explosion of Web 2.0 application such as blogs, social and professional networks, and various other types of social media, the rich online information and various new sources of knowledge flood users and hence pose a great challenge in terms of information overload. It is critical to use intelligent agent software systems to assist users in finding the right information from an abundance of Web data. Recommender systems can help users deal with information overload problem efficiently by suggesting items (e.g., information and products) that match users’ personal interests. The recommender technology has been successfully employed in many applications such as recommending films, music, books, etc. The purpose of this report is to give an overview of existing technologies for building personalized recommender systems in social networking environment, to propose a research direction for addressing user profiling and cold start problems by exploiting user-generated content newly available in Web 2.0

    Machine Learning of User Profiles: Representational Issues

    Full text link
    As more information becomes available electronically, tools for finding information of interest to users becomes increasingly important. The goal of the research described here is to build a system for generating comprehensible user profiles that accurately capture user interest with minimum user interaction. The research described here focuses on the importance of a suitable generalization hierarchy and representation for learning profiles which are predictively accurate and comprehensible. In our experiments we evaluated both traditional features based on weighted term vectors as well as subject features corresponding to categories which could be drawn from a thesaurus. Our experiments, conducted in the context of a content-based profiling system for on-line newspapers on the World Wide Web (the IDD News Browser), demonstrate the importance of a generalization hierarchy and the promise of combining natural language processing techniques with machine learning (ML) to address an information retrieval (IR) problem.Comment: 6 page

    WebPUM : a web-based recommendation system to predict user future movements.

    Get PDF
    Web usage mining has become the subject of exhaustive research, as its potential for Web-based personalized services, prediction of user near future intentions, adaptive Web sites, and customer profiling are recognized. Recently, a variety of recommendation systems to predict user future movements through Web usage mining have been proposed. However, the quality of recommendations in the current systems to predict user future requests in a particular Web site is below satisfaction. To effectively provide online prediction, we have developed a recommendation system called WebPUM, an action using Web usage mining system and propose a novel approach online prediction for classifying user navigation patterns to predict users’ future intentions. The approach is based on the new graph partitioning algorithm to model user navigation patterns for the navigation patterns mining phase. Furthermore, longest common subsequence algorithm is used for classifying current user activities to predict user next movement. The proposed system has been tested on CTI and MSNBC datasets. The results show an improvement in the quality of recommendations. Furthermore, experiments on scalability prove that the size of dataset and the number of the users in dataset do not significantly contribute to the percentage of accuracy

    A recommender system approach for classifying user navigation patterns using longest common subsequence algorithm.

    Get PDF
    Prediction of user future movements and intentions based on the users’ clickstream data is a main challenging problem in Web based recommendation systems. Web usage mining based on the users’ clickstream data has become the subject of exhaustive research, as its potential for web based personalized services, predicting user near future intentions, adaptive Web sites and customer profiling is recognized. A variety of the recommender systems for online personalization through web usage mining have been proposed. However, the quality of the recommendations in the current systems to predict users’ future intentions systems cannot still satisfy users in the particular huge web sites. In this paper, to provide online predicting effectively, we develop a model for online predicting through web usage mining system and propose a novel approach for classifying user navigation patterns to predict users’ future intentions. The approach is based on the using longest common subsequence algorithm to classify current user activities to predict user next movement. We have tested our proposed model on the CTI datasets. The results indicate that the approach can improve the quality of the system for the predictions

    A Novel Framework For User Customizable Privacy Preserving Search

    Get PDF
    The objective of the Personalized web search (PWS) is to provide an effective and efficient search results, which are tailor mode for individual user needs. we build user profiles based on user preference and these profiles are then used to re-rank the search results and rank the order of user-examined results.User privacy can be protected without affecting the personalized search quality. However, users are troubled, with exposing personal preference information to search engines has become a major limitation for profile based personalized web search.The Privacy-preserving personalized web search framework is called UPS framework which can generalize profiles for each query according to user-specific privacy requirements. .In general, there is a tradeoff between the search quality and the level of privacy protection achieved from generalization. Effective generalization algorithms namely GreedyDP and GreedyIL are used to support the runtime profiling. Experiments are conducted on real web search data show that the algorithms are effective in enhancing the stability of the search quality and avoids the unnecessary exposure of the user profile. DOI: 10.17762/ijritcc2321-8169.150313

    An effective approach for personalized web search based on community-cluster analysis

    Get PDF
    The concept of Personalized Web Search is commonly used for improving the quality of web search results by identifying and facilitating different users' search needs. There are several techniques such as user profiling, content analysis, hyperlink analysis and biased PageRank algorithm that are used to achieve web personalization. User Profiling is one of the widely used techniques for personalizing web search at large scale. But it contains several technical and ethical issues such as privacy violations, inefficient use of computing resources as well. Collaborative web search is also a kind of a relatively "new concept which defines the way of optimizing/personalizing search results by using details of group of people and contributing the knowledge of all of them about web search. This paper presents the details of an alternative approach for personalizing web results by using user profiling technique with community cluster analysis of collaborative web search by adapting concept of reusability 'among web results
    corecore