Search CORE

11,854 research outputs found

A fine grained heuristic to capture web navigation patterns

Author: Borges J.
Charniak E.
José Borges
Mark Levene
Perkowitz M.
Perkowltz M.
Spiliopoulou M.
Stout R.
Zin N.
Publication venue: KDD
Publication date: 01/01/2000
Field of study

In previous work we have proposed a statistical model to capture the user behaviour when browsing the web. The user navigation information obtained from web logs is modelled as a hypertext probabilistic grammar (HPG) which is within the class of regular probabilistic grammars. The set of highest probability strings generated by the grammar corresponds to the user preferred navigation trails. We have previously conducted experiments with a Breadth-First Search algorithm (BFS) to perform the exhaustive computation of all the strings with probability above a specified cut-point, which we call the rules. Although the algorithm’s running time varies linearly with the number of grammar states, it has the drawbacks of returning a large number of rules when the cut-point is small and a small set of very short rules when the cut-point is high. In this work, we present a new heuristic that implements an iterative deepening search wherein the set of rules is incrementally augmented by first exploring trails with high probability. A stopping parameter is provided which measures the distance between the current rule-set and its corresponding maximal set obtained by the BFS algorithm. When the stopping parameter takes the value zero the heuristic corresponds to the BFS algorithm and as the parameter takes values closer to one the number of rules obtained decreases accordingly. Experiments were conducted with both real and synthetic data and the results show that for a given cut-point the number of rules induced increases smoothly with the decrease of the stopping criterion. Therefore, by setting the value of the stopping criterion the analyst can determine the number and quality of rules to be induced; the quality of a rule is measured by both its length and probability

CiteSeerX

Crossref

Birkbeck Institutional Research Online

Agents, Bookmarks and Clicks: A topical model of Web traffic

Author: Flammini Alessandro
Gonçalves Bruno
Meiss Mark
Menczer Filippo
Ramasco José J.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2010
Field of study

Analysis of aggregate and individual Web traffic has shown that PageRank is a poor model of how people navigate the Web. Using the empirical traffic patterns generated by a thousand users, we characterize several properties of Web traffic that cannot be reproduced by Markovian models. We examine both aggregate statistics capturing collective behavior, such as page and link traffic, and individual statistics, such as entropy and session size. No model currently explains all of these empirical observations simultaneously. We show that all of these traffic patterns can be explained by an agent-based model that takes into account several realistic browsing behaviors. First, agents maintain individual lists of bookmarks (a non-Markovian memory mechanism) that are used as teleportation targets. Second, agents can retreat along visited links, a branching mechanism that also allows us to reproduce behaviors such as the use of a back button and tabbed browsing. Finally, agents are sustained by visiting novel pages of topical interest, with adjacent pages being more topically related to each other than distant ones. This modulates the probability that an agent continues to browse or starts a new session, allowing us to recreate heterogeneous session lengths. The resulting model is capable of reproducing the collective and individual behaviors we observe in the empirical data, reconciling the narrowly focused browsing patterns of individual users with the extreme heterogeneity of aggregate traffic measurements. This result allows us to identify a few salient features that are necessary and sufficient to interpret the browsing patterns observed in our data. In addition to the descriptive and explanatory power of such a model, our results may lead the way to more sophisticated, realistic, and effective ranking and crawling algorithms.Comment: 10 pages, 16 figures, 1 table - Long version of paper to appear in Proceedings of the 21th ACM conference on Hypertext and Hypermedi

arXiv.org e-Print Archive

Crossref

Preprocessing and Content/Navigational Pages Identification as Premises for an Extended Web Usage Mining Model Development

Author: Dan Andrei SITAR TAUT
Daniel MICAN
Publication venue
Publication date
Field of study

From its appearance until nowadays, the internet saw a spectacular growth not only in terms of websites number and information volume, but also in terms of the number of visitors. Therefore, the need of an overall analysis regarding both the web sites and the content provided by them was required. Thus, a new branch of research was developed, namely web mining, that aims to discover useful information and knowledge, based not only on the analysis of websites and content, but also on the way in which the users interact with them. The aim of the present paper is to design a database that captures only the relevant data from logs in a way that will allow to store and manage large sets of temporal data with common tools in real time. In our work, we rely on different web sites or website sections with known architecture and we test several hypotheses from the literature in order to extend the framework to sites with unknown or chaotic structure, which are non-transparent in determining the type of visited pages. In doing this, we will start from non-proprietary, preexisting raw server logs.Knowledge Management, Web Mining, Data Preprocessing, Decision Trees, Databases

Research Papers in Economics

Principles in Patterns (PiP) : User Acceptance Testing of Course and Class Approval Online Pilot (C-CAP)

Author: Macgregor George
Publication venue: University of Strathclyde
Publication date: 01/02/2012
Field of study

The PiP Evaluation Plan documents four distinct evaluative strands, the first of which entails an evaluation of the PiP system pilot (WP7:37 – Systems & tool evaluation). Phase 1 of this evaluative strand focused on the heuristic evaluation of the PiP Course and Class Approval Online Pilot system (C-CAP) and was completed in December 2011. Phase 2 of the evaluation is broadly concerned with "user acceptance testing". This entails exploring the extent to which C-CAP functionality meets users' expectations within specific curriculum design tasks, as well as eliciting data on C-CAP's overall usability and its ability to support academics in improving the quality of curricula. The general evaluative approach adopted therefore employs a combination of standard Human-Computer Interaction (HCI) approaches and specially designed data collection instruments, including protocol analysis, stimulated recall and pre- and post-test questionnaire instruments. This brief report summarises the methodology deployed, presents the results of the evaluation and discusses their implications for the further development of C-CAP

University of Strathclyde Institutional Repository

Characterizations of User Web Revisit Behavior

Author: Herder E.
Publication venue
Publication date: 01/01/2005
Field of study

In this article we update and extend on earlier long-term studies on user's page revisit behavior. Revisits ar

CiteSeerX

University of Twente Research Information

Recommended from our members

Evaluating usability of e-commerce sites by tracking eye movements

Author: Tzanidou Ekaterini
Publication venue
Publication date: 01/01/2006
Field of study

The majority of existing e-commerce design guidelines has been derived by conducting heuristic evaluations, without reporting the involvement of the users themselves. This research provides clarification on a number of existing web design guidelines for e-commerce sites based on empirical studies with users. Four studies were conducted and each study focused on a specific set of design guidelines as found in the literature. A combined qualitative and quantitative approach has been used, including a state-of-the-art technique, eye tracking. The eye movement data were complemented by user-profile data elicited through background questionnaires and user-perception data as captured through semi-structured interviews. The first study investigated users’ initial impressions of homepages of e-commerce sites. The second study examined users’ adaptability to persistent or varied placement of design elements. The third and fourth studies explored the effect of the presentation format of e-commerce web pages: the first in terms of the proportion of images, and the second in terms of how key icons related to an e-commerce transaction were presented. On the whole, the results of the studies corroborated existing design guidelines, but they also identified potential refinements. The thesis contributes both methodologically and empirically to Human-Computer Interaction. The combined methodological approach enables insight into the user experience that spans behavioural aspects such as visual search behaviour and visual search performance data, and subjective aspects such as user expectations and preferences. The empirical outcomes amplify the design guidelines from a user’s perspective

Open Research Online (The Open University)

OpenGrey Repository

Footprints of information foragers: Behaviour semantics of visual exploration

Author: Chen C
Cribbin T
Kuljis J
Macredie RD
Publication venue: Academic Press (Elsevier)
Publication date: 01/01/2002
Field of study

Social navigation exploits the knowledge and experience of peer users of information resources. A wide variety of visual–spatial approaches become increasingly popular as a means to optimize information access as well as to foster and sustain a virtual community among geographically distributed users. An information landscape is among the most appealing design options of representing and communicating the essence of distributed information resources to users. A fundamental and challenging issue is how an information landscape can be designed such that it will not only preserve the essence of the underlying information structure, but also accommodate the diversity of individual users. The majority of research in social navigation has been focusing on how to extract useful information from what is in common between users' profiles, their interests and preferences. In this article, we explore the role of modelling sequential behaviour patterns of users in augmenting social navigation in thematic landscapes. In particular, we compare and analyse the trails of individual users in thematic spaces along with their cognitive ability measures. We are interested in whether such trails can provide useful guidance for social navigation if they are embedded in a visual–spatial environment. Furthermore, we are interested in whether such information can help users to learn from each other, for example, from the ones who have been successful in retrieving documents. In this article, we first describe how users' trails in sessions of an experimental study of visual information retrieval can be characterized by Hidden Markov Models. Trails of users with the most successful retrieval performance are used to estimate parameters of such models. Optimal virtual trails generated from the models are visualized and animated as if they were actual trails of individual users in order to highlight behavioural patterns that may foster social navigation. The findings of the research will provide direct input to the design of social navigation systems as well as to enrich theories of social navigation in a wider context. These findings will lead to the further development and consolidation of a tightly coupled paradigm of spatial, semantic and social navigation

CiteSeerX

Brunel University Research Archive