16 research outputs found

    A differential semantic algorithm for query relevant web page recommendation

    Get PDF
    With the exponential rise in the amount of information in the World Wide Web, there is a need for a much efficient algorithm for Web Search. The traditional keyword matching as well as the standard statistical techniques is insufficient as the Web Pages they recommend are not highly relevant to the query. With the growth in Semantic Web, an algorithm which semantically computes the most relevant Web Pages is required. In this paper, a methodology which computes the semantic heterogeneity between the keywords, content words and query words for web page recommendation is incorporated. A Differential Adaptive PMI Algorithm is formulated for with varied thresholds for recommending the Web Pages based on the input query. The proposed methodology yields an accuracy of 0.87 which is much better than the existing strategies. ร‚ยฉ 2016 IEEE

    A taxonomy of web prediction algorithms

    Full text link
    Web prefetching techniques are an attractive solution to reduce the user-perceived latency. These techniques are driven by a prediction engine or algorithm that guesses following actions of web users. A large amount of prediction algorithms has been proposed since the first prefetching approach was published, although it is only over the last two or three years when they have begun to be successfully implemented in commercial products. These algorithms can be implemented in any element of the web architecture and can use a wide variety of information as input. This affects their structure, data system, computational resources and accuracy. The knowledge of the input information and the understanding of how it can be handled to make predictions can help to improve the design of current prediction engines, and consequently prefetching techniques. This paper analyzes fifty of the most relevant algorithms proposed along 15 years of prefetching research and proposes a taxonomy where the algorithms are classified according to the input data they use. For each group, the main advantages and shortcomings are highlighted. ยฉ 2012 Elsevier Ltd. All rights reserved.This work has been partially supported by Spanish Ministry of Science and Innovation under Grant TIN2009-08201, Generalitat Valenciana under Grant GV/2011/002 and Universitat Politecnica de Valencia under Grant PAID-06-10/2424.Domenech, J.; De La Ossa Perez, BA.; Sahuquillo Borrรกs, J.; Gil Salinas, JA.; Pont Sanjuan, A. (2012). A taxonomy of web prediction algorithms. Expert Systems with Applications. 39(9):8496-8502. https://doi.org/10.1016/j.eswa.2012.01.140S8496850239

    Referrer Graph: A cost-effective algorithm and pruning method for predicting web accesses

    Full text link
    This paper presents the Referrer Graph (RG) web prediction algorithm and a pruning method for the associated graph as a low-cost solution to predict next web users accesses. RG is aimed at being used in a real web system with prefetching capabilities without degrading its performance. The algorithm learns from users accesses and builds a Markov model. These kinds of algorithms use the sequence of the user accesses to make predictions. Unlike previous Markov model based proposals, the RG algorithm differentiates dependencies in objects of the same page from objects of different pages by using the object URI and the referrer in each request. Although its design permits us to build a simple data structure that is easier to handle and, consequently, needs lower computational cost in comparison with other algorithms, a pruning mechanism has been devised to avoid the continuous growing of this data structure. Results show that, compared with the best prediction algorithms proposed in the open literature, the RG algorithm achieves similar precision values and page latency savings but requiring much less computational and memory resources. Furthermore, when pruning is applied, additional and notable resource consumption savings can be achieved without degrading original performance. In order to reduce further the resource consumption, a mechanism to prune de graph has been devised, which reduces resource consumption of the baseline system without degrading the latency savings. 2013 Elsevier B.V. All rights reserved.This work has been partially supported by Spanish Ministry of Science and Innovation under Grant TIN2009-08201. The authors would also like to thank the technical staff of the School of Computer Science at the Polytechnic University of Valencia for providing us recent and customized trace files logged by their web server.De La Ossa Perez, BA.; Gil Salinas, JA.; Sahuquillo Borrรกs, J.; Pont Sanjuan, A. (2013). Referrer Graph: A cost-effective algorithm and pruning method for predicting web accesses. Computer Communications. 36(8):881-894. https://doi.org/10.1016/j.comcom.2013.02.005S88189436

    Time-weighted multi-touch attribution and channel relevance in the customer journey to online purchase

    Get PDF
    We address statistical issues in attributing revenue to marketing channels and inferring the importance of individual channels in customer journeys towards an online purchase. We describe the relevant data structures and introduce an example. We suggest an asymmetric bathtub shape as appropriate for time-weighted revenue attribution to the customer journey, provide an algorithm, and illustrate the method. We suggest a modification to this method when there is independent information available on the relative values of the channels. To infer channel importance, we employ sequential data analysis ideas and restrict to data which ends in a purchase. We propose metrics for source, intermediary, and destination channels based on twoand three-step transitions in fragments of the customer journey. We comment on the practicalities of formal hypothesis testing. We illustrate the ideas and computations using data from a major UK online retailer. Finally, we compare the revenue attributions suggested by the methods in this paper with several common attribution methods

    An angle-based interest model for text recommendation

    Get PDF
    Building an interest model is the key to realize personalized text recommendation. Previous interest models neglect the fact that a user may have multiple angles of interests. Different angles of interest provide different requests and criteria for text recommendation. This paper proposes an interest model that consists of two kinds of angles: persistence and pattern, which can be combined to form complex angles. The model uses a new method to represent the long-term interest and the short-term interest, and distinguishes the interest on object and the interest on the link structure of objects. Experiments with news-scale text data show that the interest on object and the interest on link structure have real requirements, and it is effective to recommend texts according to the angles

    ๋Œ€๊ทœ๋ชจ TV ์‹œ์ฒญ๋กœ๊ทธ ํด๋Ÿฌ์Šคํ„ฐ๋ง์„ ํ†ตํ•œ ์‹œ์ฒญํ–‰์œ„ ๋ฐ ์‹œ์ฒญ๊ฐ€๊ตฌ ์œ ํ˜• ๋ถ„์„ ์—ฐ๊ตฌ

    Get PDF
    ํ•™์œ„๋…ผ๋ฌธ (์„์‚ฌ)-- ์„œ์šธ๋Œ€ํ•™๊ต ์œตํ•ฉ๊ณผํ•™๊ธฐ์ˆ ๋Œ€ํ•™์› ์œตํ•ฉ๊ณผํ•™๋ถ€, 2017. 8. ์„œ๋ด‰์›.์ตœ๊ทผ์—๋Š” ๋” ์ด์ƒ ๊ณผ๊ฑฐ์ฒ˜๋Ÿผ ๊ฐ™์€ ์‹œ๊ฐ„์— ์˜จ ๊ฐ€์กฑ์ด ๋ชจ์—ฌ ์•‰์•„ ์†Œ์œ„ ๋ณธ๋ฐฉ์‚ฌ์ˆ˜๋ฅผ ํ•˜๋Š” ํ–‰๋™๋งŒ์œผ๋กœ๋Š” TV ์‹œ์ฒญ์„ ์ดํ•ดํ•  ์ˆ˜ ์—†์„ ๋งŒํผ TV ์‹œ์ฒญ ํ–‰ํƒœ๊ฐ€ ๋งค์šฐ ๋ณต์žกํ•ด์กŒ๋‹ค. ๋‹ค์–‘ํ•œ ๋งค์ฒด์™€ ์ฝ˜ํ…์ธ  ๊ณต๊ธ‰ ์„œ๋น„์Šค๋“ค๊ณผ ์ƒํ˜ธ์ž‘์šฉํ•˜๋ฉฐ ์„œ๋กœ ์–ฝํžˆ๊ณ ์„คํ‚จ ๋ณต์žกํ•œ ์‹œ์ฒญ ํ–‰๋™์„ ๋ณด์ด๊ณ  ์žˆ๋Š” ๊ฒƒ์ด๋‹ค. TV ์‹œ์ฒญ ํ™˜๊ฒฝ์€ ์ฝ˜ํ…์ธ  ํ”Œ๋žซํผ ๋ฐ ๋””๋ฐ”์ด์Šค ํ™˜๊ฒฝ ๋ณ€ํ™”๋กœ ์ธํ•ด ๊ณผ๊ฑฐ์™€ ๋‹ฌ๋ฆฌ ํ›จ์”ฌ ์˜ˆ์ธกํ•˜๊ธฐ ์–ด๋ ค์šด ๋ณต์žกํ•œ ํ™˜๊ฒฝ์œผ๋กœ ๋ณ€๋ชจํ•˜๊ฒŒ ๋˜์—ˆ๋‹ค. TV๋ฅผ ๋‘˜๋Ÿฌ์‹ผ ํ™˜๊ฒฝ์ด ๋”์šฑ ๋ณต์žกํ•ด์ง€๋Š” ์ƒํ™ฉ์—์„œ๋„ TV ์‹œ์ฒญ์— ๋Œ€ํ•œ ์ดํ•ด๋Š” ์—ฌ์ „ํžˆ ์ค‘์š”ํ•˜๊ฒŒ ์—ฌ๊ฒจ์ง„๋‹ค. N-์Šคํฌ๋ฆฐ ์‹œ์ฒญ ํ™˜๊ฒฝ์ด ๋ณดํŽธํ™”๋˜๋ฉด์„œ TV์— ๋Œ€ํ•œ ๋น„์ค‘์ด ํ•˜๋ฝํ•˜๊ณ ๋Š” ์žˆ์œผ๋‚˜, ์•„์ง๊นŒ์ง€๋Š” TV ์‹œ์ฒญ์— ๋งŽ์€ ์‹œ๊ฐ„์„ ๋ณด๋‚ด๊ณ  ์žˆ๊ณ  ์ผ์ƒ ์ƒํ™œ์—์„œ์˜ ์ค‘์š”๋„๋„ ๋†’์€ ๋งŒํผ, TV๋Š” ์—ฌ์ „ํžˆ ์ฝ˜ํ…์ธ  ์†Œ๋น„์— ์žˆ์–ด ์ค‘์š”ํ•œ ์—ญํ• ์„ ํ•˜๊ณ  ์žˆ๊ธฐ ๋•Œ๋ฌธ์ด๋‹ค. ๋‹ฌ๋ผ์ง„ ํ™˜๊ฒฝ ์†์—์„œ๋„ TV ์‹œ์ฒญ์€ ์—ฌ์ „ํžˆ ๊ฑด์žฌํ•œ ์—ฌ๊ฐ€ ํ™œ๋™์ด๋ผ ํ•  ์ˆ˜ ์žˆ์œผ๋ฉฐ, ๋ณต์žกํ•œ ํ™˜๊ฒฝ ์†์—์„œ ๋‹ฌ๋ผ์ง„ TV ์‹œ์ฒญ์ž์™€ ์‹œ์ฒญ ํ–‰ํƒœ์— ๋Œ€ํ•œ ์ดํ•ด๊ฐ€ ๋”์šฑ ํ•„์š”ํ•œ ์ƒํ™ฉ์ด๋ผ๋Š” ์ ์„ ์‹œ์‚ฌํ•œ๋‹ค. ๋ณธ ์—ฐ๊ตฌ์—์„œ๋Š” ํ–‰๋™์„ ์ค‘์‹ฌ์œผ๋กœ TV ์‹œ์ฒญ์„ ์ดํ•ดํ•˜๊ณ ์ž ํ–ˆ๋˜ ๊ธฐ์กด ์—ฐ๊ตฌ์˜ ํ™•์žฅ์„ฑ ๋“ฑ์˜ ํ•œ๊ณ„์ ์„ ๊ทน๋ณตํ•˜๊ณ  ๊ถ๊ทน์ ์œผ๋กœ๋Š” ์ „ํ†ต์  ๊ด€์ ์—์„œ ๋ฒ—์–ด๋‚˜ ๋‹ค๋ณ€ํ™”๋œ TV ์‹œ์ฒญ ํ™˜๊ฒฝ์—์„œ์˜ TV ์‹œ์ฒญ์— ๋Œ€ํ•œ ์ดํ•ด์˜ ํญ์„ ๋„“ํžˆ๊ณ ์ž, ๋””์ง€ํ„ธ ์ผ€์ด๋ธ” TV ์…‹ํ†ฑ๋ฐ•์Šค ๋กœ๊ทธ๋ฅผ ํ†ตํ•ด ํš๋“ํ•œ ๋Œ€๊ทœ๋ชจ TV ์‹œ์ฒญ ๋กœ๊ทธ๋ฅผ ๋ฐ”ํƒ•์œผ๋กœ TV ์‹œ์ฒญ ํŒจํ„ด์„ ํ–‰๋™ ์ค‘์‹ฌ์œผ๋กœ ์œ ํ˜•ํ™”ํ•˜๊ณ , ์ด๋ฅผ ๋‹ค์‹œ ์‚ฌ์šฉ์ž ์ค‘์‹ฌ์œผ๋กœ ์กฐํ•ฉํ•˜์—ฌ ํ•ด์„ํ•˜๋Š” ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ์‹œํ•˜์˜€๋‹ค. ์ด๋ฅผ ์œ„ํ•ด ๊ธฐ์กด์˜ ์›น ์‚ฌ์šฉ ๋งˆ์ด๋‹ ๋ถ„์•ผ์—์„œ ์‚ฌ์šฉ๋˜์—ˆ๋˜ ์„ธ์…˜ ํด๋Ÿฌ์Šคํ„ฐ๋ง ๊ธฐ๋ฐ˜ ์œ ํ˜•ํ™” ๋ถ„์„ ๊ธฐ๋ฒ•์„ TV ์‹œ์ฒญ ๋กœ๊ทธ์— ์ ์šฉํ•˜์˜€๋‹ค. ๋˜ํ•œ ์œ ํ˜•ํ™” ๋œ ์‹œ์ฒญ ํ–‰๋™๊ณผ ์„œ๋น„์Šค ํ•ด์ง€์œจ ๊ฐ„์˜ ์ƒ๊ด€๊ด€๊ณ„๋ฅผ ์‚ดํŽด๋ด„์œผ๋กœ์จ ๋ณธ ์—ฐ๊ตฌ์˜ ์ ‘๊ทผ ๋ฐฉ์‹์ด ์œ ํšจํ•จ์„ ์ž…์ฆํ•˜๊ณ ์ž ํ•˜์˜€๋‹ค. ์ œ์•ˆ๋œ ๋ถ„์„ ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ํ†ตํ•ด, ๋ณธ ์—ฐ๊ตฌ์—์„œ๋Š” ์ด 7๊ฐœ์˜ ์‹œ์ฒญ ํ–‰๋™ ์œ ํ˜•๊ณผ ์ด๋ฅผ ํ†ตํ•ด ์กฐํ•ฉ๋œ 8๊ฐœ์˜ ์‹œ์ฒญ ๊ฐ€๊ตฌ ์œ ํ˜•์„ ๋„์ถœํ•˜์˜€๋‹ค. ๋˜ํ•œ ๊ฐ ์‹œ์ฒญ ๊ฐ€๊ตฌ ์œ ํ˜• ๊ทธ๋ฃน ๋‚ด์˜ ์„œ๋น„์Šค ํ•ด์ง€์œจ๊ณผ ์‹œ์ฒญ ํ–‰๋™ ์œ ํ˜• ๊ตฌ์„ฑ๋น„ ๊ฐ„์˜ ์ƒ๊ด€๊ด€๊ณ„ ๋„์ถœ์„ ํ†ตํ•ด, ๋ณธ ์—ฐ๊ตฌ์—์„œ ๋„์ถœํ•œ ์‹œ์ฒญ ํ–‰๋™ ์œ ํ˜•์ด ์„œ๋น„์Šค ํ•ด์ง€๋ฅผ ์˜๋ฏธ ์žˆ๊ฒŒ ์„ค๋ช…ํ•  ์ˆ˜ ์žˆ๋‹ค๋Š” ๊ฒƒ์„ ํ™•์ธํ•  ์ˆ˜ ์žˆ์—ˆ๋‹ค. ๋ณธ ์—ฐ๊ตฌ๋ฅผ ํ†ตํ•ด ์ œ์•ˆ๋œ ๋ถ„์„ ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ํ™œ์šฉํ•˜์—ฌ ํ–‰๋™์„ ๊ธฐ๋ฐ˜์œผ๋กœ ์‹œ์ฒญ ํŒจํ„ด์„ ๋ถ„์„ํ•จ์œผ๋กœ์จ, ๊ธฐ์กด์˜ ๊ฑฐ์‹œ์  ๋งฅ๋ฝ์—์„œ ์ด๋ฃจ์–ด์ง„ ์„ ํ–‰ ์—ฐ๊ตฌ๋ฅผ ํ™•์žฅํ•˜์—ฌ ํ˜„์žฌ ๋ฏธ๋””์–ด ํ™˜๊ฒฝ์—์„œ์˜ TV ์‹œ์ฒญ ํ–‰์œ„์— ๋Œ€ํ•ด ๋”์šฑ ํ’๋ถ€ํ•œ ์ดํ•ด๋ฅผ ๋„์šธ ์ˆ˜ ์žˆ์„ ๊ฒƒ์œผ๋กœ ๊ธฐ๋Œ€ํ•œ๋‹ค.์ œ 1 ์žฅ ์„œ ๋ก  1 ์ œ 1 ์ ˆ ์—ฐ๊ตฌ์˜ ๋ฐฐ๊ฒฝ 1 ์ œ 2 ์ ˆ ์—ฐ๊ตฌ์˜ ๋ชฉํ‘œ 10 ์ œ 2 ์žฅ ์„ ํ–‰์—ฐ๊ตฌ 13 ์ œ 1 ์ ˆ ์ด๋ก ์  ๋ฐฐ๊ฒฝ 13 ์ œ 2 ์ ˆ ๊ธฐ์ˆ ์  ๋ฐฐ๊ฒฝ 24 ์ œ 3 ์žฅ ์—ฐ๊ตฌ ๋ฌธ์ œ 30 ์ œ 1 ์ ˆ ์—ฐ๊ตฌ ๋ฌธ์ œ 30 ์ œ 4 ์žฅ ์—ฐ๊ตฌ ๋ฐฉ๋ฒ• 33 ์ œ 1 ์ ˆ ๋ฐ์ดํ„ฐ ๊ฐœ์š” ๋ฐ ์ „์ฒ˜๋ฆฌ 34 ์ œ 2 ์ ˆ ์„ธ์…˜ ์œ ํ˜•ํ™” 38 ์ œ 3 ์ ˆ ์‹œ์ฒญ ๊ฐ€๊ตฌ ์œ ํ˜•ํ™” 46 ์ œ 4 ์ ˆ ์„ธ์…˜ ์œ ํ˜•๊ณผ ์„œ๋น„์Šค ํ•ด์ง€์˜ ์—ฐ๊ด€์„ฑ 48 ์ œ 5 ์žฅ ์—ฐ๊ตฌ ๊ฒฐ๊ณผ 49 ์ œ 1 ์ ˆ ์„ธ์…˜ ์œ ํ˜• 49 ์ œ 2 ์ ˆ ์‹œ์ฒญ ๊ฐ€๊ตฌ ์œ ํ˜• 60 ์ œ 3 ์ ˆ ์„ธ์…˜ ์œ ํ˜•๊ณผ ์„œ๋น„์Šค ํ•ด์ง€์˜ ์—ฐ๊ด€์„ฑ 77 ์ œ 6 ์žฅ ๊ฒฐ ๋ก  85 ์ œ 1 ์ ˆ ์—ฐ๊ตฌ ์š”์•ฝ 85 ์ œ 2 ์ ˆ ์—ฐ๊ตฌ์˜ ์‹œ์‚ฌ์  87 ์ œ 3 ์ ˆ ์—ฐ๊ตฌ์˜ ํ•œ๊ณ„ 90 ์ฐธ๊ณ ๋ฌธํ—Œ 92Maste

    A web page prediction model based on click-stream tree representation of user behavior

    No full text
    Predicting the next request of a user as she visits Web pages has gained importance as Web-based activity increases. Markov models and their variations, or models based on sequence mining have been found well suited for this problem. However, higher order Markov models are extremely complicated due to their large number of states whereas lower order Markov models do not capture the entire behavior of a user in a session. The models that are based on sequential pattern mining only consider the frequent sequences in the data set, making it difficult to predict the next request following a page that is not in the sequential pattern. Furthermore, it is hard to find models for mining two different kinds of information of a user session. We propose a new model that considers both the order information of pages in a session and the time spent on them. We cluster user sessions based on their pair-wise similarity and represent the resulting clusters by a click-stream tree. The new user session is then assigned to a cluster based on a similarity measure. The click-stream tree of that cluster is used to generate the recommendation set. The model can be used as part of a cache prefetching system as well as a recommendation model. Categories and Subject Descriptors I.5.2 [Pattern Recognition]: Design Methodology- classifier design and evaluatio
    corecore