Search CORE

1,054 research outputs found

Determining factors behind the PageRank log-log plot

Author: Bonato Anthony
Chung Fan R.K.
Donato Debora
Litvak Nelly
Volkovich Yana
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2007
Field of study

We study the relation between PageRank and other parameters of information networks such as in-degree, out-degree, and the fraction of dangling nodes. We model this relation through a stochastic equation inspired by the original definition of PageRank. Further, we use the theory of regular variation to prove that PageRank and in-degree follow power laws with the same exponent. The difference between these two power laws is in a multiple coefficient, which depends mainly on the fraction of dangling nodes, average in-degree, the power law exponent, and damping factor. The out-degree distribution has a minor effect, which we explicitly quantify. Our theoretical predictions show a good agreement with experimental data on three different samples of the Web

arXiv.org e-Print Archive

CiteSeerX

University of Twente Research Information

A framework for evaluating statistical dependencies and rank correlations in power law graphs

Author: Litvak N.
Volkovich Y.V.
Zwart B.
Publication venue: Department of Applied Mathematics, University of Twente
Publication date: 01/01/2008
Field of study

We analyze dependencies in power law graph data (Web sample, Wikipedia sample and a preferential attachment graph) using statistical inference for multivariate regular variation. To the best of our knowledge, this is the first attempt to apply the well developed theory of regular variation to graph data. The new insights this yields are striking: the three above-mentioned data sets are shown to have a totally different dependence structure between different graph parameters, such as in-degree and PageRank. Based on the proposed methodology, we suggest a new measure for rank correlations. Unlike most known methods, this measure is especially sensitive to rank permutations for topranked nodes. Using this method, we demonstrate that the PageRank ranking is not sensitive to moderate changes in the damping factor

CWI's Institutional Repository

University of Twente Research Information

Asymptotic analysis for personalized Web search

Author: Litvak N.
Volkovich Y.V.
Publication venue: Department of Applied Mathematics, University of Twente
Publication date: 01/01/2008
Field of study

Personalized PageRank is used in Web search as an importance measure for Web documents. The goal of this paper is to characterize the tail behavior of the PageRank distribution in the Web and other complex networks characterized by power laws. To this end, we model the PageRank as a solution of a stochastic equation

R\stackrel{d}{=}\sum_{i=1}^NA_iR_i+B

, where

R_i

's are distributed as

R

. This equation is inspired by the original definition of the PageRank. In particular,

N

models the number of incoming links of a page, and

B

stays for the user preference. Assuming that

N

B

are heavy-tailed, we employ the theory of regular variation to obtain the asymptotic behavior of

R

under quite general assumptions on the involved random variables. Our theoretical predictions show a good agreement with experimental data

University of Twente Research Information

PageRank in scale-free random graphs

Author: G Alsmeyer
G Pandurangan
K Avrachenkov
M Olvera-Cravioto
N Chen
N Litvak
PR Jelenković
PR Jelenković
PR Jelenković
S Brin
Y Volkovich
Publication venue
Publication date: 15/08/2014
Field of study

We analyze the distribution of PageRank on a directed configuration model and show that as the size of the graph grows to infinity it can be closely approximated by the PageRank of the root node of an appropriately constructed tree. This tree approximation is in turn related to the solution of a linear stochastic fixed point equation that has been thoroughly studied in the recent literature

arXiv.org e-Print Archive

Crossref

University of Twente Research Information

Usage Bibliometrics

Author: Abt
Accomazzi
Aggarwal
Baldi
Bar-Ilan
Bensman
Bertot
Blecic
Bollen
Bollen
Bollen
Bollen
Bollen
Bollen
Bollen
Bonitz
Borgman
Boyack
Boyack
Brin
Broadus
Broadus
Brody
Brody
Brookes
Burton
Börner
Börner
Castellano
Chen
Cooper
Craig
Cronin
Cronin
Cronin
Darmoni
Davis
Davis
Davis
Davis
Davis
Drott
Duy
Eason
Egghe
Eichhorn
Eysenbach
Eysenbach
Fortunato
Freire
Galvin
Gardner
Garfield
Garfield
Garfield
Gargouri
Georgakopoulos
Ginsparg
Ginsparg
Ginsparg
Ginsparg
Goldberg
Gosnell
Grant
Gross
Hajjem
Harnad
Harnad
Harnad
He
Henneken
Henneken
Henneken
Hider
Hood
Huntington
Jamali
Jansen
Jansen
Kaplan
King
King
King
King
Kurtz
Kurtz
Kurtz
Kurtz
Kurtz
Kurtz
Kurtz
Kurtz
Kurtz
Kurtz
Kurtz
Ladwig
Lawrence
Leydesdorff
Leydesdorff
Leydesdorff
Line
Line
Liu
Ludascher
Luther
MacRoberts
May
Mayr
McDonald
Meadows
Merton
Moed
Moed
Moed
Moya-Anegón
Nicholas
Norris
Pan
Parker
Peters
Pinski
Pirolli
Price
Price
Price
Rice
Rosvall
Rowlands
Rowlands
Scales
Shepherd
Small
Stankus
Szalay
Szalay
Tenopir
Tenopir
Tonta
Trimble
Trimble
Tsay
Tsay
Van de Sompel
Van de Sompel
Van de Sompel
Walter
Wang
Wasserman
White
Wilson
York
Publication venue: 'Wiley'
Publication date: 14/02/2011
Field of study

Scholarly usage data provides unique opportunities to address the known shortcomings of citation analysis. However, the collection, processing and analysis of usage data remains an area of active research. This article provides a review of the state-of-the-art in usage-based informetric, i.e. the use of usage data to study the scholarly process.Comment: Publisher's PDF (by permission). Publisher web site: books.infotoday.com/asist/arist44.shtm

arXiv.org e-Print Archive

Crossref