Search CORE

19,622 research outputs found

On sampling nodes in a network

Author: CHIERICHETTI FLAVIO
Dasgupta Anirban
Kumar Ravi
Lattanzi Silvio
Sarlós Tamás
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2016
Field of study

Random walk is an important tool in many graph mining applications including estimating graph parameters, sampling portions of the graph, and extracting dense communities. In this paper we consider the problem of sampling nodes from a large graph according to a prescribed distribution by using random walk as the basic primitive. Our goal is to obtain algorithms that make a small number of queries to the graph but output a node that is sampled according to the prescribed distribution. Focusing on the uniform distribution case, we study the query complexity of three algorithms and show a near-tight bound expressed in terms of the parameters of the graph such as average degree and the mixing time. Both theoretically and empirically, we show that some algorithms are preferable in practice than the others. We also extend our study to the problem of sampling nodes according to some polynomial function of their degrees; this has implications for designing efficient algorithms for applications such as triangle counting

IIT Gandhinagar

ZENODO

NEUROSURGERY ENTHUSIASTIC WOMEN SOCIETY

Archivio della ricerca- Università di Roma La Sapienza

Estimating graph parameters with random walks

Author: Ben-Hamou Anna
Oliveira Roberto I.
Peres Yuval
Publication venue
Publication date: 17/08/2018
Field of study

An algorithm observes the trajectories of random walks over an unknown graph

G

, starting from the same vertex

x

, as well as the degrees along the trajectories. For all finite connected graphs, one can estimate the number of edges

m

up to a bounded factor in

O\left(t_{\mathrm{rel}}^{3/4}\sqrt{m/d}\right)

steps, where

t_{\mathrm{rel}}

is the relaxation time of the lazy random walk on

G

and

d

is the minimum degree in

G

. Alternatively,

m

can be estimated in

O\left(t_{\mathrm{unif}} +t_{\mathrm{rel}}^{5/6}\sqrt{n}\right)

, where

n

is the number of vertices and

t_{\mathrm{unif}}

is the uniform mixing time on

G

. The number of vertices

n

can then be estimated up to a bounded factor in an additional

O\left(t_{\mathrm{unif}}\frac{m}{n}\right)

steps. Our algorithms are based on counting the number of intersections of random walk paths

X,Y

, i.e. the number of pairs

(t,s)

such that

X_t=Y_s

. This improves on previous estimates which only consider collisions (i.e., times

t

with

X_t=Y_t

). We also show that the complexity of our algorithms is optimal, even when restricting to graphs with a prescribed relaxation time. Finally, we show that, given either

m

or the mixing time of

G

, we can compute the "other parameter" with a self-stopping algorithm

arXiv.org e-Print Archive

Hal-Diderot

Degree Ranking Using Local Information

Author: Gera Ralucca
Iyengar S. R. S.
Saxena Akrati
Publication venue
Publication date: 10/06/2017
Field of study

Most real world dynamic networks are evolved very fast with time. It is not feasible to collect the entire network at any given time to study its characteristics. This creates the need to propose local algorithms to study various properties of the network. In the present work, we estimate degree rank of a node without having the entire network. The proposed methods are based on the power law degree distribution characteristic or sampling techniques. The proposed methods are simulated on synthetic networks, as well as on real world social networks. The efficiency of the proposed methods is evaluated using absolute and weighted error functions. Results show that the degree rank of a node can be estimated with high accuracy using only

1\%

samples of the network size. The accuracy of the estimation decreases from high ranked to low ranked nodes. We further extend the proposed methods for random networks and validate their efficiency on synthetic random networks, that are generated using Erd\H{o}s-R\'{e}nyi model. Results show that the proposed methods can be efficiently used for random networks as well

arXiv.org e-Print Archive

Calhoun, Institutional Archive of the Naval Postgraduate School