Location of Repository

A stochastic model for the evolution of the Web

By Mark Levene, Trevor Fenner, George Loizou and Richard Wheeldon

Abstract

Recently several authors have proposed stochastic models of the growth of the Web graph that give rise to power-law distributions. These models are based on the notion of preferential attachment leading to the "rich get richer" phenomenon. However, these models fail to explain several distributions arising from empirical results, due to the fact that the predicted exponent is not consistent with the data. To address this problem, we extend the evolutionary model of the Web graph by including a non-preferential component, and we view the stochastic process in terms of an urn transfer model. By making this extension, we can now explain a wider variety of empirically discovered power-law distributions provided the exponent is greater than two. These include: the distribution of incoming links, the distribution of outgoing links, the distribution of pages in a Web site and the distribution of visitors to a Web site. A by-product of our results is a formal proof of the convergence of the standard stochastic model (first proposed by Simon)

Topics: csis
Publisher: Elsevier
Year: 2002
OAI identifier: oai:eprints.bbk.ac.uk.oai2:210

Suggested articles

Preview

Citations

  1. (1976). A general theory of bibliometric and other cumulative advantage processes. doi
  2. (1959). A note on a class of skew distribution functions: Analysis and critique of a paper by doi
  3. (1989). Bibliometric modeling processes and the empirical validity of Lotka’s law. doi
  4. (1999). Extracting large-scale knowledge bases from the web. In doi
  5. (1991). Fractals, Chaos, Power Laws: Minutes from an Infinite Paradise. doi
  6. (2000). Graph strucutre in the web. doi
  7. (2001). Hyperlink analysis for the web. doi
  8. (1995). Lotka’s law, Price’s urn, and electronic publishing. doi
  9. (1999). Mean-field theory for scale free random networks. doi
  10. (1955). On a class of skew distribution functions. doi
  11. (1999). On power-law relationships of the internet topology. doi
  12. (2000). Scale-free characteristics of random networks: the topology of the world-wide web. doi
  13. (2001). Self-similarity in the Web. In doi
  14. (2000). Sizing the internet. White paper, Cyveillance,
  15. (1963). Some Monte Carlo estimates of the Yule distribution. doi
  16. (2000). Structure of growing networks with preferential linking. Physical Review Letters, doi
  17. (2000). The nature of markets in the World Wide Web. doi
  18. (2000). The web as a graph. doi
  19. (1977). Urn Models and their Application: An Approach to Modern Discrete Probability. doi
  20. (1955). World-Wide Web scaling exponent from Simon’s doi
  21. (0009). WWW and internet models from 1955 till our day and the “popularity is attractive” principle. Condensed Matter Archive,
  22. (2000). WWW and internet models from 1955 till our day and the “popularity is attractive” principle. Condensed Matter Archive, cond-mat/0009090,
  23. (1982). Zipf’s law revisited.

To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.