research

Large-scale behavior of the partial duplication random graph

Abstract

The following random graph model was introduced for the evolution of protein-protein interaction networks: Let G=(Gn)n=n0,n0+1,...\mathcal G = (G_n)_{n=n_0, n_0+1,...} be a sequence of random graphs, where Gn=(Vn,En)G_n = (V_n, E_n) is a graph with Vn=n|V_n|=n vertices, n=n0,n0+1,...n=n_0,n_0+1,... In state Gn=(Vn,En)G_n = (V_n, E_n), a vertex vVnv\in V_n is chosen from VnV_n uniformly at random and is partially duplicated. Upon such an event, a new vertex vVnv'\notin V_n is created and every edge {v,w}En\{v,w\} \in E_n is copied with probability~pp, i.e.\ En+1E_{n+1} has an edge {v,w}\{v',w\} with probability~pp, independently of all other edges. Within this graph, we study several aspects for large~nn. (i) The frequency of isolated vertices converges to~1 if pp0.567143p\leq p^* \approx 0.567143, the unique solution of pep=1pe^p=1. (ii) The number CkC_k of kk-cliques behaves like nkpk1n^{kp^{k-1}} in the sense that nkpk1Ckn^{-kp^{k-1}}C_k converges against a non-trivial limit, if the starting graph has at least one kk-clique. In particular, the average degree of a vertex (which equals the number of edges -- or 2-cliques -- divided by the size of the graph) converges to 00 iff p<0.5p<0.5 and we obtain that the transitivity ratio of the random graph is of the order n2p(1p)n^{-2p(1-p)}. (iii) The evolution of the degrees of the vertices in the initial graph can be described explicitly. Here, we obtain the full distribution as well as convergence results.Comment: 27 pages, 1 figur

    Similar works

    Full text

    thumbnail-image