Search CORE

18,778 research outputs found

Recommended from our members

Heterogeneous network embedding enabling accurate disease association predictions.

Author: Guo Mengjie
Kong Xiangnan
Ruan Lu
Tang Chunlei
Wang Wei
Xiong Yun
Zhu Yangyong
Publication venue: eScholarship, University of California
Publication date: 01/12/2019
Field of study

BackgroundIt is significant to identificate complex biological mechanisms of various diseases in biomedical research. Recently, the growing generation of tremendous amount of data in genomics, epigenomics, metagenomics, proteomics, metabolomics, nutriomics, etc., has resulted in the rise of systematic biological means of exploring complex diseases. However, the disparity between the production of the multiple data and our capability of analyzing data has been broaden gradually. Furthermore, we observe that networks can represent many of the above-mentioned data, and founded on the vector representations learned by network embedding methods, entities which are in close proximity but at present do not actually possess direct links are very likely to be related, therefore they are promising candidate subjects for biological investigation.ResultsWe incorporate six public biological databases to construct a heterogeneous biological network containing three categories of entities (i.e., genes, diseases, miRNAs) and multiple types of edges (i.e., the known relationships). To tackle the inherent heterogeneity, we develop a heterogeneous network embedding model for mapping the network into a low dimensional vector space in which the relationships between entities are preserved well. And in order to assess the effectiveness of our method, we conduct gene-disease as well as miRNA-disease associations predictions, results of which show the superiority of our novel method over several state-of-the-arts. Furthermore, many associations predicted by our method are verified in the latest real-world dataset.ConclusionsWe propose a novel heterogeneous network embedding method which can adequately take advantage of the abundant contextual information and structures of heterogeneous network. Moreover, we illustrate the performance of the proposed method on directing studies in biology, which can assist in identifying new hypotheses in biological investigation

eScholarship - University of California

Switcher-random-walks: a cognitive-inspired mechanism for network exploration

Author: Barabasi A.-L.
BERNAT COROMINAS-MURTRA
Erdös P.
GONZALO ARRONDO
IÑIGO MARTINCORENA
JOAQUÍN GOÑI
Lezak M.
PABLO VILLOSLADA
SERGIO ARDANZA-TREVIJANO
Squire L.
Troyer A. K.
Watts D.
Publication venue: 'World Scientific Pub Co Pte Lt'
Publication date: 24/03/2009
Field of study

Semantic memory is the subsystem of human memory that stores knowledge of concepts or meanings, as opposed to life specific experiences. The organization of concepts within semantic memory can be understood as a semantic network, where the concepts (nodes) are associated (linked) to others depending on perceptions, similarities, etc. Lexical access is the complementary part of this system and allows the retrieval of such organized knowledge. While conceptual information is stored under certain underlying organization (and thus gives rise to a specific topology), it is crucial to have an accurate access to any of the information units, e.g. the concepts, for efficiently retrieving semantic information for real-time needings. An example of an information retrieval process occurs in verbal fluency tasks, and it is known to involve two different mechanisms: -clustering-, or generating words within a subcategory, and, when a subcategory is exhausted, -switching- to a new subcategory. We extended this approach to random-walking on a network (clustering) in combination to jumping (switching) to any node with certain probability and derived its analytical expression based on Markov chains. Results show that this dual mechanism contributes to optimize the exploration of different network models in terms of the mean first passage time. Additionally, this cognitive inspired dual mechanism opens a new framework to better understand and evaluate exploration, propagation and transport phenomena in other complex systems where switching-like phenomena are feasible.Comment: 9 pages, 3 figures. Accepted in "International Journal of Bifurcations and Chaos": Special issue on "Modelling and Computation on Complex Networks

arXiv.org e-Print Archive

Crossref

Estimating graph parameters with random walks

Author: Ben-Hamou Anna
Oliveira Roberto I.
Peres Yuval
Publication venue
Publication date: 17/08/2018
Field of study

An algorithm observes the trajectories of random walks over an unknown graph

G

, starting from the same vertex

x

, as well as the degrees along the trajectories. For all finite connected graphs, one can estimate the number of edges

m

up to a bounded factor in

O\left(t_{\mathrm{rel}}^{3/4}\sqrt{m/d}\right)

steps, where

t_{\mathrm{rel}}

is the relaxation time of the lazy random walk on

G

and

d

is the minimum degree in

G

. Alternatively,

m

can be estimated in

O\left(t_{\mathrm{unif}} +t_{\mathrm{rel}}^{5/6}\sqrt{n}\right)

, where

n

is the number of vertices and

t_{\mathrm{unif}}

is the uniform mixing time on

G

. The number of vertices

n

can then be estimated up to a bounded factor in an additional

O\left(t_{\mathrm{unif}}\frac{m}{n}\right)

steps. Our algorithms are based on counting the number of intersections of random walk paths

X,Y

, i.e. the number of pairs

(t,s)

such that

X_t=Y_s

. This improves on previous estimates which only consider collisions (i.e., times

t

with

X_t=Y_t

). We also show that the complexity of our algorithms is optimal, even when restricting to graphs with a prescribed relaxation time. Finally, we show that, given either

m

or the mixing time of

G

, we can compute the "other parameter" with a self-stopping algorithm

arXiv.org e-Print Archive

Hal-Diderot

Estimating and Sampling Graphs with Multidimensional Random Walks

Author: Ribeiro Bruno
Towsley Don
Publication venue
Publication date: 01/01/2010
Field of study

Estimating characteristics of large graphs via sampling is a vital part of the study of complex networks. Current sampling methods such as (independent) random vertex and random walks are useful but have drawbacks. Random vertex sampling may require too many resources (time, bandwidth, or money). Random walks, which normally require fewer resources per sample, can suffer from large estimation errors in the presence of disconnected or loosely connected graphs. In this work we propose a new

m

-dimensional random walk that uses

m

dependent random walkers. We show that the proposed sampling method, which we call Frontier sampling, exhibits all of the nice sampling properties of a regular random walk. At the same time, our simulations over large real world graphs show that, in the presence of disconnected or loosely connected components, Frontier sampling exhibits lower estimation errors than regular random walks. We also show that Frontier sampling is more suitable than random vertex sampling to sample the tail of the degree distribution of the graph

arXiv.org e-Print Archive

CiteSeerX