5,385 research outputs found
Benchmarking Network Embedding Models for Link Prediction: Are We Making Progress?
Network embedding methods map a network's nodes to vectors in an embedding
space, in such a way that these representations are useful for estimating some
notion of similarity or proximity between pairs of nodes in the network. The
quality of these node representations is then showcased through results of
downstream prediction tasks. Commonly used benchmark tasks such as link
prediction, however, present complex evaluation pipelines and an abundance of
design choices. This, together with a lack of standardized evaluation setups
can obscure the real progress in the field. In this paper, we aim to shed light
on the state-of-the-art of network embedding methods for link prediction and
show, using a consistent evaluation pipeline, that only thin progress has been
made over the last years. The newly conducted benchmark that we present here,
including 17 embedding methods, also shows that many approaches are
outperformed even by simple heuristics. Finally, we argue that standardized
evaluation tools can repair this situation and boost future progress in this
field
Connected Spatial Networks over Random Points and a Route-Length Statistic
We review mathematically tractable models for connected networks on random
points in the plane, emphasizing the class of proximity graphs which deserves
to be better known to applied probabilists and statisticians. We introduce and
motivate a particular statistic measuring shortness of routes in a network.
We illustrate, via Monte Carlo in part, the trade-off between normalized
network length and in a one-parameter family of proximity graphs. How close
this family comes to the optimal trade-off over all possible networks remains
an intriguing open question. The paper is a write-up of a talk developed by the
first author during 2007--2009.Comment: Published in at http://dx.doi.org/10.1214/10-STS335 the Statistical
Science (http://www.imstat.org/sts/) by the Institute of Mathematical
Statistics (http://www.imstat.org
- …