Search CORE

640,488 research outputs found

Mining Missing Hyperlinks from Human Navigation Traces: A Case Study of Wikipedia

Author: Clemesha A.
Milgram S.
Milne D.
Popescul A.
Singer P.
Taskar B.
West R.
West R.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 13/03/2015
Field of study

Hyperlinks are an essential feature of the World Wide Web. They are especially important for online encyclopedias such as Wikipedia: an article can often only be understood in the context of related articles, and hyperlinks make it easy to explore this context. But important links are often missing, and several methods have been proposed to alleviate this problem by learning a linking model based on the structure of the existing links. Here we propose a novel approach to identifying missing links in Wikipedia. We build on the fact that the ultimate purpose of Wikipedia links is to aid navigation. Rather than merely suggesting new links that are in tune with the structure of existing links, our method finds missing links that would immediately enhance Wikipedia's navigability. We leverage data sets of navigation paths collected through a Wikipedia-based human-computation game in which users must find a short path from a start to a target article by only clicking links encountered along the way. We harness human navigational traces to identify a set of candidates for missing links and then rank these candidates. Experiments show that our procedure identifies missing links of high quality

arXiv.org e-Print Archive

CiteSeerX

Crossref

Missing Links

Author: Agbinya Johnson
Publication venue: 'University of Technology, Sydney (UTS)'
Publication date: 01/03/2006
Field of study

There are no doubts that the African telecommunication sector has grown and made significant strides the last three years. The level of progress is not a fluke. However, one of the greatest problems facing affordable telecommunication access in many parts of Africa is monopoly of access, links and inter-connectivity between operators. In many countries, this monopoly is controlled by incumbents, legacies of state owned telecommunication companies failing to realise when their job is done and when relinquishing their hold on national structures is more nationally productive. Often the links in question have been paid for with tax payers’ money before such companies are privatised or sold. This problem is significant across the African continent and has kept communication access in the continent very expensive

UTS ePress

Uncovering missing links with cold ends

Author: Adamic
Albert
Amaral
Barabási
Biernacki
Boccaletti
Butts
Cohen
Costa
Dorogovtsev
Getoor
Guimerà
Hanely
Jaccard
Kossinets
Leicht
Liben-Nowell
Linyuan Lü
Liu
Liu
Liu
Lovász
Lü
Lü
Lü
Molloy
Neal
Newman
Newman
Newman
Newman
Ou
Qian-Ming Zhang
Ravasz
Salton
Stumpf
Sørensen
Tao Zhou
von Mering
Wang
Watts
Yan
Yu
Yu-Xiao Zhu
Zeng
Zhang
Zhou
Zhou
Publication venue: 'Elsevier BV'
Publication date: 02/10/2011
Field of study

To evaluate the performance of prediction of missing links, the known data are randomly divided into two parts, the training set and the probe set. We argue that this straightforward and standard method may lead to terrible bias, since in real biological and information networks, missing links are more likely to be links connecting low-degree nodes. We therefore study how to uncover missing links with low-degree nodes, namely links in the probe set are of lower degree products than a random sampling. Experimental analysis on ten local similarity indices and four disparate real networks reveals a surprising result that the Leicht-Holme-Newman index [E. A. Leicht, P. Holme, and M. E. J. Newman, Phys. Rev. E 73, 026120 (2006)] performs the best, although it was known to be one of the worst indices if the probe set is a random sampling of all links. We further propose an parameter-dependent index, which considerably improves the prediction accuracy. Finally, we show the relevance of the proposed index on three real sampling methods.Comment: 16 pages, 5 figures, 6 table

arXiv.org e-Print Archive

Crossref

RERO DOC Digital Library

Missing Links in Multiple Trade Networks

Author: Foschi Rachele
Riccaboni Massimo
Schiavo Stefano
Publication venue: IMT Institute for Advanced Studies Lucca
Publication date: 01/01/2013
Field of study

In this paper we develop a network model of international trade which is able to replicate the concentrated and sparse nature of trade data. Our model extends the preferential attachment (PA) growth model to the case of multiple networks. Countries trade a variety of goods of different complexity. Every country progressively evolves from trading less sophisticated to high-tech goods. The probability to capture more trade opportunities at a given level of complexity and to start trading more complex goods are both proportional to the number of existing trade links. We provide a set of theoretical predictions and simulative results. A calibration exercise shows that our model replicates the same concentration level of world trade as well as the sparsity pattern of the trade matrix. Moreover, we find a lower bound for the share of genuine missing trade links. We also discuss a set of numerical solutions to deal with large multiple networks

Archivio della Ricerca - Università di Pisa

IMT Institutional Repository

Entropy-based approach to missing-links prediction

Author: Caldarelli Guido
Parisi Federica
Squartini Tiziano
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2018
Field of study

Link-prediction is an active research field within network theory, aiming at uncovering missing connections or predicting the emergence of future relationships from the observed network structure. This paper represents our contribution to the stream of research concerning missing links prediction. Here, we propose an entropy-based method to predict a given percentage of missing links, by identifying them with the most probable non-observed ones. The probability coefficients are computed by solving opportunely defined null-models over the accessible network structure. Upon comparing our likelihood-based, local method with the most popular algorithms over a set of economic, financial and food networks, we find ours to perform best, as pointed out by a number of statistical indicators (e.g. the precision, the area under the ROC curve, etc.). Moreover, the entropy-based formalism adopted in the present paper allows us to straightforwardly extend the link-prediction exercise to directed networks as well, thus overcoming one of the main limitations of current algorithms. The higher accuracy achievable by employing these methods - together with their larger flexibility - makes them strong competitors of available link-prediction algorithms

arXiv.org e-Print Archive

Directory of Open Access Journals

Archivio della ricerca della Scuola IMT Alti Studi Lucca

Missing Links: Referrer Behavior and Job Segregation

Author: Fernandez Roberto
Rubineau Brian
Publication venue: DigitalCommons@ILR
Publication date: 01/01/2013
Field of study

The importance of networks in labor markets is well-known, and their job segregating effects in organizations taken as granted. Conventional wisdom attributes this segregation to the homophilous nature of contact networks, and leaves little role for organizational influences. But employee referrals are necessarily initiated within a firm by employee referrers subject to organizational policies. We build theory regarding the role of referrers in the segregating effects of network recruitment. Using mathematical and computational models, we investigate how empirically-documented referrer behaviors affect job segregation. We show that referrer behaviors can segregate jobs beyond the effects of homophilous network recruitment. Further, and contrary to past understandings, we show that referrer behaviors can also mitigate most if not all of the segregating effects of network recruitment. Although largely neglected in previous labor market network scholarship, referrers are the missing links revealing opportunities for organizations to influence the effects of network recruitment

DigitalCommons@ILR

eCommons@Cornell