Search CORE

1,436 research outputs found

Quantifying and suppressing ranking bias in a large citation network

Author: Mariani Manuel Sebastian
Medo Matúš
Vaccario Giacomo
Wider Nicolas
Publication venue
Publication date: 01/01/2017
Field of study

It is widely recognized that citation counts for papers from different fields cannot be directly compared because different scientific fields adopt different citation practices. Citation counts are also strongly biased by paper age since older papers had more time to attract citations. Various procedures aim at suppressing these biases and give rise to new normalized indicators, such as the relative citation count. We use a large citation dataset from Microsoft Academic Graph and a new statistical framework based on the Mahalanobis distance to show that the rankings by well known indicators, including the relative citation count and Google's PageRank score, are significantly biased by paper field and age. Our statistical framework to assess ranking bias allows us to exactly quantify the contributions of each individual field to the overall bias of a given ranking. We propose a general normalization procedure motivated by the z-score which produces much less biased rankings when applied to citation count and PageRank score

arXiv.org e-Print Archive

RERO DOC Digital Library

Bern Open Repository and Information System (BORIS)

Network-based ranking in social systems: three challenges

Author: Lü Linyuan
Mariani Manuel S.
Publication venue: 'IOP Publishing'
Publication date: 29/05/2020
Field of study

Ranking algorithms are pervasive in our increasingly digitized societies, with important real-world applications including recommender systems, search engines, and influencer marketing practices. From a network science perspective, network-based ranking algorithms solve fundamental problems related to the identification of vital nodes for the stability and dynamics of a complex system. Despite the ubiquitous and successful applications of these algorithms, we argue that our understanding of their performance and their applications to real-world problems face three fundamental challenges: (i) Rankings might be biased by various factors; (2) their effectiveness might be limited to specific problems; and (3) agents' decisions driven by rankings might result in potentially vicious feedback mechanisms and unhealthy systemic consequences. Methods rooted in network science and agent-based modeling can help us to understand and overcome these challenges.Comment: Perspective article. 9 pages, 3 figure

arXiv.org e-Print Archive

ZORA

Algorithmic bias amplification via temporal effects: The case of PageRank in evolving networks

Author: Cui Mengtian
Mariani Manuel
Medo Matúš
Publication venue: 'Elsevier BV'
Publication date: 01/01/2022
Field of study

Biases impair the effectiveness of algorithms. For example, the age bias of the widely-used PageRank algorithm impairs its ability to effectively rank nodes in growing networks. PageRank’s temporal bias cannot be fully explained by existing analytic results that predict a linear relation between the expected PageRank score and the indegree of a given node. We show that in evolving networks, under a mean-field approximation, the expected PageRank score of a node can be expressed as the product of the node’s indegree and a previously-neglected age factor which can “amplify” the indegree’s age bias. We use two well-known empirical networks to show that our analytic results explain the observed PageRank’s age bias and, when there is an age bias amplification, they enable estimates of the node PageRank score that are more accurate than estimates based solely on local structural information. Accuracy gains are larger in degree-degree correlated networks, as revealed by a growing directed network model with tunable assortativity. Our approach can be used to analytically study other kinds of ranking bias

ZORA

Early identification of important patents through network centrality

Author: Lafond François
Mariani Manuel Sebastian
Medo Matus
Publication venue: 'Elsevier BV'
Publication date: 25/10/2017
Field of study

One of the most challenging problems in technological forecasting is to identify as early as possible those technologies that have the potential to lead to radical changes in our society. In this paper, we use the US patent citation network (1926-2010) to test our ability to early identify a list of historically significant patents through citation network analysis. We show that in order to effectively uncover these patents shortly after they are issued, we need to go beyond raw citation counts and take into account both the citation network topology and temporal information. In particular, an age-normalized measure of patent centrality, called rescaled PageRank, allows us to identify the significant patents earlier than citation count and PageRank score. In addition, we find that while high-impact patents tend to rely on other high-impact patents in a similar way as scientific papers, the patents' citation dynamics is significantly slower than that of papers, which makes the early identification of significant patents more challenging than that of significant papers.Comment: 14 page

arXiv.org e-Print Archive

ZORA

Ranking in evolving complex networks

Author: Abu-Mostafa
Acuna
Adamic
Adler
Adomavicius
Albarrán
Albert
Amin
Arlot
Avrachenkov
Baeza-Yates
Baeza-Yates
Balcan
Bar-Ilan
Barabási
Barabási
Barzel
Battiston
Battiston
Bell
Benevenuto
Bennett
Berberich
Berberich
Berberich
Bergstrom
Berkhin
Berman
Bianchini
Bianconi
Bobadilla
Boccaletti
Boccaletti
Bohlin
Boldi
Boldi
Bollen
Bonacich
Bonacich
Borgatti
Borgatti
Bornmann
Breese
Breiman
Brin
Brockmann
Caldarelli
Caldarelli
Cao
Carmi
Casteigts
Catalini
Ceriani
Cha
Cha
Chaintreau
Chen
Chen
Chen
Cho
Clauset
Colavizza
Costanza
Coyle
Cristelli
Cristelli
De Domenico
de Solla Price
Del Vicario
Del Vicario
Delvenne
Deng
Dhote
Ding
Domínguez-García
Dorogovtsev
Dorogovtsev
Duhan
Dunaiski
Editors
Epstein
Ermann
Falagas
Feenberg
Felipe
Fiala
Fleder
Fletcher
Fogaras
Fortunato
Fortunato
Fortunato
Fortunato
Fortunato
Franceschet
Freeman
Gama
Garas
Ghosh
Gleich
Golosovsky
González-Pereira
Gregg
Hanani
Hao Liao
Hart
Hausmann
Hausmann
Hidalgo
Hirsch
Hirsch
Hofman
Holme
Holme
Hubbell
Isella
Iván
Jackson
Jahrer
Jeong
Jiang
Jing
Jøsang
Júnior
Karsai
Katz
Kawamoto
Ke
Kempe
Kempe
Kenett
Kim
Kitsak
Kivelä
Kleinberg
Klemm
Koher
Kong
Koren
Koren
Koren
Koren
Koren
Kostakos
Krapivsky
Krings
Kunegis
Kuznets
König
König
Lambiotte
Lambiotte
Langville
Laureti
Lazer
Lentz
Leskovec
Li
Li
Li
Liao
Liao
Liben-Nowell
Liu
Liu
Liu
Liu
Liu
Lohmann
Lorenz
Lorenz
Lü
Lü
Lü
Lü
Lü
Lü
Manuel Sebastian Mariani
Mao
Mariani
Mariani
Mariani
Marsden
Maslov
Masuda
Masum
Matúš Medo
McAfee
McDonald
Medo
Medo
Medo
Medo
Medo
Medo
Menche
Ming-Yang Zhou
Mingers
Moody
Moradabadi
Morone
Motegi
Munasinghe
Murphy
Newman
Newman
Newman
Newman
Newman
Ng
Nicosia
Opsahl
Pan
Pan
Pan
Papadopoulos
Pariser
Park
Parolo
Pei
Penner
Peoples
Perra
Perra
Perron
Petersen
Pfitzner
Picard
Pinski
Pinyol
Piramuthu
Piraveenan
Price
Qiu
Radicchi
Radicchi
Radicchi
Radicchi
Radicchi
Redner
Reis
Ren
Ren
Ren
Resnick
Rocha
Rochat
Rosvall
Ruusuvirta
Sabidussi
Sarigöl
Schafer
Schneider
Scholtes
Scholtes
Scholtes
Schubert
Scott
Sekara
Sendiña-Nadal
Sidiropoulos
Sinatra
Smith-Clarke
Solé-Ribalta
Spitz
Starnini
Subrahmanian
Tacchella
Takács
Tang
Tang
Tylenda
Van Mieghem
Van Noorden
Van Raan
Vidmer
Vidmer
Vinkler
Walker
Wallner
Waltman
Waltman
Wang
Wang
Wang
Wang
Wang
Wasserman
Wilsdon
Wolf
Wu
Xu
Yi-Cheng Zhang
Yu
Yu
Yu
Zachary
Zanin
Zeng
Zeng
Zeng
Zhang
Zhang
Zhang
Zhang
Zhang
Zhirov
Zhou
Zhou
Zhou
Zhou
Zhou
Zhou
Ziegler
Zweig
Publication venue
Publication date: 01/01/2017
Field of study

Complex networks have emerged as a simple yet powerful framework to represent and analyze a wide range of complex systems. The problem of ranking the nodes and the edges in complex networks is critical for a broad range of real-world problems because it affects how we access online information and products, how success and talent are evaluated in human activities, and how scarce resources are allocated by companies and policymakers, among others. This calls for a deep understanding of how existing ranking algorithms perform, and which are their possible biases that may impair their effectiveness. Many popular ranking algorithms (such as Google’s PageRank) are static in nature and, as a consequence, they exhibit important shortcomings when applied to real networks that rapidly evolve in time. At the same time, recent advances in the understanding and modeling of evolving networks have enabled the development of a wide and diverse range of ranking algorithms that take the temporal dimension into account. The aim of this review is to survey the existing ranking algorithms, both static and time-aware, and their applications to evolving networks. We emphasize both the impact of network evolution on well-established static algorithms and the benefits from including the temporal dimension for tasks such as prediction of network traffic, prediction of future links, and identification of significant nodes

arXiv.org e-Print Archive

Crossref

RERO DOC Digital Library

Bern Open Repository and Information System (BORIS)

The long-term impact of ranking algorithms in growing networks

Author: Lü Linyuan
Mariani Manuel Sebastian
Medo Matúš
Zhang Shilun
Publication venue
Publication date: 19/11/2018
Field of study

When users search online for content, they are constantly exposed to rankings. For example, web search results are presented as a ranking of relevant websites, and online bookstores often show us lists of best-selling books. While popularity-based ranking algorithms (like Google’s PageRank) have been extensively studied in previous works, we still lack a clear understanding of their potential systemic consequences. In this work, we fill this gap by introducing a new model of network growth that allows us to compare the properties of networks generated under the influence of different ranking algorithms. We show that by correcting for the omnipresent age bias of popularity-based ranking algorithms, the resulting networks exhibit a significantly larger agreement between the nodes’ inherent quality and their long-term popularity, and a less concentrated popularity distribution. To further promote popularity diversity, we introduce and validate a perturbation of the original rankings where a small number of randomly-selected nodes are promoted to the top of the ranking. Our findings move the first steps toward a model-based understanding of the long-term impact of popularity-based ranking algorithms, and our novel framework could be used to design improved information filtering tools

arXiv.org e-Print Archive

ZORA

RERO DOC Digital Library

Big networks : a survey

Author: Bedru Hayat
Xia Feng
Xiao Xinru
Yu Shuo
Zhang Da
Publication venue: 'Elsevier BV'
Publication date: 01/01/2020
Field of study

A network is a typical expressive form of representing complex systems in terms of vertices and links, in which the pattern of interactions amongst components of the network is intricate. The network can be static that does not change over time or dynamic that evolves through time. The complication of network analysis is different under the new circumstance of network size explosive increasing. In this paper, we introduce a new network science concept called a big network. A big networks is generally in large-scale with a complicated and higher-order inner structure. This paper proposes a guideline framework that gives an insight into the major topics in the area of network science from the viewpoint of a big network. We first introduce the structural characteristics of big networks from three levels, which are micro-level, meso-level, and macro-level. We then discuss some state-of-the-art advanced topics of big network analysis. Big network models and related approaches, including ranking methods, partition approaches, as well as network embedding algorithms are systematically introduced. Some typical applications in big networks are then reviewed, such as community detection, link prediction, recommendation, etc. Moreover, we also pinpoint some critical open issues that need to be investigated further. © 2020 Elsevier Inc

Federation ResearchOnline