Search CORE

1,127 research outputs found

Scalable Mining of Common Routes in Mobile Communication Network Traffic Data

Author: A.Z. Broder
C. Song
C. Song
D.J. Patterson
G. Yavas
J. Hightower
K. Laasonen
L. Liao
M.C. González
T. Sohn
W. Massey
W. Rand
Publication venue
Publication date: 01/01/2012
Field of study

A probabilistic method for inferring common routes from mobile communication network traffic data is presented. Besides providing mobility information, valuable in a multitude of application areas, the method has the dual purpose of enabling efficient coarse-graining as well as anonymisation by mapping individual sequences onto common routes. The approach is to represent spatial trajectories by Cell ID sequences that are grouped into routes using locality-sensitive hashing and graph clustering. The method is demonstrated to be scalable, and to accurately group sequences using an evaluation set of GPS tagged data

Crossref

RISE – Research Institutes of Sweden

Digitala Vetenskapliga Arkivet - Academic Archive On-line

Swedish Institute of Computer Science Publications Database

Software institutes' Online Digital Archive

Viral antibody dynamics in a chiropteran host

Author: Baker K.S.
Barr J.
Boots M.
Broder C.C.
Cunningham A.A.
Durrant C.
Hayman D.T.S.
Horton D.L.
Murcia P.R.
Suu-Ire R.
Wood J.L.N.
Publication venue: 'Wiley'
Publication date: 01/03/2014
Field of study

1. Bats host many viruses that are significant for human and domestic animal health, but the dynamics of these infections in their natural reservoir hosts remain poorly elucidated. 2. In these, and other, systems, there is evidence that seasonal life-cycle events drive infection dynamics, directly impacting the risk of exposure to spillover hosts. Understanding these dynamics improves our ability to predict zoonotic spillover from the reservoir hosts. 3. To this end, we followed henipavirus antibody levels of >100 individual E. helvum in a closed, captive, breeding population over a 30-month period, using a powerful novel antibody quantitation method. 4. We demonstrate the presence of maternal antibodies in this system and accurately determine their longevity. We also present evidence of population-level persistence of viral infection and demonstrate periods of increased horizontal virus transmission associated with the pregnancy/lactation period. 5.The novel findings of infection persistence and the effect of pregnancy on viral transmission, as well as an accurate quantitation of chiropteran maternal antiviral antibody half-life, provide fundamental baseline data for the continued study of viral infections in these important reservoir hosts

Crossref

Enlighten

Surrey Research Insight

Fractal-like Distributions over the Rational Numbers in High-throughput Biological and Clinical Data

Author: A Broder
B Cowling
C Fraser
D Jamieson
E Mardis
G Bignell
J Salk
L Ding
L Pasqualucci
P Vlierberghe
R Johnston
S Shea
Publication venue
Publication date: 19/10/2010
Field of study

Recent developments in extracting and processing biological and clinical data are allowing quantitative approaches to studying living systems. High-throughput sequencing, expression profiles, proteomics, and electronic health records are some examples of such technologies. Extracting meaningful information from those technologies requires careful analysis of the large volumes of data they produce. In this note, we present a set of distributions that commonly appear in the analysis of such data. These distributions present some interesting features: they are discontinuous in the rational numbers, but continuous in the irrational numbers, and possess a certain self-similar (fractal-like) structure. The first set of examples which we present here are drawn from a high-throughput sequencing experiment. Here, the self-similar distributions appear as part of the evaluation of the error rate of the sequencing technology and the identification of tumorogenic genomic alterations. The other examples are obtained from risk factor evaluation and analysis of relative disease prevalence and co-mordbidity as these appear in electronic clinical data. The distributions are also relevant to identification of subclonal populations in tumors and the study of the evolution of infectious diseases, and more precisely the study of quasi-species and intrahost diversity of viral populations

arXiv.org e-Print Archive

Crossref

Columbia University Academic Commons

PubMed Central

Nature Precedings

You can't see what you can't see: Experimental evidence for how much relevant information may be missed due to Google's Web search personalisation

Author: A Broder
A Savoldelli
B Pan
C Hölscher
D Lewandowski
G Adomavicius
GJ Hardeveld van
J Ørmen
JT Du
M Haim
S Brin
W Webber
X Lu
Z Ebrahim
Z Lan
Publication venue
Publication date: 01/01/2019
Field of study

The influence of Web search personalisation on professional knowledge work is an understudied area. Here we investigate how public sector officials self-assess their dependency on the Google Web search engine, whether they are aware of the potential impact of algorithmic biases on their ability to retrieve all relevant information, and how much relevant information may actually be missed due to Web search personalisation. We find that the majority of participants in our experimental study are neither aware that there is a potential problem nor do they have a strategy to mitigate the risk of missing relevant information when performing online searches. Most significantly, we provide empirical evidence that up to 20% of relevant information may be missed due to Web search personalisation. This work has significant implications for Web research by public sector professionals, who should be provided with training about the potential algorithmic biases that may affect their judgments and decision making, as well as clear guidelines how to minimise the risk of missing relevant information.Comment: paper submitted to the 11th Intl. Conf. on Social Informatics; revision corrects error in interpretation of parameter Psi/p in RBO resulting from discrepancy between the documentation of the implementation in R (https://rdrr.io/bioc/gespeR/man/rbo.html) and the original definition (https://dl.acm.org/citation.cfm?id=1852106) as per 20/05/201

arXiv.org e-Print Archive

Victoria University of Wellington

Crossref

Complexity transitions in global algorithms for sparse linear systems over finite fields

Author: A Braunstein
Alava M J
Broder A Z
Cocco S
Cormen T H
Creignou N
F Ricci-Tersenghi
Garey M
Kolchin V F
Leone M
Leone M
M Leone
Mézard M
Papadimitriou C H
Pomerance C
R Zecchina
Rivest R L
Schaefer T J
Sourlas N
Publication venue: 'IOP Publishing'
Publication date: 01/01/2002
Field of study

We study the computational complexity of a very basic problem, namely that of finding solutions to a very large set of random linear equations in a finite Galois Field modulo q. Using tools from statistical mechanics we are able to identify phase transitions in the structure of the solution space and to connect them to changes in performance of a global algorithm, namely Gaussian elimination. Crossing phase boundaries produces a dramatic increase in memory and CPU requirements necessary to the algorithms. In turn, this causes the saturation of the upper bounds for the running time. We illustrate the results on the specific problem of integer factorization, which is of central interest for deciphering messages encrypted with the RSA cryptosystem.Comment: 23 pages, 8 figure

arXiv.org e-Print Archive

Crossref

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)

Archivio della ricerca- Università di Roma La Sapienza

PORTO Publications Open Repository TOrino

Clustering and preferential attachment in growing networks

Author: A. Broder
A.-L. Barabási
C. Moore
D. J. Watts
D. J. Watts
L. A. N. Amaral
M. E. J. Newman
M. E. J. Newman
M. E. J. Newman
M. E. J. Newman
M. E. J. Newman
M. Faloutsos
P. L. Krapivsky
R. Albert
R. Monasson
S. H. Strogatz
S. N. Dorogovtsev
Publication venue: 'American Physical Society (APS)'
Publication date: 01/01/2001
Field of study

We study empirically the time evolution of scientific collaboration networks in physics and biology. In these networks, two scientists are considered connected if they have coauthored one or more papers together. We show that the probability of scientists collaborating increases with the number of other collaborators they have in common, and that the probability of a particular scientist acquiring new collaborators increases with the number of his or her past collaborators. These results provide experimental evidence in favor of previously conjectured mechanisms for clustering and power-law degree distributions in networks.Comment: 13 pages, 2 figure

arXiv.org e-Print Archive

CiteSeerX

Crossref

Minimizing energy below the glass thresholds

Author: A. J. Parkes
A. K. Hartmann
A. K. Hartmann
A. Kaporis
A. Z. Broder
B. Selman
C. H. Papadimitriou
D. Achlioptas
Demian Battaglia
F. R. Kschischang
H. Karlo
J.-P. Bouchaud
M. R. Garey
Michal Kolář
O. Dubois
R. Motwani
Riccardo Zecchina
T. Richardson
Publication venue: 'American Physical Society (APS)'
Publication date: 01/01/2004
Field of study

Focusing on the optimization version of the random K-satisfiability problem, the MAX-K-SAT problem, we study the performance of the finite energy version of the Survey Propagation (SP) algorithm. We show that a simple (linear time) backtrack decimation strategy is sufficient to reach configurations well below the lower bound for the dynamic threshold energy and very close to the analytic prediction for the optimal ground states. A comparative numerical study on one of the most efficient local search procedures is also given.Comment: 12 pages, submitted to Phys. Rev. E, accepted for publicatio

arXiv.org e-Print Archive

Crossref

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)

PORTO Publications Open Repository TOrino

Network robustness and fragility: Percolation on random graphs

Author: A. Broder
B. A. Huberman
B. Bollobás
C. Moore
Duncan J. Watts
Duncan S. Callaway
F. Ball
H. S. Wilf
J. O. Kephart
L. A. N. Amaral
M. E. J. Newman
M. E. J. Newman
M. E. J. Newman
M. Faloutsos
M. Molloy
M. Molloy
R. Albert
R. Albert
R. Cohen
Steven H. Strogatz
W. Aiello
Publication venue: 'American Physical Society (APS)'
Publication date: 01/01/2000
Field of study

Recent work on the internet, social networks, and the power grid has addressed the resilience of these networks to either random or targeted deletion of network nodes. Such deletions include, for example, the failure of internet routers or power transmission lines. Percolation models on random graphs provide a simple representation of this process, but have typically been limited to graphs with Poisson degree distribution at their vertices. Such graphs are quite unlike real world networks, which often possess power-law or other highly skewed degree distributions. In this paper we study percolation on graphs with completely general degree distribution, giving exact solutions for a variety of cases, including site percolation, bond percolation, and models in which occupation probabilities depend on vertex degree. We discuss the application of our theory to the understanding of network resilience.Comment: 4 pages, 2 figure

arXiv.org e-Print Archive

CiteSeerX

Crossref

Minimum spanning trees on random networks

Author: A. A. Middleton
A. L. Barabasi
A. Maritan
A. P. Sheppard
A. Z. Broder
C. M. Newman
D. Stauffer
D. Wilkinson
J. C. Dyre
J. Chayes
J.-C. Anglès D'Auriac
M. Aizenman
M. Cieplak
M. Cieplak
M. Kardar
M. Marsili
M. Porto
M. R. Swift
P. M. Duxbury
P. M. Duxbury
R. Dobrin
S. N. Majumdar
S. Tyc̆
T. H. Cormen
T. Halpin-Healy
Publication venue: 'American Physical Society (APS)'
Publication date: 01/01/2001
Field of study

We show that the geometry of minimum spanning trees (MST) on random graphs is universal. Due to this geometric universality, we are able to characterise the energy of MST using a scaling distribution (

P(\epsilon)

) found using uniform disorder. We show that the MST energy for other disorder distributions is simply related to

P(\epsilon)

. We discuss the relationship to invasion percolation (IP), to the directed polymer in a random media (DPRM) and the implications for the broader issue of universality in disordered systems.Comment: 4 pages, 3 figure

arXiv.org e-Print Archive

Crossref

Eldorado - Ressourcen aus und für Lehre, Studium und Forschung