Search CORE

29 research outputs found

Dynamics on expanding spaces: modeling the emergence of novelties

Author: A Morgan De
AL Barabási
B Corominas-Murtra
C Cattuto
C Cattuto
D Zanette
EU Condon
F Jacob
FM Hoppe
FM Hoppe
GK Zipf
GK Zipf
H Mahmoud
JM Alexander
L Lü
M Buchanan
M Gerlach
M Mitzenmacher
MA Serrano
MEJ Newman
MV Simkin
RA Fisher
RV Solé
S Johnson
S Kotz
S Thurner
S Wright
SA Kauffman
SA Kauffman
SL Zabell
SN Dorogovtsev
T Felin
UG Yule
W Ewens
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

Novelties are part of our daily lives. We constantly adopt new technologies, conceive new ideas, meet new people, experiment with new situations. Occasionally, we as individuals, in a complicated cognitive and sometimes fortuitous process, come up with something that is not only new to us, but to our entire society so that what is a personal novelty can turn into an innovation at a global level. Innovations occur throughout social, biological and technological systems and, though we perceive them as a very natural ingredient of our human experience, little is known about the processes determining their emergence. Still the statistical occurrence of innovations shows striking regularities that represent a starting point to get a deeper insight in the whole phenomenology. This paper represents a small step in that direction, focusing on reviewing the scientific attempts to effectively model the emergence of the new and its regularities, with an emphasis on more recent contributions: from the plain Simon's model tracing back to the 1950s, to the newest model of Polya's urn with triggering of one novelty by another. What seems to be key in the successful modelling schemes proposed so far is the idea of looking at evolution as a path in a complex space, physical, conceptual, biological, technological, whose structure and topology get continuously reshaped and expanded by the occurrence of the new. Mathematically it is very interesting to look at the consequences of the interplay between the "actual" and the "possible" and this is the aim of this short review.Comment: 25 pages, 10 figure

arXiv.org e-Print Archive

Crossref

Archivio della ricerca- Università di Roma La Sapienza

On the Representability of Complete Genomes by Multiple Competing Finite-Context (Markov) Models

Author: A Milosavljević
AJ Pinho
AL Delcher
António J. R. Neves
Armando J. Pinho
B Behzadi
Carlos A. C. Bastos
CB Burge
Christos A. Ouzounis
D Loewenstern
D Robelin
D Salomon
E Rivals
ET Whittaker
G Korodi
G Korodi
G Manzini
GF Hardy
H Richard
I Tabus
J Rissanen
J Venn
J Ziv
K Sayood
K Sjölander
L Allison
L Allison
M Brown
M Rho
M Stanke
MD Cao
MD Cao
MY Borodovsky
MY Borodovsky
MY Borodovsky
P Ferragina
P Salamon
Paulo J. S. G. Ferreira
PS Laplace
R Giancarlo
S Grumbach
S Tavaré
SL Salzberg
SL Zabell
SL Zabell
T Bayes
TC Bell
TI Dix
W Zhu
WE Johnson
X Chen
Z Liu
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

A finite-context (Markov) model of order yields the probability distribution of the next symbol in a sequence of symbols, given the recent past up to depth . Markov modeling has long been applied to DNA sequences, for example to find gene-coding regions. With the first studies came the discovery that DNA sequences are non-stationary: distinct regions require distinct model orders. Since then, Markov and hidden Markov models have been extensively used to describe the gene structure of prokaryotes and eukaryotes. However, to our knowledge, a comprehensive study about the potential of Markov models to describe complete genomes is still lacking. We address this gap in this paper. Our approach relies on (i) multiple competing Markov models of different orders (ii) careful programming techniques that allow orders as large as sixteen (iii) adequate inverted repeat handling (iv) probability estimates suited to the wide range of context depths used. To measure how well a model fits the data at a particular position in the sequence we use the negative logarithm of the probability estimate at that position. The measure yields information profiles of the sequence, which are of independent interest. The average over the entire sequence, which amounts to the average number of bits per base needed to describe the sequence, is used as a global performance measure. Our main conclusion is that, from the probabilistic or information theoretic point of view and according to this performance measure, multiple competing Markov models explain entire genomes almost as well or even better than state-of-the-art DNA compression methods, such as XM, which rely on very different statistical models. This is surprising, because Markov models are local (short-range), contrasting with the statistical models underlying other methods, where the extensive data repetitions in DNA sequences is explored, and therefore have a non-local character

CiteSeerX

Public Library of Science (PLOS)

Crossref

Repositório Institucional da Universidade de Aveiro

Directory of Open Access Journals

PubMed Central

A frequentist framework of inductive reasoning

Author: AM Polansky
AP Dempster
AR Brazzale
AWF Edwards
B Clarke
B Efron
B Efron
B Efron
B Finetti De
D Fraser
D Heath
D Heath
DA Freedman
DA Sprott
DAS Fraser
DAS Fraser
DAS Fraser
DAS Fraser
DAS Fraser
DR Bickel
DR Bickel
DR Bickel
DR Bickel
DR Cox
DV Lindley
G Shafer
GA Barnard
GN Wilkinson
GS Datta
H Scheffe
HE Kyburg
I Hacking
I Hacking
IS Helland
J Cornfield
J Hannig
J Hannig
J Kiefer
J Kohlas
J Robins
J-Y Jaffray
JB Paris
JM Bernardo
JM Bernardo
JO Berger
JO Berger
JT Hwang
K Chaloner
K Singh
LJ Gleser
LJ Savage
M Goldstein
M Goldstein
MCM Troffaes
MJ Schervish
OE Barndorff-nielsen
P Maher
P Mccullagh
P Vos
PH Garthwaite
PM Grundy
PS Craig
R Carnap
R Jeffrey
R Liu
R Royall
R Royall
RA Fisher
RA Fisher
RJ Buehler
SL Zabell
SR Lele
SS Sharma
T Schweder
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 24/12/2009
Field of study

Reacting against the limitation of statistics to decision procedures, R. A. Fisher proposed for inductive reasoning the use of the fiducial distribution, a parameter-space distribution of epistemological probability transferred directly from limiting relative frequencies rather than computed according to the Bayes update rule. The proposal is developed as follows using the confidence measure of a scalar parameter of interest. (With the restriction to one-dimensional parameter space, a confidence measure is essentially a fiducial probability distribution free of complications involving ancillary statistics.) A betting game establishes a sense in which confidence measures are the only reliable inferential probability distributions. The equality between the probabilities encoded in a confidence measure and the coverage rates of the corresponding confidence intervals ensures that the measure's rule for assigning confidence levels to hypotheses is uniquely minimax in the game. Although a confidence measure can be computed without any prior distribution, previous knowledge can be incorporated into confidence-based reasoning. To adjust a p-value or confidence interval for prior information, the confidence measure from the observed data can be combined with one or more independent confidence measures representing previous agent opinion. (The former confidence measure may correspond to a posterior distribution with frequentist matching of coverage probabilities.) The representation of subjective knowledge in terms of confidence measures rather than prior probability distributions preserves approximate frequentist validity.Comment: major revisio

arXiv.org e-Print Archive

Crossref

Confirming Universal Generalizations

Author: AJ Ayer
D Costantini
D Costantini
H Jeffreys
IJ Good
J Earman
J Hintikka
J Hintikka
R Carnap
SL Zabell
SL Zabell
SL Zabell
TAF Kuipers
WE Johnson
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/1997
Field of study

Crossref

The Fake News Vaccine - A Content-Agnostic System for Preventing Fake News from Becoming Viral.

Author: A Gupta
David Austen-Smith
GL Ciampaglia
Karl-Rudolf Koch
KR Saikaew
M Viviani
P Resnick
P Resnick
SL Zabell
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 19/06/2019
Field of study

International audienceWhile spreading fake news is an old phenomenon, today social media enables misinformation to instantaneously reach millions of people. Content-based approaches to detect fake news, typically based on automatic text checking, are limited. It is indeed difficult to come up with general checking criteria. Moreover, once the criteria are known to an adversary, the checking can be easily bypassed. On the other hand, it is practically impossible for humans to check every news item, let alone preventing them from becoming viral.We present Credulix, the first content-agnostic system to prevent fake news from going viral. Credulix is implemented as a plugin on top of a social media platform and acts as a vaccine. Human fact-checkers review a small number of popular news items, which helps us estimate the inclination of each user to share fake news. Using the resulting information, we automatically estimate the probability that an unchecked news item is fake. We use a Bayesian approach that resembles Condorcet’s Theorem to compute this probability. We show how this computation can be performed in an incremental, and hence fast manner

HAL-CentraleSupelec

Infoscience - École polytechnique fédérale de Lausanne

Crossref

INRIA a CCSD electronic archive server

HAL-Rennes 1

Modelling parameter uncertainty for risk capital calculation

Author: A Sklar
B Efron
CM Boucher
D Diers
GS Datta
K Sauler
OE Barndorff-Nielsen
R Gerrard
RA Fisher
RV Hogg
SL Zabell
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

A Simple Model of Volatility Fluctuations in Asset Markets

Author: AZ Mekjian
F Kelly
GH Watterson
GH Watterson
JFC Kingman
JFC Kingman
JFC Kingman
M Aoki
M Aoki
M Aoki
M Aoki
M Aoki
R Day
SL Zabell
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2002
Field of study

Crossref