Search CORE

43 research outputs found

Non-linear regression models for Approximate Bayesian Computation

Author: A. Butler
A. Gelman
B. Schölkopf
B.D. Ripley
C. Gourieroux
C.M. Bishop
C.P. Robert
D.A. Nix
D.E. Reich
E.A. Nadaraya
G. Weiss
G.E.P. Box
G.S. Watson
I.J. Wilson
J. Fan
J. Hey
J.H. Friedman
J.K. Pritchard
J.K. Pritchard
J.P. King
J.S. Liu
K. Heggland
L.A. Zhivotovsky
L.A. Zhivotovsky
M. Stephens
M. Tanaka
M.A. Beaumont
M.D. Shriver
M.K. Kuhner
N.J.R. Fagundes
O. Ratmann
P. Bortot
P. Marjoram
P. Marjoram
P.J. Diggle
S. Tavaré
S. Tavaré
S.A. Sisson
T. Ohta
T. Toni
V.N. Vapnik
W. Härdle
Y.-X. Fu
Y.-X. Fu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 23/02/2009
Field of study

Approximate Bayesian inference on the basis of summary statistics is well-suited to complex problems for which the likelihood is either mathematically or computationally intractable. However the methods that use rejection suffer from the curse of dimensionality when the number of summary statistics is increased. Here we propose a machine-learning approach to the estimation of the posterior density by introducing two innovations. The new method fits a nonlinear conditional heteroscedastic regression of the parameter on the summary statistics, and then adaptively improves estimation using importance sampling. The new algorithm is compared to the state-of-the-art approximate Bayesian methods, and achieves considerable reduction of the computational burden in two examples of inference in statistical genetics and in a queueing model.Comment: 4 figures; version 3 minor changes; to appear in Statistics and Computin

arXiv.org e-Print Archive

Crossref

How Fitch-Margoliash Algorithm can Benefit from Multi Dimensional Scaling

Author: Hitchcock E.
Darwin C.
Edwards A.W.F.
Sneath P.H.A.
Saitou N.
Salemi M.
Lespinats S.
Jolliffe I.
Kuhner M.K.
Zaretsky K.
Cavalli-Sforza L.L.
Matsuda H.
Swofford D.L.
Li J.
Press W.H.
Glover F.
Goldberg D.E.
Reeves C.R.
Dowsland K.A.
Chalmers M.
Gromov M.
Milman V.D.
Bulmer M.
Demartines P.
Fleiss J.L.
Publication venue: Libertas Academica
Publication date: 01/01/2011
Field of study

Whatever the phylogenetic method, genetic sequences are often described as strings of characters, thus molecular sequences can be viewed as elements of a multi-dimensional space. As a consequence, studying motion in this space (ie, the evolutionary process) must deal with the amazing features of high-dimensional spaces like concentration of measured phenomenon

Crossref

Hal - Université Grenoble Alpes

Directory of Open Access Journals

INRIA a CCSD electronic archive server

PubMed Central

Warwick Research Archives Portal Repository

Online Research Database In Technology

Phylogeography of the white-tailed eagle, a generalist with large dispersal capacity

Crossref

On the Use of Bootstrapped Topologies in Coalescent-Based Bayesian MCMC Inference: A Comparison of Estimation and Computational Efficiencies

Author: Drummond A.J.
Kuhner M.K.
Kuhner M.K.
Publication venue: 'SAGE Publications'
Publication date
Field of study

Crossref

The genetic code can cause systematic bias in simple phylogenetic models

Author: Fitch W.M
Kuhner M.K
Simon Whelan
Yang Z
Publication venue: The Royal Society
Publication date: 27/12/2008
Field of study

Phylogenetic analysis depends on inferential methodology estimating accurately the degree of divergence between sequences. Inaccurate estimates can lead to misleading evolutionary inferences, including incorrect tree topology estimates and poor dating of historical species divergence. Protein coding sequences are ubiquitous in phylogenetic inference, but many of the standard methods commonly used to describe their evolution do not explicitly account for the dependencies between sites in a codon induced by the genetic code. This study evaluates the performance of several standard methods on datasets simulated under a simple substitution model, describing codon evolution under a range of different types of selective pressures. This approach also offers insights into the relative performance of different phylogenetic methods when there are dependencies acting between the sites in the data. Methods based on statistical models performed well when there was no or limited purifying selection in the simulated sequences (low degree of dependency between sites in a codon), although more biologically realistic models tended to outperform simpler models. Phylogenetic methods exhibited greater variability in performance for sequences simulated under strong purifying selection (high degree of the dependencies between sites in a codon). Simple models substantially underestimate the degree of divergence between sequences, and underestimation was more pronounced on the internal branches of the tree. This underestimation resulted in some statistical methods performing poorly and exhibiting evidence for systematic bias in tree inference. Amino acid-based and nucleotide models that contained generic descriptions of spatial and temporal heterogeneity, such as mixture and temporal hidden Markov models, coped notably better, producing more accurate estimates of evolutionary divergence and the tree topology

Crossref

PubMed Central

The University of Manchester - Institutional Repository

A likelihood ratio test for species membership based on DNA sequence data

Author: Cornuet J.M
Cornuet J.M
Kuhner M.K
Mikhail V Matz
Nielsen R
Rasmus Nielsen
Publication venue: The Royal Society
Publication date: 01/01/2005
Field of study

DNA barcoding as an approach for species identification is rapidly increasing in popularity. However, it remains unclear which statistical procedures should accompany the technique to provide a measure of uncertainty. Here we describe a likelihood ratio test which can be used to test if a sampled sequence is a member of an a priori specified species. We investigate the performance of the test using coalescence simulations, as well as using the real data from butterflies and frogs representing two kinds of challenge for DNA barcoding: extremely low and extremely high levels of sequence variability

Crossref

PubMed Central

Copenhagen University Research Information System

TreSpEx–-Detection of Misleading Signal in Phylogenetic Reconstructions Based on Tree Information

Author: Brinkman H.
Dordel J.
Kuhner M.K.
Lockhart P.J.
Milinkovitch M.C.
Swofford D.L.
Publication venue: 'SAGE Publications'
Publication date
Field of study

Crossref

Phylogenomic Inference of Protein Molecular Function

Author: Galperin M.Y.
Kuhner M.K.
McClure M.A.
Saitou N.
Sjölander K.
Sjölander K.
Publication venue: 'Wiley'
Publication date
Field of study

Crossref

Using the quantitative genetic threshold model for inferences between and within species

Author: Edwards A.W.F
Felsenstein J
Felsenstein J
Joseph Felsenstein
Kuhner M.K
Pagel M
Wright S
Wright S
Publication venue: The Royal Society
Publication date: 29/07/2005
Field of study

Sewall Wright's threshold model has been used in modelling discrete traits that may have a continuous trait underlying them, but it has proven difficult to make efficient statistical inferences with it. The availability of Markov chain Monte Carlo (MCMC) methods makes possible likelihood and Bayesian inference using this model. This paper discusses prospects for the use of the threshold model in morphological systematics to model the evolution of discrete all-or-none traits. There the threshold model has the advantage over 0/1 Markov process models in that it not only accommodates polymorphism within species, but can also allow for correlated evolution of traits with far fewer parameters that need to be inferred. The MCMC importance sampling methods needed to evaluate likelihood ratios for the threshold model are introduced and described in some detail

Crossref

PubMed Central

Distance Corrections on Recombinant Sequences

Author: A. Rambaut
A. Rzhetsky
C. Wiuf
D. Bryant
D. Posada
F. Rodriguez
H.-J. Bandelt
J. Chang
J. Maynard-Smith
K. Atteson
K. Strimmer
K. Tamura
M. Schierup
M.K. Kuhner
M.K. Kuhner
O. Gascuel
R.R. Hudson
T.H. Jukes
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2003
Field of study

Crossref