Search CORE

LaCRIS - University of Lapland Current Research System

Unpacking reindeer husbandry governance in Sweden, Norway and Finland:A political discursive perspective

Author: Forbes Bruce C.
Labba Minka Maria Kristiina
Landauer Mia
Löf Annette
Raitio Kaisa
Risvoll Camilla
Sarkki Simo
Publication venue: 'Informa UK Limited'
Publication date: 01/01/2022
Field of study

A Geometric Variational Approach to Bayesian Inference

Author: Abhijoy Saha
Barber D.
Bauer M.
Bhattacharyya A
Bishop C. M
Broderick T.
Chen T.
Ghahramani Z.
Hernández-Lobato J.
Hoffman M.
Hoffman M. D.
Jaakkola T.
Karthik Bharath
Kass R. E.
Kingma D. P.
Kucukelbir A.
Lang S
Li Y.
Minka T. P
Rao C. R
Rezende D.
Rényi A
Saul L. K.
Sebastian Kurtek
Sigillito V. G.
Srivastava A.
Tan L. S
Wang C.
Yeung D.
Publication venue
Publication date: 27/03/2019
Field of study

We propose a novel Riemannian geometric framework for variational inference in Bayesian models based on the nonparametric Fisher-Rao metric on the manifold of probability density functions. Under the square-root density representation, the manifold can be identified with the positive orthant of the unit hypersphere in L2, and the Fisher-Rao metric reduces to the standard L2 metric. Exploiting such a Riemannian structure, we formulate the task of approximating the posterior distribution as a variational problem on the hypersphere based on the alpha-divergence. This provides a tighter lower bound on the marginal distribution when compared to, and a corresponding upper bound unavailable with, approaches based on the Kullback-Leibler divergence. We propose a novel gradient-based algorithm for the variational problem based on Frechet derivative operators motivated by the geometry of the Hilbert sphere, and examine its properties. Through simulations and real-data applications, we demonstrate the utility of the proposed geometric framework and algorithm on several Bayesian models

arXiv.org e-Print Archive

arXiv.org e-Print Archive

Repository@Nottingham

FigShare

Learning a Factor Model via Regularized PCA

Author: A. A. Amini
A. D’Aspremont
B. Laurent
Benjamin Van Roy
C. M. Bishop
D. B. Rubin
D. Paul
E. J. Candès
G. Pison
H. Akaike
H. H. Harman
H. M. Markowitz
H. Xu
I. M. Johnstone
I. T. Jolliffe
J. Baik
J. Friedman
K. Hirose
M. E. Tipping
M. Pourahmadi
M. Yuan
O. Banerjee
P. Ravikumar
S. Boyd
T. P. Minka
V. Chandrasekaran
Yi-Hao Kao
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

We consider the problem of learning a linear factor model. We propose a regularized form of principal component analysis (PCA) and demonstrate through experiments with synthetic and real data the superiority of resulting estimates to those produced by pre-existing factor analysis approaches. We also establish theoretical results that explain how our algorithm corrects the biases induced by conventional approaches. An important feature of our algorithm is that its computational requirements are similar to those of PCA, which enjoys wide use in large part due to its efficiency

MGMR: leveraging RNA-Seq population data to optimize expression estimation

Author: A Oshlack
B Li
B Li
B Pasaniuc
C Trapnell
Eran Halperin
JH Bullard
JK Pickrell
KA et al. Frazer
L Pachter
MD Robinson
Ron Shamir
Roye Rozov
SB Montgomery
TP Minka
Publication venue: BioMed Central
Publication date: 01/04/2012
Field of study

Abstract Background RNA-Seq is a technique that uses Next Generation Sequencing to identify transcripts and estimate transcription levels. When applying this technique for quantification, one must contend with reads that align to multiple positions in the genome (multireads). Previous efforts to resolve multireads have shown that RNA-Seq expression estimation can be improved using probabilistic allocation of reads to genes. These methods use a probabilistic generative model for data generation and resolve ambiguity using likelihood-based approaches. In many instances, RNA-seq experiments are performed in the context of a population. The generative models of current methods do not take into account such population information, and it is an open question whether this information can improve quantification of the individual samples Results In order to explore the contribution of population level information in RNA-seq quantification, we apply a hierarchical probabilistic generative model, which assumes that expression levels of different individuals are sampled from a Dirichlet distribution with parameters specific to the population, and reads are sampled from the distribution of expression levels. We introduce an optimization procedure for the estimation of the model parameters, and use HapMap data and simulated data to demonstrate that the model yields a significant improvement in the accuracy of expression levels of paralogous genes. Conclusions We provide a proof of principal of the benefit of drawing on population commonalities to estimate expression. The results of our experiments demonstrate this approach can be beneficial, primarily for estimation at the gene level.</p

Directory of Open Access Journals

eScholarship - University of California

Distinguishing Asthma Phenotypes Using Machine Learning Approaches.

Author: A Custovic
A Custovic
A Fraser
A Høst
A Pickles
A Simpson
A Wijga
Adnan Custovic
AJ Lowe
AV Berg
B Clarisse
BD Spycher
BD Spycher
BD Spycher
BG Toelle
BL Jones
BL Jones
C-M Chen
CA Figueiredo
CE Kuehni
CJ Lodge
CL Storr
D Barber
D Belgrave
D Caudri
D Nagin
DA Linzer
DC Belgrave
DC Belgrave
DCM Belgrave
DCM Belgrave
F Kauffmann
F Kauffmann
FD Martinez
FL Garden
FP Perera
G Bochenek
G Weinmayr
GB Marks
GP Anderson
J Hagenaars
J Henderson
J Lotvall
J Magidson
J Sunyer
J Winn
JA Smith
JK Vermunt
K Burnham
KE Wonderen Van
KL Nylund
L García-Marcos Álvarez
L Hunt
L Lowe
L Panico
LA Lowe
M Depner
M Herr
M Scott
Magnus Rattray
Mattia Prosperi
MJ Ege
ML Barreto
MM Hagendorens
MW Pijnenburg
N Lazic
NC Nicolaou
NG Papadopoulos
OE Savenije
P Burney
P Haldar
P Rzehak
P Rzehak
PD Sly
Q Chen
Q Vuong
Rebecca Howard
RJP Valk van der
RL Bergmann
RL Miller
RO Crapo
RT Stein
S American Thoracic
S Havstad
S Mihrshahi
S Rabe-Hesketh
S Stanojevic
SE Wenzel
SK Weiland
ST Lanza
ST Lanza
T Jung
T Minka
The European Community Respiratory Health Survey
V Siroux
WC Moore
X Robin
Y Lo
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

Asthma is not a single disease, but an umbrella term for a number of distinct diseases, each of which are caused by a distinct underlying pathophysiological mechanism. These discrete disease entities are often labelled as asthma endotypes. The discovery of different asthma subtypes has moved from subjective approaches in which putative phenotypes are assigned by experts to data-driven ones which incorporate machine learning. This review focuses on the methodological developments of one such machine learning technique-latent class analysis-and how it has contributed to distinguishing asthma and wheezing subtypes in childhood. It also gives a clinical perspective, presenting the findings of studies from the past 5 years that used this approach. The identification of true asthma endotypes may be a crucial step towards understanding their distinct pathophysiological mechanisms, which could ultimately lead to more precise prevention strategies, identification of novel therapeutic targets and the development of effective personalized therapies

Springer - Publisher Connector

Spiral - Imperial College Digital Repository

The University of Manchester - Institutional Repository

Tabular: A Schema-driven Probabilistic Programming Language

Author: Bachrach Y.
Bishop C. M.
Domingos P.
Getoor L.
Goodman N.
Grosse R.
Herbrich R.
Izbicki M.
Koller D.
McCallum A.
Minka T.
Pfeffer A.
Shafto P.
Wingate D.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2013
Field of study

We propose a new kind of probabilistic programming language for machine learning. We write programs simply by annotating existing relational schemas with probabilistic model expressions. We describe a detailed design of our language, Tabular, complete with formal semantics and type system. A rich series of examples illustrates the expressiveness of Tabular. We report an implementation, and show evidence of the succinctness of our notation relative to current best practice. Finally, we describe and verify a transformation of Tabular schemas so as to predict missing values in a concrete database. The ability to query for missing values provides a uniform interface to a wide variety of tasks, including classification, clustering, recommendation, and ranking

Publikationer från Uppsala Universitet

Edinburgh Research Explorer

Digitala Vetenskapliga Arkivet - Academic Archive On-line

Piecewise Approximate Bayesian Computation: fast inference for discretely observed Markov models using a factorised posterior distribution

Author: A. Golightly
B.W. Silverman
C. Andrieu
D. Moriña
D.J. Wilkinson
D.J. Wilkinson
E. Gabriel
E. McKenzie
E.B. Sudderth
G.B. Durham
G.J. Székely
J.C. Cox
J.K. Pritchard
J.M. Marin
K. Fukunaga
K.V. Mardia
M.A. Al-Osh
M.A. Beaumont
M.G.B. Blum
N.G. Kampen Van
P. Fearnhead
P. Marjoram
P. Neal
P. Wand
R.D. Wilkinson
R.J. Boys
S. P. Preston
S. R. White
T. Kypraios
T. McKinley
T. Toni
T.P. Minka
Y. Aït-Sahalia
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

Many modern statistical applications involve inference for complicated stochastic models for which the likelihood function is difficult or even impossible to calculate, and hence conventional likelihood-based inferential techniques cannot be used. In such settings, Bayesian inference can be performed using Approximate Bayesian Computation (ABC). However, in spite of many recent developments to ABC methodology, in many applications the computational cost of ABC necessitates the choice of summary statistics and tolerances that can potentially severely bias the estimate of the posterior. We propose a new “piecewise” ABC approach suitable for discretely observed Markov models that involves writing the posterior density of the parameters as a product of factors, each a function of only a subset of the data, and then using ABC within each factor. The approach has the advantage of side-stepping the need to choose a summary statistic and it enables a stringent tolerance to be set, making the posterior “less approximate”. We investigate two methods for estimating the posterior density based on ABC samples for each of the factors: the first is to use a Gaussian approximation for each factor, and the second is to use a kernel density estimate. Both methods have their merits. The Gaussian approximation is simple, fast, and probably adequate for many applications. On the other hand, using instead a kernel density estimate has the benefit of consistently estimating the true piecewise ABC posterior as the number of ABC samples tends to infinity. We illustrate the piecewise ABC approach with four examples; in each case, the approach offers fast and accurate inference

Nottingham ePrints

Nottingham eTheses

Springer - Publisher Connector

Repository@Nottingham

Probabilistic machine learning and artificial intelligence.

Author: A Doucet
A Gelman
A Korattikara
A Krizhevsky
A O'Hagan
A Pfeffer
A Pfeffer
A Pfeffer
B Bakker
B De Finetti
B Fischer
B Milch
B Paige
C Freer
C Kemp
C Lu
C Shannon
C Thornton
CE Rasmussen
CE Rasmussen
CE Rasmussen
CM Bishop
CM Bishop
D Koller
D Koller
D Wingate
DE Wolstenholme
DJ Hand
DJ Lunn
DJC MacKay
DM Wolpert
DR Jones
ET Jaynes
F Wood
F Wood
G Hinton
GE Hinton
GF Marcus
H Kushner
H Robbins
I Sutskever
J Bergstra
J Hensman
J Snoek
JB Tenenbaum
JM Hernández-Lobato
JR Lloyd
K Doya
K Miller
KP Murphy
KS Van Horn
L Li
LR Rabiner
M Girolami
M Hoffman
M Jordan
M Medvedovic
M Schmidt
M Welling
MI Jordan
MP Deisenroth
N Goodman
N Hjort
N Houlsby
ND Goodman
ND Goodman
P Diaconis
P Hennig
P Marjoram
P Orbanz
P Poupart
P Sermanet
RB Grosse
RD King
RM Neal
RM Neal
RM Neal
RM Neal
RP Adams
RT Cox
S Deneve
S Russell
S Thrun
SJ Russell
TL Griffiths
TL Griffiths
TP Minka
TP Minka
TS Ferguson
V Mansinghka
WH Jefferys
Y Bengio
YW Teh
Z Ghahramani
Publication venue: 'The Nature Conservancy'
Publication date: 01/05/2015
Field of study

How can a machine learn from experience? Probabilistic modelling provides a framework for understanding what learning is, and has therefore emerged as one of the principal theoretical and practical approaches for designing machines that learn from data acquired through experience. The probabilistic framework, which describes how to represent and manipulate uncertainty about models and predictions, has a central role in scientific data analysis, machine learning, robotics, cognitive science and artificial intelligence. This Review provides an introduction to this framework, and discusses some of the state-of-the-art advances in the field, namely, probabilistic programming, Bayesian optimization, data compression and automatic model discovery.The author acknowledges an EPSRC grant EP/I036575/1, the DARPA PPAML programme, a Google Focused Research Award for the Automatic Statistician and support from Microsoft Research.This is the author accepted manuscript. The final version is available from NPG at http://www.nature.com/nature/journal/v521/n7553/full/nature14541.html#abstract

Public Library of Science (PLOS)

Apollo (Cambridge)

Independent Component Analysis of the Effect of L-dopa on fMRI of Language Processing

Author: A Bell
A Dagher
A Hyvärinen
Ashleigh Hillier
B Chalfin
B Gold
B Liddell
C Asanuma
C Beckmann
C Beckmann
D Beversdorf
D Durstewitz
D Ketteler
David Q. Beversdorf
G Shivde
G Winterer
H Hall
H Markowitsch
I Rektorova
J Cohen
J Demb
J Demonet
J Devlin
J Kievit
J Schmahmann
J Trojanowski
K McDermott
L Selemon
M Lidow
M McKeown
M Mehta
M Mehta
M Svensen
M Tivarus
Madalina E. Tivarus
Namhee Kim
Pedro Antonio Valdes-Sosa
Prem K. Goel
R Cools
R Poldrack
R Zatorre
S Floresco
S Kastner
S Thompson-Schill
T Eichele
T Minka
U Kischka
V Calhoun
V Schmithorst
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

L-dopa, which is a precursor for dopamine, acts to amplify strong signals, and dampen weak signals as suggested by previous studies. The effect of L-dopa has been demonstrated in language studies, suggesting restriction of the semantic network. In this study, we aimed to examine the effect of L-dopa on language processing with fMRI using Independent Component Analysis (ICA). Two types of language tasks (phonological and semantic categorization tasks) were tested under two drug conditions (placebo and L-dopa) in 16 healthy subjects. Probabilistic ICA (PICA), part of FSL, was implemented to generate Independent Components (IC) for each subject for the four conditions and the ICs were classified into task-relevant source groups by a correlation threshold criterion. Our key findings include: (i) The highly task-relevant brain regions including the Left Inferior Frontal Gyrus (LIFG), Left Fusiform Gyrus (LFUS), Left Parietal lobe (LPAR) and Superior Temporal Gyrus (STG) were activated with both L-dopa and placebo for both tasks, and (ii) as compared to placebo, L-dopa was associated with increased activity in posterior regions, including the superior temporal area (BA 22), and decreased activity in the thalamus (pulvinar) and inferior frontal gyrus (BA 11/47) for both tasks. These results raise the possibility that L-dopa may exert an indirect effect on posterior regions mediated by the thalamus (pulvinar)

Directory of Open Access Journals