Search CORE

250 research outputs found

A Bayesian method for evaluating and discovering disease loci associations

Author: A Galvin
AB Moffa
B Kuschel
B Tycko
C Hoggart
D Heckerman
DF Easton
DJ Hunter
DR Velez
EM Reiman
GF Cooper
Gregory F. Cooper
H Shi
J Wakefield
J Wu
JD Storey
JD Storey
JD Storey
JS Barnholtz-Sloan
KD Coon
L Ding
LW Hahn
M McCarthy
M. Michael Barmada
MD Fallin
Michael J. Becich
N Bonifaci
N Risch
P Sebastiani
R Grose
RA Fisher
RA Fisher
RE Neapolitan
RE Neapolitan
RE Neapolitan
S Visweswaran
S Wacholder
Vladimir Brusic
X Jiang
X Jiang
X Jiang
X Liang
Xia Jiang
Y Benjamin
Y Zhang
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2011
Field of study

Background: A genome-wide association study (GWAS) typically involves examining representative SNPs in individuals from some population. A GWAS data set can concern a million SNPs and may soon concern billions. Researchers investigate the association of each SNP individually with a disease, and it is becoming increasingly commonplace to also analyze multi-SNP associations. Techniques for handling so many hypotheses include the Bonferroni correction and recently developed Bayesian methods. These methods can encounter problems. Most importantly, they are not applicable to a complex multi-locus hypothesis which has several competing hypotheses rather than only a null hypothesis. A method that computes the posterior probability of complex hypotheses is a pressing need. Methodology/Findings: We introduce the Bayesian network posterior probability (BNPP) method which addresses the difficulties. The method represents the relationship between a disease and SNPs using a directed acyclic graph (DAG) model, and computes the likelihood of such models using a Bayesian network scoring criterion. The posterior probability of a hypothesis is computed based on the likelihoods of all competing hypotheses. The BNPP can not only be used to evaluate a hypothesis that has previously been discovered or suspected, but also to discover new disease loci associations. The results of experiments using simulated and real data sets are presented. Our results concerning simulated data sets indicate that the BNPP exhibits both better evaluation and discovery performance than does a p-value based method. For the real data sets, previous findings in the literature are confirmed and additional findings are found. Conclusions/Significance: We conclude that the BNPP resolves a pressing problem by providing a way to compute the posterior probability of complex multi-locus hypotheses. A researcher can use the BNPP to determine the expected utility of investigating a hypothesis further. Furthermore, we conclude that the BNPP is a promising method for discovering disease loci associations. © 2011 Jiang et al

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

D-Scholarship@Pitt

The Francis Crick Institute

Inequality and violent crime: evidence from data on robbery and violent theft

Author: Alvarez
Bourguignon François
Di Tella Rafael
Eric Neumayer
Fleisher Belton
Gartner Rosemary
Gibney Mark
Gurr Ted Robert
Hagan John
Neapolitan Jerome L.
Prillaman William C.
Saridakis George
Soares Rodrigo Reis
UN
UN-WIDER
Van Dijk Jan
Wooldridge Jeffrey M.
World Bank
World Bank
World Health Organization
Publication venue: 'SAGE Publications'
Publication date: 03/12/2004
Field of study

This article argues that the link between income inequality and violent property crime might be spurious, complementing a similar argument in prior analysis by the author on the determinants of homicide. In contrast, Fajnzylber, Lederman & Loayza (1998; 2002a, b) provide seemingly strong and robust evidence that inequality causes a higher rate of both homicide and robbery/violent theft even after controlling for country-specific fixed effects. Our results suggest that inequality is not a statistically significant determinant, unless either country-specific effects are not controlled for or the sample is artificially restricted to a small number of countries. The reason why the link between inequality and violent property crime might be spurious is that income inequality is likely to be strongly correlated with country-specific fixed effects such as cultural differences. A high degree of inequality might be socially undesirable for any number of reasons, but that it causes violent crime is far from proven

Crossref

LSE Research Online

Micro-analysis of seriation skills

Author: Neapolitan Denise M.
Publication venue: The University of Edinburgh
Publication date: 01/01/1991
Field of study

Edinburgh Research Archive

Uniform random generation of large acyclic digraphs

Author: B. Steinsky
B. Steinsky
B. Steinsky
B.D. McKay
D. Colombo
D. Madigan
D. Madigan
E.A. Bender
E.A. Bender
F. Emmert-Streib
G. Melançon
G. Melançon
Giusi Moffa
I. Alon
J.M. Peña
J.S. Ide
J.S. Ide
Jack Kuipers
M. Grzegorczyk
M. Kalisch
M. Kalisch
M. Scutari
N. Friedman
N. Friedman
R. Daly
R.E. Neapolitan
R.P. Stanley
R.W. Robinson
R.W. Robinson
R.W. Robinson
S.A. Andersson
S.B. Gillispie
S.L. Lauritzen
V. Liskovets
X. Jiang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

Directed acyclic graphs are the basic representation of the structure underlying Bayesian networks, which represent multivariate probability distributions. In many practical applications, such as the reverse engineering of gene regulatory networks, not only the estimation of model parameters but the reconstruction of the structure itself is of great interest. As well as for the assessment of different structure learning algorithms in simulation studies, a uniform sample from the space of directed acyclic graphs is required to evaluate the prevalence of certain structural features. Here we analyse how to sample acyclic digraphs uniformly at random through recursive enumeration, an approach previously thought too computationally involved. Based on complexity considerations, we discuss in particular how the enumeration directly provides an exact method, which avoids the convergence issues of the alternative Markov chain methods and is actually computationally much faster. The limiting behaviour of the distribution of acyclic digraphs then allows us to sample arbitrarily large graphs. Building on the ideas of recursive enumeration based sampling we also introduce a novel hybrid Markov chain with much faster convergence than current alternatives while still being easy to adapt to various restrictions. Finally we discuss how to include such restrictions in the combinatorial enumeration and the new hybrid Markov chain method for efficient uniform sampling of the corresponding graphs.Comment: 15 pages, 2 figures. To appear in Statistics and Computin

arXiv.org e-Print Archive

University of Regensburg Publication Server

Crossref

edoc

An evolutionary technique to approximate multiple optimal alignments

Author: A Adriansyah
B Dongen van
B Vázquez-Barreiros
D Reißner
D Ruppert
F Mannhardt
F Taymouri
F Taymouri
J Munoz-Gama
M Koorneef
M Leoni de
R Neapolitan
SB Needleman
SJJ Leemans
T Murata
WMP Aalst van der
WMP Aalst van der
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2018
Field of study

The alignment of observed and modeled behavior is an essential aid for organizations, since it opens the door for root-cause analysis and enhancement of processes. The state-of-the-art technique for computing alignments has exponential time and space complexity, hindering its applicability for medium and large instances. Moreover, the fact that there may be multiple optimal alignments is perceived as a negative situation, while in reality it may provide a more comprehensive picture of the model’s explanation of observed behavior, from which other techniques may benefit. This paper presents a novel evolutionary technique for approximating multiple optimal alignments. Remarkably, the memory footprint of the proposed technique is bounded, representing an unprecedented guarantee with respect to the state-of-the-art methods for the same task. The technique is implemented into a tool, and experiments on several benchmarks are provided.Peer ReviewedPostprint (author's final draft

Crossref

UPCommons. Portal del coneixement obert de la UPC

UPCommons (Universitat Politècnica de Catalunya)

A genetic algorithm-Bayesian network approach for the analysis of metabolomics and spectroscopic data: application to the rapid detection of Bacillus spores and identification of Bacillus species

Author: A Atrih
AD Warth
AP Snyder
CD Havey
D Heckerman
DE Goldberg
DE Goldberg
DH Wolpert
DM Chickering
E Ghiamati
EC Lopez-Diez
Elon Correa
FV Jensen
IH Witten
J Opitz
J Pearl
JF Hair
JH Holland
L Breiman
LA Shute
LA Shute
M Barker
M Mitchell
M Seasholtz
MB Beverly
MW Tabor
N Sproch
NA Karp
P Zhang
R Goodacre
RE Neapolitan
Royston Goodacre
RR Bouckaert
SH Pendukar
SJ DeLuca
SL Lauritzen
Ss Huang
TV Inglesby
W Barnaby
X Zeng
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2011
Field of study

Background The rapid identification of Bacillus spores and bacterial identification are paramount because of their implications in food poisoning, pathogenesis and their use as potential biowarfare agents. Many automated analytical techniques such as Curie-point pyrolysis mass spectrometry (Py-MS) have been used to identify bacterial spores giving use to large amounts of analytical data. This high number of features makes interpretation of the data extremely difficult We analysed Py-MS data from 36 different strains of aerobic endospore-forming bacteria encompassing seven different species. These bacteria were grown axenically on nutrient agar and vegetative biomass and spores were analyzed by Curie-point Py-MS. Results We develop a novel genetic algorithm-Bayesian network algorithm that accurately identifies sand selects a small subset of key relevant mass spectra (biomarkers) to be further analysed. Once identified, this subset of relevant biomarkers was then used to identify Bacillus spores successfully and to identify Bacillus species via a Bayesian network model specifically built for this reduced set of features. Conclusions This final compact Bayesian network classification model is parsimonious, computationally fast to run and its graphical visualization allows easy interpretation of the probabilistic relationships among selected biomarkers. In addition, we compare the features selected by the genetic algorithm-Bayesian network approach with the features selected by partial least squares-discriminant analysis (PLS-DA). The classification accuracy results show that the set of features selected by the GA-BN is far superior to PLS-DA

University of Salford Institutional Repository

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

The University of Manchester - Institutional Repository

Constraint solving in uncertain and dynamic environments - a survey

Author: A. Borning
A. Davenport
A. Mackworth
B. Faltings
B. Freeman-Benson
C. Boutilier
C. Lottaz
D. Fowler
E. Gelle
E. Hebrard
F. Fages
G. Verfaillie
Gérard Verfaillie
H. E. Sakkout
I. Miguel
J. Amilhastre
J. Doyle
J. Kleer de
J. Pearl
L. Bordeaux
M. Ginsberg
M. Littman
M. Puterman
M. Sannella
M. Yokoo
N. Jussien
N. Muscettola
Narendra Jussien
P. Berlandier
P. V. Hentenryck
R. Alami
R. Bryant
R. Debruyne
R. Debruyne
R. Dechter
R. Dechter
R. Neapolitan
R. Wallace
S. Bistarelli
S. Minton
T. Schiex
T. Vidal
T. Walsh
U. Montanari
W. Harvey
Y. Georget
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2005
Field of study

International audienceThis article follows a tutorial, given by the authors on dynamic constraint solving at CP 2003 (Ninth International Conference on Principles and Practice of Constraint Programming) in Kinsale, Ireland. It aims at offering an overview of the main approaches and techniques that have been proposed in the domain of constraint satisfaction to deal with uncertain and dynamic environments

Crossref

INRIA a CCSD electronic archive server

HAL Mines Nantes

HAL: Hyper Article en Ligne

What is behind a summary-evaluation decision?

Author: A. B. Inoue
A. Bandura
A. L. Brown
Ana Arruarte
B. M. Taylor
B. Robinson
C. Glymour
C. S. Peirce
C. Sherrard
D. Cassany
D. E. Rumelhart
D. Heckerman
D. W. Hosmer
E. B. Page
E. Kozminsky
E. M. Glazer
F. C. Bartlett
F. Genesee
F. V. Jensen
G. H. Bower
G. J. Cizek
G. K. W. K. Chung
G. L. Goldberg
I. Mani
I. Zipitria
Iraide Zipitria
J. Burstein
J. Catlett
J. D. Bransford
J. Dougherty
J. Fitzgerald
J. H. Holland
J. Long
J. Pearl
J. Pearl
J. R. Kirby
J. Whittaker
Jon A. Elorriaga
L. Breiman
L. Magnani
L. Magnani
L. Manelis
M. Minsky
M. R. Elosúa
M. Stone
M. Virvou
N. Cristianini
N. Friedman
P. Clark
P. Langley
P. N. Winograd
P. Spirtes
P. W. Thorndyke
Pedro Larrañaga
R. A. Fisher
R. Blanco
R. C. Schank
R. Cook
R. E. Neapolitan
R. Garner
R. Garner
R. Kerber
Ruben Armañanzas
S. E. Shimony
S. L. Lauritzen
S. Symons
T. Bayes
T. K. Landauer
T. M. Cover
U. M. Fayyad
V. Dimitrova
W. G. Lehnert
W. H. Kruskal
W. Kintsch
W. S. McCulloch
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2008
Field of study

Research in psychology has reported that, among the variety of possibilities for assessment methodologies, summary evaluation offers a particularly adequate context for inferring text comprehension and topic understanding. However, grades obtained in this methodology are hard to quantify objectively. Therefore, we carried out an empirical study to analyze the decisions underlying human summary-grading behavior. The task consisted of expert evaluation of summaries produced in critically relevant contexts of summarization development, and the resulting data were modeled by means of Bayesian networks using an application called Elvira, which allows for graphically observing the predictive power (if any) of the resultant variables. Thus, in this article, we analyzed summary-evaluation decision making in a computational framewor

Crossref

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Archivo Digital UPM (Univ. Politécnica de Madrid)

Learning genetic epistasis using Bayesian network scoring criteria

Author: A Heidema
A Herbert
AJ Brookes
B Han
BA Logsdon
BM Armes
CJ Verzilli
D Brinza
D Heckerman
D Thomas
DR Velez
E Castillo
E Perrier
E Segal
EM Reiman
FV Jensen
FV Jensen
GF Cooper
HJ Cordell
J Pearl
J Rissanen
J Suzuki
J Wu
JC Lambert
JH Moore
K Korb
KD Coon
LW Hahn
M Chickering
M Fishelson
M Fishelson
M Michael Barmada
M Spinola
MD Ritchie
N Friedman
N Friedman
N Friedman
N Friedman
N Friedman
P Sebastiani
P Spirtes
RE Neapolitan
RE Neapolitan
RE Neapolitan
RI Nagel
Richard E Neapolitan
RW Robinson
S Visweswaran
Shyam Visweswaran
T Silander
TT Wu
W Bateson
W Wongseree
X Jiang
X Wan
X Zhang
Xia Jiang
Y Meng
Y Meng
YM Cho
Publication venue: BioMed Central
Publication date: 01/03/2011
Field of study

Abstract Background Gene-gene epistatic interactions likely play an important role in the genetic basis of many common diseases. Recently, machine-learning and data mining methods have been developed for learning epistatic relationships from data. A well-known combinatorial method that has been successfully applied for detecting epistasis is <it>Multifactor Dimensionality Reduction </it>(MDR). Jiang et al. created a combinatorial epistasis learning method called <it>BNMBL </it>to learn Bayesian network (BN) epistatic models. They compared BNMBL to MDR using simulated data sets. Each of these data sets was generated from a model that associates two SNPs with a disease and includes 18 unrelated SNPs. For each data set, BNMBL and MDR were used to score all 2-SNP models, and BNMBL learned significantly more correct models. In real data sets, we ordinarily do not know the number of SNPs that influence phenotype. BNMBL may not perform as well if we also scored models containing more than two SNPs. Furthermore, a number of other BN scoring criteria have been developed. They may detect epistatic interactions even better than BNMBL. Although BNs are a promising tool for learning epistatic relationships from data, we cannot confidently use them in this domain until we determine which scoring criteria work best or even well when we try learning the correct model without knowledge of the number of SNPs in that model. Results We evaluated the performance of 22 BN scoring criteria using 28,000 simulated data sets and a real Alzheimer's GWAS data set. Our results were surprising in that the Bayesian scoring criterion with large values of a hyperparameter called α performed best. This score performed better than other BN scoring criteria and MDR at <it>recall </it>using simulated data sets, at detecting the hardest-to-detect models using simulated data sets, and at substantiating previous results using the real Alzheimer's data set. Conclusions We conclude that representing epistatic interactions using BN models and scoring them using a BN scoring criterion holds promise for identifying epistatic genetic variants in data. In particular, the Bayesian scoring criterion with large values of a hyperparameter α appears more promising than a number of alternatives.</p

Crossref

Directory of Open Access Journals

PubMed Central

D-Scholarship@Pitt

Building and Testing of an Adaptive Optics System for Optical Microscopy

Author: Adams S.
Cole A.
Directorate of Defence Aviation and Air Force Safety
Durso F. T.
Gigerenzer G.
Klein G.
Marr D.
Mitchell T. M.
Neapolitan R.
Neisser U.
Taylor R. M.
Publication venue: eCommons
Publication date: 18/04/2012
Field of study

Adaptive optics (AO), as the technology of compensating the wavefront distortion can significantly improve the performance of existing optical systems. An adaptive optics system is used to correct the wavefront distortion caused by the imperfection of optical elements and environment. It was originally developed for military and astronomy applications to mitigate the adverse effect of wavefront distortions caused by Earthâs atmosphere turbulence. With a closed-loop AO system, distortions caused by the environment can be reduced dramatically. As the technology matures, AO systems can be integrated into a wide variety of optical systems to improve their performance. The goal of this project is to build such an AO system which can be integrated into high-resolution optical microscopy. A Thorlabs Adaptive Optics Kit was set up. A Shack-Hartmann Wavefront sensor, a Deformable Mirror and other necessary optics hardware was combined together on a breadboard, and the control software was also implemented to form the feedback loop.https://ecommons.udayton.edu/stander_posters/1183/thumbnail.jp

Crossref

University of Dayton

University of Melbourne Institutional Repository

UQ eSpace (University of Queensland)