54 research outputs found
Simultaneous Analysis of All SNPs in Genome-Wide and Re-Sequencing Association Studies
Testing one SNP at a time does not fully realise the potential of genome-wide association studies to identify multiple causal variants, which is a plausible scenario for many complex diseases. We show that simultaneous analysis of the entire set of SNPs from a genome-wide study to identify the subset that best predicts disease outcome is now feasible, thanks to developments in stochastic search methods. We used a Bayesian-inspired penalised maximum likelihood approach in which every SNP can be considered for additive, dominant, and recessive contributions to disease risk. Posterior mode estimates were obtained for regression coefficients that were each assigned a prior with a sharp mode at zero. A non-zero coefficient estimate was interpreted as corresponding to a significant SNP. We investigated two prior distributions and show that the normal-exponential-gamma prior leads to improved SNP selection in comparison with single-SNP tests. We also derived an explicit approximation for type-I error that avoids the need to use permutation procedures. As well as genome-wide analyses, our method is well-suited to fine mapping with very dense SNP sets obtained from re-sequencing and/or imputation. It can accommodate quantitative as well as case-control phenotypes, covariate adjustment, and can be extended to search for interactions. Here, we demonstrate the power and empirical type-I error of our approach using simulated case-control data sets of up to 500 K SNPs, a real genome-wide data set of 300 K SNPs, and a sequence-based dataset, each of which can be analysed in a few hours on a desktop workstation
Comparison of evolutionary algorithms in gene regulatory network model inference
Background: The evolution of high throughput technologies that measure gene expression levels has created a
data base for inferring GRNs (a process also known as reverse engineering of GRNs). However, the nature of
these data has made this process very di±cult. At the moment, several methods of discovering qualitative
causal relationships between genes with high accuracy from microarray data exist, but large scale quantitative
analysis on real biological datasets cannot be performed, to date, as existing approaches are not suitable for real
microarray data which are noisy and insu±cient.
Results: This paper performs an analysis of several existing evolutionary algorithms for quantitative gene
regulatory network modelling. The aim is to present the techniques used and o®er a comprehensive comparison
of approaches, under a common framework. Algorithms are applied to both synthetic and real gene expression
data from DNA microarrays, and ability to reproduce biological behaviour, scalability and robustness to noise are assessed and compared.
Conclusions: Presented is a comparison framework for assessment of evolutionary algorithms, used to infer gene
regulatory networks. Promising methods are identi¯ed and a platform for development of appropriate model
formalisms is established
- …