102,801 research outputs found
Learning the optimal scale for GWAS through hierarchical SNP aggregation
Motivation: Genome-Wide Association Studies (GWAS) seek to identify causal
genomic variants associated with rare human diseases. The classical statistical
approach for detecting these variants is based on univariate hypothesis
testing, with healthy individuals being tested against affected individuals at
each locus. Given that an individual's genotype is characterized by up to one
million SNPs, this approach lacks precision, since it may yield a large number
of false positives that can lead to erroneous conclusions about genetic
associations with the disease. One way to improve the detection of true genetic
associations is to reduce the number of hypotheses to be tested by grouping
SNPs. Results: We propose a dimension-reduction approach which can be applied
in the context of GWAS by making use of the haplotype structure of the human
genome. We compare our method with standard univariate and multivariate
approaches on both synthetic and real GWAS data, and we show that reducing the
dimension of the predictor matrix by aggregating SNPs gives a greater precision
in the detection of associations between the phenotype and genomic regions
Matching Methods for Causal Inference: A Review and a Look Forward
When estimating causal effects using observational data, it is desirable to
replicate a randomized experiment as closely as possible by obtaining treated
and control groups with similar covariate distributions. This goal can often be
achieved by choosing well-matched samples of the original treated and control
groups, thereby reducing bias due to the covariates. Since the 1970s, work on
matching methods has examined how to best choose treated and control subjects
for comparison. Matching methods are gaining popularity in fields such as
economics, epidemiology, medicine and political science. However, until now the
literature and related advice has been scattered across disciplines.
Researchers who are interested in using matching methods---or developing
methods related to matching---do not have a single place to turn to learn about
past and current research. This paper provides a structure for thinking about
matching methods and guidance on their use, coalescing the existing research
(both old and new) and providing a summary of where the literature on matching
methods is now and where it should be headed.Comment: Published in at http://dx.doi.org/10.1214/09-STS313 the Statistical
Science (http://www.imstat.org/sts/) by the Institute of Mathematical
Statistics (http://www.imstat.org
- …