Search CORE

507 research outputs found

The EM Algorithm in Genetics, Genomics and Public Health

Author: Laird Nan M.
Publication venue: 'Institute of Mathematical Statistics'
Publication date: 11/04/2011
Field of study

The popularity of the EM algorithm owes much to the 1977 paper by Dempster, Laird and Rubin. That paper gave the algorithm its name, identified the general form and some key properties of the algorithm and established its broad applicability in scientific research. This review gives a nontechnical introduction to the algorithm for a general scientific audience, and presents a few examples characteristic of its application.Comment: Published in at http://dx.doi.org/10.1214/08-STS270 the Statistical Science (http://www.imstat.org/sts/) by the Institute of Mathematical Statistics (http://www.imstat.org

arXiv.org e-Print Archive

Crossref

The Role of Family-Based Designs in Genome-Wide Association Studies

Author: Laird Nan M.
Lange Christoph
Publication venue: 'Institute of Mathematical Statistics'
Publication date: 01/01/2009
Field of study

Genome-Wide Association Studies (GWAS) offer an exciting and promising new research avenue for finding genes for complex diseases. Traditional case-control and cohort studies offer many advantages for such designs. Family-based association designs have long been attractive for their robustness properties, but robustness can mean a loss of power. In this paper we discuss some of the special features of family designs and their relevance in the era of GWAS.Comment: Published in at http://dx.doi.org/10.1214/08-STS280 the Statistical Science (http://www.imstat.org/sts/) by the Institute of Mathematical Statistics (http://www.imstat.org

arXiv.org e-Print Archive

CiteSeerX

fgui: A Method for Automatically Creating Graphical User Interfaces for Command-Line R Packages

Author: Nan M. Laird
Thomas J. Hoffmann
Publication venue
Publication date
Field of study

The fgui R package is designed for developers of R packages, to help rapidly, and sometimes fully automatically, create a graphical user interface for a command line R package. The interface is built upon the Tcl/Tk graphical interface included in R. The package further facilitates the developer by loading in the help files from the command line functions to provide context sensitive help to the user with no additional effort from the developer. Passing a function as the argument to the routines in the fgui package creates a graphical interface for the function, and further options are available to tweak this interface for those who want more flexibility.

Research Papers in Economics

fgui: A Method for Automatically Creating Graphical User Interfaces for Command-Line R Packages

Author: Hoffmann Thomas J.
Laird Nan M.
Publication venue: 'Foundation for Open Access Statistic'
Publication date: 01/04/2009
Field of study

Directory of Open Access Journals

PubMed Central

Journal of Statistical Software

The Sib Transmission/Disequilibrium Test is a Mantel-Haenszel Test

Author: Blacker Deborah
Laird Nan M.
Wilcox Marsha
Publication venue: The American Society of Human Genetics. Published by Elsevier Inc.
Publication date: 31/12/1998
Field of study

Elsevier - Publisher Connector

Fitting ACE Structural Equation Models to Case-Control Family Data

Author: Hudson James I
Javaras Kristin N
Laird Nan M
Publication venue: Collection of Biostatistics Research Archive
Publication date: 18/03/2009
Field of study

Investigators interested in whether a disease aggregates in families often collect case-control family data, which consist of disease status and covariate information for families selected via case or control probands. Here, we focus on the use of case-control family data to investigate the relative contributions to the disease of additive genetic effects (A), shared family environment (C), and unique environment (E). To this end, we describe a ACE model for binary family data and then introduce an approach to fitting the model to case-control family data. The structural equation model, which has been described previously, combines a general-family extension of the classic ACE twin model with a (possibly covariate-specific) liability-threshold model for binary outcomes. Our likelihood-based approach to fitting involves conditioning on the proband’s disease status, as well as setting prevalence equal to a pre-specified value that can be estimated from the data themselves if necessary. Simulation experiments suggest that our approach to fitting yields approximately unbiased estimates of the A, C, and E variance components, provided that certain commonly-made assumptions hold. These assumptions include: the usual assumptions for the classic ACE and liability-threshold models; assumptions about shared family environment for relative pairs; and assumptions about the case-control family sampling, including single ascertainment. When our approach is used to fit the ACE model to Austrian case-control family data on depression, the resulting estimate of heritability is very similar to those from previous analyses of twin data

PubMed Central

Collection Of Biostatistics Research Archive

Recommended from our members

Identifying causal rare variants of disease through family-based analysis of Genetics Analysis Workshop 17 data set

Author: De Gourab
Laird Nan
Raby Benjamin A
Yip Wai-Ki
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Linkage- and association-based methods have been proposed for mapping disease-causing rare variants. Based on the family information provided in the Genetic Analysis Workshop 17 data set, we formulate a two-pronged approach that combines both methods. Using the identity-by-descent information provided for eight extended pedigrees (n = 697) and the simulated quantitative trait Q1, we explore various traditional nonparametric linkage analysis methods; the best result is obtained by assuming between-family heterogeneity and applying the Haseman-Elston regression to each pedigree separately. We discover strong signals from two genes in two different families and weaker signals for a third gene from two other families. As an exploratory approach, we apply an association test based on a modified family-based association test statistic to all rare variants (frequency < 1% or < 3%) designated as causal for Q1. Family-based association tests correctly identified causal single-nucleotide polymorphisms for four genes (KDR, VEGFA, VEGFC, and FLT1). Our results suggest that both linkage and association tests with families show promise for identifying rare variants

Harvard University - DASH

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Rare Variant Analysis for Family-Based Design

Author: De Gourab
Ionita-Laza Iuliana
Laird Nan
Yip Wai-Ki
Publication venue: 'Columbia University Libraries/Information Services'
Publication date: 01/01/2013
Field of study

Genome-wide association studies have been able to identify disease associations with many common variants; however most of the estimated genetic contribution explained by these variants appears to be very modest. Rare variants are thought to have larger effect sizes compared to common SNPs but effects of rare variants cannot be tested in the GWAS setting. Here we propose a novel method to test for association of rare variants obtained by sequencing in family-based samples by collapsing the standard family-based association test (FBAT) statistic over a region of interest. We also propose a suitable weighting scheme so that low frequency SNPs that may be enriched in functional variants can be upweighted compared to common variants. Using simulations we show that the family-based methods perform at par with the population-based methods under no population stratification. By construction, family-based tests are completely robust to population stratification; we show that our proposed methods remain valid even when population stratification is present

CiteSeerX

Public Library of Science (PLOS)

Crossref

Columbia University Academic Commons

Directory of Open Access Journals

PubMed Central

FigShare

Estimating the Prevalence of Disease Using Relatives of Case and Control Probands

Author: Hudson James I
Javaras Kristin N
Laird Nan M
Ripley Brian D
Publication venue: Collection of Biostatistics Research Archive
Publication date: 06/08/2007
Field of study

We introduce a method for estimating the prevalence of disease using data from a case-control family study performed to investigate the aggregation of disease in families. The families are sampled via case and control probands, and the resulting data consist of information on disease status and covariates for the probands and their relatives. We introduce estimators for overall prevalence and for covariate stratum-specific prevalence (e.g., sex-specific prevalence) that yield approximately unbiased estimates of their population counterparts. We also introduce corresponding confidence intervals that have good coverage properties even for small prevalences. The estimators and intervals address the over-representation of diseased individuals in case-control family data by using only the relatives (of the probands) and by taking into account whether each relative was selected via a case or a control proband. Finally, we describe a simulation experiment in which the estimators and intervals were applied to case-control family datasets sampled from a fictional population that resembled the catchment area for an Austrian family study of major depressive disorder. The resulting estimates varied closely and symmetrically around their population counterparts, and the resulting intervals had good coverage properties

Collection Of Biostatistics Research Archive

A general semi-parametric approach to the analysis of genetic association studies in population-based designs

Author: Hokanson John
Laird Nan M.
Lange Christoph
Lutz Sharon
Yip Wai-Ki
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

Background: For genetic association studies in designs of unrelated individuals, current statistical methodology typically models the phenotype of interest as a function of the genotype and assumes a known statistical model for the phenotype. In the analysis of complex phenotypes, especially in the presence of ascertainment conditions, the specification of such model assumptions is not straight-forward and is error-prone, potentially causing misleading results. Results: In this paper, we propose an alternative approach that treats the genotype as the random variable and conditions upon the phenotype. Thereby, the validity of the approach does not depend on the correctness of assumptions about the phenotypic model. Misspecification of the phenotypic model may lead to reduced statistical power. Theoretical derivations and simulation studies demonstrate both the validity and the advantages of the approach over existing methodology. In the COPDGene study (a GWAS for Chronic Obstructive Pulmonary Disease (COPD)), we apply the approach to a secondary, quantitative phenotype, the Fagerstrom nicotine dependence score, that is correlated with COPD affection status. The software package that implements this method is available. Conclusions: The flexibility of this approach enables the straight-forward application to quantitative phenotypes and binary traits in ascertained and unascertained samples. In addition to its robustness features, our method provides the platform for the construction of complex statistical models for longitudinal data, multivariate data, multi-marker tests, rare-variant analysis, and others

Crossref

Harvard University - DASH