18 research outputs found
A robust clustering algorithm for identifying problematic samples in genome-wide association studies
Summary: High-throughput genotyping arrays provide an efficient way to survey single nucleotide polymorphisms (SNPs) across the genome in large numbers of individuals. Downstream analysis of the data, for example in genome-wide association studies (GWAS), often involves statistical models of genotype frequencies across individuals. The complexities of the sample collection process and the potential for errors in the experimental assay can lead to biases and artefacts in an individual's inferred genotypes. Rather than attempting to model these complications, it has become a standard practice to remove individuals whose genome-wide data differ from the sample at large. Here we describe a simple, but robust, statistical algorithm to identify samples with atypical summaries of genome-wide variation. Its use as a semi-automated quality control tool is demonstrated using several summary statistics, selected to identify different potential problems, and it is applied to two different genotyping platforms and sample collections
Recommended from our members
Analysis of five chronic inflammatory diseases identifies 27 new associations and highlights disease-specific patterns at shared loci.
We simultaneously investigated the genetic landscape of ankylosing spondylitis, Crohn's disease, psoriasis, primary sclerosing cholangitis and ulcerative colitis to investigate pleiotropy and the relationship between these clinically related diseases. Using high-density genotype data from more than 86,000 individuals of European ancestry, we identified 244 independent multidisease signals, including 27 new genome-wide significant susceptibility loci and 3 unreported shared risk loci. Complex pleiotropy was supported when contrasting multidisease signals with expression data sets from human, rat and mouse together with epigenetic and expressed enhancer profiles. The comorbidities among the five immune diseases were best explained by biological pleiotropy rather than heterogeneity (a subgroup of cases genetically identical to those with another disease, possibly owing to diagnostic misclassification, molecular subtypes or excessive comorbidity). In particular, the strong comorbidity between primary sclerosing cholangitis and inflammatory bowel disease is likely the result of a unique disease, which is genetically distinct from classical inflammatory bowel disease phenotypes
Single-Point Haplotype Scores Telomeric to Human Leukocyte Antigen-C Give a High Susceptibility Major Histocompatability Complex Haplotype for Psoriasis in a Caucasian Population
Genetic predisposition, humans
The translation from genetic knowledge to a molecular understanding of disease is contributing to the development of improved diagnostic and therapeutic products. Here, we briefly cover the localisation of autoimmune disease-associated loci and examine recent progress in molecular understanding of autoimmune disease that has been facilitated by these approaches. We concentrate primarily on progress in the genetics of type 1 diabetes, multiple sclerosis and systemic lupus erythematosus, and draw attention to parallels and contrasts between these diseases, other autoimmune diseases and other immune diseases