123 research outputs found
GemTools: A fast and efficient approach to estimating genetic ancestry
To uncover the genetic basis of complex disease, individuals are often
measured at a large number of genetic variants (usually SNPs) across the
genome. GemTools provides computationally efficient tools for modeling genetic
ancestry based on SNP genotypes. The main algorithm creates an eigenmap based
on genetic similarities, and then clusters subjects based on their map
position. This process is continued iteratively until each cluster is
relatively homogeneous. For genetic association studies, GemTools matches cases
and controls based on genetic similarity.Comment: 5 pages, 1 figur
Refining genetically inferred relationships using treelet covariance smoothing
Recent technological advances coupled with large sample sets have uncovered
many factors underlying the genetic basis of traits and the predisposition to
complex disease, but much is left to discover. A common thread to most genetic
investigations is familial relationships. Close relatives can be identified
from family records, and more distant relatives can be inferred from large
panels of genetic markers. Unfortunately these empirical estimates can be
noisy, especially regarding distant relatives. We propose a new method for
denoising genetically - inferred relationship matrices by exploiting the
underlying structure due to hierarchical groupings of correlated individuals.
The approach, which we call Treelet Covariance Smoothing, employs a multiscale
decomposition of covariance matrices to improve estimates of pairwise
relationships. On both simulated and real data, we show that smoothing leads to
better estimates of the relatedness amongst distantly related individuals. We
illustrate our method with a large genome-wide association study and estimate
the "heritability" of body mass index quite accurately. Traditionally
heritability, defined as the fraction of the total trait variance attributable
to additive genetic effects, is estimated from samples of closely related
individuals using random effects models. We show that by using smoothed
relationship matrices we can estimate heritability using population-based
samples. Finally, while our methods have been developed for refining genetic
relationship matrices and improving estimates of heritability, they have much
broader potential application in statistics. Most notably, for
error-in-variables random effects models and settings that require
regularization of matrices with block or hierarchical structure.Comment: Published in at http://dx.doi.org/10.1214/12-AOAS598 the Annals of
Applied Statistics (http://www.imstat.org/aoas/) by the Institute of
Mathematical Statistics (http://www.imstat.org
Meta-analysis of GWAS of over 16,000 individuals with autism spectrum disorder highlights a novel locus at 10q24.32 and a significant overlap with schizophrenia
Background Over the past decade genome-wide association studies (GWAS) have
been applied to aid in the understanding of the biology of traits. The success
of this approach is governed by the underlying effect sizes carried by the
true risk variants and the corresponding statistical power to observe such
effects given the study design and sample size under investigation. Previous
ASD GWAS have identified genome-wide significant (GWS) risk loci; however,
these studies were of only of low statistical power to identify GWS loci at
the lower effect sizes (odds ratio (OR) <1.15). Methods We conducted a large-
scale coordinated international collaboration to combine independent
genotyping data to improve the statistical power and aid in robust discovery
of GWS loci. This study uses genome-wide genotyping data from a discovery
sample (7387 ASD cases and 8567 controls) followed by meta-analysis of summary
statistics from two replication sets (7783 ASD cases and 11359 controls; and
1369 ASD cases and 137308 controls). Results We observe a GWS locus at
10q24.32 that overlaps several genes including PITX3, which encodes a
transcription factor identified as playing a role in neuronal differentiation
and CUEDC2 previously reported to be associated with social skills in an
independent population cohort. We also observe overlap with regions previously
implicated in schizophrenia which was further supported by a strong genetic
correlation between these disorders (Rg = 0.23; P = 9 × 10−6). We further
combined these Psychiatric Genomics Consortium (PGC) ASD GWAS data with the
recent PGC schizophrenia GWAS to identify additional regions which may be
important in a common neurodevelopmental phenotype and identified 12 novel GWS
loci. These include loci previously implicated in ASD such as FOXP1 at 3p13,
ATP2B2 at 3p25.3, and a ‘neurodevelopmental hub’ on chromosome 8p11.23.
Conclusions This study is an important step in the ongoing endeavour to
identify the loci which underpin the common variant signal in ASD. In addition
to novel GWS loci, we have identified a significant genetic correlation with
schizophrenia and association of ASD with several neurodevelopmental-related
genes such as EXT1, ASTN2, MACROD2, and HDAC4
Recommended from our members
Candidate gene analysis of femoral neck trabecular and cortical volumetric bone mineral density in older men.
In contrast to conventional dual-energy X-ray absorptiometry, quantitative computed tomography separately measures trabecular and cortical volumetric bone mineral density (vBMD). Little is known about the genetic variants associated with trabecular and cortical vBMD in humans, although both may be important for determining bone strength and osteoporotic risk. In the current analysis, we tested the hypothesis that there are genetic variants associated with trabecular and cortical vBMD at the femoral neck by genotyping 4608 tagging and potentially functional single-nucleotide polymorphisms (SNPs) in 383 bone metabolism candidate genes in 822 Caucasian men aged 65 years or older from the Osteoporotic Fractures in Men Study (MrOS). Promising SNP associations then were tested for replication in an additional 1155 men from the same study. We identified SNPs in five genes (IFNAR2, NFATC1, SMAD1, HOXA, and KLF10) that were robustly associated with cortical vBMD and SNPs in nine genes (APC, ATF2, BMP3, BMP7, FGF18, FLT1, TGFB3, THRB, and RUNX1) that were robustly associated with trabecular vBMD. There was no overlap between genes associated with cortical vBMD and trabecular vBMD. These findings identify novel genetic variants for cortical and trabecular vBMD and raise the possibility that some genetic loci may be unique for each bone compartment
Recommended from our members
Amino Acid Position 11 of HLA-DRβ1 is a Major Determinant of Chromosome 6p Association with Ulcerative Colitis
The major histocompatibility complex (MHC) on chromosome 6p is an established risk locus for ulcerative colitis (UC) and Crohn’s disease (CD). We aimed to better define MHC association signals in UC and CD by combining data from dense single nucleotide polymorphism (SNP) genotyping and from imputation of classical HLA types, their constituent SNPs and corresponding amino acids in 562 UC, 611 CD, and 1,428 control subjects. Univariate and multivariate association analyses were performed, controlling for ancestry. In univariate analyses, absence of the rs9269955 C allele was strongly associated with risk for UC (P = 2.67×). rs9269955 is a SNP in the codon for amino acid position 11 of HLA-DRβ1, located in the P6 pocket of the HLA-DR antigen binding cleft. This amino acid position was also the most significantly UC-associated amino acid in omnibus tests (P = 2.68×). Multivariate modeling identified rs9269955-C and 13 other variants in best predicting UC versus control status. In contrast, there was only suggestive association evidence between the MHC and CD. Taken together, these data demonstrate that variation at HLA-DRβ1, amino acid 11 in the P6 pocket of the HLA-DR complex antigen binding cleft is a major determinant of chromosome 6p association with ulcerative colitis
Recommended from our members
Ulcerative colitis-risk loci on chromosomes 1p36 and 12q15 found by genome-wide association study.
Ulcerative colitis is a chronic inflammatory disease of the colon that presents as diarrhea and gastrointestinal bleeding. We performed a genome-wide association study using DNA samples from 1,052 individuals with ulcerative colitis and preexisting data from 2,571 controls, all of European ancestry. In an analysis that controlled for gender and population structure, ulcerative colitis loci attaining genome-wide significance and subsequent replication in two independent populations were identified on chromosomes 1p36 (rs6426833, combined P = 5.1 x 10(-13), combined odds ratio OR = 0.73) and 12q15 (rs1558744, combined P = 2.5 x 10(-12), combined OR = 1.35). In addition, combined genome-wide significant evidence for association was found in a region spanning BTNL2 to HLA-DQB1 on chromosome 6p21 (rs2395185, combined P = 1.0 x 10(-16), combined OR = 0.66) and at the IL23R locus on chromosome 1p31 (rs11209026, combined P = 1.3 x 10(-8), combined OR = 0.56; rs10889677, combined P = 1.3 x 10(-8), combined OR = 1.29)
Rare deleterious mutations of the gene EFR3A in autism spectrum disorders
Background: Whole-exome sequencing studies in autism spectrum disorder (ASD) have identified de novo mutations in novel candidate genes, including the synaptic gene Eighty-five Requiring 3A (EFR3A). EFR3A is a critical component of a protein complex required for the synthesis of the phosphoinositide PtdIns4P, which has a variety of functions at the neural synapse. We hypothesized that deleterious mutations in EFR3A would be significantly associated with ASD. Methods: We conducted a large case/control association study by deep resequencing and analysis of whole-exome data for coding and splice site variants in EFR3A. We determined the potential impact of these variants on protein structure and function by a variety of conservation measures and analysis of the Saccharomyces cerevisiae Efr3 crystal structure. We also analyzed the expression pattern of EFR3A in human brain tissue. Results: Rare nonsynonymous mutations in EFR3A were more common among cases (16 / 2,196 = 0.73%) than matched controls (12 / 3,389 = 0.35%) and were statistically more common at conserved nucleotides based on an experiment-wide significance threshold (P = 0.0077, permutation test). Crystal structure analysis revealed that mutations likely to be deleterious were also statistically more common in cases than controls (P = 0.017, Fisher exact test). Furthermore, EFR3A is expressed in cortical neurons, including pyramidal neurons, during human fetal brain development in a pattern consistent with ASD-related genes, and it is strongly co-expressed (P < 2.2 × 10−16, Wilcoxon test) with a module of genes significantly associated with ASD. Conclusions: Rare deleterious mutations in EFR3A were found to be associated with ASD using an experiment-wide significance threshold. Synaptic phosphoinositide metabolism has been strongly implicated in syndromic forms of ASD. These data for EFR3A strengthen the evidence for the involvement of this pathway in idiopathic autism
Common Genetic Variants, Acting Additively, Are a Major Source of Risk for Autism
Background: Autism spectrum disorders (ASD) are early onset neurodevelopmental syndromes typified by impairments in reciprocal social interaction and communication, accompanied by restricted and repetitive behaviors. While rare and especially de novo genetic variation are known to affect liability, whether common genetic polymorphism plays a substantial role is an open question and the relative contribution of genes and environment is contentious. It is probable that the relative contributions of rare and common variation, as well as environment, differs between ASD families having only a single affected individual (simplex) versus multiplex families who have two or more affected individuals. Methods: By using quantitative genetics techniques and the contrast of ASD subjects to controls, we estimate what portion of liability can be explained by additive genetic effects, known as narrow-sense heritability. We evaluate relatives of ASD subjects using the same methods to evaluate the assumptions of the additive model and partition families by simplex/multiplex status to determine how heritability changes with status. Results: By analyzing common variation throughout the genome, we show that common genetic polymorphism exerts substantial additive genetic effects on ASD liability and that simplex/multiplex family status has an impact on the identified composition of that risk. As a fraction of the total variation in liability, the estimated narrow-sense heritability exceeds 60% for ASD individuals from multiplex families and is approximately 40% for simplex families. By analyzing parents, unaffected siblings and alleles not transmitted from parents to their affected children, we conclude that the data for simplex ASD families follow the expectation for additive models closely. The data from multiplex families deviate somewhat from an additive model, possibly due to parental assortative mating. Conclusions: Our results, when viewed in the context of results from genome-wide association studies, demonstrate that a myriad of common variants of very small effect impacts ASD liability
Common genetic variants, acting additively, are a major source of risk for autism
Abstract
Background
Autism spectrum disorders (ASD) are early onset neurodevelopmental syndromes typified by impairments in reciprocal social interaction and communication, accompanied by restricted and repetitive behaviors. While rare and especially de novo genetic variation are known to affect liability, whether common genetic polymorphism plays a substantial role is an open question and the relative contribution of genes and environment is contentious. It is probable that the relative contributions of rare and common variation, as well as environment, differs between ASD families having only a single affected individual (simplex) versus multiplex families who have two or more affected individuals.
Methods
By using quantitative genetics techniques and the contrast of ASD subjects to controls, we estimate what portion of liability can be explained by additive genetic effects, known as narrow-sense heritability. We evaluate relatives of ASD subjects using the same methods to evaluate the assumptions of the additive model and partition families by simplex/multiplex status to determine how heritability changes with status.
Results
By analyzing common variation throughout the genome, we show that common genetic polymorphism exerts substantial additive genetic effects on ASD liability and that simplex/multiplex family status has an impact on the identified composition of that risk. As a fraction of the total variation in liability, the estimated narrow-sense heritability exceeds 60% for ASD individuals from multiplex families and is approximately 40% for simplex families. By analyzing parents, unaffected siblings and alleles not transmitted from parents to their affected children, we conclude that the data for simplex ASD families follow the expectation for additive models closely. The data from multiplex families deviate somewhat from an additive model, possibly due to parental assortative mating.
Conclusions
Our results, when viewed in the context of results from genome-wide association studies, demonstrate that a myriad of common variants of very small effect impacts ASD liability.http://deepblue.lib.umich.edu/bitstream/2027.42/112370/1/13229_2012_Article_55.pd
Recommended from our members
Individual common variants exert weak effects on the risk for autism spectrum disorders.
While it is apparent that rare variation can play an important role in the genetic architecture of autism spectrum disorders (ASDs), the contribution of common variation to the risk of developing ASD is less clear. To produce a more comprehensive picture, we report Stage 2 of the Autism Genome Project genome-wide association study, adding 1301 ASD families and bringing the total to 2705 families analysed (Stages 1 and 2). In addition to evaluating the association of individual single nucleotide polymorphisms (SNPs), we also sought evidence that common variants, en masse, might affect the risk. Despite genotyping over a million SNPs covering the genome, no single SNP shows significant association with ASD or selected phenotypes at a genome-wide level. The SNP that achieves the smallest P-value from secondary analyses is rs1718101. It falls in CNTNAP2, a gene previously implicated in susceptibility for ASD. This SNP also shows modest association with age of word/phrase acquisition in ASD subjects, of interest because features of language development are also associated with other variation in CNTNAP2. In contrast, allele scores derived from the transmission of common alleles to Stage 1 cases significantly predict case status in the independent Stage 2 sample. Despite being significant, the variance explained by these allele scores was small (Vm< 1%). Based on results from individual SNPs and their en masse effect on risk, as inferred from the allele score results, it is reasonable to conclude that common variants affect the risk for ASD but their individual effects are modest
- …