94 research outputs found
Integration of GWAS SNPs and tissue specific expression profiling reveal discrete eQTLs for human traits in blood and brain
Our knowledge of the transcriptome has become much more complex since the days of
the central dogma of molecular biology. We now know that splicing takes place to
create potentially thousands of isoforms from a single gene, and we know that RNA
does not always faithfully recapitulate DNA if RNA editing occurs. Collectively, these
observations show that the transcriptome is amazingly rich with intricate regulatory
mechanisms for overall gene expression, splicing, and RNA editing.
Genetic variability can play a role in controlling gene expression, which can be
identified by examining expression quantitative trait loci (eQTLs). eQTLs are genomic
regions where genetic variants, including single nucleotide polymorphisms (SNPs)
show a statistical association with expression of mRNA transcripts. In humans, many
SNPs are also associated with disease, and have been identified using genome wide
association studies (GWAS) but the biological effects of those SNPs are usually not
known. If SNPs found in GWAS are also found in eQTLs, then one could hypothesize
that expression levels may contribute to disease risk. Performing eQTL analysis with
GWAS SNPs in both blood and brain, specifically the frontal cortex and the
cerebellum, we found both shared and tissue unique eQTLS. The identification of
tissue-unique eQTLs supports the argument that choice of tissue type is important in
eQTL studies (Paper I).
Aging is a complex process with the mechanisms underlying aging still being poorly
defined. There is evidence that the transcriptome changes with age, and hence we used
the brain dataset from our first paper as a discovery set, with an additional replication
dataset, to investigate any aging-gene expression associations. We found evidence that
many genes were associated with aging. We further found that there were more
statically significant expression changes in the frontal cortex versus the cerebellum,
indicating that brain regions may age at different rates. As the brain is a heterogeneous
tissue including both neurons and non-neuronal cells, we used LCM to capture Purkinje
cells as a representative neuronal type and repeated the age analysis. Looking at the
discovery, replication and Purkinje cell datasets we found five genes with strong,
replicated evidence of age-expression associations (Paper II).
Being able to capture and quantify the depth of the transcriptome has been a lengthy
process starting with methods that could only measure a single gene to genome-wide
techniques such as microarray. A recently developed technology, RNA-Seq, shows
promise in its ability to capture expression, splicing, and editing and with its broad
dynamic range quantification is accurate and reliable. RNA-Seq is, however, data
intensive and a great deal of computational expertise is required to fully utilize the
strengths of this method. We aimed to create a small, well-controlled, experiment in
order to test the performance of this relatively new technology in the brain. We chose
embryonic versus adult cerebral cortex, as mice are genetically homogenous and there
are many known differences in gene expression related to brain development that we
could use as benchmarks for analysis testing. We found a large number of differences
in total gene expression between embryonic and adult brain. Rigorous technical and
biological validation illustrated the accuracy and dynamic range of RNA-Seq. We were also able to interrogate differences in exon usage in the same dataset. Finally we
were able to identify and quantify both well-known and novel A-to-I edit sites. Overall
this project helped us develop the tools needed to build usable pipelines for RNA-Seq
data processing (Paper III).
Our studies in the developing brain (Paper III) illustrated that RNA-Seq was a useful
unbiased method for investigating RNA editing. To extend this further, we utilized a
genetically modified mouse model to study the transcriptomic role of the RNA editing
enzyme ADAR2. We found that ADAR2 was important for editing of the coding
region of mRNA as a large proportion of RNA editing sites in coding regions had a
statistically significant decrease in editing percentages in Adar2
-/-Gria2
R/R
mice versus
controls. However, despite indications in the literature that ADAR2 may also be
involved in splicing and expression regulatory machinery we found no changes in gene
expression or exon utilization in Adar2
-/-Gria2
R/R
mice as compared to their littermate
controls (Paper IV).
In our final study, based on the methods developed in Papers III and IV, we revisited
the idea of age related gene expression associations from Paper II. We used a subset of
human frontal cortices for RNA sequencing. Interestingly we found more gene
expression changes with aging compared to the previous data using microarrays in
Paper II. When the significant gene lists were analysed for gene ontology enrichment,
we found that there was a large number of downregulated genes involved in synaptic
function while those that were upregulated had enrichment in immune function. This
dataset illustrates that the aging brain may be predisposed to the processes found in
neurodegenerative diseases (Paper V)
Recommended from our members
Prion-like domain mutations in hnRNPs cause multisystem proteinopathy and ALS
Summary Algorithms designed to identify canonical yeast prions predict that ~250 human proteins, including several RNA-binding proteins associated with neurodegenerative disease, harbor a distinctive prion-like domain (PrLD) enriched in uncharged polar amino acids and glycine. PrLDs in RNA-binding proteins are essential for the assembly of ribonucleoprotein granules. However, the interplay between human PrLD function and disease is not understood. Here, we define pathogenic mutations in PrLDs of hnRNPA2/B1 and hnRNPA1 in families with inherited degeneration affecting muscle, brain, motor neuron and bone, and a case of familial ALS. Wild-type hnRNPA2 and hnRNPA1 display an intrinsic tendency to assemble into self-seeding fibrils, which is exacerbated by the disease mutations. Indeed, the pathogenic mutations strengthen a ‘steric zipper’ motif in the PrLD, which accelerates formation of self-seeding fibrils that cross-seed polymerization of wild-type hnRNP. Importantly, the disease mutations promote excess incorporation of hnRNPA2 and hnRNPA1 into stress granules and drive the formation of cytoplasmic inclusions in animal models that recapitulate the human pathology. Thus, dysregulated polymerization caused by a potent mutant ‘steric zipper’ motif in a PrLD can initiate degenerative disease. Related proteins with PrLDs must be considered candidates for initiating and perhaps propagating proteinopathies of muscle, brain, motor neuron and bone
Genotype, haplotype and copy-number variation in worldwide human populations
Genome-wide patterns of variation across individuals provide a powerful source of data for uncovering the history of migration, range expansion, and adaptation of the human species. However, high-resolution surveys of variation in genotype, haplotype and copy number have generally focused on a small number of population groups(1-3). Here we report the analysis of high-quality genotypes at 525,910 single-nucleotide polymorphisms ( SNPs) and 396 copy-number-variable loci in a worldwide sample of 29 populations. Analysis of SNP genotypes yields strongly supported fine-scale inferences about population structure. Increasing linkage disequilibrium is observed with increasing geographic distance from Africa, as expected under a serial founder effect for the out-of-Africa spread of human populations. New approaches for haplotype analysis produce inferences about population structure that complement results based on unphased SNPs. Despite a difference from SNPs in the frequency spectrum of the copy-number variants (CNVs) detected-including a comparatively large number of CNVs in previously unexamined populations from Oceania and the Americas-the global distribution of CNVs largely accords with population structure analyses for SNP data sets of similar size. Our results produce new inferences about inter-population variation, support the utility of CNVs in human population-genetic research, and serve as a genomic resource for human-genetic studies in diverse worldwide populations.Peer Reviewedhttp://deepblue.lib.umich.edu/bitstream/2027.42/62552/1/nature06742.pd
A genome-wide association study of myasthenia gravis
IMPORTANCE: Myasthenia gravis is a chronic, autoimmune, neuromuscular disease characterized by fluctuating weakness of voluntary muscle groups. Although genetic factors are known to play a role in this neuroimmunological condition, the genetic etiology underlying myasthenia gravis is not well understood. OBJECTIVE: To identify genetic variants that alter susceptibility to myasthenia gravis, we performed a genome-wide association study. DESIGN, SETTING, AND PARTICIPANTS: DNA was obtained from 1032 white individuals from North America diagnosed as having acetylcholine receptor antibody–positive myasthenia gravis and 1998 race/ethnicity-matched control individuals from January 2010 to January 2011. These samples were genotyped on Illumina OmniExpress single-nucleotide polymorphism arrays. An independent cohort of 423 Italian cases and 467 Italian control individuals were used for replication. MAIN OUTCOMES AND MEASURES: We calculated P values for association between 8114394 genotyped and imputed variants across the genome and risk for developing myasthenia gravis using logistic regression modeling. A threshold P value of 5.0 × 10(−8) was set for genome-wide significance after Bonferroni correction for multiple testing. RESULTS: In the over all case-control cohort, we identified association signals at CTLA4 (rs231770; P = 3.98 × 10(−8); odds ratio, 1.37; 95% CI, 1.25–1.49), HLA-DQA1 (rs9271871; P = 1.08 × 10(−8); odds ratio, 2.31; 95% CI, 2.02 – 2.60), and TNFRSF11A (rs4263037; P = 1.60 × 10(−9); odds ratio, 1.41; 95% CI, 1.29–1.53). These findings replicated for CTLA4 and HLA-DQA1 in an independent cohort of Italian cases and control individuals. Further analysis revealed distinct, but overlapping, disease-associated loci for early- and late-onset forms of myasthenia gravis. In the late-onset cases, we identified 2 association peaks: one was located in TNFRSF11A (rs4263037; P = 1.32 × 10(−12); odds ratio, 1.56; 95% CI, 1.44–1.68) and the other was detected in the major histocompatibility complex on chromosome 6p21 (HLA-DQA1; rs9271871; P = 7.02 × 10(−18); odds ratio, 4.27; 95% CI, 3.92–4.62). Association within the major histocompatibility complex region was also observed in early-onset cases (HLA-DQA1; rs601006; P = 2.52 × 10(−11); odds ratio, 4.0; 95% CI, 3.57–4.43), although the set of single-nucleotide polymorphisms was different from that implicated among late-onset cases. CONCLUSIONS AND RELEVANCE: Our genetic data provide insights into aberrant cellular mechanisms responsible for this prototypical autoimmune disorder. They also suggest that clinical trials of immunomodulatory drugs related to CTLA4 and that are already Food and Drug Administration approved as therapies for other autoimmune diseases could be considered for patients with refractory disease
Abundant Quantitative Trait Loci Exist for DNA Methylation and Gene Expression in Human Brain
A fundamental challenge in the post-genome era is to understand and annotate the consequences of genetic variation, particularly within the context of human tissues. We present a set of integrated experiments that investigate the effects of common genetic variability on DNA methylation and mRNA expression in four human brain regions each from 150 individuals (600 samples total). We find an abundance of genetic cis regulation of mRNA expression and show for the first time abundant quantitative trait loci for DNA CpG methylation across the genome. We show peak enrichment for cis expression QTLs to be approximately 68,000 bp away from individual transcription start sites; however, the peak enrichment for cis CpG methylation QTLs is located much closer, only 45 bp from the CpG site in question. We observe that the largest magnitude quantitative trait loci occur across distinct brain tissues. Our analyses reveal that CpG methylation quantitative trait loci are more likely to occur for CpG sites outside of islands. Lastly, we show that while we can observe individual QTLs that appear to affect both the level of a transcript and a physically close CpG methylation site, these are quite rare. We believe these data, which we have made publicly available, will provide a critical step toward understanding the biological effects of genetic variation
Recommended from our members
Pathogenic Huntingtin Repeat Expansions in Patients with Frontotemporal Dementia and Amyotrophic Lateral Sclerosis.
We examined the role of repeat expansions in the pathogenesis of frontotemporal dementia (FTD) and amyotrophic lateral sclerosis (ALS) by analyzing whole-genome sequence data from 2,442 FTD/ALS patients, 2,599 Lewy body dementia (LBD) patients, and 3,158 neurologically healthy subjects. Pathogenic expansions (range, 40-64 CAG repeats) in the huntingtin (HTT) gene were found in three (0.12%) patients diagnosed with pure FTD/ALS syndromes but were not present in the LBD or healthy cohorts. We replicated our findings in an independent collection of 3,674 FTD/ALS patients. Postmortem evaluations of two patients revealed the classical TDP-43 pathology of FTD/ALS, as well as huntingtin-positive, ubiquitin-positive aggregates in the frontal cortex. The neostriatal atrophy that pathologically defines Huntington's disease was absent in both cases. Our findings reveal an etiological relationship between HTT repeat expansions and FTD/ALS syndromes and indicate that genetic screening of FTD/ALS patients for HTT repeat expansions should be considered
Association of Variants in the SPTLC1 Gene With Juvenile Amyotrophic Lateral Sclerosis
Importance: Juvenile amyotrophic lateral sclerosis (ALS) is a rare form of ALS characterized by age of symptom onset less than 25 years and a variable presentation.Objective: To identify the genetic variants associated with juvenile ALS.Design, Setting, and Participants: In this multicenter family-based genetic study, trio whole-exome sequencing was performed to identify the disease-associated gene in a case series of unrelated patients diagnosed with juvenile ALS and severe growth retardation. The patients and their family members were enrolled at academic hospitals and a government research facility between March 1, 2016, and March 13, 2020, and were observed until October 1, 2020. Whole-exome sequencing was also performed in a series of patients with juvenile ALS. A total of 66 patients with juvenile ALS and 6258 adult patients with ALS participated in the study. Patients were selected for the study based on their diagnosis, and all eligible participants were enrolled in the study. None of the participants had a family history of neurological disorders, suggesting de novo variants as the underlying genetic mechanism.Main Outcomes and Measures: De novo variants present only in the index case and not in unaffected family members.Results: Trio whole-exome sequencing was performed in 3 patients diagnosed with juvenile ALS and their parents. An additional 63 patients with juvenile ALS and 6258 adult patients with ALS were subsequently screened for variants in the SPTLC1 gene. De novo variants in SPTLC1 (p.Ala20Ser in 2 patients and p.Ser331Tyr in 1 patient) were identified in 3 unrelated patients diagnosed with juvenile ALS and failure to thrive. A fourth variant (p.Leu39del) was identified in a patient with juvenile ALS where parental DNA was unavailable. Variants in this gene have been previously shown to be associated with autosomal-dominant hereditary sensory autonomic neuropathy, type 1A, by disrupting an essential enzyme complex in the sphingolipid synthesis pathway.Conclusions and Relevance: These data broaden the phenotype associated with SPTLC1 and suggest that patients presenting with juvenile ALS should be screened for variants in this gene.</p
Novel genetic loci associated with hippocampal volume
The hippocampal formation is a brain structure integrally involved in episodic memory, spatial navigation, cognition and stress responsiveness. Structural abnormalities in hippocampal volume and shape are found in several common neuropsychiatric disorders. To identify the genetic underpinnings of hippocampal structure here we perform a genome-wide association study (GWAS) of 33,536 individuals and discover six independent loci significantly associated with hippocampal volume, four of them novel. Of the novel loci, three lie within genes (ASTN2, DPP4 and MAST4) and one is found 200 kb upstream of SHH. A hippocampal subfield analysis shows that a locus within the MSRB3 gene shows evidence of a localized effect along the dentate gyrus, subiculum, CA1 and fissure. Further, we show that genetic variants associated with decreased hippocampal volume are also associated with increased risk for Alzheimer's disease (rg =-0.155). Our findings suggest novel biological pathways through which human genetic variation influences hippocampal volume and risk for neuropsychiatric illness
- …