Search CORE

27,889 research outputs found

Incorporating non-coding annotations into rare variant analysis

Author: Campbell Colin
Gaunt Tom
Richardson Thomas
Timpson Nicholas
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2016
Field of study

BackgroundThe success of collapsing methods which investigate the combined effect of rare variants on complex traits has so far been limited. The manner in which variants within a gene are selected prior to analysis has a crucial impact on this success, which has resulted in analyses conventionally filtering variants according to their consequence. This study investigates whether an alternative approach to filtering, using annotations from recently developed bioinformatics tools, can aid these types of analyses in comparison to conventional approaches.Methods & resultsWe conducted a candidate gene analysis using the UK10K sequence and lipids data, filtering according to functional annotations using the resource CADD (Combined Annotation-Dependent Depletion) and contrasting results with 'nonsynonymous' and 'loss of function' consequence analyses. Using CADD allowed the inclusion of potentially deleterious intronic variants, which was not possible when filtering by consequence. Overall, different filtering approaches provided similar evidence of association, although filtering according to CADD identified evidence of association between ANGPTL4 and High Density Lipoproteins (P = 0.02, N = 3,210) which was not observed in the other analyses. We also undertook genome-wide analyses to determine how filtering in this manner compared to conventional approaches for gene regions. Results suggested that filtering by annotations according to CADD, as well as other tools known as FATHMM-MKL and DANN, identified association signals not detected when filtering by variant consequence and vice versa.ConclusionIncorporating variant annotations from non-coding bioinformatics tools should prove to be a valuable asset for rare variant analyses in the future. Filtering by variant consequence is only possible in coding regions of the genome, whereas utilising non-coding bioinformatics annotations provides an opportunity to discover unknown causal variants in non-coding regions as well. This should allow studies to uncover a greater number of causal variants for complex traits and help elucidate their functional role in disease

Directory of Open Access Journals

PubMed Central

Explore Bristol Research

FigShare

Detecting functional rare variants by collapsing and incorporating functional annotation in Genetic Analysis Workshop 17 mini-exome data

Author: AP Morris
B Li
BE Madsen
C Dering
DE Reich
DJ Smith
F Han
Hongyu Zhao
J Graham
JK Pritchard
JK Pritchard
JN Hirschhorn
John Ferguson
Joon Sang Lee
LA Almasy
LA Hindorff
Lun Li
NJ Schork
P Donnelly
R Tibshirani
SK Iyengar
Wei Zheng
Xiting Yan
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Association studies using tag SNPs have been successful in detecting disease-associated common variants. However, common variants, with rare exceptions, explain only at most 5–10% of the heritability resulting from genetic factors, which leads to the common disease/rare variants assumption. Indeed, recent studies using sequencing technologies have demonstrated that common diseases can be due to rare variants that could not be systematically studied earlier. Unfortunately, methods for common variants are not optimal if applied to rare variants. To identify rare variants that affect disease risk, several investigators have designed new approaches based on the idea of collapsing different rare variants inside the same genomic block (e.g., the same gene or pathway) to enrich the signal. Here, we consider three different collapsing methods in the multimarker regression model and compared their performance on the Genetic Analysis Workshop 17 data using the consistency of results across different simulations and the cross-validation prediction error rate. The comparison shows that the proportion collapsing method seems to outperform the other two methods and can find both truly associated rare and common variants. Moreover, we explore one way of incorporating the functional annotations for the variants in the data that collapses nonsynonymous and synonymous variants separately to allow for different penalties on them. The incorporation of functional annotations led to higher sensitivity and specificity levels when the detection results were compared with the answer sheet. The initial analysis was performed without knowledge of the simulating model

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Improving dbNSFP

Author: Lu Mingyao
Publication venue: DigitalCommons@TMC
Publication date: 01/12/2018
Field of study

IMPROVING dbNSFP Mingyao Lu, B.S. Advisory Professor: Xiaoming Liu, Ph.D. The analysis and interpretation of DNA variation are very important for the Whole Exome studies (WES). Genome research has focused on single nucleotide variants (SNVs). Since indels are as important as SNVs, especially indels in coding regions are often candidates of disease-causing variants, thus, it is necessary to expand the focus to include indel mutations. The goal of my project is to provide an automatic annotation pipeline to the WES based disease studies project by extending the dbNSFP with a tool for automated indel annotation and deleteriousness prediction. The current sequencing results typically include both SNVs and indels. Although there have been many available tools to integrate functional prediction/annotations for SNV effects, there are no such tools for indels to my knowledge. Therefore, the aim of this thesis was to add deleteriousness prediction scores to indel annotation based on gene models, including CADD, SIFT, and PROVEAN. All those scores can be calculated on-the-fly after installing resources locally. A Docker implementing the indel annotation and deleteriousness prediction has been developed and ready to be deployed from the cloud

DigitalCommons@The Texas Medical Center

Using genomic annotations increases statistical power to detect eGenes.

Author: Duong Dat
Ernst Jason
Eskin Eleazar
Han Buhm
Hormozdiari Farhad
Sul Jae Hoon
Zou Jennifer
Publication venue: eScholarship, University of California
Publication date: 01/06/2016
Field of study

MotivationExpression quantitative trait loci (eQTLs) are genetic variants that affect gene expression. In eQTL studies, one important task is to find eGenes or genes whose expressions are associated with at least one eQTL. The standard statistical method to determine whether a gene is an eGene requires association testing at all nearby variants and the permutation test to correct for multiple testing. The standard method however does not consider genomic annotation of the variants. In practice, variants near gene transcription start sites (TSSs) or certain histone modifications are likely to regulate gene expression. In this article, we introduce a novel eGene detection method that considers this empirical evidence and thereby increases the statistical power.ResultsWe applied our method to the liver Genotype-Tissue Expression (GTEx) data using distance from TSSs, DNase hypersensitivity sites, and six histone modifications as the genomic annotations for the variants. Each of these annotations helped us detected more candidate eGenes. Distance from TSS appears to be the most important annotation; specifically, using this annotation, our method discovered 50% more candidate eGenes than the standard permutation [email protected] or [email protected]

SNU Open Repository and Archive

PubMed Central

eScholarship - University of California

Multiethnic meta-analysis identifies ancestry-specific and cross-ancestry loci for pulmonary function

Author: Feitosa Mary F.
Province Michael A.
Wojczynski Mary K.
Publication venue: Digital Commons@Becker
Publication date: 01/01/2018
Field of study

Digital Commons@Becker

Deep-coverage whole genome sequences and blood lipids among 16,324 individuals.

Author: Abecasis Goncalo
Alver Maris
Bloom Jonathan M
Chaffin Mark
Correa Adolfo
Cupples L Adrienne
Engreitz Jesse M
Ernst Jason
Esko Tonu
Ganna Andrea
Johnson W Craig
Kathiresan Sekar
Kellis Manolis
Khera Amit V
Lander Eric S
Manichaikul Ani
Mitchell Braxton
Montasser May
Natarajan Pradeep
Neale Benjamin M
NHLBI TOPMed Lipids Working Group
O'Connell Jeffrey R
Peloso Gina M
Perry James A
Poterba Timothy
Rich Stephen S
Ripatti Samuli
Rotter Jerome I
Ruotsalainen Sanni E
Salomaa Veikko
Seed Cotton
Surakka Ida L
Vasan Ramachandran S
Willer Cristen J
Wilson James G
Zekavat Seyedeh Maryam
Zhou Wei
Publication venue: eScholarship, University of California
Publication date: 01/08/2018
Field of study

Large-scale deep-coverage whole-genome sequencing (WGS) is now feasible and offers potential advantages for locus discovery. We perform WGS in 16,324 participants from four ancestries at mean depth >29X and analyze genotypes with four quantitative traits-plasma total cholesterol, low-density lipoprotein cholesterol (LDL-C), high-density lipoprotein cholesterol, and triglycerides. Common variant association yields known loci except for few variants previously poorly imputed. Rare coding variant association yields known Mendelian dyslipidemia genes but rare non-coding variant association detects no signals. A high 2M-SNP LDL-C polygenic score (top 5th percentile) confers similar effect size to a monogenic mutation (~30 mg/dl higher for each); however, among those with severe hypercholesterolemia, 23% have a high polygenic score and only 2% carry a monogenic mutation. At these sample sizes and for these phenotypes, the incremental value of WGS for discovery is limited but WGS permits simultaneous assessment of monogenic and polygenic models to severe hypercholesterolemia

DSpace@MIT

Directory of Open Access Journals

eScholarship - University of California

George Washington University: Health Sciences Research Commons (HSRC)

Recommended from our members

Allele-specific NKX2-5 binding underlies multiple genetic associations with human electrocardiographic traits.

Author: Benaglio Paola
D'Antonio Matteo
D'Antonio-Chronowska Agnieszka
DeBoever Christopher
Donovan Margaret KR
Drees Frauke
Frazer Kelly A
Gaulton Kyle J
Li He
Ma Wubin
Matsui Hiroko
Rosenfeld Michael G
Singhal Sanghamitra
Smith Erin N
Sotoodehnia Nona
van Setten Jessica
Yang Feng
Young Greenwald William W
Publication venue: eScholarship, University of California
Publication date: 01/10/2019
Field of study

The cardiac transcription factor (TF) gene NKX2-5 has been associated with electrocardiographic (EKG) traits through genome-wide association studies (GWASs), but the extent to which differential binding of NKX2-5 at common regulatory variants contributes to these traits has not yet been studied. We analyzed transcriptomic and epigenomic data from induced pluripotent stem cell-derived cardiomyocytes from seven related individuals, and identified ~2,000 single-nucleotide variants associated with allele-specific effects (ASE-SNVs) on NKX2-5 binding. NKX2-5 ASE-SNVs were enriched for altered TF motifs, for heart-specific expression quantitative trait loci and for EKG GWAS signals. Using fine-mapping combined with epigenomic data from induced pluripotent stem cell-derived cardiomyocytes, we prioritized candidate causal variants for EKG traits, many of which were NKX2-5 ASE-SNVs. Experimentally characterizing two NKX2-5 ASE-SNVs (rs3807989 and rs590041) showed that they modulate the expression of target genes via differential protein binding in cardiac cells, indicating that they are functional variants underlying EKG GWAS signals. Our results show that differential NKX2-5 binding at numerous regulatory variants across the genome contributes to EKG phenotypes

eScholarship - University of California