Search CORE

2,113 research outputs found

inPHAP: Interactive visualization of genotype and phased haplotype data

Author: Jäger Günter
Nieselt Kay
Peltzer Alexander
Publication venue
Publication date: 01/01/2014
Field of study

Background: To understand individual genomes it is necessary to look at the variations that lead to changes in phenotype and possibly to disease. However, genotype information alone is often not sufficient and additional knowledge regarding the phase of the variation is needed to make correct interpretations. Interactive visualizations, that allow the user to explore the data in various ways, can be of great assistance in the process of making well informed decisions. But, currently there is a lack for visualizations that are able to deal with phased haplotype data. Results: We present inPHAP, an interactive visualization tool for genotype and phased haplotype data. inPHAP features a variety of interaction possibilities such as zooming, sorting, filtering and aggregation of rows in order to explore patterns hidden in large genetic data sets. As a proof of concept, we apply inPHAP to the phased haplotype data set of Phase 1 of the 1000 Genomes Project. Thereby, inPHAP's ability to show genetic variations on the population as well as on the individuals level is demonstrated for several disease related loci. Conclusions: As of today, inPHAP is the only visual analytical tool that allows the user to explore unphased and phased haplotype data interactively. Due to its highly scalable design, inPHAP can be applied to large datasets with up to 100 GB of data, enabling users to visualize even large scale input data. inPHAP closes the gap between common visualization tools for unphased genotype data and introduces several new features, such as the visualization of phased data.Comment: BioVis 2014 conferenc

arXiv.org e-Print Archive

Springer - Publisher Connector

Publikationsserver der Universität Tübingen

PubMed Central

ATXN2 and its neighbouring gene SH2B3 are associated with increased ALS risk in the Turkish population

Author: Auburger Georg
Ağım Zeynep Sena
Başak Ayşe Nazlı
Deymeer Feza
Koç Filiz
Lahut Suna
Oflazer Piraye
Parman Yeşim
Uyan Özgün
Ömür Özgür
Özoğuz Aslihan
Özçelik Hilmi
Publication venue
Publication date: 20/08/2012
Field of study

Expansions of the polyglutamine (polyQ) domain (≥34) in Ataxin-2 (ATXN2) are the primary cause of spinocerebellar ataxia type 2 (SCA2). Recent studies reported that intermediate-length (27–33) expansions increase the risk of Amyotrophic Lateral Sclerosis (ALS) in 1–4% of cases in diverse populations. This study investigates the Turkish population with respect to ALS risk, genotyping 158 sporadic, 78 familial patients and 420 neurologically healthy controls. We re-assessed the effect of ATXN2 expansions and extended the analysis for the first time to cover the ATXN2 locus with 18 Single Nucleotide Polymorphisms (SNPs) and their haplotypes. In accordance with other studies, our results confirmed that 31–32 polyQ repeats in the ATXN2 gene are associated with risk of developing ALS in 1.7% of the Turkish ALS cohort (p = 0.0172). Additionally, a significant association of a 136 kb haplotype block across the ATXN2 and SH2B3 genes was found in 19.4% of a subset of our ALS cohort and in 10.1% of the controls (p = 0.0057, OR: 2.23). ATXN2 and SH2B3 encode proteins that both interact with growth receptor tyrosine kinases. Our novel observations suggest that genotyping of SNPs at this locus may be useful for the study of ALS risk in a high percentage of individuals and that ATXN2 and SH2B3 variants may interact in modulating the disease pathway

Directory of Open Access Journals

PubMed Central

Hochschulschriftenserver - Universität Frankfurt am Main

FigShare

Recommended from our members

Common DNA sequence variation influences 3-dimensional conformation of the human genome.

Author: Chiou Joshua
Fletez-Brant Kipper
Gaulton Kyle J
Gorkin David U
Hansen Kasper D
Hu Ming
Li Yun
Liu Tristin
Noor Amina
Qiu Yunjiang
Ren Bing
Schmitt Anthony D
Sebat Jonathan
Publication venue: eScholarship, University of California
Publication date: 01/11/2019
Field of study

BACKGROUND:The 3-dimensional (3D) conformation of chromatin inside the nucleus is integral to a variety of nuclear processes including transcriptional regulation, DNA replication, and DNA damage repair. Aberrations in 3D chromatin conformation have been implicated in developmental abnormalities and cancer. Despite the importance of 3D chromatin conformation to cellular function and human health, little is known about how 3D chromatin conformation varies in the human population, or whether DNA sequence variation between individuals influences 3D chromatin conformation. RESULTS:To address these questions, we perform Hi-C on lymphoblastoid cell lines from 20 individuals. We identify thousands of regions across the genome where 3D chromatin conformation varies between individuals and find that this variation is often accompanied by variation in gene expression, histone modifications, and transcription factor binding. Moreover, we find that DNA sequence variation influences several features of 3D chromatin conformation including loop strength, contact insulation, contact directionality, and density of local cis contacts. We map hundreds of quantitative trait loci associated with 3D chromatin features and find evidence that some of these same variants are associated at modest levels with other molecular phenotypes as well as complex disease risk. CONCLUSION:Our results demonstrate that common DNA sequence variants can influence 3D chromatin conformation, pointing to a more pervasive role for 3D chromatin conformation in human phenotypic variation than previously recognized

eScholarship - University of California

InPhaDel: integrative shotgun and proximity-ligation sequencing to phase deletions with single nucleotide polymorphisms.

Author: Bafna Vineet
Bansal Vikas
Edge Peter
Patel Anand
Selvaraj Siddarth
Publication venue: eScholarship, University of California
Publication date: 21/04/2016
Field of study

Phasing of single nucleotide (SNV), and structural variations into chromosome-wide haplotypes in humans has been challenging, and required either trio sequencing or restricting phasing to population-based haplotypes. Selvaraj et al demonstrated single individual SNV phasing is possible with proximity ligated (HiC) sequencing. Here, we demonstrate HiC can phase structural variants into phased scaffolds of SNVs. Since HiC data is noisy, and SV calling is challenging, we applied a range of supervised classification techniques, including Support Vector Machines and Random Forest, to phase deletions. Our approach was demonstrated on deletion calls and phasings on the NA12878 human genome. We used three NA12878 chromosomes and simulated chromosomes to train model parameters. The remaining NA12878 chromosomes withheld from training were used to evaluate phasing accuracy. Random Forest had the highest accuracy and correctly phased 86% of the deletions with allele-specific read evidence. Allele-specific read evidence was found for 76% of the deletions. HiC provides significant read evidence for accurately phasing 33% of the deletions. Also, eight of eight top ranked deletions phased by only HiC were validated using long range polymerase chain reaction and Sanger. Thus, deletions from a single individual can be accurately phased using a combination of shotgun and proximity ligation sequencing. InPhaDel software is available at: http://l337x911.github.io/inphadel/

PubMed Central

eScholarship - University of California

Localization of adaptive variants in human genomes using averaged one-dependence estimation.

Author: Atkinson Elizabeth G
Fischer Annie P
Henn Brenna M
Ramachandran Sohini
Rong Stephen
Sugden Lauren Alpert
Publication venue: eScholarship, University of California
Publication date: 01/02/2018
Field of study

Statistical methods for identifying adaptive mutations from population genetic data face several obstacles: assessing the significance of genomic outliers, integrating correlated measures of selection into one analytic framework, and distinguishing adaptive variants from hitchhiking neutral variants. Here, we introduce SWIF(r), a probabilistic method that detects selective sweeps by learning the distributions of multiple selection statistics under different evolutionary scenarios and calculating the posterior probability of a sweep at each genomic site. SWIF(r) is trained using simulations from a user-specified demographic model and explicitly models the joint distributions of selection statistics, thereby increasing its power to both identify regions undergoing sweeps and localize adaptive mutations. Using array and exome data from 45 ‡Khomani San hunter-gatherers of southern Africa, we identify an enrichment of adaptive signals in genes associated with metabolism and obesity. SWIF(r) provides a transparent probabilistic framework for localizing beneficial mutations that is extensible to a variety of evolutionary scenarios

Directory of Open Access Journals

eScholarship - University of California

Duquesne University: Digital Commons

A reference haplotype panel for genome-wide imputation of short tandem repeats.

Author: Fotsing Stephanie Feupe
Gymrek Melissa
Mitra Ileena
Mousavi Nima
Saini Shubham
Publication venue: eScholarship, University of California
Publication date: 01/10/2018
Field of study

Short tandem repeats (STRs) are involved in dozens of Mendelian disorders and have been implicated in complex traits. However, genotyping arrays used in genome-wide association studies focus on single nucleotide polymorphisms (SNPs) and do not readily allow identification of STR associations. We leverage next-generation sequencing (NGS) from 479 families to create a SNP + STR reference haplotype panel. Our panel enables imputing STR genotypes into SNP array data when NGS is not available for directly genotyping STRs. Imputed genotypes achieve mean concordance of 97% with observed genotypes in an external dataset compared to 71% expected under a naive model. Performance varies widely across STRs, with near perfect concordance at bi-allelic STRs vs. 70% at highly polymorphic repeats. Imputation increases power over individual SNPs to detect STR associations with gene expression. Imputing STRs into existing SNP datasets will enable the first large-scale STR association studies across a range of complex traits

Directory of Open Access Journals

eScholarship - University of California

The effects of common structural variants on 3D chromatin structure

Author: Noor Amina
Sebat Jonathan
Shanta Omar
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 30/01/2020
Field of study

Background Three-dimensional spatial organization of chromosomes is defined by highly self-interacting regions 0.1-1 Mb in size termed Topological Associating Domains (TADs). Genetic factors that explain dynamic variation in TAD structure are not understood. We hypothesize that common structural variation (SV) in the human population can disrupt regulatory sequences and thereby influence TAD formation. To determine the effects of SVs on 3D chromatin organization, we performed chromosome conformation capture sequencing (Hi-C) of lymphoblastoid cell lines from 19 subjects for which SVs had been previously characterized in the 1000 genomes project. We tested the effects of common deletion polymorphisms on TAD structure by linear regression analysis of nearby quantitative chromatin interactions (contacts) within 240 kb of the deletion, and we specifically tested the hypothesis that deletions at TAD boundaries (TBs) could result in large-scale alterations in chromatin conformation. Results Large (> 10 kb) deletions had significant effects on long-range chromatin interactions. Deletions were associated with increased contacts that span the deleted region and this effect was driven by large deletions that were not located within a TAD boundary (nonTB). Some deletions at TBs, including a 80 kb deletion of the genes CFHR1 and CFHR3, had detectable effects on chromatin contacts. However for TB deletions overall, we did not detect a pattern of effects that was consistent in magnitude or direction. Large inversions in the population had a distinguishable signature characterized by a rearrangement of contacts that span its breakpoints. Conclusions Our study demonstrates that common SVs in the population impact long-range chromatin structure, and deletions and inversions have distinct signatures. However, the effects that we observe are subtle and variable between loci. Genome-wide analysis of chromatin conformation in large cohorts will be needed to quantify the influence of common SVs on chromatin structure.</p

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen

Dissertations of the University of Groningen