Search CORE

FigShare

Predicting cell types and genetic variations contributing to disease by combining GWAS and epigenetic data

Author: A Milosavljevic
A Pekowska
A Visel
AJ Saldanha
AM Mondul
Anjana Rao
Anna Gerasimova
AR Quinlan
BE Bernstein
BE Himes
BE Stranger
Bin Li
Bjoern Peters
C Ober
C Zang
Consortium The International HapMap
D Karolchik
DB Hancock
DG Torgerson
E Birney
E Noguchi
EK Miller
F Castro-Giner
G Hon
GE Zentner
Gregory Seumois
J Ernst
J Ernst
J Harrow
Jason Greenbaum
JH Kim
JJ Farrell
KM Ansel
LD Ward
Lukas Chavez
MA Schaub
MF Moffatt
MF Moffatt
MJ de Hoon
MP Creyghton
MT Maurano
ND Heintzman
NU Rashid
Pandurangan Vijayanand
PB Talbert
PM Sleiman
PM Visscher
R Bhandare
R Jaenisch
S Chanock
S Michel
S Weidinger
T Hirota
TH Pham
W Yu
Y Li
Y Zhang
Yi-Hsiang Hsu
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2013
Field of study

Genome-wide association studies (GWASs) identify single nucleotide polymorphisms (SNPs) that are enriched in individuals suffering from a given disease. Most disease-associated SNPs fall into non-coding regions, so that it is not straightforward to infer phenotype or function; moreover, many SNPs are in tight genetic linkage, so that a SNP identified as associated with a particular disease may not itself be causal, but rather signify the presence of a linked SNP that is functionally relevant to disease pathogenesis. Here, we present an analysis method that takes advantage of the recent rapid accumulation of epigenomics data to address these problems for some SNPs. Using asthma as a prototypic example; we show that non-coding disease-associated SNPs are enriched in genomic regions that function as regulators of transcription, such as enhancers and promoters. Identifying enhancers based on the presence of the histone modification marks such as H3K4me1 in different cell types, we show that the location of enhancers is highly cell-type specific. We use these findings to predict which SNPs are likely to be directly contributing to disease based on their presence in regulatory regions, and in which cell types their effect is expected to be detectable. Moreover, we can also predict which cell types contribute to a disease based on overlap of the disease-associated SNPs with the locations of enhancers present in a given cell type. Finally, we suggest that it will be possible to re-analyze GWAS studies with much higher power by limiting the SNPs considered to those in coding or regulatory regions of cell types relevant to a given disease

CiteSeerX

Southampton (e-Prints Soton)

FigShare

Hybridization interactions between probesets in short oligo microarrays lead to spurious correlations

Author: A Nimgaonkar
Affymetrix
Affymetrix
AI Su
AJ Butte
AJ Butte
AT Adai
BH Mecham
BH Mecham
C Wu
CL Wilson
Crispin J Miller
E Birney
G Liu
G Sherlock
H Wang
HS Leong
J Harbig
J Stuart
KD Pruitt
L Gautier
L Gautier
M Dai
Michał J Okoniewski
O Teuffel
R Gentleman
R Irizarry
S Carter
S Zakharkin
T Attwood
W Shannon
Z Wu
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

BACKGROUND: Microarrays measure the binding of nucleotide sequences to a set of sequence specific probes. This information is combined with annotation specifying the relationship between probes and targets and used to make inferences about transcript- and, ultimately, gene expression. In some situations, a probe is capable of hybridizing to more than one transcript, in others, multiple probes can target a single sequence. These 'multiply targeted' probes can result in non-independence between measured expression levels. RESULTS: An analysis of these relationships for Affymetrix arrays considered both the extent and influence of exact matches between probe and transcript sequences. For the popular HGU133A array, approximately half of the probesets were found to interact in this way. Both real and simulated expression datasets were used to examine how these effects influenced the expression signal. It was found not only to lead to increased signal strength for the affected probesets, but the major effect is to significantly increase their correlation, even in situations when only a single probe from a probeset was involved. By building a network of probe-probeset-transcript relationships, it is possible to identify families of interacting probesets. More than 10% of the families contain members annotated to different genes or even different Unigene clusters. Within a family, a mixture of genuine biological and artefactual correlations can occur. CONCLUSION: Multiple targeting is not only prevalent, but also significant. The ability of probesets to hybridize to more than one gene product can lead to false positives when analysing gene expression. Comprehensive annotation describing multiple targeting is required when interpreting array data

Complex exon-intron marking by histone modifications is not determined solely by nucleosome distribution

It has recently been shown that nucleosome distribution, histone modifications and RNA polymerase II (Pol II) occupancy show preferential association with exons (“exon-intron marking”), linking chromatin structure and function to co-transcriptional splicing in a variety of eukaryotes. Previous ChIP-sequencing studies suggested that these marking patterns reflect the nucleosomal landscape. By analyzing ChIP-chip datasets across the human genome in three cell types, we have found that this marking system is far more complex than previously observed. We show here that a range of histone modifications and Pol II are preferentially associated with exons. However, there is noticeable cell-type specificity in the degree of exon marking by histone modifications and, surprisingly, this is also reflected in some histone modifications patterns showing biases towards introns. Exon-intron marking is laid down in the absence of transcription on silent genes, with some marking biases changing or becoming reversed for genes expressed at different levels. Furthermore, the relationship of this marking system with splicing is not simple, with only some histone modifications reflecting exon usage/inclusion, while others mirror patterns of exon exclusion. By examining nucleosomal distributions in all three cell types, we demonstrate that these histone modification patterns cannot solely be accounted for by differences in nucleosome levels between exons and introns. In addition, because of inherent differences between ChIP-chip array and ChIP-sequencing approaches, these platforms report different nucleosome distribution patterns across the human genome. Our findings confound existing views and point to active cellular mechanisms which dynamically regulate histone modification levels and account for exon-intron marking. We believe that these histone modification patterns provide links between chromatin accessibility, Pol II movement and co-transcriptional splicing

UCL Discovery

Enlighten

Including all voices in international data-sharing governance

Author: A Rowhani-Farid
A-W Chan
AJ Goldenberg
CJ Haug
D Field
E Birney
E Pizzo
G Yoshizawa
Global Alliance for Genomics and Health
I Chalmers
J de Vries
J de Vries
J de Vries
J Kaye
J Kaye
J Kaye
J Kaye
J Kaye
J-M Coicaud
L Mabile
LL Haak
M Mort
M Parker
M Shabani
M Swan
MJ Murtagh
MJ Murtagh
MR Macleod
P De Castro
RE Upshur
Y Joly
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2018
Field of study

Background Governments, funding bodies, institutions, and publishers have developed a number of strategies to encourage researchers to facilitate access to datasets. The rationale behind this approach is that this will bring a number of benefits and enable advances in healthcare and medicine by allowing the maximum returns from the investment in research, as well as reducing waste and promoting transparency. As this approach gains momentum, these data-sharing practices have implications for many kinds of research as they become standard practice across the world. Main text The governance frameworks that have been developed to support biomedical research are not well equipped to deal with the complexities of international data sharing. This system is nationally based and is dependent upon expert committees for oversight and compliance, which has often led to piece-meal decisionmaking. This system tends to perpetuate inequalities by obscuring the contributions and the important role of different data providers along the data stream, whether they be low- or middle-income country researchers, patients, research participants, groups, or communities. As research and data-sharing activities are largely publicly funded, there is a strong moral argument for including the people who provide the data in decision-making and to develop governance systems for their continued participation. Conclusions We recommend that governance of science becomes more transparent, representative, and responsive to the voices of many constituencies by conducting public consultations about data-sharing addressing issues of access and use; including all data providers in decision-making about the use and sharing of data along the whole of the data stream; and using digital technologies to encourage accessibility, transparency, and accountability. We anticipate that this approach could enhance the legitimacy of the research process, generate insights that may otherwise be overlooked or ignored, and help to bring valuable perspectives into the decision-making around international data sharing.</p

Ghent University Academic Bibliography

Edinburgh Research Explorer

Carolina Digital Repository

University of Tasmania Open Access Repository

University of Miami: Scholarship Miami

Oxford University Research Archive

University of Melbourne Institutional Repository

Species Specificity in Major Urinary Proteins by Parallel Evolution

Species-specific chemosignals, pheromones, regulate social behaviors such as aggression, mating, pup-suckling, territory establishment, and dominance. The identity of these cues remains mostly undetermined and few mammalian pheromones have been identified. Genetically-encoded pheromones are expected to exhibit several different mechanisms for coding 1) diversity, to enable the signaling of multiple behaviors, 2) dynamic regulation, to indicate age and dominance, and 3) species-specificity. Recently, the major urinary proteins (Mups) have been shown to function themselves as genetically-encoded pheromones to regulate species-specific behavior. Mups are multiple highly related proteins expressed in combinatorial patterns that differ between individuals, gender, and age; which are sufficient to fulfill the first two criteria. We have now characterized and fully annotated the mouse Mup gene content in detail. This has enabled us to further analyze the extent of Mup coding diversity and determine their potential to encode species-specific cues

CNV-seq, a new method to detect copy number variation using high-throughput sequencing

Author: A Mortazavi
A Valouev
AJ Iafrate
B Ewing
BT Wilhelm
Chao Xie
CP Van Tassell
D Pinkel
DA Wheeler
DR Bentley
DS Johnson
DV Hinkley
E Birney
E Sherwood
F Sanger
J Hayya
J Marioni
J Sebat
J Shendure
LW Hillier
M Margulies
MA Quail
Martti T Tammi
MT Tammi
NP Carter
R Development Core Team
R Redon
S Levy
S Solinas-Toldo
SC Schuster
SJ Cokus
U Nagalakshmi
W Chen
WJ Kent
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background DNA copy number variation (CNV) has been recognized as an important source of genetic variation. Array comparative genomic hybridization (aCGH) is commonly used for CNV detection, but the microarray platform has a number of inherent limitations. Results Here, we describe a method to detect copy number variation using shotgun sequencing, CNV-seq. The method is based on a robust statistical model that describes the complete analysis procedure and allows the computation of essential confidence values for detection of CNV. Our results show that the number of reads, not the length of the reads is the key factor determining the resolution of detection. This favors the next-generation sequencing methods that rapidly produce large amount of short reads. Conclusion Simulation of various sequencing methods with coverage between 0.1× to 8× show overall specificity between 91.7 – 99.9%, and sensitivity between 72.2 – 96.5%. We also show the results for assessment of CNV between two individual human genomes.</p

ScholarBank@NUS

A Novel RNA Transcript with Antiapoptotic Function Is Silenced in Fragile X Syndrome

Author: A Fire
Ahmad M. Khalil
AJ Verkerk
BA Oostra
BA Oostra
BR Migeon
C Chureau
C Wahlestedt
Claes Wahlestedt
DP Bartel
E Birney
F Tassone
F Tassone
Farzaneh Modarresi
G Waldstein
HL Hinds
I Okamoto
J Cheng
J Ponjavic
JL Rinn
K Garber
KC Pang
KV Prasanth
L He
LS Davidow
MA Carmell
MF Mehler
Mohammad Ali Faghihi
N Sreeram
P Carninci
PC Scacheri
PD Ladd
PG Engstrom
S Houwing
S Ikeda
S Katayama
Shaun P. Brothers
TA Hore
TB Nesterova
Vladimir Bajic
Publication venue: Public Library of Science
Publication date: 01/01/2008
Field of study

Several genome-wide transcriptomics efforts have shown that a large percentage of the mammalian genome is transcribed into RNAs, however, only a small percentage (1–2%) of these RNAs is translated into proteins. Currently there is an intense interest in characterizing the function of the different classes of noncoding RNAs and their relevance to human disease. Using genomic approaches we discovered FMR4, a primate-specific noncoding RNA transcript (2.4 kb) that resides upstream and likely shares a bidirectional promoter with FMR1. FMR4 is a product of RNA polymerase II and has a similar half-life to FMR1. The CGG expansion in the 5′ UTR of FMR1 appears to affect transcription in both directions as we found FMR4, similar to FMR1, to be silenced in fragile X patients and up-regulated in premutation carriers. Knockdown of FMR4 by several siRNAs did not affect FMR1 expression, nor vice versa, suggesting that FMR4 is not a direct regulatory transcript for FMR1. However, FMR4 markedly affected human cell proliferation in vitro; siRNAs knockdown of FMR4 resulted in alterations in the cell cycle and increased apoptosis, while the overexpression of FMR4 caused an increase in cell proliferation. Collectively, our results demonstrate an antiapoptotic function of FMR4 and provide evidence that a well-studied genomic locus can show unexpected functional complexity. It cannot be excluded that altered FMR4 expression might contribute to aspects of the clinical presentation of fragile X syndrome and/or related disorders

University of Miami: Scholarship Miami

Complex nature of SNP genotype effects on gene expression in primary human leucocytes

Author: AJ Myers
AL Dixon
BE Stranger
BE Stranger
BM Bolstad
Cisca Wijmenga
DA van Heel
DA van Heel
DA van Heel
David A vanHeel
E Birney
E Petretto
E Potokina
EE Schadt
FJ Steemers
Gosia Trynka
Graham A Heap
HH Goring
I Vastrik
JJ Keurentjes
JJ Keurentjes
K Karell
KA Hunt
KA Hunt
Karen A Hunt
L Greco
L Nistico
Lotte C Dinesen
Lude Franke
Marcel Bruinenberg
Morris A Swertz
N Hubner
PJ Ciclitira
R Alberts
R Alberts
RC Jansen
Ritsert C Jansen
RP Anderson
T Kwan
V Emilsson
VG Cheung
Y Hochberg
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background Genome wide association studies have been hugely successful in identifying disease risk variants, yet most variants do not lead to coding changes and how variants influence biological function is usually unknown. Methods We correlated gene expression and genetic variation in untouched primary leucocytes (n = 110) from individuals with celiac disease – a common condition with multiple risk variants identified. We compared our observations with an EBV-transformed HapMap B cell line dataset (n = 90), and performed a meta-analysis to increase power to detect non-tissue specific effects. Results In celiac peripheral blood, 2,315 SNP variants influenced gene expression at 765 different transcripts (< 250 kb from SNP, at FDR = 0.05, <it>cis </it>expression quantitative trait loci, eQTLs). 135 of the detected SNP-probe effects (reflecting 51 unique probes) were also detected in a HapMap B cell line published dataset, all with effects in the same allelic direction. Overall gene expression differences within the two datasets predominantly explain the limited overlap in observed <it>cis</it>-eQTLs. Celiac associated risk variants from two regions, containing genes <it>IL18RAP </it>and <it>CCR3</it>, showed significant <it>cis </it>genotype-expression correlations in the peripheral blood but not in the B cell line datasets. We identified 14 genes where a SNP affected the expression of different probes within the same gene, but in opposite allelic directions. By incorporating genetic variation in co-expression analyses, functional relationships between genes can be more significantly detected. Conclusion In conclusion, the complex nature of genotypic effects in human populations makes the use of a relevant tissue, large datasets, and analysis of different exons essential to enable the identification of the function for many genetic risk variants in common diseases.</p

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen

Dissertations of the University of Groningen

Rice-Map: a new-generation rice genome browser

Author: A Shomura
A Siepel
AJ Garris
C Liang
C Trapnell
CA Brosnan
D Blankenberg
D Karolchik
D Kennedy
DA Ntanos
E Birney
E Bonnet
FA Feltus
G Bejerano
Ge Gao
He Zhang
International Rice Genome Sequencing Project
J Ni
J Yu
Jingchu Luo
Jun Wang
KH Jung
LD Stein
Lei Kong
Liang Tang
M Kellis
N Namiki
Q Yuan
Q Yuan
S Kurtz
SA Goff
SA Keiko Hatae
SDKaDA Ntanos
Shuqi Zhao
T Hubbard
T Sang
TD Wu
V Singh
WJ Kent
WJ Kent
X Wang
X Wang
Xiaocheng Gu
Y Xing
Y Zhang
Zhe Li
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background The concurrent release of rice genome sequences for two subspecies (<it>Oryza sativa </it>L. ssp. <it>japonica </it>and <it>Oryza sativa </it>L. ssp. <it>indica</it>) facilitates rice studies at the whole genome level. Since the advent of high-throughput analysis, huge amounts of functional genomics data have been delivered rapidly, making an integrated online genome browser indispensable for scientists to visualize and analyze these data. Based on next-generation web technologies and high-throughput experimental data, we have developed Rice-Map, a novel genome browser for researchers to navigate, analyze and annotate rice genome interactively. Description More than one hundred annotation tracks (81 for <it>japonica </it>and 82 for <it>indica</it>) have been compiled and loaded into Rice-Map. These pre-computed annotations cover gene models, transcript evidences, expression profiling, epigenetic modifications, inter-species and intra-species homologies, genetic markers and other genomic features. In addition to these pre-computed tracks, registered users can interactively add comments and research notes to Rice-Map as User-Defined Annotation entries. By smoothly scrolling, dragging and zooming, users can browse various genomic features simultaneously at multiple scales. On-the-fly analysis for selected entries could be performed through dedicated bioinformatic analysis platforms such as WebLab and Galaxy. Furthermore, a BioMart-powered data warehouse "Rice Mart" is offered for advanced users to fetch bulk datasets based on complex criteria. Conclusions Rice-Map delivers abundant up-to-date <it>japonica </it>and <it>indica </it>annotations, providing a valuable resource for both computational and bench biologists. Rice-Map is publicly accessible at <url>http://www.ricemap.org/</url>, with all data available for free downloading.</p