Search CORE

Archivio istituzionale della Ricerca - Scuola Normale Superiore

Archivio istituzionale della ricerca - Università di Palermo

Exploiting expression patterns across multiple tissues to map expression quantitative trait loci

Author: A Ramaswamy
AA Shabalin
AL Price
Andrew S. Allen
Chaitanya R. Acharya
DJ Liu
DM Gatti
DY Lin
FE Satterthwaite
J Lonsdale
Janice M. McCarthy
Kouros Owzar
KW Broman
MC Wu
P Duchesne
PJ Harrison
PJ Harrison
RB Brem
RR Wilcox
S Purcell
W Cookson
X Lin
Y Benjamini
YT Huang
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Application of machine learning methods to histone methylation ChIP-Seq data reveals H4R3me2 globally represses gene expression

Author: A Barski
AI Su
AJ Ruthenburg
B Li
BD Strahl
BE Bernstein
BE Bernstein
BM Turner
DY Lee
E Fabbrizio
FD Gibbons
H Yu
J Wysocka
JA Latham
K Ancelin
L Wang
M Ku
M Litt
Marty W Mayo
MC Yu
MJ Buck
MT Bedford
MT Bedford
Q Zhao
R Karlic
S Pal
S Pal
SL Berger
Stefan Bekiranov
Stephen Hoang
T Hastie
T Jenuwein
T Suganuma
TJ Hubbard
TS Mikkelsen
X Le Guezennec
Xiaojiang Xu
Y Benjamini
Y Zhang
Z Hou
ZJ Wu
Publication venue: BioMed Central
Publication date: 01/07/2010
Field of study

Abstract Background In the last decade, biochemical studies have revealed that epigenetic modifications including histone modifications, histone variants and DNA methylation form a complex network that regulate the state of chromatin and processes that depend on it including transcription and DNA replication. Currently, a large number of these epigenetic modifications are being mapped in a variety of cell lines at different stages of development using high throughput sequencing by members of the ENCODE consortium, the NIH Roadmap Epigenomics Program and the Human Epigenome Project. An extremely promising and underexplored area of research is the application of machine learning methods, which are designed to construct predictive network models, to these large-scale epigenomic data sets. Results Using a ChIP-Seq data set of 20 histone lysine and arginine methylations and histone variant H2A.Z in human CD4+ T-cells, we built predictive models of gene expression as a function of histone modification/variant levels using Multilinear (ML) Regression and Multivariate Adaptive Regression Splines (MARS). Along with extensive crosstalk among the 20 histone methylations, we found H4R3me2 was the most and second most globally repressive histone methylation among the 20 studied in the ML and MARS models, respectively. In support of our finding, a number of experimental studies show that PRMT5-catalyzed symmetric dimethylation of H4R3 is associated with repression of gene expression. This includes a recent study, which demonstrated that H4R3me2 is required for DNMT3A-mediated DNA methylation--a known global repressor of gene expression. Conclusion In stark contrast to univariate analysis of the relationship between H4R3me2 and gene expression levels, our study showed that the regulatory role of some modifications like H4R3me2 is masked by confounding variables, but can be elucidated by multivariate/systems-level approaches.</p

Impaired Resting-State Functional Integrations within Default Mode Network of Generalized Tonic-Clonic Seizures Epilepsy

Author: A Vanhaudenhuyse
AB Waites
AE Cavanna
AM Morcom
B Liu
B Mazoyer
Bing Hou
BJ He
CE Elger
CE Elger
DA Fair
DA Fair
DA Fair
DY Zhang
E Bullmore
G Bettus
GL Shulman
Guocai Wu
H Blumenfeld
H Laufs
Hanjian Du
Hua Feng
J Gotman
Jian Wang
JL Vincent
K Hamandi
KJ Friston
L Tian
LJ Larson-Prior
M Song
Marcus Kaiser
MD Fox
MD Fox
ME Raichle
ME Raichle
ME Raichle
Ming Song
Nan Wu
NU Dosenbach
QF Li
RL Buckner
RL Buckner
SG Horovitz
SG Horovitz
Tianzi Jiang
Y Benjamini
ZQ Zhang
Publication venue: Public Library of Science
Publication date: 25/02/2011
Field of study

Generalized tonic-clonic seizures (GTCS) are characterized by unresponsiveness and convulsions, which cause complete loss of consciousness. Many recent studies have found that the ictal alterations in brain activity of the GTCS epilepsy patients are focally involved in some brain regions, including thalamus, upper brainstem, medial prefrontal cortex, posterior midbrain regions, and lateral parietal cortex. Notably, many of these affected brain regions are the same and overlap considerably with the components of the so-called default mode network (DMN). Here, we hypothesize that the brain activity of the DMN of the GTCS epilepsy patients are different from normal controls, even in the resting state. To test this hypothesis, we compared the DMN of the GTCS epilepsy patients and the controls using the resting state functional magnetic resonance imaging. Thirteen brain areas in the DMN were extracted, and a complete undirected weighted graph was used to model the DMN for each participant. When directly comparing the edges of the graph, we found significant decreased functional connectivities within the DMN of the GTCS epilepsy patients comparing to the controls. As for the nodes of the graph, we found that the degree of some brain areas within the DMN was significantly reduced in the GTCS epilepsy patients, including the anterior medial prefrontal cortex, the bilateral superior frontal cortex, and the posterior cingulate cortex. Then we investigated into possible mechanisms of how GTCS epilepsy could cause the reduction of the functional integrations of DMN. We suggested the damaged functional integrations of the DMN in the GTCS epilepsy patients even during the resting state, which could help to understand the neural correlations of the impaired consciousness of GTCS epilepsy patients

GISTIC2.0 facilitates sensitive and confident localization of the targets of focal somatic copy-number alteration in human cancers

Author: A Baross
AB Olshen
AJ Bass
AJ Holland
B Nilsson
BA Weir
Barbara Hill
BM Bolstad
BS Taylor
C Greenman
C Li
C Li
Craig H Mermel
D Chiang
D Etemadmoghadam
D Hanahan
DY Chiang
E Pleasance
ED Pleasance
ES Venkatraman
F Sanchez-Garcia
G Schwarz
Gad Getz
GR Bignell
HS Dahlback
LM Merlo
M Guttman
M Metzker
Matthew L Meyerson
MR Stratton
Network CGAR
NT Leach
P Hupé
PA Northcott
PJ Stephens
PJ Stephens
R Beroukhim
R Beroukhim
R Firestein
R McLendon
Rameen Beroukhim
SA McCarroll
SJ Diskin
SP Shah
Steven E Schumacher
T Santarius
T Sjoblom
WM Lin
Y Benjamini
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2011
Field of study

We describe methods with enhanced power and specificity to identify genes targeted by somatic copy-number alterations (SCNAs) that drive cancer growth. By separating SCNA profiles into underlying arm-level and focal alterations, we improve the estimation of background rates for each category. We additionally describe a probabilistic method for defining the boundaries of selected-for SCNA regions with user-defined confidence. Here we detail this revised computational approach, GISTIC2.0, and validate its performance in real and simulated datasets

DSpace@MIT

Detection of recurrent rearrangement breakpoints from copy number data

Author: A Baras
A Ben-Dor
AB Olshen
AJ Iafrate
Anna Ritz
Benjamin J Raphael
C Erdman
Colin Collins
D Barry
D Pinkel
D Pinkel
D Pinto
D St Clair
DY Chiang
F Meng
F Picard
GD Eley
H David
H Lian
JS Liu
K Huse
K Nakabayashi
KW Choy
Michael M Ittmann
MM Carrasquillo
MT Barrett
NR Zhang
Pamela L Paris
PJ Campbell
PL Paris
Q Zhang
R Beroukhim
R Lucito
R McLendon
S Guha
S Yoon
SA Tomlins
SJ Diskin
V Vladimirova
WG Christen
WR Lai
Y Benjamini
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Copy number variants (CNVs), including deletions, amplifications, and other rearrangements, are common in human and cancer genomes. Copy number data from array comparative genome hybridization (aCGH) and next-generation DNA sequencing is widely used to measure copy number variants. Comparison of copy number data from multiple individuals reveals recurrent variants. Typically, the interior of a recurrent CNV is examined for genes or other loci associated with a phenotype. However, in some cases, such as gene truncations and fusion genes, the target of variant lies at the boundary of the variant. Results We introduce Neighborhood Breakpoint Conservation (NBC), an algorithm for identifying rearrangement breakpoints that are highly conserved at the same locus in multiple individuals. NBC detects recurrent breakpoints at varying levels of resolution, including breakpoints whose location is exactly conserved and breakpoints whose location varies within a gene. NBC also identifies pairs of recurrent breakpoints such as those that result from fusion genes. We apply NBC to aCGH data from 36 primary prostate tumors and identify 12 novel rearrangements, one of which is the well-known TMPRSS2-ERG fusion gene. We also apply NBC to 227 glioblastoma tumors and predict 93 novel rearrangements which we further classify as gene truncations, germline structural variants, and fusion genes. A number of these variants involve the protein phosphatase PTPN12 suggesting that deregulation of PTPN12, via a variety of rearrangements, is common in glioblastoma. Conclusions We demonstrate that NBC is useful for detection of recurrent breakpoints resulting from copy number variants or other structural variants, and in particular identifies recurrent breakpoints that result in gene truncations or fusion genes. Software is available at <url>http://http.//cs.brown.edu/people/braphael/software.html</url>.</p

eScholarship - University of California

Genetic variation in PRL and PRLR, and relationships with serum prolactin levels and breast cancer risk: results from a population-based case-control study in Poland

Rapid and Accurate Multiple Testing Correction and Power Estimation for Millions of Correlated Markers

Author: A Genz
A Genz
B Devlin
B Devlin
B Han
BL Browning
Buhm Han
D Altshuler
DA Williams
DJ Schaid
DL Nicolae
DR Nyholt
DY Lin
E Eskin
E Jorgenson
Eleazar Eskin
F Dudbridge
F Dudbridge
F Yates
FS Collins
G Kimmel
GU Yule
Hyun Min Kang
I Pe'er
J Li
J Marchini
JD Storey
JK Pritchard
JM Cheverud
John D. Storey
KN Conneely
LA Wasserman
N Risch
N Zaitlen
NA Zaitlen
P de Bakker
PD Sasieni
PH Westfall
RJ Klein
S Purcell
SR Browning
SR Seaman
TA Louis
TR Bhangale
V Hajivassiliou
V Moskvina
Y Benjamini
Publication venue: Public Library of Science
Publication date: 01/04/2009
Field of study

With the development of high-throughput sequencing and genotyping technologies, the number of markers collected in genetic association studies is growing rapidly, increasing the importance of methods for correcting for multiple hypothesis testing. The permutation test is widely considered the gold standard for accurate multiple testing correction, but it is often computationally impractical for these large datasets. Recently, several studies proposed efficient alternative approaches to the permutation test based on the multivariate normal distribution (MVN). However, they cannot accurately correct for multiple testing in genome-wide association studies for two reasons. First, these methods require partitioning of the genome into many disjoint blocks and ignore all correlations between markers from different blocks. Second, the true null distribution of the test statistic often fails to follow the asymptotic distribution at the tails of the distribution. We propose an accurate and efficient method for multiple testing correction in genome-wide association studies—SLIDE. Our method accounts for all correlation within a sliding window and corrects for the departure of the true null distribution of the statistic from the asymptotic distribution. In simulations using the Wellcome Trust Case Control Consortium data, the error rate of SLIDE's corrected p-values is more than 20 times smaller than the error rate of the previous MVN-based methods' corrected p-values, while SLIDE is orders of magnitude faster than the permutation test and other competing methods. We also extend the MVN framework to the problem of estimating the statistical power of an association study with correlated markers and propose an efficient and accurate power estimation method SLIP. SLIP and SLIDE are available at http://slide.cs.ucla.edu

SNU Open Repository and Archive

eScholarship - University of California

Genetically-Based Olfactory Signatures Persist Despite Dietary Variation

Author: A Willse
A Willse
AG Singer
AG Singer
AL Sherborne
Alan Willse
CL Arthur
D Restrepo
DJ Penn
DY Lin
F Röck
FJ Schwende
Gary K. Beauchamp
George Preti
GK Beauchamp
H van den Dool
Hiroaki Matsunami
HM Liebich
HM Schellinck
J Kwak
J Kwak
Jae Kwak
JG Valenta
JL Hurst
K Yamazaki
K Yamazaki
K Yamazaki
Koichi Matsumura
Kunio Yamazaki
M Gallagher
M Leon
M Lynch
M Yamaguchi
Maryanne Curran Opiekun
MH Ferkin
MN Kayali-Sayadi
MV Novotny
MV Novotny
P Legendre
PA Brennan
RE Brown
S Saini
SA Cheetham
T Boehm
T Hastie
V Walker
Weiguang Yi
WK Potts
Y Benjamini
Publication venue: Public Library of Science
Publication date: 01/01/2008
Field of study

Individual mice have a unique odor, or odortype, that facilitates individual recognition. Odortypes, like other phenotypes, can be influenced by genetic and environmental variation. The genetic influence derives in part from genes of the major histocompatibility complex (MHC). A major environmental influence is diet, which could obscure the genetic contribution to odortype. Because odortype stability is a prerequisite for individual recognition under normal behavioral conditions, we investigated whether MHC-determined urinary odortypes of inbred mice can be identified in the face of large diet-induced variation. Mice trained to discriminate urines from panels of mice that differed both in diet and MHC type found the diet odor more salient in generalization trials. Nevertheless, when mice were trained to discriminate mice with only MHC differences (but on the same diet), they recognized the MHC difference when tested with urines from mice on a different diet. This indicates that MHC odor profiles remain despite large dietary variation. Chemical analyses of urinary volatile organic compounds (VOCs) extracted by solid phase microextraction (SPME) and analyzed by gas chromatography/mass spectrometry (GC/MS) are consistent with this inference. Although diet influenced VOC variation more than MHC, with algorithmic training (supervised classification) MHC types could be accurately discriminated across different diets. Thus, although there are clear diet effects on urinary volatile profiles, they do not obscure MHC effects

CiteSeerX

Combined analysis of transcriptome and metabolite data reveals extensive differences between black and brown nearly-isogenic soybean (Glycine max) seed coats enabling the identification of pigment isogenes

Author: A Brazma
AD Boveris
Ammar Saleem
B Winkel-Shirley
BA Snyder
Brian Miki
CM Woodworth
DY Xie
E Butelli
F Paolocci
G Zabala
G Zabala
G Zabala
GJ Peel
GJ Tanner
H Liu
H-M Ku
I Nagai
J Hughes
J Schmutz
J Todd
JA Kennedy
JH Lee
JH Tuteja
JJ Todd
John T Arnason
K Ranathunge
K Toda
K Yang
M Choung
MO Downey
MY Hirai
MY Hirai
N Kovinich
N Kovinich
Nik Kovinich
PI Mackenzie
QJ Song
R Bernard
R Buzzell
RA Dixon
RA Irizarry
RC Gentleman
S Dhaubhadel
S Pienkny
T Sugimoto
T Tohge
Y Benjamini
Y Pang
Y Tanaka
Y-G Li
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background The <it>R </it>locus controls the color of pigmented soybean (<it>Glycine max</it>) seeds. However information about its control over seed coat biochemistry and gene expressions remains limited. The seed coats of nearly-isogenic black (<it>iRT</it>) and brown (<it>irT</it>) soybean (<it>Glycine max</it>) were known to differ by the presence or absence of anthocyanins, respectively, with genes for only a single enzyme (anthocyanidin synthase) found to be differentially expressed between isolines. We recently identified and characterized a UDP-glycose:flavonoid-3-<it>O</it>-glycosyltransferase (<it>UGT78K1</it>) from the seed coat of black (<it>iRT</it>) soybean with the aim to engineer seed coat color by suppression of an anthocyanin-specific gene. However, it remained to be investigated whether <it>UGT78K1 </it>was overexpressed with anthocyanin biosynthesis in the black (<it>iRT</it>) seed coat compared to the nearly-isogenic brown (<it>irT</it>) tissue. In this study, we performed a combined analysis of transcriptome and metabolite data to elucidate the control of the R locus over seed coat biochemistry and to identify pigment biosynthesis genes. Two differentially expressed late-stage anthocyanin biosynthesis isogenes were further characterized, as they may serve as useful targets for the manipulation of soybean grain color while minimizing the potential for unintended effects on the plant system. Results Metabolite composition differences were found to not be limited to anthocyanins, with specific proanthocyanidins, isoflavones, and phenylpropanoids present exclusively in the black (<it>iRT</it>) or the brown (<it>irT</it>) seed coat. A global analysis of gene expressions identified <it>UGT78K1 </it>and 19 other anthocyanin, (iso)flavonoid, and phenylpropanoid isogenes to be differentially expressed between isolines. A combined analysis of metabolite and gene expression data enabled the assignment of putative functions to biosynthesis and transport isogenes. The recombinant enzymes of two genes were validated to catalyze late-stage steps in anthocyanin biosynthesis <it>in vitro </it>and expression profiles of the corresponding genes were shown to parallel anthocyanin biosynthesis during black (<it>iRT</it>) seed coat development. Conclusion Metabolite composition and gene expression differences between black (<it>iRT</it>) and brown (<it>irT</it>) seed coats are far more extensive than previously thought. Putative anthocyanin, proanthocyanidin, (iso)flavonoid, and phenylpropanoid isogenes were differentially-expressed between black (<it>iRT</it>) and brown (<it>irT</it>) seed coats, and <it>UGT78K2 </it>and <it>OMT5 </it>were validated to code UDP-glycose:flavonoid-3-<it>O</it>-glycosyltransferase and anthocyanin 3'-<it>O</it>-methyltransferase proteins <it>in vitro</it>, respectively. Duplicate gene copies for several enzymes were overexpressed in the black (<it>iRT</it>) seed coat suggesting more than one isogene may have to be silenced to engineer seed coat color using RNA interference.</p