Search CORE

205 research outputs found

A complete tool set for molecular QTL discovery and analysis

Author: AA Shabalin
AC Nica
D Welter
H Ongen
HJ Westra
J Ernst
JD Storey
JK Pickrell
M Gutierrez-Arcelus
O Canela-Xandri
P Picotti
PA Hoen
S Purcell
S Waszak
SS Rao
T Lappalainen
WE Kraus
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

Crossref

Serveur académique lausannois

University of Dundee Online Publications

Archive ouverte UNIGE

Meta-eQTL: a tool set for flexible eQTL meta-analysis

Author: AA Shabalin
Antonio Fabio Di Narzo
AP Boyle
B Howie
CJ Willer
DM Greenawalt
EE Schadt
Haoxiang Cheng
Jianwei Lu
K Hao
Ke Hao
L Liang
LA Hindorff
S Sanna
V Emilsson
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Widespread sex differences in gene expression and splicing in the adult human brain

Author: A Aleman
A Fung
A Subramanian
AA Shabalin
AK Vaags
AM Craig
BG Weinshenker
C Ober
CJ Newschaffer
CS Weickert
D Trabzuni
D Trabzuni
DG Hernandez
DH Skuse
E Heard
E Jazin
F Simunovic
H Skaletsky
HJ Kang
I Cantuti-Castelvetri
I Kato
IN Miller
JB Berletch
JR Gibbs
K Andersen
KP Cosgrove
L Cahill
L Matthews
M Jiang
M Kanehisa
M Kanehisa
MA Nalls
MP Vawter
MT Ross
NL Barbosa-Morais
P Parma
PA McCombe
R Gater
R Schmidt
RA Irizarry
S Amor
S Jamain
S Tang
T Millar
TG Beach
Y Li
Y Li
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

There is strong evidence to show that men and women differ in terms of neurodevelopment, neurochemistry and susceptibility to neurodegenerative and neuropsychiatric disease. The molecular basis of these differences remains unclear. Progress in this field has been hampered by the lack of genome-wide information on sex differences in gene expression and in particular splicing in the human brain. Here we address this issue by using post-mortem adult human brain and spinal cord samples originating from 137 neuropathologically confirmed control individuals to study whole-genome gene expression and splicing in 12 CNS regions. We show that sex differences in gene expression and splicing are widespread in adult human brain, being detectable in all major brain regions and involving 2.5% of all expressed genes. We give examples of genes where sex-biased expression is both disease-relevant and likely to have functional consequences, and provide evidence suggesting that sex biases in expression may reflect sex-biased gene regulatory structures

Crossref

UCL Discovery

PubMed Central

Edinburgh Research Explorer

King's Research Portal

UNCLES: Method for the identification of genes differentially consistently co-expressed in a specific subset of datasets

Author: A Huber
A Prelić
AA Shabalin
AP Gasch
Asoke K. Nandi
B Abu-Jamous
B Abu-Jamous
Basel Abu-Jamous
C Koch
CH Wade
CT Harbison
D Dikicioglu
D Liu
DA Orlando
David J. Roberts
IS Dhillon
J Bahler
J Yang
JK Choi
JK Limb
JM Pena
JM Stuart
KC Li
KC Li
KY Yeung
KY Yeung
L Lazzeroni
LP Zhao
MB Eisen
P Cahan
P Grandi
PC Roberts
PT Spellman
R Fa
R Lletı́a
R Nilsson
RJ Cho
RM Piro
Rui Fa
S Chu
S Fujii
S Sharma
S Vega-Pons
T Hayata
T Murali
T Pramila
TC Fleischer
VA Gennarino
X Liu
Y Cheng
Y Kluger
Z Tao
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 04/06/2015
Field of study

Background: Collective analysis of the increasingly emerging gene expression datasets are required. The recently proposed binarisation of consensus partition matrices (Bi-CoPaM) method can combine clustering results from multiple datasets to identify the subsets of genes which are consistently co-expressed in all of the provided datasets in a tuneable manner. However, results validation and parameter setting are issues that complicate the design of such methods. Moreover, although it is a common practice to test methods by application to synthetic datasets, the mathematical models used to synthesise such datasets are usually based on approximations which may not always be sufficiently representative of real datasets. Results: Here, we propose an unsupervised method for the unification of clustering results from multiple datasets using external specifications (UNCLES). This method has the ability to identify the subsets of genes consistently co-expressed in a subset of datasets while being poorly co-expressed in another subset of datasets, and to identify the subsets of genes consistently co-expressed in all given datasets. We also propose the M-N scatter plots validation technique and adopt it to set the parameters of UNCLES, such as the number of clusters, automatically. Additionally, we propose an approach for the synthesis of gene expression datasets using real data profiles in a way which combines the ground-truth-knowledge of synthetic data and the realistic expression values of real data, and therefore overcomes the problem of faithfulness of synthetic expression data modelling. By application to those datasets, we validate UNCLES while comparing it with other conventional clustering methods, and of particular relevance, biclustering methods. We further validate UNCLES by application to a set of 14 real genome-wide yeast datasets as it produces focused clusters that conform well to known biological facts. Furthermore, in-silico-based hypotheses regarding the function of a few previously unknown genes in those focused clusters are drawn. Conclusions: The UNCLES method, the M-N scatter plots technique, and the expression data synthesis approach will have wide application for the comprehensive analysis of genomic and other sources of multiple complex biological datasets. Moreover, the derived in-silico-based biological hypotheses represent subjects for future functional studies.The National Institute for Health Research (NIHR) under its Programme Grants for Applied Research Programme (Grant Reference Number RP-PG-0310-1004)

Jyväskylä University Digital Archive

Crossref

Springer - Publisher Connector

PubMed Central

Brunel University Research Archive

Exploiting expression patterns across multiple tissues to map expression quantitative trait loci

Author: A Ramaswamy
AA Shabalin
AL Price
Andrew S. Allen
Chaitanya R. Acharya
DJ Liu
DM Gatti
DY Lin
FE Satterthwaite
J Lonsdale
Janice M. McCarthy
Kouros Owzar
KW Broman
MC Wu
P Duchesne
PJ Harrison
PJ Harrison
RB Brem
RR Wilcox
S Purcell
W Cookson
X Lin
Y Benjamini
YT Huang
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Cross-Platform Microarray Data Normalisation for Regulatory Network Inference

Author: AA Shabalin
Alina Sîrbu
C Li
C Spieth
D Donoho
DA Orlando
G Alterovitz
G Kaiser
GK Smyth
Heather J. Ruskin
JH Do
K Shakya
KF Aoki-Kinoshita
M Hecker
MA Savageau
Martin Crane
N Noman
N Noman
PT Spellman
R Xulvi-Brunet
Raya Khanin
T Pramila
T Pramila
TM Przytycka
WE Johnson
WK Lim
Y Fomekong-Nanfack
Y Wang
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

Background Inferring Gene Regulatory Networks (GRNs) from time course microarray data suffers from the dimensionality problem created by the short length of available time series compared to the large number of genes in the network. To overcome this, data integration from diverse sources is mandatory. Microarray data from different sources and platforms are publicly available, but integration is not straightforward, due to platform and experimental differences. Methods We analyse here different normalisation approaches for microarray data integration, in the context of reverse engineering of GRN quantitative models. We introduce two preprocessing approaches based on existing normalisation techniques and provide a comprehensive comparison of normalised datasets. Conclusions Results identify a method based on a combination of Loess normalisation and iterative K-means as best for time series normalisation for this problem

Public Library of Science (PLOS)

CiteSeerX

Crossref

Directory of Open Access Journals

PubMed Central

Archivio della Ricerca - Università di Pisa

Archivio istituzionale della ricerca - Alma Mater Studiorum Università di Bologna

Exaggerated CpH methylation in the autism-affected brain

Author: A Akalin
A Meissner
A Nguyen
AA Shabalin
AE Jaffe
AK Smith
B Kinde
CG Bell
D Zhang
ET Wiles
F Krueger
I Voineagu
JM Lasalle
JT Leek
JU Guo
K Day
L de la Torre-Ubieta
M Münzel
MJ Aryee
NT Vijayakumar
R Lister
R Nagarajan
Roadmap Epigenomics Consortium
S De Rubeis
S Gupta
S Nardone
SE Ellis
SG Gregory
T Zhang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 11/08/2016
Field of study

BACKGROUND: The etiology of autism, a complex, heritable, neurodevelopmental disorder, remains largely unexplained. Given the unexplained risk and recent evidence supporting a role for epigenetic mechanisms in the development of autism, we explored the role of CpG and CpH (H = A, C, or T) methylation within the autism-affected cortical brain tissue. METHODS: Reduced representation bisulfite sequencing (RRBS) was completed, and analysis was carried out in 63 post-mortem cortical brain samples (Brodmann area 19) from 29 autism-affected and 34 control individuals. Analyses to identify single sites that were differentially methylated and to identify any global methylation alterations at either CpG or CpH sites throughout the genome were carried out. RESULTS: We report that while no individual site or region of methylation was significantly associated with autism after multi-test correction, methylated CpH dinucleotides were markedly enriched in autism-affected brains (~2-fold enrichment at p < 0.05 cutoff, p = 0.002). CONCLUSIONS: These results further implicate epigenetic alterations in pathobiological mechanisms that underlie autism. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13229-017-0119-y) contains supplementary material, which is available to authorized users

Crossref

PubMed Central

eScholarship - University of California

Detection of regulator genes and eQTLs in gene networks

Author: A Butte
A Chatr-Aryamontri
A Clauset
A Joshi
A Joshi
A Kundaje
AA Shabalin
AJ Enright
AJ Walhout
AS Dimas
B Schwanhausser
B Zhang
B Zhang
C Cenik
CO Daub
D Koller
DA Cusanovich
DM Greenawalt
E Bonnet
E Ravasz
E Segal
EC Neto
EC Neto
EC Neto
EE Schadt
EE Schadt
EE Schadt
EE Schadt
EE Schadt
EJ Foss
F Grubert
F Yue
FA Cubillos
FW Albert
G Hemani
G Nicholson
GD Smith
GH Golub
H Foroughi Asl
H Talukdar
HN Kadarmideen
J Millstein
J Qi
J Zhu
J Zhu
J Zhu
JE Aten
JF Ayroles
JJ Faith
JL Björkegren
JS Liu
K Basso
K Qu
KG Ardlie
L Wu
LA Hindorff
LH Hartwell
LS Chen
M Ashburner
M Civelek
M Georges
M Gerstein
M Medvedovic
M Schmidt
M Scutari
MA Schaub
MB Eisen
MD Ritchie
ME Goddard
MEJ Newman
MEJ Newman
MV Rockman
MV Rockman
N Friedman
N Friedman
N Friedman
N Laird
O Stegle
P Langfelder
P Langfelder
P Langfelder
P Lu
R Sharan
R Sharan
RB Brem
RW Williams
S Lee
S Roy
S Tavazoie
SI Lee
SM Waszak
SS Rao
T Lappalainen
T Michoel
TA Manolio
TF Mackay
The ENCODE
TS Furey
VG Cheung
W Cookson
W Zhang
Y Chen
Y Li
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/11/2016
Field of study

Genetic differences between individuals associated to quantitative phenotypic traits, including disease states, are usually found in non-coding genomic regions. These genetic variants are often also associated to differences in expression levels of nearby genes (they are "expression quantitative trait loci" or eQTLs for short) and presumably play a gene regulatory role, affecting the status of molecular networks of interacting genes, proteins and metabolites. Computational systems biology approaches to reconstruct causal gene networks from large-scale omics data have therefore become essential to understand the structure of networks controlled by eQTLs together with other regulatory genes, and to generate detailed hypotheses about the molecular mechanisms that lead from genotype to phenotype. Here we review the main analytical methods and softwares to identify eQTLs and their associated genes, to reconstruct co-expression networks and modules, to reconstruct causal Bayesian gene and module networks, and to validate predicted networks in silico.Comment: minor revision with typos corrected; review article; 24 pages, 2 figure

arXiv.org e-Print Archive

Crossref

Genetic determinants of co-accessible chromatin regions in activated T cells across humans.

Author: A Barrie
A Battle
A Franke
AA Shabalin
AM Klein
AR Quinlan
Atsede Siba
Aviv Regev
Aviva P. Aiden
B Li
BE Stranger
C Hou
Christine S. Cheng
Christophe Benoist
Chun J. Ye
CJ Ye
CK Stroud
D Hnisz
D Lee
D Sakata
DE Speiser
Dmytro Lituiev
E Elinav
E Splinter
EM Schmidt
Erez Lieberman Aiden
EZ Macosko
G Jun
G McVicker
H Kilpinen
H Li
H Li
HK Finucane
HM Kang
Howard Y. Chang
Ido Machol
Ivo Wortman
J Yang
JD Buenrostro
JD Buenrostro
JD Storey
JE Phillips
JF Degner
JN Hirschhorn
JS Delisle
K Enjyoji
Kendrick L. Hougen
KK Farh
L Chen
L Plesner
M Feuerer
M Ghandi
M Kasowski
M Kronenberg
M Kurachi
M. Grace Gordon
Marcin Tabaka
MB Gerstein
Meena Subramaniam
MI Love
MI McCarthy
Michael A. Beer
MN Lee
MT Maurano
Muhammad Shamim
MY Donath
N Kumasaka
NC Durand
Neva C. Durand
NP Restifo
P Cauchy
P Li
PC Hollenhorst
Philip L. De Jager
PM Visscher
PS Ohashi
R Satija
Rachel E. Gate
RE Thurman
RM Samstein
Roadmap Epigenomics Consortium
S Deaglio
S Heinz
S Neph
SM Waszak
SS Rao
Su-Chen Huang
T Lappalainen
T Raj
The ENCODE Project Consortium.
Ting Feng
TL Murphy
UM Marigorta
WA Whyte
WJ Astle
X Chen
X Sun
Y Belkaid
Y Zhang
YY Fan
Publication venue: eScholarship, University of California
Publication date: 01/08/2018
Field of study

Over 90% of genetic variants associated with complex human traits map to non-coding regions, but little is understood about how they modulate gene regulation in health and disease. One possible mechanism is that genetic variants affect the activity of one or more cis-regulatory elements leading to gene expression variation in specific cell types. To identify such cases, we analyzed ATAC-seq and RNA-seq profiles from stimulated primary CD4+ T cells in up to 105 healthy donors. We found that regions of accessible chromatin (ATAC-peaks) are co-accessible at kilobase and megabase resolution, consistent with the three-dimensional chromatin organization measured by in situ Hi-C in T cells. Fifteen percent of genetic variants located within ATAC-peaks affected the accessibility of the corresponding peak (local-ATAC-QTLs). Local-ATAC-QTLs have the largest effects on co-accessible peaks, are associated with gene expression and are enriched for autoimmune disease variants. Our results provide insights into how natural genetic variants modulate cis-regulatory elements, in isolation or in concert, to influence gene expression

Crossref

eScholarship - University of California

Regularized gene selection in cancer microarray meta-analysis

Author: A Maynard
AA Shabalin
B Fung
CA Iacobuzio-Donahue
CD Logsdon
D Ghosh
D Ghosh
DD Smith
EM Conlon
F Hong
H Friess
H Jiang
H Zhang
J Choi
J Friedman
J Gui
J Gui
J Wang
Jian Huang
JR Stevens
M Bloomston
P Warnet
R Grutzmann
R Guerra
S Kim
S Ma
S Ma
S Ma
Shuangge Ma
SK Johnson
T Crnogorac-Jurcevic
T Crnogorac-Jurcevic
T Crnogorac-Jurcevic
WP Kuo
Y Jung
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background In cancer studies, it is common that multiple microarray experiments are conducted to measure the same clinical outcome and expressions of the same set of genes. An important goal of such experiments is to identify a subset of genes that can potentially serve as predictive markers for cancer development and progression. Analyses of individual experiments may lead to unreliable gene selection results because of the small sample sizes. Meta analysis can be used to pool multiple experiments, increase statistical power, and achieve more reliable gene selection. The meta analysis of cancer microarray data is challenging because of the high dimensionality of gene expressions and the differences in experimental settings amongst different experiments. Results We propose a Meta Threshold Gradient Descent Regularization (MTGDR) approach for gene selection in the meta analysis of cancer microarray data. The MTGDR has many advantages over existing approaches. It allows different experiments to have different experimental settings. It can account for the joint effects of multiple genes on cancer, and it can select the same set of cancer-associated genes across multiple experiments. Simulation studies and analyses of multiple pancreatic and liver cancer experiments demonstrate the superior performance of the MTGDR. Conclusion The MTGDR provides an effective way of analyzing multiple cancer microarray studies and selecting reliable cancer-associated genes.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central