Search CORE

124 research outputs found

Use of GenMAPP and MAPPFinder to analyse pathways involved in chickens infected with the protozoan parasite Eimeria

Author: AR Pico
Dennis Prickett
F Al-Shahrour
J Hedegaard
M van Iersel
MD Prickett
Michael Watson
N Salomonis
N Yeung
RB Williams
SW Doniger
Publication venue: BioMed Central
Publication date: 01/07/2009
Field of study

Abstract Background Microarrays allow genome-wide assays of gene expression. There is a need for user-friendly software to visualise and analyse these data. Analysing microarray data in the context of biological pathways is now common, and several tools exist. Results We describe the use of MAPPFinder, a component of GenMAPP to characterise the biological pathways affected in chickens infected with the protozoan parasite <it>Eimeria. </it>Several pathways were significantly affected based on the unadjusted p-value, including several immune-system pathways. Conclusion GenMAPP/MAPPFinder provides a means to rapidly visualise pathways affected in microarray studies. However, it relies on good genome annotation and having genes reliably linked to pathway objects. We show that GenMAPP/MAPPFinder can produce useful results, and as the annotation of the chicken genome improves, so will the level of information gained.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Edinburgh Research Explorer

Estimation and testing for the effect of a genetic pathway on a disease outcome using logistic kernel machine regression via logistic mixed models

Author: A Subramanian
B Schölkopf
D Eisenberg
D Liu
D Zhang
Dawei Liu
Debashis Ghosh
G Kimeldorf
JJ Goeman
JJ Goeman
JJ Goeman
KD Dahlquist
M Raponi
N Breslow
P Grosu
P McCullagh
R Davies
R Davies
S Dhanasekaran
S le Cessie
SG Self
SW Doniger
V Vapnik
Xihong Lin
Z Wei
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Abstract Background Growing interest on biological pathways has called for new statistical methods for modeling and testing a genetic pathway effect on a health outcome. The fact that genes within a pathway tend to interact with each other and relate to the outcome in a complicated way makes nonparametric methods more desirable. The kernel machine method provides a convenient, powerful and unified method for multi-dimensional parametric and nonparametric modeling of the pathway effect. Results In this paper we propose a logistic kernel machine regression model for binary outcomes. This model relates the disease risk to covariates parametrically, and to genes within a genetic pathway parametrically or nonparametrically using kernel machines. The nonparametric genetic pathway effect allows for possible interactions among the genes within the same pathway and a complicated relationship of the genetic pathway and the outcome. We show that kernel machine estimation of the model components can be formulated using a logistic mixed model. Estimation hence can proceed within a mixed model framework using standard statistical software. A score test based on a Gaussian process approximation is developed to test for the genetic pathway effect. The methods are illustrated using a prostate cancer data set and evaluated using simulations. An extension to continuous and discrete outcomes using generalized kernel machine models and its connection with generalized linear mixed models is discussed. Conclusion Logistic kernel machine regression and its extension generalized kernel machine regression provide a novel and flexible statistical tool for modeling pathway effects on discrete and continuous outcomes. Their close connection to mixed models and attractive performance make them have promising wide applications in bioinformatics and other biomedical areas.</p

Crossref

Harvard University - DASH

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Collection Of Biostatistics Research Archive

Harvard Dataverse Network

Evidence for Pervasive Adaptive Protein Evolution in Wild Mice

Author: A Eyre-Walker
A Eyre-Walker
A Eyre-Walker
A Geraldes
Adam Eyre-Walker
AR Boyko
B Weir
Bettina Harr
C Haag-Liautard
D Bachtrog
DA Hinds
Daniel L. Halligan
DJ Begun
F Bonhomme
F Tajima
F Tajima
FH Bronson
Fiona Oliver
G Coop
G Liti
G McVicker
GA Watterson
J Charlesworth
J Charlesworth
J Gojobori
JA Shapiro
JC Fay
JF Baines
JH McDonald
JK Pritchard
JL Caswell
JM Akey
JM Macpherson
JP Foxe
L Zhang
M Kimura
Michael W. Nachman
MW Nachman
N Bray
N Takahata
N Yu
NG Smith
P Andolfatto
P Andolfatto
PD Keightley
PD Keightley
Peter D. Keightley
R Burgess
R Haygood
R Nielsen
RJ Livingston
S Rozen
SF Altschul
SW Doniger
T Salcedo
W Din
X Maside
Z Patwa
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2010
Field of study

The relative contributions of neutral and adaptive substitutions to molecular evolution has been one of the most controversial issues in evolutionary biology for more than 40 years. The analysis of within-species nucleotide polymorphism and between-species divergence data supports a widespread role for adaptive protein evolution in certain taxa. For example, estimates of the proportion of adaptive amino acid substitutions (alpha) are 50% or more in enteric bacteria and Drosophila. In contrast, recent estimates of alpha for hominids have been at most 13%. Here, we estimate alpha for protein sequences of murid rodents based on nucleotide polymorphism data from multiple genes in a population of the house mouse subspecies Mus musculus castaneus, which inhabits the ancestral range of the Mus species complex and nucleotide divergence between M. m. castaneus and M. famulus or the rat. We estimate that 57% of amino acid substitutions in murids have been driven by positive selection. Hominids, therefore, are exceptional in having low apparent levels of adaptive protein evolution. The high frequency of adaptive amino acid substitutions in wild mice is consistent with their large effective population size, leading to effective natural selection at the molecular level. Effective natural selection also manifests itself as a paucity of effectively neutral nonsynonymous mutations in M. m. castaneus compared to humans

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Edinburgh Research Explorer

Sussex Research Online

MPG.PuRe

Mining Biological Pathways Using WikiPathways Web Services

Author: A Doerr
AL Tarca
Alexander R. Pico
AR Pico
Bruce R. Conklin
Chris Evelo
D Nam
JW Huss 3rd
K Tarassov
Kristina Hanspers
L Matthews
LD Stein
M Kanehisa
Martijn P. van Iersel
MP van Iersel
MS Cline
N Salomonis
O Keskin
P Fisher
P Shannon
SW Doniger
T Ideker
T Oinn
Thomas Kelder
Winston Hide
Y Li
Publication venue: Public Library of Science
Publication date: 01/01/2009
Field of study

WikiPathways is a platform for creating, updating, and sharing biological pathways [1]. Pathways can be edited and downloaded using the wiki-style website. Here we present a SOAP web service that provides programmatic access to WikiPathways that is complementary to the website. We describe the functionality that this web service offers and discuss several use cases in detail. Exposing WikiPathways through a web service opens up new ways of utilizing pathway information and assisting the community curation process

CiteSeerX

Public Library of Science (PLOS)

Maastricht University Research Portal

Crossref

Directory of Open Access Journals

PubMed Central

eScholarship - University of California

Formation of regulatory modules by local sequence duplication

Author: A Stark
A Tanay
AL Halpern
AM Moses
AM Moses
AM Moses
Amos Tanay
Armita Nourmohammad
B Ondek
BP Berman
CM Bergman
CM Bergman
CT Harbison
D Gruen
D Stanojevic
DN Arnosti
DS Fields
E Segal
EE Hare
EH Davidson
EH Davidson
G Badis
G Benson
G Leung
GD Stormo
I Abnizova
J Berg
J Berg
J Monod
JM Hancock
K Thornton
L Li
M Kimura
M Kimura
M Levine
M Lynch
M Lynch
M Lässig
M Markstein
M Pachkov
M Ptashne
MC King
MD Vinces
Michael Lässig
MM Kulkarni
MS Halfon
MS Halfon
MV Katti
MZ Ludwig
MZ Ludwig
MZ Ludwig
MZ Ludwig
N Rajewsky
NE Buchler
O Berg
PW Messer
R Durbin
RJ Britten
RW Lusk
S Kullback
S Mukherjee
S Sinha
S Sinha
S Sinha
S Small
SJ Maerkl
SM Gallo
SW Doniger
V Boeva
V Mustonen
V Mustonen
V Mustonen
Z Wunderlich
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2011
Field of study

Turnover of regulatory sequence and function is an important part of molecular evolution. But what are the modes of sequence evolution leading to rapid formation and loss of regulatory sites? Here, we show that a large fraction of neighboring transcription factor binding sites in the fly genome have formed from a common sequence origin by local duplications. This mode of evolution is found to produce regulatory information: duplications can seed new sites in the neighborhood of existing sites. Duplicate seeds evolve subsequently by point mutations, often towards binding a different factor than their ancestral neighbor sites. These results are based on a statistical analysis of 346 cis-regulatory modules in the Drosophila melanogaster genome, and a comparison set of intergenic regulatory sequence in Saccharomyces cerevisiae. In fly regulatory modules, pairs of binding sites show significantly enhanced sequence similarity up to distances of about 50 bp. We analyze these data in terms of an evolutionary model with two distinct modes of site formation: (i) evolution from independent sequence origin and (ii) divergent evolution following duplication of a common ancestor sequence. Our results suggest that pervasive formation of binding sites by local sequence duplications distinguishes the complex regulatory architecture of higher eukaryotes from the simpler architecture of unicellular organisms

arXiv.org e-Print Archive

Public Library of Science (PLOS)

Crossref

Kölner UniversitätsPublikationsServer

Directory of Open Access Journals

PubMed Central

Integrated Genome-Scale Prediction of Detrimental Mutations in Transcription Networks

Author: A Tanay
AM Moses
AP Gasch
B Prud'homme
Ben Lehner
C Zhu
C-S Chin
CA Brown
CS Chan
CT Harbison
D Schmidt
DM Gelperin
DT Odom
E Segal
ET Dermitzakis
G Giaever
G Liti
GD Stormo
HC Mak
I Lee
I Tirosh
I Tirosh
J Gagneur
J Gerke
J Gertz
J Ihmels
J Kim
J Zheng
J Zhu
JD Lieb
JH McDonald
JI Semple
Joshua M. Akey
K Chen
KD MacIsaac
L Giorgetti
L Peña-Castillo
L Teytelman
LA Boyer
LA Hindorff
M Dreze
M Kellis
MC King
Mirko Francesconi
NN Batada
Q Zhong
R Johnson
R Jothi
R Sopko
Rob Jelier
S MacArthur
S Marcand
S Ohno
S Zeiser
SB Carroll
SW Doniger
SW Doniger
T Vavouri
T Vavouri
U Nagalakshmi
V Mustonen
X yong Li
Y Bilu
Y Field
Z Ouyang
Z Wunderlich
Publication venue: Public Library of Science
Publication date: 01/05/2011
Field of study

A central challenge in genetics is to understand when and why mutations alter the phenotype of an organism. The consequences of gene inhibition have been systematically studied and can be predicted reasonably well across a genome. However, many sequence variants important for disease and evolution may alter gene regulation rather than gene function. The consequences of altering a regulatory interaction (or “edge”) rather than a gene (or “node”) in a network have not been as extensively studied. Here we use an integrative analysis and evolutionary conservation to identify features that predict when the loss of a regulatory interaction is detrimental in the extensively mapped transcription network of budding yeast. Properties such as the strength of an interaction, location and context in a promoter, regulator and target gene importance, and the potential for compensation (redundancy) associate to some extent with interaction importance. Combined, however, these features predict quite well whether the loss of a regulatory interaction is detrimental across many promoters and for many different transcription factors. Thus, despite the potential for regulatory diversity, common principles can be used to understand and predict when changes in regulation are most harmful to an organism

Lirias

Crossref

Directory of Open Access Journals

PubMed Central

Genome Expression Pathway Analysis Tool – Analysis and visualization of microarray gene expression data under genomic, proteomic and metabolic context

Author: A Rosenwald
AA Alizadeh
AI Saeed
B Mlecnik
B Zhang
BM Bolstad
C von Mering
F Al-Shahrour
Gene Ontology Consortium
GJ Dennis
GK Smyth
J Rainer
JM Vaquerizas
Julia C Engelmann
Jörg Schultz
M Kanehisa
M Kapushesky
M Kotera
M Masseroli
M Pelizzola
Markus Weniger
O Troyanskaya
P Khatri
P Lichter
P Shannon
R Gentleman
R Shamir
S Bea
SW Doniger
TJP Hubbard
W Huber
YH Yang
Publication venue: BioMed Central
Publication date: 01/06/2007
Field of study

Abstract Background Regulation of gene expression is relevant to many areas of biology and medicine, in the study of treatments, diseases, and developmental stages. Microarrays can be used to measure the expression level of thousands of mRNAs at the same time, allowing insight into or comparison of different cellular conditions. The data derived out of microarray experiments is highly dimensional and often noisy, and interpretation of the results can get intricate. Although programs for the statistical analysis of microarray data exist, most of them lack an integration of analysis results and biological interpretation. Results We have developed GEPAT, Genome Expression Pathway Analysis Tool, offering an analysis of gene expression data under genomic, proteomic and metabolic context. We provide an integration of statistical methods for data import and data analysis together with a biological interpretation for subsets of probes or single probes on the chip. GEPAT imports various types of oligonucleotide and cDNA array data formats. Different normalization methods can be applied to the data, afterwards data annotation is performed. After import, GEPAT offers various statistical data analysis methods, as hierarchical, k-means and PCA clustering, a linear model based t-test or chromosomal profile comparison. The results of the analysis can be interpreted by enrichment of biological terms, pathway analysis or interaction networks. Different biological databases are included, to give various information for each probe on the chip. GEPAT offers no linear work flow, but allows the usage of any subset of probes and samples as a start for a new data analysis. GEPAT relies on established data analysis packages, offers a modular approach for an easy extension, and can be run on a computer grid to allow a large number of users. It is freely available under the LGPL open source license for academic and commercial users at <url>http://gepat.sourceforge.net</url>. Conclusion GEPAT is a modular, scalable and professional-grade software integrating analysis and interpretation of microarray gene expression data. An installation available for academic users can be found at <url>http://gepat.bioapps.biozentrum.uni-wuerzburg.de</url>.</p

Crossref

University of Regensburg Publication Server

Directory of Open Access Journals

PubMed Central

Genetic Architecture of Highly Complex Chemical Resistance Traits across Four Yeast Strains

Author: AH Tong
Audrey P. Gasch
D Gresham
DJ Kvitek
DM Ruderfer
DS Falconer
EO Perlstein
F Storici
FA Cubillos
G Liti
HA Orr
HB Fraser
HS Kim
HS Kim
Ian M. Ehrenreich
IM Ehrenreich
IM Ehrenreich
J Gerke
J Schacherer
J Warringer
JC Fay
JH McCusker
Joshua Bloom
L Parts
Leonid Kruglyak
LM Steinmetz
Noorossadat Torabi
RB Brem
RB Brem
RK Bradley
SW Doniger
TA Manolio
TF Mackay
W Wei
Xin Wang
Yue Jia
Z Gu
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

Many questions about the genetic basis of complex traits remain unanswered. This is in part due to the low statistical power of traditional genetic mapping studies. We used a statistically powerful approach, extreme QTL mapping (X-QTL), to identify the genetic basis of resistance to 13 chemicals in all 6 pairwise crosses of four ecologically and genetically diverse yeast strains, and we detected a total of more than 800 loci. We found that the number of loci detected in each experiment was primarily a function of the trait (explaining 46% of the variance) rather than the cross (11%), suggesting that the level of genetic complexity is a consistent property of a trait across different genetic backgrounds. Further, we observed that most loci had trait-specific effects, although a small number of loci with effects in many conditions were identified. We used the patterns of resistance and susceptibility alleles in the four parent strains to make inferences about the allele frequency spectrum of functional variants. We also observed evidence of more complex allelic series at a number of loci, as well as strain-specific signatures of selection. These results improve our understanding of complex traits in yeast and have implications for study design in other organisms

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

eScholarship - University of California

FigShare

Prioritization of gene regulatory interactions from large-scale modules in yeast

Author: A Tanay
A-L Barabasi
CT Harbison
D Greenbaum
E Schweizer
E Segal
F Gao
F Rolland
G Lesage
Ho-Joon Lee
I Simon
J Ihmels
K Lemmens
LH Hartwell
LL Newcomb
M Kellis
M Koranda
Martin Vingron
N Zhang
P Cliften
P Prochasson
PT Spellman
R Siddharthan
Ricardo Bringas
S Rahmann
S Tavazoie
SW Doniger
T Manke
T Yu
Thomas Manke
TI Lee
V Matys
VR Iyer
W-S Wu
X Xu
Y Pilpel
Z Bar-Joseph
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Abstract Background The identification of groups of co-regulated genes and their transcription factors, called transcriptional modules, has been a focus of many studies about biological systems. While methods have been developed to derive numerous modules from genome-wide data, individual links between regulatory proteins and target genes still need experimental verification. In this work, we aim to prioritize regulator-target links within transcriptional modules based on three types of large-scale data sources. Results Starting with putative transcriptional modules from ChIP-chip data, we first derive modules in which target genes show both expression and function coherence. The most reliable regulatory links between transcription factors and target genes are established by identifying intersection of target genes in coherent modules for each enriched functional category. Using a combination of genome-wide yeast data in normal growth conditions and two different reference datasets, we show that our method predicts regulatory interactions with significantly higher predictive power than ChIP-chip binding data alone. A comparison with results from other studies highlights that our approach provides a reliable and complementary set of regulatory interactions. Based on our results, we can also identify functionally interacting target genes, for instance, a group of co-regulated proteins related to cell wall synthesis. Furthermore, we report novel conserved binding sites of a glycoprotein-encoding gene, CIS3, regulated by Swi6-Swi4 and Ndd1-Fkh2-Mcm1 complexes. Conclusion We provide a simple method to prioritize individual TF-gene interactions from large-scale transcriptional modules. In comparison with other published works, we predict a complementary set of regulatory interactions which yields a similar or higher prediction accuracy at the expense of sensitivity. Therefore, our method can serve as an alternative approach to prioritization for further experimental studies.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Nucleosome-coupled expression differences in closely-related species

Author: AL Olins
AM Tsankov
BE Bernstein
C Koch
Corey Nislow
CT Harbison
DE Schones
E Segal
EA Sekinger
F Ozsolak
G Badis
G Zhu
GC Yuan
GJ Hogan
H Li
I Tirosh
IP Ioshikhes
JD Hughes
KA Zawadzki
Kyle Tsui
L Bai
Maitreya J Dunham
Marinella Gebbia
N Kaplan
N Morohashi
O Elemento
O Troyanskaya
OC Martin
Olga G Troyanskaya
P Cliften
P Clifton
RD Kornberg
S Mahony
S Shivaswamy
S Washietl
SW Doniger
T Owen-Hughes
T Pramila
Victoria Yao
W Lee
X Liu
Y Guan
Y Zhang
Yuanfang Guan
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Genome-wide nucleosome occupancy is negatively related to the average level of transcription factor motif binding based on studies in yeast and several other model organisms. The degree to which nucleosome-motif interactions relate to phenotypic changes across species is, however, unknown. Results We address this challenge by generating nucleosome positioning and cell cycle expression data for <it>Saccharomyces bayanus </it>and show that differences in nucleosome occupancy reflect cell cycle expression divergence between two yeast species, <it>S. bayanus </it>and <it>S. cerevisiae</it>. Specifically, genes with nucleosome-depleted MBP1 motifs upstream of their coding sequence show periodic expression during the cell cycle, whereas genes with nucleosome-shielded motifs do not. In addition, conserved cell cycle regulatory motifs across these two species are more nucleosome-depleted compared to those that are not conserved, suggesting that the degree of conservation of regulatory sites varies, and is reflected by nucleosome occupancy patterns. Finally, many changes in cell cycle gene expression patterns across species can be correlated to changes in nucleosome occupancy on motifs (rather than to the presence or absence of motifs). Conclusions Our observations suggest that alteration of nucleosome occupancy is a previously uncharacterized feature related to the divergence of cell cycle expression between species.</p

University of Toronto Research Repository

Princeton University Open Access Repository

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central