Search CORE

Brunel University Research Archive

DI-fusion

An intuitionistic approach to scoring DNA sequences against transcription factor binding site motifs

Author: A Sandelin
A Sandelin
A Sharov
A Tomovic
Adrian J Shepherd
Armando Blanco
C Lawrence
D Denning
E Baker
E Szmidt
E Wingender
F Garcia
F Lam
F Lopez
F Offner
F Zare-Mirakabad
Fernando Garcia-Alcalde
G Chamilos
G Diop
G Hertz
J Hanley
J Hughes
J Sainz
J Van Helden
J Zhao
K Atanassov
K Atanassov
K Atanassov
K Atanassov
K Won
L Liang
L Zadeh
M Bulyk
M Das
M Eisen
N Dror
N Kim
P Benos
P Bochud
P Schling
R Gordan
S De
T Bailey
T Fawcett
T Hehlgans
T Tamura
T Tamura
V Khatibi
W Hung
W Wasserman
X Chen
Y Haudry
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2010
Field of study

Background: Transcription factors (TFs) control transcription by binding to specific regions of DNA called transcription factor binding sites (TFBSs). The identification of TFBSs is a crucial problem in computational biology and includes the subtask of predicting the location of known TFBS motifs in a given DNA sequence. It has previously been shown that, when scoring matches to known TFBS motifs, interdependencies between positions within a motif should be taken into account. However, this remains a challenging task owing to the fact that sequences similar to those of known TFBSs can occur by chance with a relatively high frequency. Here we present a new method for matching sequences to TFBS motifs based on intuitionistic fuzzy sets (IFS) theory, an approach that has been shown to be particularly appropriate for tackling problems that embody a high degree of uncertainty. Results: We propose SCintuit, a new scoring method for measuring sequence-motif affinity based on IFS theory. Unlike existing methods that consider dependencies between positions, SCintuit is designed to prevent overestimation of less conserved positions of TFBSs. For a given pair of bases, SCintuit is computed not only as a function of their combined probability of occurrence, but also taking into account the individual importance of each single base at its corresponding position. We used SCintuit to identify known TFBSs in DNA sequences. Our method provides excellent results when dealing with both synthetic and real data, outperforming the sensitivity and the specificity of two existing methods in all the experiments we performed. Conclusions: The results show that SCintuit improves the prediction quality for TFs of the existing approaches without compromising sensitivity. In addition, we show how SCintuit can be successfully applied to real research problems. In this study the reliability of the IFS theory for motif discovery tasks is proven

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Directory of Open Access Journals

Repositorio Institucional Universidad de Granada

UCL Discovery

Birkbeck Institutional Research Online

A cAMP-binding ectoprotein in the yeast Saccharomyces cerevisiae

Author: Achstetter T.
Baroni M. D.
Behrens M. M.
Biilow R.
Bregman D. B.
Brunton L. L.
Cannon J. F.
Caras I. W.
Caras I. W.
Carr S. A.
Clegg C.
Conzelmann A.
Conzelmann A.
Corbin J. D.
Cross G. A. M.
Dallner G.
Davitz M. A.
Davitz M. A.
DeCamilli P.
Doering T. L.
Edelman A. M.
Ferguson M. A. J.
Ferguson M. A. J.
Ferguson R.
Fersht A.
Flockart D. A.
Gruber W.
Hixson C. S.
Ishihara M.
Jaynes P. K.
Johnson K. E.
Kamps M. P.
Kiibler D.
Korc-Grodzicki B.
Kunisawa R.
Lang B.
Lohmann S. M.
Lohmann S. M.
Low M. G.
Low M. G.
Matsumoto K.
Matsumoto K.
Matsumoto K.
Matsumoto K.
Matsumoto K.
Merino A.
Mitts M. R.
Mostov K. E.
Muller G.
Muller G.
Muller G.
Muller G„
Nairn A. C.
Nigam S. K.
Nigg E. A.
Nigg E. A.
Olson S.
Pall G.
Rhee T.
Rodel G.
Sakar D.
Salomon Y.
Smith M. E.
Srere P. A.
Stieger A.
Takami N.
Thorner J.
Toda T.
Trams E. G.
Uno I.
Vai M.
Wen T. C.
Wingender-Drissen R.
Publication venue: 'American Chemical Society (ACS)'
Publication date: 01/01/1991
Field of study

tides 10, 593-595

CiteSeerX

Open Access LMU

Quantitative model for inferring dynamic regulation of the tumour suppressor gene p53

Author: A Chipperfield
A Conesa
AR Joyce
AT Kwon
AW Braithwaite
C Moorman
CG Moles
CL Wei
D Chen
DG Sedding
E Wingender
G Liu
H de Jong
J Aach
J Goutsias
J Goutsias
J Wang
J Wang
J Wang
J Wang
JC Liao
JM Espinosa
Junbai Wang
K Zhu
KB Spurgers
KH Vousden
L Ma
M Barenco
MK Yeung
MR Bhonde
MV Karamouzis
N Sun
PS Kho
Q Wei
Q Wu
R Rahman-Roblick
RB Zhao
RC Gentleman
RS Erb
S Liu
S Rogers
SA Johnson
T Tian
Tianhai Tian
TS Gardner
TT Vu
WS el Deiry
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2010
Field of study

Background: The availability of various "omics" datasets creates a prospect of performing the study of genome-wide genetic regulatory networks. However, one of the major challenges of using mathematical models to infer genetic regulation from microarray datasets is the lack of information for protein concentrations and activities. Most of the previous researches were based on an assumption that the mRNA levels of a gene are consistent with its protein activities, though it is not always the case. Therefore, a more sophisticated modelling framework together with the corresponding inference methods is needed to accurately estimate genetic regulation from "omics" datasets. Results: This work developed a novel approach, which is based on a nonlinear mathematical model, to infer genetic regulation from microarray gene expression data. By using the p53 network as a test system, we used the nonlinear model to estimate the activities of transcription factor (TF) p53 from the expression levels of its target genes, and to identify the activation/inhibition status of p53 to its target genes. The predicted top 317 putative p53 target genes were supported by DNA sequence analysis. A comparison between our prediction and the other published predictions of p53 targets suggests that most of putative p53 targets may share a common depleted or enriched sequence signal on their upstream non-coding region. Conclusions: The proposed quantitative model can not only be used to infer the regulatory relationship between TF and its down-stream genes, but also be applied to estimate the protein activities of TF from the expression levels of its target genes

Directory of Open Access Journals

Enlighten

Efficient and accurate P-value computation for Position Weight Matrices

Author: A Liefooghe
C Pizzi
E Wingender
G Bejerano
GE Crooks
GZ Hertz
H Huang
Hélène Touzet
J Zhang
Jean-Stéphane Varré
JM Claverie
K Malde
M Beckstette
M Garey
R Staden
S Mount
S Rahmann
TD Wu
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Abstract Background Position Weight Matrices (PWMs) are probabilistic representations of signals in sequences. They are widely used to model approximate patterns in DNA or in protein sequences. The usage of PWMs needs as a prerequisite to knowing the statistical significance of a word according to its score. This is done by defining the P-value of a score, which is the probability that the background model can achieve a score larger than or equal to the observed value. This gives rise to the following problem: Given a P-value, find the corresponding score threshold. Existing methods rely on dynamic programming or probability generating functions. For many examples of PWMs, they fail to give accurate results in a reasonable amount of time. Results The contribution of this paper is two fold. First, we study the theoretical complexity of the problem, and we prove that it is NP-hard. Then, we describe a novel algorithm that solves the P-value problem efficiently. The main idea is to use a series of discretized score distributions that improves the final result step by step until some convergence criterion is met. Moreover, the algorithm is capable of calculating the exact P-value without any error, even for matrices with non-integer coefficient values. The same approach is also used to devise an accurate algorithm for the reverse problem: finding the P-value for a given score. Both methods are implemented in a software called TFM-PVALUE, that is freely available. Conclusion We have tested TFM-PVALUE on a large set of PWMs representing transcription factor binding sites. Experimental results show that it achieves better performance in terms of computational time and precision than existing tools.</p

HAL - Lille 3

Directory of Open Access Journals

INRIA a CCSD electronic archive server

mirConnX: condition-specific mRNA-microRNA network integrator

Author: Ashburner
C. Athanassiou
Chan
Chen
Corcoran
Corsten
Enright
Fang
Furnari
G. T. Huang
Gene Ontology Consortium
Huang
Kertesz
Krek
Kruger
Langfelder
Marson
P. V. Benos
Papagiannakopoulos
Plikus
Rebholz-Schuhmann
Schmid
Silber
Stormo
Wingender
Publication venue: Oxford University Press
Publication date
Field of study

mirConnX is a user-friendly web interface for inferring, displaying and parsing mRNA and microRNA (miRNA) gene regulatory networks. mirConnX combines sequence information with gene expression data analysis to create a disease-specific, genome-wide regulatory network. A prior, static network has been constructed for all human and mouse genes. It consists of computationally predicted transcription factor (TF)-gene associations and miRNA target predictions. The prior network is supplemented with known interactions from the literature. Dynamic TF- and miRNA-gene associations are inferred from user-provided expression data using an association measure of choice. The static and dynamic networks are then combined using an integration function with user-specified weights. Visualization of the network and subsequent analysis are provided via a very responsive graphic user interface. Two organisms are currently supported: Homo sapiens and Mus musculus. The intuitive user interface and large database make mirConnX a useful tool for clinical scientists for hypothesis generation and explorations. mirConnX is freely available for academic use at http://www.benoslab.pitt.edu/mirconnx

Vitamin D receptor ChIP-seq in primary CD4+ cells: relationship to serum 25-hydroxyvitamin D levels and autoimmune disease

Author: A Sandelin
A Sanyal
Adam E Handel
AE Handel
Antonio J Berlanga-Taylor
AP Boyle
B Langmead
B Lehmann
BE Bernstein
C Carlberg
CE Grant
CS Ross-Innes
CY McLean
D Berglund
E Wingender
F Birzele
Finn Drabløs
G Pavesi
Gavin Giovannoni
Geir K Sandve
George C Ebers
Giulio Disanto
Giuseppe Gallone
GK Sandve
Heather Hanwell
IV Kulakovskiy
J Orgaz-Molina
J-C Souberbielle
JHA Martens
K Li
KL Munger
LA Hindorff
LL Issa
M Ashburner
M Caliskan
M Lutz
M Thomas-Chollier
MA Kriegel
MD Shirley
ML McCullough
NU Rashid
O Weth
PA Fujita
PA Marshall
R Salehi-Tabar
RM Tolón
S Gundersen
S Heikkinen
Sreeram V Ramagopalan
SV Ramagopalan
T Liu
TA Owen
TL Bailey
TL Bailey
TL Bailey
Y Zhang
Y-C Huang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

PMCID: PMC3710212This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited

Oxford University Research Archive

Spiral - Imperial College Digital Repository

Queen Mary Research Online

NORA - Norwegian Open Research Archives

FlyFactorSurvey: a database of Drosophila transcription factor binding specificities determined using the bacterial one-hybrid system

Author: Bailey
Bailey
Bailey
Berger
Bergman
Berman
Blatti
Bosch
Christopher J. Hull
Cong Zhu
Crooks
David S. Lapointe
Donlin
Drysdale
Gupta
Halfon
Janssens
Jessie A. Brasefield
Johnson
Jolma
Kheradpour
Kinzler
Li
Liang
Lihua Julie Zhu
Lyne
Majid Kazemian
Matthew D. Basciotta
Matys
Meng
Metewo Selase Enuameh
Michael H. Brodsky
Newburger
Noyes
Noyes
Portales-Casamar
Ren
Roulet
Ryan G. Christensen
Saurabh Sinha
Schneider
Schroeder
Scot A. Wolfe
Segal
Sinha
Stormo
The UniProt Consortium
Tuerk
Tweedie
Wingender
Wingender
Yuna Asriyan
Zeitlinger
Zhao
Zykovich
Publication venue: Oxford University Press
Publication date: 01/01/2011
Field of study

FlyFactorSurvey (http://pgfe.umassmed.edu/TFDBS/) is a database of DNA binding specificities for Drosophila transcription factors (TFs) primarily determined using the bacterial one-hybrid system. The database provides community access to over 400 recognition motifs and position weight matrices for over 200 TFs, including many unpublished motifs. Search tools and flat file downloads are provided to retrieve binding site information (as sequences, matrices and sequence logos) for individual TFs, groups of TFs or for all TFs with characterized binding specificities. Linked analysis tools allow users to identify motifs within our database that share similarity to a query matrix or to view the distribution of occurrences of an individual motif throughout the Drosophila genome. Together, this database and its associated tools provide computational and experimental biologists with resources to predict interactions between Drosophila TFs and target cis-regulatory sequences

Digital Commons@Becker

eScholarship@UMMS

UCbase & miRfunc: a database of ultraconserved sequences and microRNA function

Author: A. Bottoni
Altschul
Barrett
Brazma
C. M. Croce
C. Piovan
C. Taccioli
Calin
Calin
Cheng
E. Fabbri
G. A. Calin
G. Romano
Griffiths-Jones
Hamosh
Iorio
Iorio
J. Hagan
Karp
L. Y. Fong
Lai
Lee
Lee
M. Acunzo
M. V. Iorio
Nam
Pruitt
R. Gambari
R. Visone
Rajewsky
Ruvkun
S. Volinia
Stenson
Takai
Visone
Wingender
Xu
Zhi
Publication venue: Oxford University Press
Publication date: 01/01/2009
Field of study

Four hundred and eighty-one ultraconserved sequences (UCRs) longer than 200 bases were discovered in the genomes of human, mouse and rat. These are DNA sequences showing 100% identity among the three species. UCRs are frequently located at genomic regions involved in cancer, differentially expressed in human leukemias and carcinomas and in some instances regulated by microRNAs (miRNAs). Here we present UCbase & miRfunc, the first database which provides ultraconserved sequences data and shows miRNA function. Also, it links UCRs and miRNAs with the related human disorders and genomic properties. The current release contains over 2000 sequences from three species (human, mouse and rat). As a web application, UCbase & miRfunc is platform independent and it is accessible at http://microrna.osu.edu/.UCbase4