Search CORE

436 research outputs found

JASPAR 2016: a major expansion and update of the open-access database of transcription factor binding profiles.

Author: Arenillas DJ
Chen CY
Denay G
Fornes O
Lee J
Lenhard B
Mathelier A
Parcy F
Sandelin A
Shi W
Shyr C
Tan G
Wasserman WW
Worsley-Hunt R
Zhang AW
Publication venue: 'Oxford University Press (OUP)'
Publication date: 22/10/2015
Field of study

JASPAR (http://jaspar.genereg.net) is an open-access database storing curated, non-redundant transcription factor (TF) binding profiles representing transcription factor binding preferences as position frequency matrices for multiple species in six taxonomic groups. For this 2016 release, we expanded the JASPAR CORE collection with 494 new TF binding profiles (315 in vertebrates, 11 in nematodes, 3 in insects, 1 in fungi and 164 in plants) and updated 59 profiles (58 in vertebrates and 1 in fungi). The introduced profiles represent an 83% expansion and 10% update when compared to the previous release. We updated the structural annotation of the TF DNA binding domains (DBDs) following a published hierarchical structural classification. In addition, we introduced 130 transcription factor flexible models trained on ChIP-seq data for vertebrates, which capture dinucleotide dependencies within TF binding sites. This new JASPAR release is accompanied by a new web tool to infer JASPAR TF binding profiles recognized by a given TF protein sequence. Moreover, we provide the users with a Ruby module complementing the JASPAR API to ease programmatic access and use of the JASPAR collection of profiles. Finally, we provide the JASPAR2016 R/Bioconductor data package with the data of this release

PubMed Central

Copenhagen University Research Information System

Spiral - Imperial College Digital Repository

Genome-wide nucleosome map and cytosine methylation levels of an ancient human genome.

Author: Andersson R.
Gilbert Thomas
Hoover C.
Jones P.
Kelly T.
Krogh A.
Lilje B.
Lindgreen S.
Orlando L.
Parker B.
Pedersen J.
Prokhortchouk E.
Rasmussen M.
Rubin E.
Sandelin A.
Tikhonov A.
Tobin D.
Valen E.
Vang S.
Velazquez A.
Willerslev E.
Publication venue: 'Cold Spring Harbor Laboratory'
Publication date: 01/01/2014
Field of study

yesEpigenetic information is available from contemporary organisms, but is difficult to track back in evolutionary time. Here, we show that genome-wide epigenetic information can be gathered directly from next-generation sequence reads of DNA isolated from ancient remains. Using the genome sequence data generated from hair shafts of a 4000-yr-old Paleo- Eskimo belonging to the Saqqaq culture, we generate the first ancient nucleosome map coupled with a genome-wide survey of cytosine methylation levels. The validity of both nucleosome map and methylation levels were confirmed by the recovery of the expected signals at promoter regions, exon/intron boundaries, and CTCF sites. The top-scoring nucleosome calls revealed distinct DNA positioning biases, attesting to nucleotide-level accuracy. The ancient methylation levels exhibited high conservation over time, clustering closely with modern hair tissues. Using ancient methylation information, we estimated the age at death of the Saqqaq individual and illustrate how epigenetic information can be used to infer ancient gene expression. Similar epigenetic signatures were found in other fossil material, such as 110,000- to 130,000-yr-old bones, supporting the contention that ancient epigenomic information can be reconstructed from a deep past. Our findings lay the foundation for extracting epigenomic information from ancient samples, allowing shifts in epialleles to be tracked through evolutionary time, as well as providing an original window into modern epigenomics

Crossref

Copenhagen University Research Information System

PubMed Central

The Australian National University

Bradford Scholars

espace@Curtin

Identification of TNF-alpha-Responsive Promoters and Enhancers in the Intestinal Epithelial Cell Model Caco-2

Author: A. Sandelin
Andersson
B. Lilje
Beraud
Bernstein
Carninci
Coskun
Coskun
Dahan
Davuluri
Degen
Derrien
Engstrom
Ernst
Frith
Ger
Hidalgo
Hoffmann
I. Hoof
Imura
J. B. Seidelin
J. Bornholdt
J. Olsen
J. T. Bjerrum
J. T. Troelsen
K. Dahlgaard
Koch
Kodzius
Lenhard
Li
M. Boyd
M. Coskun
M. Vitezic
Ma
Mechtcheriakova
Micheau
O. H. Nielsen
Olsen
Ordas
R. Andersson
RIKEN Genome Exploration Research Group and Genome
Sandberg
Sandelin
Sethu
SINGER
Suzuki
The FANTOM Consortium
Thurman
Wice
Zboralski
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2014
Field of study

The Caco-2 cell line is one of the most important in vitro models for enterocytes, and is used to study drug absorption and disease, including inflammatory bowel disease and cancer. In order to use the model optimally, it is necessary to map its functional entities. In this study, we have generated genome-wide maps of active transcription start sites (TSSs), and active enhancers in Caco-2 cells with or without tumour necrosis factor (TNF)-α stimulation to mimic an inflammatory state. We found 520 promoters that significantly changed their usage level upon TNF-α stimulation; of these, 52% are not annotated. A subset of these has the potential to confer change in protein function due to protein domain exclusion. Moreover, we locate 890 transcribed enhancer candidates, where ∼50% are changing in usage after TNF-α stimulation. These enhancers share motif enrichments with similarly responding gene promoters. As a case example, we characterize an enhancer regulating the laminin-5 γ2-chain (LAMC2) gene by nuclear factor (NF)-κB binding. This report is the first to present comprehensive TSS and enhancer maps over Caco-2 cells, and highlights many novel inflammation-specific promoters and enhancers

CiteSeerX

Crossref

Roskilde Universitet

Copenhagen University Research Information System

PubMed Central

Vitamin D receptor ChIP-seq in primary CD4+ cells: relationship to serum 25-hydroxyvitamin D levels and autoimmune disease

Author: A Sandelin
A Sanyal
Adam E Handel
AE Handel
Antonio J Berlanga-Taylor
AP Boyle
B Langmead
B Lehmann
BE Bernstein
C Carlberg
CE Grant
CS Ross-Innes
CY McLean
D Berglund
E Wingender
F Birzele
Finn Drabløs
G Pavesi
Gavin Giovannoni
Geir K Sandve
George C Ebers
Giulio Disanto
Giuseppe Gallone
GK Sandve
Heather Hanwell
IV Kulakovskiy
J Orgaz-Molina
J-C Souberbielle
JHA Martens
K Li
KL Munger
LA Hindorff
LL Issa
M Ashburner
M Caliskan
M Lutz
M Thomas-Chollier
MA Kriegel
MD Shirley
ML McCullough
NU Rashid
O Weth
PA Fujita
PA Marshall
R Salehi-Tabar
RM Tolón
S Gundersen
S Heikkinen
Sreeram V Ramagopalan
SV Ramagopalan
T Liu
TA Owen
TL Bailey
TL Bailey
TL Bailey
Y Zhang
Y-C Huang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

PMCID: PMC3710212This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited

Crossref

Springer - Publisher Connector

PubMed Central

Oxford University Research Archive

Spiral - Imperial College Digital Repository

Queen Mary Research Online

NORA - Norwegian Open Research Archives

Transcriptional and epigenomic profiling identifies YAP signaling as a key regulator of intestinal epithelium maturation

Author: Benes Vladimir
Bornholdt Jette
Bressan Raul B.
Chen Yun
Guiu Jordi
Hansen Stine L.
Jensen Kim B.
Larsen Hjalte L.
Lõhmussaar Kadi
Maciag Grzegorz J.
Maimets Martti
Mayer Daniela
Pedersen Marianne Terndrup
Pikkupeura Laura M.
Sandelin Albin
Schweiger Pawel J.
Teves Joji M. Yap
Publication venue: American Association for the Advancement of Science (AAAS)
Publication date: 04/09/2023
Field of study

During intestinal organogenesis, equipotent epithelial progenitors mature into phenotypically distinct stem cells that are responsible for lifelong maintenance of the tissue. While the morphological changes associated with the transition are well characterized, the molecular mechanisms underpinning the maturation process are not fully understood. Here, we leverage intestinal organoid cultures to profile transcriptional, chromatin accessibility, DNA methylation, and three-dimensional (3D) chromatin conformation landscapes in fetal and adult epithelial cells. We observed prominent differences in gene expression and enhancer activity, which are accompanied by local changes in 3D organization, DNA accessibility, and methylation between the two cellular states. Using integrative analyses, we identified sustained Yes-Associated Protein (YAP) transcriptional activity as a major gatekeeper of the immature fetal state. We found the YAP-associated transcriptional network to be regulated at various levels of chromatin organization and likely to be coordinated by changes in extracellular matrix composition. Together, our work highlights the value of unbiased profiling of regulatory landscapes for the identification of key mechanisms underlying tissue maturation

Diposit Digital de la Universitat de Barcelona

Limitations and potentials of current motif discovery algorithms

Author: B. Li
Banerjee
Blanchette
Brazma
Buhler
Burset
D. Kihara
Day
Duret
Ellrott
Gelfand
Helden
Hertz
Hertz
Huang
J. Hu
Kanehisa
Kellis
Lawrence
Liu
Liu
McGuire
Ohler
Pevzner
Qin
Rogic
Roth
Salgado
Sandelin
Simon
Sinha
Sinha
Spellman
Stormo
Thijs
Tompa
van Helden
Wang
Wyrick
Publication venue: Oxford University Press
Publication date: 01/01/2005
Field of study

Computational methods for de novo identification of gene regulation elements, such as transcription factor binding sites, have proved to be useful for deciphering genetic regulatory networks. However, despite the availability of a large number of algorithms, their strengths and weaknesses are not sufficiently understood. Here, we designed a comprehensive set of performance measures and benchmarked five modern sequence-based motif discovery algorithms using large datasets generated from Escherichia coli RegulonDB. Factors that affect the prediction accuracy, scalability and reliability are characterized. It is revealed that the nucleotide and the binding site level accuracy are very low, while the motif level accuracy is relatively high, which indicates that the algorithms can usually capture at least one correct motif in an input sequence. To exploit diverse predictions from multiple runs of one or more algorithms, a consensus ensemble algorithm has been developed, which achieved 6–45% improvement over the base algorithms by increasing both the sensitivity and specificity. Our study illustrates limitations and potentials of existing sequence-based motif discovery algorithms. Taking advantage of the revealed potentials, several promising directions for further improvements are discussed. Since the sequence-based algorithms are the baseline of most of the modern motif discovery algorithms, this paper suggests substantial improvements would be possible for them

CiteSeerX

Crossref

PubMed Central

Scholar Commons - Institutional Repository of the University of South Carolina

Transcription factor site dependencies in human, mouse and rat genomes

Author: A Di Cara
A Gyenesei
A Sandelin
A Sandelin
A Tomovic
A Tomovic
AG Jegga
AH Brivanlou
AJ Walhout
Andrija Tomovic
AV Morozov
B Lenhard
C Kunsch
CC Liu
D Choi
D GuhaThakurta
DC King
DE Schones
DH Crouch
Edward J Oakeley
G Caretti
G Robertson
G Zhao
H Klein
H Wang
IJ Donaldson
IJ Donaldson
J Carabana
J Karlseder
L Narlikar
L Narlikar
M Blanchette
M Defrance
Michael Stadler
O Puig
PR van Ginkel
R Sharan
R Sharan
S Impey
S Mahony
SJ Ho Sui
SM Kielbasa
T Mahmoudi
V Ferretti
W Thompson
WB Alkema
WW Wasserman
X Yan
X Zhang
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background It is known that transcription factors frequently act together to regulate gene expression in eukaryotes. In this paper we describe a computational analysis of transcription factor site dependencies in human, mouse and rat genomes. Results Our approach for quantifying tendencies of transcription factor binding sites to co-occur is based on a binding site scoring function which incorporates dependencies between positions, the use of information about the structural class of each transcription factor (major/minor groove binder), and also considered the possible implications of varying GC content of the sequences. Significant tendencies (dependencies) have been detected by non-parametric statistical methodology (permutation tests). Evaluation of obtained results has been performed in several ways: reports from literature (many of the significant dependencies between transcription factors have previously been confirmed experimentally); dependencies between transcription factors are not biased due to similarities in their DNA-binding sites; the number of dependent transcription factors that belong to the same functional and structural class is significantly higher than would be expected by chance; supporting evidence from GO clustering of targeting genes. Based on dependencies between two transcription factor binding sites (second-order dependencies), it is possible to construct higher-order dependencies (networks). Moreover results about transcription factor binding sites dependencies can be used for prediction of groups of dependent transcription factors on a given promoter sequence. Our results, as well as a scanning tool for predicting groups of dependent transcription factors binding sites are available on the Internet. Conclusion We show that the computational analysis of transcription factor site dependencies is a valuable complement to experimental approaches for discovering transcription regulatory interactions and networks. Scanning promoter sequences with dependent groups of transcription factor binding sites improve the quality of transcription factor predictions.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

The Novartis Repository

WordCluster: detecting clusters of DNA words and genomic elements

Author: A Sandelin
A Siepel
AR Quinlan
B Giardine
D Durand
D Karolchik
Guillermo Barturen
José L Oliver
KD Pruitt
M Ashburner
M Gardiner-Garden
M Hackenberg
M Hackenberg
M Hackenberg
M Hackenberg
Michael Hackenberg
P Carpena
Pedro Bernaola-Galván
Pedro Carpena
R Aloni
R Lister
TJ Hubbard
VJ Makeev
Ángel M Alganza
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Many <it>k-</it>mers (or DNA words) and genomic elements are known to be spatially clustered in the genome. Well established examples are the genes, TFBSs, CpG dinucleotides, microRNA genes and ultra-conserved non-coding regions. Currently, no algorithm exists to find these clusters in a statistically comprehensible way. The detection of clustering often relies on densities and sliding-window approaches or arbitrarily chosen distance thresholds. Results We introduce here an algorithm to detect clusters of DNA words (<it>k-</it>mers), or any other genomic element, based on the distance between consecutive copies and an assigned statistical significance. We implemented the method into a web server connected to a MySQL backend, which also determines the co-localization with gene annotations. We demonstrate the usefulness of this approach by detecting the clusters of CAG/CTG (cytosine contexts that can be methylated in undifferentiated cells), showing that the degree of methylation vary drastically between inside and outside of the clusters. As another example, we used <it>WordCluster </it>to search for statistically significant clusters of olfactory receptor (OR) genes in the human genome. Conclusions <it>WordCluster </it>seems to predict biological meaningful clusters of DNA words (<it>k-</it>mers) and genomic entities. The implementation of the method into a web server is available at <url>http://bioinfo2.ugr.es/wordCluster/wordCluster.php</url> including additional features like the detection of co-localization with gene regions or the annotation enrichment tool for functional analysis of overlapped genes.</p

Crossref

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Repositorio Institucional Universidad de Granada

A computational evaluation of over-representation of regulatory motifs in the promoter regions of differentially expressed genes

BACKGROUND: Observed co-expression of a group of genes is frequently attributed to co-regulation by shared transcription factors. This assumption has led to the hypothesis that promoters of co-expressed genes should share common regulatory motifs, which forms the basis for numerous computational tools that search for these motifs. While frequently explored for yeast, the validity of the underlying hypothesis has not been assessed systematically in mammals. This demonstrates the need for a systematic and quantitative evaluation to what degree co-expressed genes share over-represented motifs for mammals. RESULTS: We identified 33 experiments for human and mouse in the ArrayExpress Database where transcription factors were manipulated and which exhibited a significant number of differentially expressed genes. We checked for over-representation of transcription factor binding sites in up- or down-regulated genes using the over-representation analysis tool oPOSSUM. In 25 out of 33 experiments, this procedure identified the binding matrices of the affected transcription factors. We also carried out de novo prediction of regulatory motifs shared by differentially expressed genes. Again, the detected motifs shared significant similarity with the matrices of the affected transcription factors. CONCLUSIONS: Our results support the claim that functional regulatory motifs are over-represented in sets of differentially expressed genes and that they can be detected with computational methods

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

MPG.PuRe

MDM2 Promoter SNP344T>A (rs1196333) Status Does Not Affect Cancer Risk

Author: A Sandelin
Anne Dørum
Caroline Seynaeve
Erik Løkkevik
GL Bond
GL Bond
GL Bond
Helga B. Salvesen
J Momand
J Momand
Jan Sommerfelt-Pettersen
JD Oliner
JE Landers
Jone Trovik
Klaus Roemer
KP Economopoulos
Kristian Hveem
Lars Vatten
Liv B. Gansmo
Merete Bjørnslett
MS Sheikh
Per E. Lønning
Peter Devilee
Pål Romundstad
R Chrisanthar
R Chrisanthar
R Trotta
Rob A. E. M. Tollenaar
S Geisler
S Geisler
S Knappskog
S Knappskog
S Knappskog
Stian Knappskog
Z Hu
ZX Xiao
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

The MDM2 proto-oncogene plays a key role in central cellular processes like growth control and apoptosis, and the gene locus is frequently amplified in sarcomas. Two polymorphisms located in the MDM2 promoter P2 have been shown to affect cancer risk. One of these polymorphisms (SNP309T>G; rs2279744) facilitates Sp1 transcription factor binding to the promoter and is associated with increased cancer risk. In contrast, SNP285G>C (rs117039649), located 24 bp upstream of rs2279744, and in complete linkage disequilibrium with the SNP309G allele, reduces Sp1 recruitment and lowers cancer risk. Thus, fine tuning of MDM2 expression has proven to be of significant importance with respect to tumorigenesis. We assessed the potential functional effects of a third MDM2 promoter P2 polymorphism (SNP344T>A; rs1196333) located on the SNP309T allele. While in silico analyses indicated SNP344A to modulate TFAP2A, SPIB and AP1 transcription factor binding, we found no effect of SNP344 status on MDM2 expression levels. Assessing the frequency of SNP344A in healthy Caucasians (n = 2,954) and patients suffering from ovarian (n = 1,927), breast (n = 1,271), endometrial (n = 895) or prostatic cancer (n = 641), we detected no significant difference in the distribution of this polymorphism between any of these cancer forms and healthy controls (6.1% in healthy controls, and 4.9%, 5.0%, 5.4% and 7.2% in the cancer groups, respectively). In conclusion, our findings provide no evidence indicating that SNP344A may affect MDM2 transcription or cancer risk

Public Library of Science (PLOS)

University of Bergen

Crossref

Directory of Open Access Journals

PubMed Central

EUR Research Repository

Leiden University Scholary Publications

Erasmus University Digital Repository

NORA - Norwegian Open Research Archives