Search CORE

12 research outputs found

Recommended from our members

Genome-wide analysis of ETS-family DNA-binding in vitro and in vivo

Author: Badis Gwenael
Berger Michael F.
Bonke Martin
Bulyk Martha L.
Enge Martin
Gehrke Andrew R.
Hughes Timothy R.
Jolma Arttu
Kivioja Teemu
Palin Kimmo
Stunnenberg Hendrik G.
Taipale Jussi
Taipale Mikko
Talukder Shaheynoor
Turunen Mikko
Ukkonen Esko
Varjosalo Markku
Wei Gong-Hong
Yan Jian
Publication venue
Publication date: 01/01/2010
Field of study

Peer reviewe

Harvard University - DASH

Julkari

PubMed Central

Helsingin yliopiston digitaalinen arkisto

Objective sequence-based subfamily classifications of mouse homeodomains reflect their in vitro DNA-binding preferences

Author: Addou
Altschul
Altschul
Andreeva
Andrei L. Turinsky
Andrew R. Gehrke
Badis
Benos
Berger
Berger
Berman
Brohee
Brown
Brown
Brown
Bulyk
Bulyk
Carbon
Donald
Dudley
Edgar
Enright
Enright
Finn
Furukubo-Tokunaga
Garcia-Fernandez
Gascuel
Gwenael Badis
Harbison
Hayashi
Hoffmann
Holland
Hunter
Itzkovitz
Jennifer Tsai
Katoh
Kawaji
Krishnamurthy
Lee
Lees
Li
Li
Luscombe
Mackay
Man
Martha L. Bulyk
Matys
Meila
Michael F. Berger
Miguel A. Santos
Mukherjee
Ochagavia
Peregrin-Alvarez
Ravasi
Remm
Scott
Serene Ong
Shaheynoor Talukder
Shoshana J. Wodak
Sjolander
Storm
Takatori
Thompson
Timothy R. Hughes
Tsai
van Dongen
Vlasblom
Vlieghe
Weston
Wicker
Wilson
Zhong
Zmasek
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/07/2010
Field of study

Classifying proteins into subgroups with similar molecular function on the basis of sequence is an important step in deriving reliable functional annotations computationally. So far, however, available classification procedures have been evaluated against protein subgroups that are defined by experts using mainly qualitative descriptions of molecular function. Recently, in vitro DNA-binding preferences to all possible 8-nt DNA sequences have been measured for 178 mouse homeodomains using protein-binding microarrays, offering the unprecedented opportunity of evaluating the classification methods against quantitative measures of molecular function. To this end, we automatically derive homeodomain subtypes from the DNA-binding data and independently group the same domains using sequence information alone. We test five sequence-based methods, which use different sequence-similarity measures and algorithms to group sequences. Results show that methods that optimize the classification robustness reflect well the detailed functional specificity revealed by the experimental data. In some of these classifications, 73–83% of the subfamilies exactly correspond to, or are completely contained in, the function-based subtypes. Our findings demonstrate that certain sequence-based classifications are capable of yielding very specific molecular function annotations. The availability of quantitative descriptions of molecular function, such as DNA-binding data, will be a key factor in exploiting this potential in the future.Canadian Institutes of Health Research (MOP#82940)Sickkids FoundationOntario Research FundNational Science Foundation (U.S.)National Human Genome Research Institute (U.S.) (R01 HG003985

DSpace@MIT

Crossref

Harvard University - DASH

PubMed Central

A Library of Yeast Transcription Factor Motifs Reveals a Widespread Function for Rsc3 in Targeting Nucleosome Exclusion at Promoters

Author: Ansari Aseem Z.
Badis Gwenael
Carlson Clayton D.
Chan Esther T.
Clarke Neil D.
Coburn David
Gebbia Marinella
Gossett Andrea J.
Hasinoff Michael J.
Hughes Timothy R.
Li Yeo Ai
Lieb Jason D.
Mnaimneh Sanie
Nislow Corey
Pena-Castillo Lourdes
Talukder Shaheynoor
Terterov Dimitri
Tillo Desiree
Tsui Kyle
van Bakel Harm
Warren Christopher L.
Yang Ally
Yeo Zhen Xuan
Publication venue
Publication date: 01/01/2008
Field of study

The sequence specificity of DNA-binding proteins is the primary mechanism by which the cell recognizes genomic features. Here, we describe systematic determination of yeast transcription factor DNA-binding specificities. We obtained binding specificities for 112 DNA-binding proteins representing 19 distinct structural classes, one-third of which have not been previously reported. Several newly discovered binding sequences have striking genomic distributions relative to transcription start sites, supporting their biological relevance and suggesting a role in promoter architecture. Among these are Rsc3 binding sequences, containing the core CGCG, which are found preferentially ~100 bp upstream of transcription start sites. Mutation of RSC3 results in a dramatic increase in nucleosome occupancy in hundreds of proximal promoters containing a Rsc3 binding element, but has little impact on promoters lacking Rsc3 binding sequences, indicating that Rsc3 plays a broad role in targeting nucleosome exclusion at yeast promoters

Carolina Digital Repository

Multiplexed massively parallel SELEX for characterization of human transcription factor binding specificities

Author: Bonke Martin
Cheng Lu
Enge Martin
Hughes Timothy R.
Jolma Arttu
Kivioja Teemu
Luscombe Nicholas M.
Palin Kimmo
Sillanpää Mikko J.
Taipale Jussi
Taipale Mikko
Talukder Shaheynoor
Toivonen Jarkko
Ukkonen Esko
Vaquerizas Juan M.
Wei Gonghong
Yan Jian
Publication venue: Cold Spring Harbor Laboratory Press
Publication date: 08/04/2010
Field of study

The genetic code—the binding specificity of all transfer-RNAs—defines how protein primary structure is determined by DNA sequence. DNA also dictates when and where proteins are expressed, and this information is encoded in a pattern of specific sequence motifs that are recognized by transcription factors. However, the DNA-binding specificity is only known for a small fraction of the ∼1400 human transcription factors (TFs). We describe here a high-throughput method for analyzing transcription factor binding specificity that is based on systematic evolution of ligands by exponential enrichment (SELEX) and massively parallel sequencing. The method is optimized for analysis of large numbers of TFs in parallel through the use of affinity-tagged proteins, barcoded selection oligonucleotides, and multiplexed sequencing. Data are analyzed by a new bioinformatic platform that uses the hundreds of thousands of sequencing reads obtained to control the quality of the experiments and to generate binding motifs for the TFs. The described technology allows higher throughput and identification of much longer binding profiles than current microarray-based methods. In addition, as our method is based on proteins expressed in mammalian cells, it can also be used to characterize DNA-binding preferences of full-length proteins or proteins requiring post-translational modifications. We validate the method by determining binding specificities of 14 different classes of TFs and by confirming the specificities for NFATC1 and RFX3 using ChIP-seq. Our results reveal unexpected dimeric modes of binding for several factors that were thought to preferentially bind DNA as monomers

Crossref

Online Research @ Cardiff

Publications from Karolinska Institutet

PubMed Central

UCL Discovery