584 research outputs found
Enhanced photochemical hydrogen production by a molecular diiron catalyst incorporated into a metal-organic framework.
A molecular proton reduction catalyst [FeFe](dcbdt)(CO)6 (1, dcbdt = 1,4-dicarboxylbenzene-2,3-dithiolate) with structural similarities to [FeFe]-hydrogenase active sites has been incorporated into a highly robust Zr(IV)-based metal-organic framework (MOF) by postsynthetic exchange (PSE). The PSE protocol is crucial as direct solvothermal synthesis fails to produce the functionalized MOF. The molecular integrity of the organometallic site within the MOF is demonstrated by a variety of techniques, including X-ray absorption spectroscopy. In conjunction with [Ru(bpy)3](2+) as a photosensitizer and ascorbate as an electron donor, MOF-[FeFe](dcbdt)(CO)6 catalyzes photochemical hydrogen evolution in water at pH 5. The immobilized catalyst shows substantially improved initial rates and overall hydrogen production when compared to a reference system of complex 1 in solution. Improved catalytic performance is ascribed to structural stabilization of the complex when incorporated in the MOF as well as the protection of reduced catalysts 1(-) and 1(2-) from undesirable charge recombination with oxidized ascorbate
Transcriptional programs: modelling higher order structure in transcriptional control.
BACKGROUND: Transcriptional regulation is an important part of regulatory control in eukaryotes. Even if binding motifs for transcription factors are known, the task of finding binding sites by scanning sequences is plagued by false positives. One way to improve the detection of binding sites from motifs is by taking cooperativity of transcription factor binding into account. We propose a non-parametric probabilistic model, similar to a document topic model, for detecting transcriptional programs, groups of cooperative transcription factors and co-regulated genes. The analysis results in transcriptional programs which generalise both transcriptional modules and TF-target gene incidence matrices and provide a higher-level summary of these structures. The method is independent of prior specification of training sets of genes, for example, via gene expression data. The analysis is based on known binding motifs. RESULTS: We applied our method to putative regulatory regions of 18,445 Mus musculus genes. We discovered just 68 transcriptional programs that effectively summarised the action of 149 transcription factors on these genes. Several of these programs were significantly enriched for known biological processes and signalling pathways. One transcriptional program has a significant overlap with a reference set of cell cycle specific transcription factors. CONCLUSION: Our method is able to pick out higher order structure from noisy sequence analyses. The transcriptional programs it identifies potentially represent common mechanisms of regulatory control across the genome. It simultaneously predicts which genes are co-regulated and which sets of transcription factors cooperate to achieve this co-regulation. The programs we discovered enable biologists to choose new genes and transcription factors to study in specific transcriptional regulatory systems.RIGHTS : This article is licensed under the BioMed Central licence at http://www.biomedcentral.com/about/license which is similar to the 'Creative Commons Attribution Licence'. In brief you may : copy, distribute, and display the work; make derivative works; or make commercial use of the work - under the following conditions: the original author must be given credit; for any reuse or distribution, it must be made clear to others what the license terms of this work are
Wellington : a novel method for the accurate identification of digital genomic footprints from DNase-seq data
The expression of eukaryotic genes is regulated by cis-regulatory elements such as promoters and enhancers, which bind sequence-specific DNA-binding proteins. One of the great challenges in the gene regulation field is to characterise these elements. This involves the identification of transcription factor (TF) binding sites within regulatory elements that are occupied in a defined regulatory context. Digestion with DNase and the subsequent analysis of regions protected from cleavage (DNase footprinting) has for many years been used to identify specific binding sites occupied by TFs at individual cis-elements with high resolution. This methodology has recently been adapted for high-throughput sequencing (DNase-seq). In this study, we describe an imbalance in the DNA strand-specific alignment information of DNase-seq data surrounding protein–DNA interactions that allows accurate prediction of occupied TF binding sites. Our study introduces a novel algorithm, Wellington, which considers the imbalance in this strand-specific information to efficiently identify DNA footprints. This algorithm significantly enhances specificity by reducing the proportion of false positives and requires significantly fewer predictions than previously reported methods to recapitulate an equal amount of ChIP-seq data. We also provide an open-source software package, pyDNase, which implements the Wellington algorithm to interface with DNase-seq data and expedite analyses
Single-cell transcriptomics : a high-resolution avenue for plant functional genomics
Plant function is the result of the concerted action of single cells in different tissues. Advances in RNA-seq technologies and tissue processing allow us now to capture transcriptional changes at single-cell resolution. The incredible potential of single-cell RNA-seq lies in the novel ability to study and exploit regulatory processes in complex tissues based on the behaviour of single cells. Importantly, the independence from reporter lines allows the analysis of any given tissue in any plant. While there are challenges associated with the handling and analysis of complex datasets, the opportunities are unique to generate knowledge of tissue functions in unprecedented detail and to facilitate the application of such information by mapping cellular functions and interactions in a plant cell atlas. [Abstract copyright: Copyright © 2019 The Authors. Published by Elsevier Ltd.. All rights reserved.
Cellular mRNAs access second ORFs using a novel amino acid sequence-dependent coupled translation termination-reinitiation mechanism
Polycistronic transcripts are considered rare in the human genome. Initiation of translation of internal ORFs of eukaryotic genes has been shown to use either leaky scanning or highly structured IRES regions to access initiation codons. Studies on mammalian viruses identified a mechanism of coupled translation termination-reinitiation that allows translation of an additional ORF. Here, the ribosome terminating translation of ORF-1 translocates upstream to reinitiate translation of ORF-2. We have devised an algorithm to identify mRNAs in the human transcriptome in which the major ORF-1 overlaps a second ORF capable of encoding a product of at least 50 aa in length. This identified 4368 transcripts representing 2214 genes. We investigated 24 transcripts, 22 of which were shown to express a protein from ORF-2 highlighting that 3' UTRs contain protein-coding potential more frequently than previously suspected. Five transcripts accessed ORF-2 using a process of coupled translation termination-reinitiation. Analysis of one transcript, encoding the CASQ2 protein, showed that the mechanism by which the coupling process of the cellular mRNAs was achieved was novel. This process was not directed by the mRNA sequence but required an aspartate-rich repeat region at the carboxyl terminus of the terminating ORF-1 protein. Introduction of wobble mutations for the aspartate codon had no effect, whereas replacing aspartate for glutamate repeats eliminated translational coupling. This is the first description of a coordinated expression of two proteins from cellular mRNAs using a coupled translation termination-reinitiation process and is the first example of such a process being determined at the amino acid level
A comparative study of S/MAR prediction tools
BACKGROUND: S/MARs are regions of the DNA that are attached to the nuclear matrix. These regions are known to affect substantially the expression of genes. The computer prediction of S/MARs is a highly significant task which could contribute to our understanding of chromatin organisation in eukaryotic cells, the number and distribution of boundary elements, and the understanding of gene regulation in eukaryotic cells. However, while a number of S/MAR predictors have been proposed, their accuracy has so far not come under scrutiny. RESULTS: We have selected S/MARs with sufficient experimental evidence and used these to evaluate existing methods of S/MAR prediction. Our main results are: 1.) all existing methods have little predictive power, 2.) a simple rule based on AT-percentage is generally competitive with other methods, 3.) in practice, the different methods will usually identify different sub-sequences as S/MARs, 4.) more research on the H-Rule would be valuable. CONCLUSION: A new insight is needed to design a method which will predict S/MARs well. Our data, including the control data, has been deposited as additional material and this may help later researchers test new predictors
MEME-LaB : motif analysis in clusters
Genome-wide expression analysis can result in large numbers of clusters of co-expressed genes. While there are tools for ab initio discovery of transcription factor binding sites, most do not provide a quick and easy way to study large numbers of clusters. To address this, we introduce a web-tool called MEME-LaB. The tool wraps MEME (an ab initio motif finder), providing an interface for users to input multiple gene clusters, retrieve promoter sequences, run motif finding, and then easily browse and condense the results, facilitating better interpretation of the results from large-scale datasets
- …