57 research outputs found

    In Silico Evaluation of Predicted Regulatory Interactions in Arabidopsis thaliana

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Prediction of transcriptional regulatory mechanisms in <it>Arabidopsis </it>has become increasingly critical with the explosion of genomic data now available for both gene expression and gene sequence composition. We have shown in previous work <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>, that a combination of correlation measurements and <it>cis</it>-regulatory element (CRE) detection methods are effective in predicting targets for candidate transcription factors for specific case studies which were validated. However, to date there has been no quantitative assessment as to which correlation measures or CRE detection methods used alone or in combination are most effective in predicting TF→target relationships on a genome-wide scale.</p> <p>Results</p> <p>We tested several widely used methods, based on correlation (Pearson and Spearman Rank correlation) and <it>cis-</it>regulatory element (CRE) detection (≥1 CRE or CRE over-representation), to determine which of these methods individually or in combination is the most effective by various measures for making regulatory predictions. To predict the regulatory targets of a transcription factor (TF) of interest, we applied these methods to microarray expression data for genes that were regulated over treatment and control conditions in wild type (WT) plants. Because the chosen data sets included identical experimental conditions used on TF over-expressor or T-DNA knockout plants, we were able to test the TF→target predictions made using microarray data from WT plants, with microarray data from mutant/transgenic plants. For each method, or combination of methods, we computed sensitivity, specificity, positive and negative predictive value and the F-measure of balance between sensitivity and positive predictive value (precision). This analysis revealed that the ≥1 CRE and Spearman correlation (used alone or in combination) were the most balanced CRE detection and correlation methods, respectively with regard to their power to accurately predict regulatory-target interactions.</p> <p>Conclusion</p> <p>These findings provide an approach and guidance for researchers interested in predicting transcriptional regulatory mechanisms using microarray data that they generate (or microarray data that is publically available) combined with CRE detection in promoter sequence data.</p

    Predictive network modeling of the high-resolution dynamic plant transcriptome in response to nitrate

    Get PDF
    International audienceABSTRACT: BACKGROUND: Nitrate, acting as both a nitrogen source and a signaling molecule, controls many aspects of plant development. However, gene networks involved in plant adaptation to fluctuating nitrate environments have not yet been identified. RESULTS: Here we use time-series transcriptome data to decipher gene relationships and consequently to build core regulatory networks involved in Arabidopsis root adaptation to nitrate provision. The experimental approach has been to monitor genome-wide responses to nitrate at 3, 6, 9, 12, 15 and 20 minutes, using Affymetrix ATH1 gene chips. This high-resolution time course analysis demonstrated that the previously known primary nitrate response is actually preceded by a very fast gene expression modulation, involving genes and functions needed to prepare plants to use or reduce nitrate. A state-space model inferred from this microarray time-series data successfully predicts gene behavior in unlearnt conditions. CONCLUSIONS: The experiments and methods allow us to propose a temporal working model for nitrate-driven gene networks. This network model is tested both in silico and experimentally. For example, the over-expression of a predicted gene hub encoding a transcription factor induced early in the cascade indeed leads to the modification of the kinetic nitrate response of sentinel genes such as NIR, NIA2, and NRT1.1, and several other transcription factors. The potential nitrate /hormone connections implicated by this time-series data is also evaluated

    Modeling the global effect of the basic-leucine zipper transcription factor 1 (bZIP1) on nitrogen and light regulation in Arabidopsis

    Get PDF
    Background: Nitrogen and light are two major regulators of plant metabolism and development. While genes involved in the control of each of these signals have begun to be identified, regulators that integrate gene responses to nitrogen and light signals have yet to be determined. Here, we evaluate the role of bZIP1, a transcription factor involved in light and nitrogen sensing, by exposing wild-type (WT) and bZIP1 T-DNA null mutant plants to a combinatorial space of nitrogen (N) and light (L) treatment conditions and performing transcriptome analysis. We use ANOVA analysis combined with clustering and Boolean modeling, to evaluate the role of bZIP1 in mediating L and N signaling genome-wide. Results: This transcriptome analysis demonstrates that a mutation in the bZIP1 gene can alter the L and/or N-regulation of several gene clusters. More surprisingly, the bZIP1 mutation can also trigger N and/or L regulation of genes that are not normally controlled by these signals in WT plants. This analysis also reveals that bZIP1 can, to a large extent, invert gene regulation (e. g., several genes induced by N in WT plants are repressed by N in the bZIP1 mutant). Conclusion: These findings demonstrate that the bZIP1 mutation triggers a genome-wide de-regulation in response to L and/or N signals that range from i) a reduction of the L signal effect, to ii) unlocking gene regulation in response to L and N combinations. This systems biology approach demonstrates that bZIP1 tunes L and N signaling relationships genome-wide, and can suppress regulatory mechanisms hypothesized to be needed at different developmental stages and/or environmental conditions

    Genome-wide investigation of light and carbon signaling interactions in Arabidopsis

    Get PDF
    BACKGROUND: Light and carbon are two essential signals influencing plant growth and development. Little is known about how carbon and light signaling pathways intersect or influence one another to affect gene expression. RESULTS: Microarrays are used to investigate carbon and light signaling interactions at a genome-wide level in Arabidopsis thaliana. A classification system, 'InterAct Class', is used to classify genes on the basis of their expression profiles. InterAct classes and the genes within them are placed into theoretical models describing interactions between carbon and light signaling. Within InterAct classes there are genes regulated by carbon (201 genes), light (77 genes) or through carbon and light interactions (1,247 genes). We determined whether genes involved in specific biological processes are over-represented in the population of genes regulated by carbon and/or light signaling. Of 29 primary functional categories identified by the Munich Information Center for Protein Sequences, five show over-representation of genes regulated by carbon and/or light. Metabolism has the highest representation of genes regulated by carbon and light interactions and includes the secondary functional categories of carbon-containing-compound/carbohydrate metabolism, amino-acid metabolism, lipid metabolism, fatty-acid metabolism and isoprenoid metabolism. Genes that share a similar InterAct class expression profile and are involved in the same biological process are used to identify putative cis elements possibly involved in responses to both carbon and light signals. CONCLUSIONS: The work presented here represents a method to organize and classify microarray datasets, enabling one to investigate signaling interactions and to identify putative cis elements in silico through the analysis of genes that share a similar expression profile and biological function

    ESTimating plant phylogeny: lessons from partitioning

    Get PDF
    BACKGROUND: While Expressed Sequence Tags (ESTs) have proven a viable and efficient way to sample genomes, particularly those for which whole-genome sequencing is impractical, phylogenetic analysis using ESTs remains difficult. Sequencing errors and orthology determination are the major problems when using ESTs as a source of characters for systematics. Here we develop methods to incorporate EST sequence information in a simultaneous analysis framework to address controversial phylogenetic questions regarding the relationships among the major groups of seed plants. We use an automated, phylogenetically derived approach to orthology determination called OrthologID generate a phylogeny based on 43 process partitions, many of which are derived from ESTs, and examine several measures of support to assess the utility of EST data for phylogenies. RESULTS: A maximum parsimony (MP) analysis resulted in a single tree with relatively high support at all nodes in the tree despite rampant conflict among trees generated from the separate analysis of individual partitions. In a comparison of broader-scale groupings based on cellular compartment (ie: chloroplast, mitochondrial or nuclear) or function, only the nuclear partition tree (based largely on EST data) was found to be topologically identical to the tree based on the simultaneous analysis of all data. Despite topological conflict among the broader-scale groupings examined, only the tree based on morphological data showed statistically significant differences. CONCLUSION: Based on the amount of character support contributed by EST data which make up a majority of the nuclear data set, and the lack of conflict of the nuclear data set with the simultaneous analysis tree, we conclude that the inclusion of EST data does provide a viable and efficient approach to address phylogenetic questions within a parsimony framework on a genomic scale, if problems of orthology determination and potential sequencing errors can be overcome. In addition, approaches that examine conflict and support in a simultaneous analysis framework allow for a more precise understanding of the evolutionary history of individual process partitions and may be a novel way to understand functional aspects of different kinds of cellular classes of gene products

    Transient genome-wide interactions of the master transcription factor NLP7 initiate a rapid nitrogen-response cascade

    Get PDF
    Dynamic reprogramming of gene regulatory networks (GRNs) enables organisms to rapidly respond to environmental perturbation. However, the underlying transient interactions between transcription factors (TFs) and genome-wide targets typically elude biochemical detection. Here, we capture both stable and transient TF-target interactions genome-wide within minutes after controlled TF nuclear import using time-series chromatin immunoprecipitation (ChIP-seq) and/or DNA adenine methyltransferase identification (DamID-seq). The transient TF-target interactions captured uncover the early mode-of-action of NIN-LIKE PROTEIN 7 (NLP7), a master regulator of the nitrogen signaling pathway in plants. These transient NLP7 targets captured in root cells using temporal TF perturbation account for 50% of NLP7-regulated genes not detectably bound by NLP7 in planta. Rapid and transient NLP7 binding activates early nitrogen response TFs, which we validate to amplify the NLP7-initiated transcriptional cascade. Our approaches to capture transient TF-target interactions genome-wide can be applied to validate dynamic GRN models for any pathway or organism of interest. Conventional methods cannot reveal transient transcription factors (TFs) and targets interactions. Here, Alvarez et al. capture both stable and transient TF-target interactions by time-series ChIP-seq and/or DamID-seq in a cell-based TF perturbation system and show NLP7 as a master TF to initiate a rapid nitrogen-response cascade

    EST analysis in Ginkgo biloba: an assessment of conserved developmental regulators and gymnosperm specific genes

    Get PDF
    BACKGROUND: Ginkgo biloba L. is the only surviving member of one of the oldest living seed plant groups with medicinal, spiritual and horticultural importance worldwide. As an evolutionary relic, it displays many characters found in the early, extinct seed plants and extant cycads. To establish a molecular base to understand the evolution of seeds and pollen, we created a cDNA library and EST dataset from the reproductive structures of male (microsporangiate), female (megasporangiate), and vegetative organs (leaves) of Ginkgo biloba. RESULTS: RNA from newly emerged male and female reproductive organs and immature leaves was used to create three distinct cDNA libraries from which 6,434 ESTs were generated. These 6,434 ESTs from Ginkgo biloba were clustered into 3,830 unigenes. A comparison of our Ginkgo unigene set against the fully annotated genomes of rice and Arabidopsis, and all available ESTs in Genbank revealed that 256 Ginkgo unigenes match only genes among the gymnosperms and non-seed plants – many with multiple matches to genes in non-angiosperm plants. Conversely, another group of unigenes in Gingko had highly significant homology to transcription factors in angiosperms involved in development, including MADS box genes as well as post-transcriptional regulators. Several of the conserved developmental genes found in Ginkgo had top BLAST homology to cycad genes. We also note here the presence of ESTs in G. biloba similar to genes that to date have only been found in gymnosperms and an additional 22 Ginkgo genes common only to genes from cycads. CONCLUSION: Our analysis of an EST dataset from G. biloba revealed genes potentially unique to gymnosperms. Many of these genes showed homology to fully sequenced clones from our cycad EST dataset found in common only with gymnosperms. Other Ginkgo ESTs are similar to developmental regulators in higher plants. This work sets the stage for future studies on Ginkgo to better understand seed and pollen evolution, and to resolve the ambiguous phylogenetic relationship of G. biloba among the gymnosperms
    corecore