87 research outputs found

    Comparative analysis of missing value imputation methods to improve clustering and interpretation of microarray experiments

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Microarray technologies produced large amount of data. In a previous study, we have shown the interest of <it>k-Nearest Neighbour </it>approach for restoring the missing gene expression values, and its positive impact of the gene clustering by hierarchical algorithm. Since, numerous replacement methods have been proposed to impute missing values (MVs) for microarray data. In this study, we have evaluated twelve different usable methods, and their influence on the quality of gene clustering. Interestingly we have used several datasets, both kinetic and non kinetic experiments from yeast and human.</p> <p>Results</p> <p>We underline the excellent efficiency of approaches proposed and implemented by Bo and co-workers and especially one based on expected maximization (<it>EM_array</it>). These improvements have been observed also on the imputation of extreme values, the most difficult predictable values. We showed that the imputed MVs have still important effects on the stability of the gene clusters. The improvement on the clustering obtained by hierarchical clustering remains limited and, not sufficient to restore completely the correct gene associations. However, a common tendency can be found between the quality of the imputation method and the gene cluster stability. Even if the comparison between clustering algorithms is a complex task, we observed that <it>k-means </it>approach is more efficient to conserve gene associations.</p> <p>Conclusions</p> <p>More than 6.000.000 independent simulations have assessed the quality of 12 imputation methods on five very different biological datasets. Important improvements have so been done since our last study. The <it>EM_array </it>approach constitutes one efficient method for restoring the missing expression gene values, with a lower estimation error level. Nonetheless, the presence of MVs even at a low rate is a major factor of gene cluster instability. Our study highlights the need for a systematic assessment of imputation methods and so of dedicated benchmarks. A noticeable point is the specific influence of some biological dataset.</p

    MitoGenesisDB: an expression data mining tool to explore spatio-temporal dynamics of mitochondrial biogenesis

    Get PDF
    Mitochondria constitute complex and flexible cellular entities, which play crucial roles in normal and pathological cell conditions. The database MitoGenesisDB focuses on the dynamic of mitochondrial protein formation through global mRNA analyses. Three main parameters confer a global view of mitochondrial biogenesis: (i) time-course of mRNA production in highly synchronized yeast cell cultures, (ii) microarray analyses of mRNA localization that define translation sites and (iii) mRNA transcription rate and stability which characterize genes that are more dependent on post-transcriptional regulation processes. MitoGenesisDB integrates and establishes cross-comparisons between these data. Several model organisms can be analyzed via orthologous relationships between interspecies genes. More generally this database supports the ‘post-transcriptional operon’ model, which postulates that eukaryotes co-regulate related mRNAs based on their functional organization in ribonucleoprotein complexes. MitoGenesisDB allows identifying such groups of post-trancriptionally regulated genes and is thus a useful tool to analyze the complex relationships between transcriptional and post-transcriptional regulation processes. The case of respiratory chain assembly factors illustrates this point. The MitoGenesisDB interface is available at http://www.dsimb.inserm.fr/dsimb_tools/mitgene/

    Evidential Clustering: A Review

    Get PDF
    International audienceIn evidential clustering, uncertainty about the assignment of objects to clusters is represented by Dempster-Shafer mass functions. The resulting clustering structure, called a credal partition, is shown to be more general than hard, fuzzy, possibilistic and rough partitions, which are recovered as special cases. Three algorithms to generate a credal partition are reviewed. Each of these algorithms is shown to implement a decision-directed clustering strategy. Their relative merits are discussed

    A Combined Approach of High-Throughput Sequencing and Degradome Analysis Reveals Tissue Specific Expression of MicroRNAs and Their Targets in Cucumber

    Get PDF
    MicroRNAs (miRNAs) are endogenous small RNAs playing an important regulatory function in plant development and stress responses. Among them, some are evolutionally conserved in plant and others are only expressed in certain species, tissue or developmental stages. Cucumber is among the most important greenhouse species in the world, but only a limited number of miRNAs from cucumber have been identified and the experimental validation of the related miRNA targets is still lacking. In this study, two independent small RNA libraries from cucumber leaves and roots were constructed, respectively, and sequenced with the high-throughput Illumina Solexa system. Based on sequence similarity and hairpin structure prediction, a total of 29 known miRNA families and 2 novel miRNA families containing a total of 64 miRNA were identified. QRT-PCR analysis revealed that some of the cucumber miRNAs were preferentially expressed in certain tissues. With the recently developed ‘high throughput degradome sequencing’ approach, 21 target mRNAs of known miRNAs were identified for the first time in cucumber. These targets were associated with development, reactive oxygen species scavenging, signaling transduction and transcriptional regulation. Our study provides an overview of miRNA expression profile and interaction between miRNA and target, which will help further understanding of the important roles of miRNAs in cucumber plants

    Repression of Mitochondrial Translation, Respiration and a Metabolic Cycle-Regulated Gene, SLF1, by the Yeast Pumilio-Family Protein Puf3p

    Get PDF
    Synthesis and assembly of the mitochondrial oxidative phosphorylation (OXPHOS) system requires genes located both in the nuclear and mitochondrial genomes, but how gene expression is coordinated between these two compartments is not fully understood. One level of control is through regulated expression mitochondrial ribosomal proteins and other factors required for mitochondrial translation and OXPHOS assembly, which are all products of nuclear genes that are subsequently imported into mitochondria. Interestingly, this cadre of genes in budding yeast has in common a 3′-UTR element that is bound by the Pumilio family protein, Puf3p, and is coordinately regulated under many conditions, including during the yeast metabolic cycle. Multiple functions have been assigned to Puf3p, including promoting mRNA degradation, localizing nucleus-encoded mitochondrial transcripts to the outer mitochondrial membrane, and facilitating mitochondria-cytoskeletal interactions and motility. Here we show that Puf3p has a general repressive effect on mitochondrial OXPHOS abundance, translation, and respiration that does not involve changes in overall mitochondrial biogenesis and largely independent of TORC1-mitochondrial signaling. We also identified the cytoplasmic translation factor Slf1p as yeast metabolic cycle-regulated gene that is repressed by Puf3p at the post-transcriptional level and promotes respiration and extension of yeast chronological life span when over-expressed. Altogether, these results should facilitate future studies on which of the many functions of Puf3p is most relevant for regulating mitochondrial gene expression and the role of nuclear-mitochondrial communication in aging and longevity

    Divergence of the Yeast Transcription Factor FZF1 Affects Sulfite Resistance

    Get PDF
    Changes in gene expression are commonly observed during evolution. However, the phenotypic consequences of expression divergence are frequently unknown and difficult to measure. Transcriptional regulators provide a mechanism by which phenotypic divergence can occur through multiple, coordinated changes in gene expression during development or in response to environmental changes. Yet, some changes in transcriptional regulators may be constrained by their pleiotropic effects on gene expression. Here, we use a genome-wide screen for promoters that are likely to have diverged in function and identify a yeast transcription factor, FZF1, that has evolved substantial differences in its ability to confer resistance to sulfites. Chimeric alleles from four Saccharomyces species show that divergence in FZF1 activity is due to changes in both its coding and upstream noncoding sequence. Between the two closest species, noncoding changes affect the expression of FZF1, whereas coding changes affect the expression of SSU1, a sulfite efflux pump activated by FZF1. Both coding and noncoding changes also affect the expression of many other genes. Our results show how divergence in the coding and promoter region of a transcription factor alters the response to an environmental stress

    Comparative transcriptomics of drought responses in Populus: a meta-analysis of genome-wide expression profiling in mature leaves and root apices across two genotypes

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Comparative genomics has emerged as a promising means of unravelling the molecular networks underlying complex traits such as drought tolerance. Here we assess the genotype-dependent component of the drought-induced transcriptome response in two poplar genotypes differing in drought tolerance. Drought-induced responses were analysed in leaves and root apices and were compared with available transcriptome data from other <it>Populus </it>species.</p> <p>Results</p> <p>Using a multi-species designed microarray, a genomic DNA-based selection of probesets provided an unambiguous between-genotype comparison. Analyses of functional group enrichment enabled the extraction of processes physiologically relevant to drought response. The drought-driven changes in gene expression occurring in root apices were consistent across treatments and genotypes. For mature leaves, the transcriptome response varied weakly but in accordance with the duration of water deficit. A differential clustering algorithm revealed similar and divergent gene co-expression patterns among the two genotypes. Since moderate stress levels induced similar physiological responses in both genotypes, the genotype-dependent transcriptional responses could be considered as intrinsic divergences in genome functioning. Our meta-analysis detected several candidate genes and processes that are differentially regulated in root and leaf, potentially under developmental control, and preferentially involved in early and long-term responses to drought.</p> <p>Conclusions</p> <p>In poplar, the well-known drought-induced activation of sensing and signalling cascades was specific to the early response in leaves but was found to be general in root apices. Comparing our results to what is known in arabidopsis, we found that transcriptional remodelling included signalling and a response to energy deficit in roots in parallel with transcriptional indices of hampered assimilation in leaves, particularly in the drought-sensitive poplar genotype.</p

    Identification of drought-responsive microRNAs in Medicago truncatula by genome-wide high-throughput sequencing

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>MicroRNAs (miRNAs) are small, endogenous RNAs that play important regulatory roles in development and stress response in plants by negatively affecting gene expression post-transcriptionally. Identification of miRNAs at the global genome-level by high-throughout sequencing is essential to functionally characterize miRNAs in plants. Drought is one of the common environmental stresses limiting plant growth and development. To understand the role of miRNAs in response of plants to drought stress, drought-responsive miRNAs were identified by high-throughput sequencing in a legume model plant, <it>Medicago truncatula</it>.</p> <p>Results</p> <p>Two hundreds eighty three and 293 known miRNAs were identified from the control and drought stress libraries, respectively. In addition, 238 potential candidate miRNAs were identified, and among them 14 new miRNAs and 15 new members of known miRNA families whose complementary miRNA*s were also detected. Both high-throughput sequencing and RT-qPCR confirmed that 22 members of 4 miRNA families were up-regulated and 10 members of 6 miRNA families were down-regulated in response to drought stress. Among the 29 new miRNAs/new members of known miRNA families, 8 miRNAs were responsive to drought stress with both 4 miRNAs being up- and down-regulated, respectively. The known and predicted targets of the drought-responsive miRNAs were found to be involved in diverse cellular processes in plants, including development, transcription, protein degradation, detoxification, nutrient status and cross adaptation.</p> <p>Conclusions</p> <p>We identified 32 known members of 10 miRNA families and 8 new miRNAs/new members of known miRNA families that were responsive to drought stress by high-throughput sequencing of small RNAs from <it>M. truncatula</it>. These findings are of importance for our understanding of the roles played by miRNAs in response of plants to abiotic stress in general and drought stress in particular.</p

    The Yin and Yang of Yeast Transcription: Elements of a Global Feedback System between Metabolism and Chromatin

    Get PDF
    When grown in continuous culture, budding yeast cells tend to synchronize their respiratory activity to form a stable oscillation that percolates throughout cellular physiology and involves the majority of the protein-coding transcriptome. Oscillations in batch culture and at single cell level support the idea that these dynamics constitute a general growth principle. The precise molecular mechanisms and biological functions of the oscillation remain elusive. Fourier analysis of transcriptome time series datasets from two different oscillation periods (0.7 h and 5 h) reveals seven distinct co-expression clusters common to both systems (34% of all yeast ORF), which consolidate into two superclusters when correlated with a compilation of 1,327 unrelated transcriptome datasets. These superclusters encode for cell growth and anabolism during the phase of high, and mitochondrial growth, catabolism and stress response during the phase of low oxygen uptake. The promoters of each cluster are characterized by different nucleotide contents, promoter nucleosome configurations, and dependence on ATP-dependent nucleosome remodeling complexes. We show that the ATP:ADP ratio oscillates, compatible with alternating metabolic activity of the two superclusters and differential feedback on their transcription via activating (RSC) and repressive (Isw2) types of promoter structure remodeling. We propose a novel feedback mechanism, where the energetic state of the cell, reflected in the ATP:ADP ratio, gates the transcription of large, but functionally coherent groups of genes via differential effects of ATP-dependent nucleosome remodeling machineries. Besides providing a mechanistic hypothesis for the delayed negative feedback that results in the oscillatory phenotype, this mechanism may underpin the continuous adaptation of growth to environmental conditions
    corecore