48 research outputs found

    Permutation-validated principal components analysis of microarray data

    Get PDF
    BACKGROUND: In microarray data analysis, the comparison of gene-expression profiles with respect to different conditions and the selection of biologically interesting genes are crucial tasks. Multivariate statistical methods have been applied to analyze these large datasets. Less work has been published concerning the assessment of the reliability of gene-selection procedures. Here we describe a method to assess reliability in multivariate microarray data analysis using permutation-validated principal components analysis (PCA). The approach is designed for microarray data with a group structure. RESULTS: We used PCA to detect the major sources of variance underlying the hybridization conditions followed by gene selection based on PCA-derived and permutation-based test statistics. We validated our method by applying it to well characterized yeast cell-cycle data and to two datasets from our laboratory. We could describe the major sources of variance, select informative genes and visualize the relationship of genes and arrays. We observed differences in the level of the explained variance and the interpretability of the selected genes. CONCLUSIONS: Combining data visualization and permutation-based gene selection, permutation-validated PCA enables one to illustrate gene-expression variance between several conditions and to select genes by taking into account the relationship of between-group to within-group variance of genes. The method can be used to extract the leading sources of variance from microarray data, to visualize relationships between genes and hybridizations and to select informative genes in a statistically reliable manner. This selection accounts for the level of reproducibility of replicates or group structure as well as gene-specific scatter. Visualization of the data can support a straightforward biological interpretation

    Deducing corticotropin-releasing hormone receptor type 1 signaling networks from gene expression data by usage of genetic algorithms and graphical Gaussian models

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Dysregulation of the hypothalamic-pituitary-adrenal (HPA) axis is a hallmark of complex and multifactorial psychiatric diseases such as anxiety and mood disorders. About 50-60% of patients with major depression show HPA axis dysfunction, i.e. hyperactivity and impaired negative feedback regulation. The neuropeptide corticotropin-releasing hormone (CRH) and its receptor type 1 (CRHR1) are key regulators of this neuroendocrine stress axis. Therefore, we analyzed CRH/CRHR1-dependent gene expression data obtained from the pituitary corticotrope cell line AtT-20, a well-established <it>in vitro </it>model for CRHR1-mediated signal transduction. To extract significantly regulated genes from a genome-wide microarray data set and to deduce underlying CRHR1-dependent signaling networks, we combined supervised and unsupervised algorithms.</p> <p>Results</p> <p>We present an efficient variable selection strategy by consecutively applying univariate as well as multivariate methods followed by graphical models. First, feature preselection was used to exclude genes not differentially regulated over time from the dataset. For multivariate variable selection a maximum likelihood (MLHD) discriminant function within GALGO, an R package based on a genetic algorithm (GA), was chosen. The topmost genes representing major nodes in the expression network were ranked to find highly separating candidate genes. By using groups of five genes (chromosome size) in the discriminant function and repeating the genetic algorithm separately four times we found eleven genes occurring at least in three of the top ranked result lists of the four repetitions. In addition, we compared the results of GA/MLHD with the alternative optimization algorithms greedy selection and simulated annealing as well as with the state-of-the-art method random forest. In every case we obtained a clear overlap of the selected genes independently confirming the results of MLHD in combination with a genetic algorithm.</p> <p>With two unsupervised algorithms, principal component analysis and graphical Gaussian models, putative interactions of the candidate genes were determined and reconstructed by literature mining. Differential regulation of six candidate genes was validated by qRT-PCR.</p> <p>Conclusions</p> <p>The combination of supervised and unsupervised algorithms in this study allowed extracting a small subset of meaningful candidate genes from the genome-wide expression data set. Thereby, variable selection using different optimization algorithms based on linear classifiers as well as the nonlinear random forest method resulted in congruent candidate genes. The calculated interacting network connecting these new target genes was bioinformatically mapped to known CRHR1-dependent signaling pathways. Additionally, the differential expression of the identified target genes was confirmed experimentally.</p

    Microarray Analysis in the Archaeon Halobacterium salinarum Strain R1

    Get PDF
    Background: Phototrophy of the extremely halophilic archaeon Halobacterium salinarum was explored for decades. The research was mainly focused on the expression of bacteriorhodopsin and its functional properties. In contrast, less is known about genome wide transcriptional changes and their impact on the physiological adaptation to phototrophy. The tool of choice to record transcriptional profiles is the DNA microarray technique. However, the technique is still rarely used for transcriptome analysis in archaea. Methodology/Principal Findings: We developed a whole-genome DNA microarray based on our sequence data of the Hbt. salinarum strain R1 genome. The potential of our tool is exemplified by the comparison of cells growing under aerobic and phototrophic conditions, respectively. We processed the raw fluorescence data by several stringent filtering steps and a subsequent MAANOVA analysis. The study revealed a lot of transcriptional differences between the two cell states. We found that the transcriptional changes were relatively weak, though significant. Finally, the DNA microarray data were independently verified by a real-time PCR analysis. Conclusion/Significance: This is the first DNA microarray analysis of Hbt. salinarum cells that were actually grown under phototrophic conditions. By comparing the transcriptomics data with current knowledge we could show that our DNA microarray tool is well applicable for transcriptome analysis in the extremely halophilic archaeon Hbt. salinarum. The reliability of our tool is based on both the high-quality array of DNA probes and the stringent data handling including MAANOVA analysis. Among the regulated genes more than 50% had unknown functions. This underlines the fact that haloarchaeal phototrophy is still far away from being completely understood. Hence, the data recorded in this study will be subject to future systems biology analysis

    A composite transcriptional signature differentiates responses towards closely related herbicides in Arabidopsis thaliana and Brassica napus

    Get PDF
    In this study, genome-wide expression profiling based on Affymetrix ATH1 arrays was used to identify discriminating responses of Arabidopsis thaliana to five herbicides, which contain active ingredients targeting two different branches of amino acid biosynthesis. One herbicide contained glyphosate, which targets 5-enolpyruvylshikimate-3-phosphate synthase (EPSPS), while the other four herbicides contain different acetolactate synthase (ALS) inhibiting compounds. In contrast to the herbicide containing glyphosate, which affected only a few transcripts, many effects of the ALS inhibiting herbicides were revealed based on transcriptional changes related to ribosome biogenesis and translation, secondary metabolism, cell wall modification and growth. The expression pattern of a set of 101 genes provided a specific, composite signature that was distinct from other major stress responses and differentiated among herbicides targeting the same enzyme (ALS) or containing the same chemical class of active ingredient (sulfonylurea). A set of homologous genes could be identified in Brassica napus that exhibited a similar expression pattern and correctly distinguished exposure to the five herbicides. Our results show the ability of a limited number of genes to classify and differentiate responses to closely related herbicides in A. thaliana and B. napus and the transferability of a complex transcriptional signature across species

    Dynamics of bacterial communities during the ripening process of different Croatian cheese types derived from raw ewe's milk cheeses.

    Get PDF
    Microbial communities play an important role in cheese ripening and determine the flavor and taste of different cheese types to a large extent. However, under adverse conditions human pathogens may colonize cheese samples during ripening and may thus cause severe outbreaks of diarrhoea and other diseases. Therefore in the present study we investigated the bacterial community structure of three raw ewe's milk cheese types, which are produced without the application of starter cultures during ripening from two production sites based on fingerprinting in combination with next generation sequencing of 16S rRNA gene amplicons. Overall a surprisingly high diversity was found in the analyzed samples and overall up to 213 OTU97 could be assigned. 20 of the major OTUs were present in all samples and include mostly lactic acid bacteria (LAB), mainly Lactococcus, and Enterococcus species. Abundance and diversity of these genera differed to a large extent between the 3 investigated cheese types and in response to the ripening process. Also a large number of non LAB genera could be identified based on phylogenetic alignments including mainly Enterobacteriaceae and Staphylococcacae. Some species belonging to these two families could be clearly assigned to species which are known as potential human pathogens like Staphylococcus saprophyticus or Salmonella spp. However, during cheese ripening their abundance was reduced. The bacterial genera, namely Lactobacillus, Streptococcus, Leuconostoc, Bifidobacterium, Brevibacterium, Corynebacterium, Clostridium, Staphylococcus, Thermoanerobacterium, E. coli, Hafnia, Pseudomonas, Janthinobacterium, Petrotoga, Kosmotoga, Megasphaera, Macrococcus, Mannheimia, Aerococcus, Vagococcus, Weissella and Pediococcus were identified at a relative low level and only in selected samples. Overall the microbial composition of the used milk and the management of the production units determined the bacterial community composition for all cheese types to a large extend, also at the late time points of cheese ripening

    Comparison of Barley Succession and Take-All Disease as Environmental Factors Shaping the Rhizobacterial Community during Take-All Decline▿

    No full text
    The root disease take-all, caused by Gaeumannomyces graminis var. tritici, can be managed by monoculture-induced take-all decline (TAD). This natural biocontrol mechanism typically occurs after a take-all outbreak and is believed to arise from an enrichment of antagonistic populations in the rhizosphere. However, it is not known whether these changes are induced by the monoculture or by ecological rhizosphere conditions due to a disease outbreak and subsequent attenuation. This question was addressed by comparing the rhizosphere microflora of barley, either inoculated with the pathogen or noninoculated, in a microcosm experiment in five consecutive vegetation cycles. TAD occurred in soil inoculated with the pathogen but not in noninoculated soil. Bacterial community analysis using terminal restriction fragment length polymorphism of 16S rRNA showed pronounced population shifts in the successive vegetation cycles, but pathogen inoculation had little effect. To elucidate rhizobacterial dynamics during TAD development, a 16S rRNA-based taxonomic microarray was used. Actinobacteria were the prevailing indicators in the first vegetation cycle, whereas the third cycle—affected most severely by take-all—was characterized by Proteobacteria, Bacteroidetes, Chloroflexi, Planctomycetes, and Acidobacteria. Indicator taxa for the last cycle (TAD) belonged exclusively to Proteobacteria, including several genera with known biocontrol traits. Our results suggest that TAD involves monoculture-induced enrichment of plant-beneficial taxa
    corecore