294 research outputs found

    Harnessing gene expression to identify the genetic basis of drug resistance

    Get PDF
    The advent of cost-effective genotyping and sequencing methods have recently made it possible to ask questions that address the genetic basis of phenotypic diversity and how natural variants interact with the environment. We developed Camelot (CAusal Modelling with Expression Linkage for cOmplex Traits), a statistical method that integrates genotype, gene expression and phenotype data to automatically build models that both predict complex quantitative phenotypes and identify genes that actively influence these traits. Camelot integrates genotype and gene expression data, both generated under a reference condition, to predict the response to entirely different conditions. We systematically applied our algorithm to data generated from a collection of yeast segregants, using genotype and gene expression data generated under drug-free conditions to predict the response to 94 drugs and experimentally confirmed 14 novel gene–drug interactions. Our approach is robust, applicable to other phenotypes and species, and has potential for applications in personalized medicine, for example, in predicting how an individual will respond to a previously unseen drug

    Simple integrative preprocessing preserves what is shared in data sources

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Bioinformatics data analysis toolbox needs general-purpose, fast and easily interpretable preprocessing tools that perform data integration during exploratory data analysis. Our focus is on vector-valued data sources, each consisting of measurements of the same entity but on different variables, and on tasks where source-specific variation is considered noisy or not interesting. Principal components analysis of all sources combined together is an obvious choice if it is not important to distinguish between data source-specific and shared variation. Canonical Correlation Analysis (CCA) focuses on mutual dependencies and discards source-specific "noise" but it produces a separate set of components for each source.</p> <p>Results</p> <p>It turns out that components given by CCA can be combined easily to produce a linear and hence fast and easily interpretable feature extraction method. The method fuses together several sources, such that the properties they share are preserved. Source-specific variation is discarded as uninteresting. We give the details and implement them in a software tool. The method is demonstrated on gene expression measurements in three case studies: classification of cell cycle regulated genes in yeast, identification of differentially expressed genes in leukemia, and defining stress response in yeast. The software package is available at <url>http://www.cis.hut.fi/projects/mi/software/drCCA/</url>.</p> <p>Conclusion</p> <p>We introduced a method for the task of data fusion for exploratory data analysis, when statistical dependencies between the sources and not within a source are interesting. The method uses canonical correlation analysis in a new way for dimensionality reduction, and inherits its good properties of being simple, fast, and easily interpretable as a linear projection.</p

    Serum microRNA array analysis identifies miR-140-3p, miR-33b-3p and miR-671-3p as potential osteoarthritis biomarkers involved in metabolic processes.

    Get PDF
    Background: MicroRNAs (miRNAs) in circulation have emerged as promising biomarkers. In this study, we aimed to identify a circulating miRNA signature for osteoarthritis (OA) patients and in combination with bioinformatics analysis to evaluate the utility of selected differentially expressed miRNAs in the serum as potential OA biomarkers. Methods: Serum samples were collected from 12 primary OA patients, and 12 healthy individuals were screened using the Agilent Human miRNA Microarray platform interrogating 2549 miRNAs. Receiver Operating Characteristic (ROC) curves were constructed to evaluate the diagnostic performance of the deregulated miRNAs. Expression levels of selected miRNAs were validated by quantitative real-time PCR (qRT-PCR) in all serum and in articular cartilage samples from OA patients (n = 12) and healthy individuals (n = 7). Bioinformatics analysis was used to investigate the involved pathways and target genes for the above miRNAs. Results: We identified 279 differentially expressed miRNAs in the serum of OA patients compared to controls. Two hundred and five miRNAs (73.5%) were upregulated and 74 (26.5%) downregulated. ROC analysis revealed that 77 miRNAs had area under the curve (AUC) > 0.8 and p < 0.05. Bioinformatics analysis in the 77 miRNAs revealed that their target genes were involved in multiple signaling pathways associated with OA, among which FoxO, mTOR, Wnt, pI3K/akt, TGF-β signaling pathways, ECM-receptor interaction, and fatty acid biosynthesis. qRT-PCR validation in seven selected out of the 77 miRNAs revealed 3 significantly downregulated miRNAs (hsa-miR-33b-3p, hsa-miR-671-3p, and hsa-miR-140-3p) in the serum of OA patients, which were in silico predicted to be enriched in pathways involved in metabolic processes. Target-gene analysis of hsa-miR-140-3p, hsa-miR-33b-3p, and hsa-miR-671-3p revealed that InsR and IGFR1 were common targets of all three miRNAs, highlighting their involvement in regulation of metabolic processes that contribute to OA pathology. Hsa-miR-140-3p and hsa-miR-671-3p expression levels were consistently downregulated in articular cartilage of OA patients compared to healthy individuals. Conclusions: A serum miRNA signature was established for the first time using high density resolution miR-arrays in OA patients. We identified a three-miRNA signature, hsa-miR-140-3p, hsa-miR-671-3p, and hsa-miR-33b-3p, in the serum of OA patients, predicted to regulate metabolic processes, which could serve as a potential biomarker for the evaluation of OA risk and progression.Peer reviewedFinal Published versio

    Evaluation of regression models in metabolic physiology: predicting fluxes from isotopic data without knowledge of the pathway

    Get PDF
    This study explores the ability of regression models, with no knowledge of the underlying physiology, to estimate physiological parameters relevant for metabolism and endocrinology. Four regression models were compared: multiple linear regression (MLR), principal component regression (PCR), partial least-squares regression (PLS) and regression using artificial neural networks (ANN). The pathway of mammalian gluconeogenesis was analyzed using [U−(13)C]glucose as tracer. A set of data was simulated by randomly selecting physiologically appropriate metabolic fluxes for the 9 steps of this pathway as independent variables. The isotope labeling patterns of key intermediates in the pathway were then calculated for each set of fluxes, yielding 29 dependent variables. Two thousand sets were created, allowing independent training and test data. Regression models were asked to predict the nine fluxes, given only the 29 isotopomers. For large training sets (>50) the artificial neural network model was superior, capturing 95% of the variability in the gluconeogenic flux, whereas the three linear models captured only 75%. This reflects the ability of neural networks to capture the inherent non-linearities of the metabolic system. The effect of error in the variables and the addition of random variables to the data set was considered. Model sensitivities were used to find the isotopomers that most influenced the predicted flux values. These studies provide the first test of multivariate regression models for the analysis of isotopomer flux data. They provide insight for metabolomics and the future of isotopic tracers in metabolic research where the underlying physiology is complex or unknown

    Hsp90 orchestrates transcriptional regulation by Hsf1 and cell wall remodelling by MAPK signalling during thermal adaptation in a pathogenic yeast

    Get PDF
    Acknowledgments We thank Rebecca Shapiro for creating CaLC1819, CaLC1855 and CaLC1875, Gillian Milne for help with EM, Aaron Mitchell for generously providing the transposon insertion mutant library, Jesus Pla for generously providing the hog1 hst7 mutant, and Cathy Collins for technical assistance.Peer reviewedPublisher PD

    Experimental Demonstration of the Fitness Consequences of an Introduced Parasite of Darwin's Finches

    Get PDF
    Introduced parasites are a particular threat to small populations of hosts living on islands because extinction can occur before hosts have a chance to evolve effective defenses. An experimental approach in which parasite abundance is manipulated in the field can be the most informative means of assessing a parasite's impact on the host. The parasitic fly Philornis downsi, recently introduced to the Galápagos Islands, feeds on nestling Darwin's finches and other land birds. Several correlational studies, and one experimental study of mixed species over several years, reported that the flies reduce host fitness. Here we report the results of a larger scale experimental study of a single species at a single site over a single breeding season.We manipulated the abundance of flies in the nests of medium ground finches (Geospiza fortis) and quantified the impact of the parasites on nestling growth and fledging success. We used nylon nest liners to reduce the number of parasites in 24 nests, leaving another 24 nests as controls. A significant reduction in mean parasite abundance led to a significant increase in the number of nests that successfully fledged young. Nestlings in parasite-reduced nests also tended to be larger prior to fledging.Our results confirm that P. downsi has significant negative effects on the fitness of medium ground finches, and they may pose a serious threat to other species of Darwin's finches. These data can help in the design of management plans for controlling P. downsi in Darwin's finch breeding populations

    Ptc6 is required for proper rapamycin-induced down-regulation of the genes coding for ribosomal and rRNA processing proteins in S. cerevisiae

    Get PDF
    Ptc6 is one of the seven components (Ptc1-Ptc7) of the protein phosphatase 2C family in the yeast Saccharomyces cerevisiae. In contrast to other type 2C phosphatases, the cellular role of this isoform is poorly understood. We present here a comprehensive characterization of this gene product. Cells lacking Ptc6 are sensitive to zinc ions, and somewhat tolerant to cell-wall damaging agents and to Li+. Ptc6 mutants are sensitive to rapamycin, albeit to lesser extent than ptc1 cells. This phenotype is not rescued by overexpression of PTC1 and mutation of ptc6 does not reproduce the characteristic geneti

    Thermosensitivity of the Saccharomyces cerevisiae gpp1gpp2 double deletion strain can be reduced by overexpression of genes involved in cell wall maintenance

    Get PDF
    A Saccharomyces cerevisiae strain in which the GPP1 and GPP2 genes, both encoding glycerol-3-phosphate phosphatase isoforms, are deleted, displays both osmo- and thermosensitive (ts) phenotypes. We isolated genes involved in cell wall maintenance as multicopy suppressors of the gpp1gpp2 ts phenotype. We found that the gpp1gpp2 strain is hypersensitive to cell wall stress such as treatment with β-1,3-glucanase containing cocktail Zymolyase and chitin-binding dye Calcofluor-white (CFW). Sensitivity to Zymolyase was rescued by overexpression of SSD1, while CFW sensitivity was rescued by SSD1, FLO8 and WSC3-genes isolated as multicopy suppressors of the gpp1gpp2 ts phenotype. Some of the isolated suppressor genes (SSD1, FLO8) also rescued the lytic phenotype of slt2 deletion strain. Additionally, the sensitivity to CFW was reduced when the cells were supplied with glycerol. Both growth on glycerol-based medium and overexpression of SSD1, FLO8 or WSC3 had additive suppressing effect on CFW sensitivity of the gpp1gpp2 mutant strain. We also confirmed that the internal glycerol level changed in cells exposed to cell wall perturbation. © 2007 Springer-Verlag

    Exploring the use of internal and externalcontrols for assessing microarray technical performance

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The maturing of gene expression microarray technology and interest in the use of microarray-based applications for clinical and diagnostic applications calls for quantitative measures of quality. This manuscript presents a retrospective study characterizing several approaches to assess technical performance of microarray data measured on the Affymetrix GeneChip platform, including whole-array metrics and information from a standard mixture of external spike-in and endogenous internal controls. Spike-in controls were found to carry the same information about technical performance as whole-array metrics and endogenous "housekeeping" genes. These results support the use of spike-in controls as general tools for performance assessment across time, experimenters and array batches, suggesting that they have potential for comparison of microarray data generated across species using different technologies.</p> <p>Results</p> <p>A layered PCA modeling methodology that uses data from a number of classes of controls (spike-in hybridization, spike-in polyA+, internal RNA degradation, endogenous or "housekeeping genes") was used for the assessment of microarray data quality. The controls provide information on multiple stages of the experimental protocol (e.g., hybridization, RNA amplification). External spike-in, hybridization and RNA labeling controls provide information related to both assay and hybridization performance whereas internal endogenous controls provide quality information on the biological sample. We find that the variance of the data generated from the external and internal controls carries critical information about technical performance; the PCA dissection of this variance is consistent with whole-array quality assessment based on a number of quality assurance/quality control (QA/QC) metrics.</p> <p>Conclusions</p> <p>These results provide support for the use of both external and internal RNA control data to assess the technical quality of microarray experiments. The observed consistency amongst the information carried by internal and external controls and whole-array quality measures offers promise for rationally-designed control standards for routine performance monitoring of multiplexed measurement platforms.</p

    Minimization of Biosynthetic Costs in Adaptive Gene Expression Responses of Yeast to Environmental Changes

    Get PDF
    Yeast successfully adapts to an environmental stress by altering physiology and fine-tuning metabolism. This fine-tuning is achieved through regulation of both gene expression and protein activity, and it is shaped by various physiological requirements. Such requirements impose a sustained evolutionary pressure that ultimately selects a specific gene expression profile, generating a suitable adaptive response to each environmental change. Although some of the requirements are stress specific, it is likely that others are common to various situations. We hypothesize that an evolutionary pressure for minimizing biosynthetic costs might have left signatures in the physicochemical properties of proteins whose gene expression is fine-tuned during adaptive responses. To test this hypothesis we analyze existing yeast transcriptomic data for such responses and investigate how several properties of proteins correlate to changes in gene expression. Our results reveal signatures that are consistent with a selective pressure for economy in protein synthesis during adaptive response of yeast to various types of stress. These signatures differentiate two groups of adaptive responses with respect to how cells manage expenditure in protein biosynthesis. In one group, significant trends towards downregulation of large proteins and upregulation of small ones are observed. In the other group we find no such trends. These results are consistent with resource limitation being important in the evolution of the first group of stress responses
    corecore