95 research outputs found
EuroDia: a beta-cell gene expression resource
Type 2 diabetes mellitus (T2DM) is a major disease affecting nearly 280 million people worldwide. Whilst the pathophysiological mechanisms leading to disease are poorly understood, dysfunction of the insulin-producing pancreatic beta-cells is key event for disease development. Monitoring the gene expression profiles of pancreatic beta-cells under several genetic or chemical perturbations has shed light on genes and pathways involved in T2DM. The EuroDia database has been established to build a unique collection of gene expression measurements performed on beta-cells of three organisms, namely human, mouse and rat. The Gene Expression Data Analysis Interface (GEDAI) has been developed to support this database. The quality of each dataset is assessed by a series of quality control procedures to detect putative hybridization outliers. The system integrates a web interface to several standard analysis functions from R/Bioconductor to identify differentially expressed genes and pathways. It also allows the combination of multiple experiments performed on different array platforms of the same technology. The design of this system enables each user to rapidly design a custom analysis pipeline and thus produce their own list of genes and pathways. Raw and normalized data can be downloaded for each experiment. The flexible engine of this database (GEDAI) is currently used to handle gene expression data from several laboratory-run projects dealing with different organisms and platforms
Using Ribosomal Protein Genes as Reference: A Tale of Caution
Background: Housekeeping genes are needed in every tissue as their expression is required for survival, integrity or duplication of every cell. Housekeeping genes commonly have been used as reference genes to normalize gene expression data, the underlying assumption being that they are expressed in every cell type at approximately the same level. Often, the terms "reference genes'' and "housekeeping genes'' are used interchangeably. In this paper, we would like to distinguish between these terms. Consensus is growing that housekeeping genes which have traditionally been used to normalize gene expression data are not good reference genes. Recently, ribosomal protein genes have been suggested as reference genes based on a meta-analysis of publicly available microarray data.
Methodology/Principal Findings: We have applied several statistical tools on a dataset of 70 microarrays representing 22 different tissues, to assess and visualize expression stability of ribosomal protein genes. We confirmed the housekeeping status of these genes, but further estimated expression stability across tissues in order to assess their potential as reference genes. One- and two-way ANOVA revealed that all ribosomal protein genes have significant expression variation across tissues and exhibit tissue-dependent expression behavior as a group. Via multidimensional unfolding analysis, we visualized this tissue-dependency. In addition, we explored mechanisms that may cause tissue dependent effects of individual ribosomal protein genes.
Conclusions/Significance: Here we provide statistical and biological evidence that ribosomal protein genes exhibit important tissue-dependent variation in mRNA expression. Though these genes are most stably expressed of all investigated genes in a meta-analysis they cannot be considered true reference genes
Network Analysis of Differential Expression for the Identification of Disease-Causing Genes
Genetic studies (in particular linkage and association studies) identify chromosomal regions involved in a disease or phenotype of interest, but those regions often contain many candidate genes, only a few of which can be followed-up for biological validation. Recently, computational methods to identify (prioritize) the most promising candidates within a region have been proposed, but they are usually not applicable to cases where little is known about the phenotype (no or few confirmed disease genes, fragmentary understanding of the biological cascades involved). We seek to overcome this limitation by replacing knowledge about the biological process by experimental data on differential gene expression between affected and healthy individuals. Considering the problem from the perspective of a gene/protein network, we assess a candidate gene by considering the level of differential expression in its neighborhood under the assumption that strong candidates will tend to be surrounded by differentially expressed neighbors. We define a notion of soft neighborhood where each gene is given a contributing weight, which decreases with the distance from the candidate gene on the protein network. To account for multiple paths between genes, we define the distance using the Laplacian exponential diffusion kernel. We score candidates by aggregating the differential expression of neighbors weighted as a function of distance. Through a randomization procedure, we rank candidates by p-values. We illustrate our approach on four monogenic diseases and successfully prioritize the known disease causing genes
Revealing New Mouse Epicardial Cell Markers through Transcriptomics
The epicardium has key functions during myocardial development, by contributing to the formation of coronary endothelial and smooth muscle cells, cardiac fibroblasts, and potentially cardiomyocytes. The epicardium plays a morphogenetic role by emitting signals to promote and maintain cardiomyocyte proliferation. In a regenerative context, the adult epicardium might comprise a progenitor cell population that can be induced to contribute to cardiac repair. Although some genes involved in epicardial function have been identified, a detailed molecular profile of epicardial gene expression has not been available.Using laser capture microscopy, we isolated the epicardial layer from the adult murine heart before or after cardiac infarction in wildtype mice and mice expressing a transgenic IGF-1 propeptide (mIGF-1) that enhances cardiac repair, and analyzed the transcription profile using DNA microarrays.Expression of epithelial genes such as basonuclin, dermokine, and glycoprotein M6A are highly enriched in the epicardial layer, which maintains expression of selected embryonic genes involved in epicardial development in mIGF-1 transgenic hearts. After myocardial infarct, a subset of differentially expressed genes are down-regulated in the epicardium representing an epicardium-specific signature that responds to injury.This study presents the description of the murine epicardial transcriptome obtained from snap frozen tissues, providing essential information for further analysis of this important cardiac cell layer
Placental lactogens induce serotonin biosynthesis in a subset of mouse beta cells during pregnancy
AIMS/HYPOTHESIS: Upregulation of the functional beta cell mass is required to match the physiological demands of mother and fetus during pregnancy. This increase is dependent on placental lactogens (PLs) and prolactin receptors, but the mechanisms underlying these events are only partially understood. We studied the mRNA expression profile of mouse islets during pregnancy to gain a better insight into these changes. METHODS: RNA expression was measured ex vivo via microarrays and quantitative RT-PCR. In vivo observations were extended by in vitro models in which ovine PL was added to cultured mouse islets and MIN6 cells. RESULTS: mRNA encoding both isoforms of the rate-limiting enzyme of serotonin biosynthesis, tryptophan hydroxylase (TPH), i.e. Tph1 and Tph2, were strongly induced (fold change 25- to 200-fold) during pregnancy. This induction was mimicked by exposing islets or MIN6 cells to ovine PLs for 24 h and was dependent on janus kinase 2 and signal transducer and activator of transcription 5. Parallel to Tph1 mRNA and protein induction, islet serotonin content increased to a peak level that was 200-fold higher than basal. Interestingly, only a subpopulation of the beta cells was serotonin-positive in vitro and in vivo. The stored serotonin pool in pregnant islets and PL-treated MIN6 cells was rapidly released (turnover once every 2 h). CONCLUSIONS/INTERPRETATION: A very strong lactogen-dependent upregulation of serotonin biosynthesis occurs in a subpopulation of mouse islet beta cells during pregnancy. Since the newly formed serotonin is rapidly released, this lactogen-induced beta cell function may serve local or endocrine tasks, the nature of which remains to be identified
DISCO-SCA and Properly Applied GSVD as Swinging Methods to Find Common and Distinctive Processes
BACKGROUND: In systems biology it is common to obtain for the same set of biological entities information from multiple sources. Examples include expression data for the same set of orthologous genes screened in different organisms and data on the same set of culture samples obtained with different high-throughput techniques. A major challenge is to find the important biological processes underlying the data and to disentangle therein processes common to all data sources and processes distinctive for a specific source. Recently, two promising simultaneous data integration methods have been proposed to attain this goal, namely generalized singular value decomposition (GSVD) and simultaneous component analysis with rotation to common and distinctive components (DISCO-SCA). RESULTS: Both theoretical analyses and applications to biologically relevant data show that: (1) straightforward applications of GSVD yield unsatisfactory results, (2) DISCO-SCA performs well, (3) provided proper pre-processing and algorithmic adaptations, GSVD reaches a performance level similar to that of DISCO-SCA, and (4) DISCO-SCA is directly generalizable to more than two data sources. The biological relevance of DISCO-SCA is illustrated with two applications. First, in a setting of comparative genomics, it is shown that DISCO-SCA recovers a common theme of cell cycle progression and a yeast-specific response to pheromones. The biological annotation was obtained by applying Gene Set Enrichment Analysis in an appropriate way. Second, in an application of DISCO-SCA to metabolomics data for Escherichia coli obtained with two different chemical analysis platforms, it is illustrated that the metabolites involved in some of the biological processes underlying the data are detected by one of the two platforms only; therefore, platforms for microbial metabolomics should be tailored to the biological question. CONCLUSIONS: Both DISCO-SCA and properly applied GSVD are promising integrative methods for finding common and distinctive processes in multisource data. Open source code for both methods is provided
- …