17,179 research outputs found

    A trend pattern assessment approach to microarray gene expression profiling data analysis

    Get PDF
    We study the problem of how to assess the reliability of a statistical measurement on data set containing unknown quantity of noises, inconsistencies, and outliers. A practical approach that analyzes the dynamical patterns (trends) of the statistical measurements through a sequential extreme-boundary-points (EBP) weed-out process is explored. We categorize the weed-out trend patterns (WOTP) and examine their relation to the reliability of the measurement. The approach is applied to the processes of extracting genes that are predictive to BCL2 translocations and to clinical survival outcomes of diffuse large B-cell lymphoma (DLBCL) from DNA Microarray gene expression profiling data sets. Fisher’s Discriminate Criterion (FDC) is used as a statistical measurement in the processes. It is found that the weed-out trend analysis (WOTA) approach is effective for qualitatively assessing the statistics-based measurements in the experimentations conducted

    Statistical modelling of transcript profiles of differentially regulated genes

    Get PDF
    Background: The vast quantities of gene expression profiling data produced in microarray studies, and the more precise quantitative PCR, are often not statistically analysed to their full potential. Previous studies have summarised gene expression profiles using simple descriptive statistics, basic analysis of variance (ANOVA) and the clustering of genes based on simple models fitted to their expression profiles over time. We report the novel application of statistical non-linear regression modelling techniques to describe the shapes of expression profiles for the fungus Agaricus bisporus, quantified by PCR, and for E. coli and Rattus norvegicus, using microarray technology. The use of parametric non-linear regression models provides a more precise description of expression profiles, reducing the "noise" of the raw data to produce a clear "signal" given by the fitted curve, and describing each profile with a small number of biologically interpretable parameters. This approach then allows the direct comparison and clustering of the shapes of response patterns between genes and potentially enables a greater exploration and interpretation of the biological processes driving gene expression. Results: Quantitative reverse transcriptase PCR-derived time-course data of genes were modelled. "Splitline" or "broken-stick" regression identified the initial time of gene up-regulation, enabling the classification of genes into those with primary and secondary responses. Five-day profiles were modelled using the biologically-oriented, critical exponential curve, y(t) = A + (B + Ct)Rt + ε. This non-linear regression approach allowed the expression patterns for different genes to be compared in terms of curve shape, time of maximal transcript level and the decline and asymptotic response levels. Three distinct regulatory patterns were identified for the five genes studied. Applying the regression modelling approach to microarray-derived time course data allowed 11% of the Escherichia coli features to be fitted by an exponential function, and 25% of the Rattus norvegicus features could be described by the critical exponential model, all with statistical significance of p < 0.05. Conclusion: The statistical non-linear regression approaches presented in this study provide detailed biologically oriented descriptions of individual gene expression profiles, using biologically variable data to generate a set of defining parameters. These approaches have application to the modelling and greater interpretation of profiles obtained across a wide range of platforms, such as microarrays. Through careful choice of appropriate model forms, such statistical regression approaches allow an improved comparison of gene expression profiles, and may provide an approach for the greater understanding of common regulatory mechanisms between genes

    Integration of microRNA changes in vivo identifies novel molecular features of muscle insulin resistance in type 2 diabetes

    Get PDF
    Skeletal muscle insulin resistance (IR) is considered a critical component of type II diabetes, yet to date IR has evaded characterization at the global gene expression level in humans. MicroRNAs (miRNAs) are considered fine-scale rheostats of protein-coding gene product abundance. The relative importance and mode of action of miRNAs in human complex diseases remains to be fully elucidated. We produce a global map of coding and non-coding RNAs in human muscle IR with the aim of identifying novel disease biomarkers. We profiled &gt;47,000 mRNA sequences and &gt;500 human miRNAs using gene-chips and 118 subjects (n = 71 patients versus n = 47 controls). A tissue-specific gene-ranking system was developed to stratify thousands of miRNA target-genes, removing false positives, yielding a weighted inhibitor score, which integrated the net impact of both up- and down-regulated miRNAs. Both informatic and protein detection validation was used to verify the predictions of in vivo changes. The muscle mRNA transcriptome is invariant with respect to insulin or glucose homeostasis. In contrast, a third of miRNAs detected in muscle were altered in disease (n = 62), many changing prior to the onset of clinical diabetes. The novel ranking metric identified six canonical pathways with proven links to metabolic disease while the control data demonstrated no enrichment. The Benjamini-Hochberg adjusted Gene Ontology profile of the highest ranked targets was metabolic (P &lt; 7.4 × 10-8), post-translational modification (P &lt; 9.7 × 10-5) and developmental (P &lt; 1.3 × 10-6) processes. Protein profiling of six development-related genes validated the predictions. Brain-derived neurotrophic factor protein was detectable only in muscle satellite cells and was increased in diabetes patients compared with controls, consistent with the observation that global miRNA changes were opposite from those found during myogenic differentiation. We provide evidence that IR in humans may be related to coordinated changes in multiple microRNAs, which act to target relevant signaling pathways. It would appear that miRNAs can produce marked changes in target protein abundance in vivo by working in a combinatorial manner. Thus, miRNA detection represents a new molecular biomarker strategy for insulin resistance, where micrograms of patient material is needed to monitor efficacy during drug or life-style interventions

    Integrative analyses identify modulators of response to neoadjuvant aromatase inhibitors in patients with early breast cancer

    Get PDF
    Introduction Aromatase inhibitors (AIs) are a vital component of estrogen receptor positive (ER+) breast cancer treatment. De novo and acquired resistance, however, is common. The aims of this study were to relate patterns of copy number aberrations to molecular and proliferative response to AIs, to study differences in the patterns of copy number aberrations between breast cancer samples pre- and post-AI neoadjuvant therapy, and to identify putative biomarkers for resistance to neoadjuvant AI therapy using an integrative analysis approach. Methods Samples from 84 patients derived from two neoadjuvant AI therapy trials were subjected to copy number profiling by microarray-based comparative genomic hybridisation (aCGH, n = 84), gene expression profiling (n = 47), matched pre- and post-AI aCGH (n = 19 pairs) and Ki67-based AI-response analysis (n = 39). Results Integrative analysis of these datasets identified a set of nine genes that, when amplified, were associated with a poor response to AIs, and were significantly overexpressed when amplified, including CHKA, LRP5 and SAPS3. Functional validation in vitro, using cell lines with and without amplification of these genes (SUM44, MDA-MB134-VI, T47D and MCF7) and a model of acquired AI-resistance (MCF7-LTED) identified CHKA as a gene that when amplified modulates estrogen receptor (ER)-driven proliferation, ER/estrogen response element (ERE) transactivation, expression of ER-regulated genes and phosphorylation of V-AKT murine thymoma viral oncogene homolog 1 (AKT1). Conclusions These data provide a rationale for investigation of the role of CHKA in further models of de novo and acquired resistance to AIs, and provide proof of concept that integrative genomic analyses can identify biologically relevant modulators of AI response

    Transcriptomic effects of the non-steroidal anti-inflammatory drug Ibuprofen in the marine bivalve Mytilus galloprovincialis Lam

    Get PDF
    The transcriptomic effects of Ibuprofen (IBU) in the digestive gland tissue of Mytilus galloprovincialis Lam. specimens exposed at low environmental concentrations (250 ng L-1) are presented. Using a 1.7 K feature cDNA microarray along with linear models and empirical Bayes statistical methods 225 differentially expressed genes were identified in mussels treated with IBU across a 15-day period. Transcriptional dynamics were typical of an adaptive response with a peak of gene expression change at day 7 (177 features, representing about 11% of sequences available for analysis) and an almost full recovery at the end of the exposure period. Functional genomics by means of Gene Ontology term analysis unraveled typical mussel stress responses i.e. aminoglycan (chitin) metabolic processes but also more specific effects such as the regulation of NF-kappa B transcription factor activity. (C) 2016 Elsevier Ltd. All rights reserved

    Expression cartography of human tissues using self organizing maps

    Get PDF
    Background: The availability of parallel, high-throughput microarray and sequencing experiments poses a challenge how to best arrange and to analyze the obtained heap of multidimensional data in a concerted way. Self organizing maps (SOM), a machine learning method, enables the parallel sample- and gene-centered view on the data combined with strong visualization and second-level analysis capabilities. The paper addresses aspects of the method with practical impact in the context of expression analysis of complex data sets.&#xd;&#xa;Results: The method was applied to generate a SOM characterizing the whole genome expression profiles of 67 healthy human tissues selected from ten tissue categories (adipose, endocrine, homeostasis, digestion, exocrine, epithelium, sexual reproduction, muscle, immune system and nervous tissues). SOM mapping reduces the dimension of expression data from ten thousands of genes to a few thousands of metagenes where each metagene acts as representative of a minicluster of co-regulated single genes. Tissue-specific and common properties shared between groups of tissues emerge as a handful of localized spots in the tissue maps collecting groups of co-regulated and co-expressed metagenes. The functional context of the spots was discovered using overrepresentation analysis with respect to pre-defined gene sets of known functional impact. We found that tissue related spots typically contain enriched populations of gene sets well corresponding to molecular processes in the respective tissues. Analysis techniques normally used at the gene-level such as two-way hierarchical clustering provide a better signal-to-noise ratio and a better representativeness of the method if applied to the metagenes. Metagene-based clustering analyses aggregate the tissues into essentially three clusters containing nervous, immune system and the remaining tissues. &#xd;&#xa;Conclusions: The global view on the behavior of a few well-defined modules of correlated and differentially expressed genes is more intuitive and more informative than the separate discovery of the expression levels of hundreds or thousands of individual genes. The metagene approach is less sensitive to a priori selection of genes. It can detect a coordinated expression pattern whose components would not pass single-gene significance thresholds and it is able to extract context-dependent patterns of gene expression in complex data sets.&#xd;&#xa
    corecore