15,727 research outputs found

    Discovering Temporal Associations among Significant Changes in Gene Expression

    Get PDF
    Abstract-One of the most demanding problems in mining temporal data is to identify how multivariate change associations might be discovered and used to better understand data interactions and dependencies. This paper introduces a framework to mine associations among significant changes in multivariate time-series data. Building on statistical methods, we detect significant changes in timeseries data and use marginal change rates to qualify the direction of change at significant change points. Furthermore, a propositional confirmation-guided rule discovery method is used to discover associations among these significant changes. We apply our approach to gene expression data measured in yeast cell cycles and demonstrate that our method can learn novel and highquality significant change associations among different genes. Such associations can be used to cluster genes and build gene interaction networks

    Discovering Graphical Granger Causality Using the Truncating Lasso Penalty

    Full text link
    Components of biological systems interact with each other in order to carry out vital cell functions. Such information can be used to improve estimation and inference, and to obtain better insights into the underlying cellular mechanisms. Discovering regulatory interactions among genes is therefore an important problem in systems biology. Whole-genome expression data over time provides an opportunity to determine how the expression levels of genes are affected by changes in transcription levels of other genes, and can therefore be used to discover regulatory interactions among genes. In this paper, we propose a novel penalization method, called truncating lasso, for estimation of causal relationships from time-course gene expression data. The proposed penalty can correctly determine the order of the underlying time series, and improves the performance of the lasso-type estimators. Moreover, the resulting estimate provides information on the time lag between activation of transcription factors and their effects on regulated genes. We provide an efficient algorithm for estimation of model parameters, and show that the proposed method can consistently discover causal relationships in the large pp, small nn setting. The performance of the proposed model is evaluated favorably in simulated, as well as real, data examples. The proposed truncating lasso method is implemented in the R-package grangerTlasso and is available at http://www.stat.lsa.umich.edu/~shojaie.Comment: 12 pages, 4 figures, 1 tabl

    Integrated analysis of gene expression by association rules discovery

    Get PDF
    BACKGROUND: Microarray technology is generating huge amounts of data about the expression level of thousands of genes, or even whole genomes, across different experimental conditions. To extract biological knowledge, and to fully understand such datasets, it is essential to include external biological information about genes and gene products to the analysis of expression data. However, most of the current approaches to analyze microarray datasets are mainly focused on the analysis of experimental data, and external biological information is incorporated as a posterior process. RESULTS: In this study we present a method for the integrative analysis of microarray data based on the Association Rules Discovery data mining technique. The approach integrates gene annotations and expression data to discover intrinsic associations among both data sources based on co-occurrence patterns. We applied the proposed methodology to the analysis of gene expression datasets in which genes were annotated with metabolic pathways, transcriptional regulators and Gene Ontology categories. Automatically extracted associations revealed significant relationships among these gene attributes and expression patterns, where many of them are clearly supported by recently reported work. CONCLUSION: The integration of external biological information and gene expression data can provide insights about the biological processes associated to gene expression programs. In this paper we show that the proposed methodology is able to integrate multiple gene annotations and expression data in the same analytic framework and extract meaningful associations among heterogeneous sources of data. An implementation of the method is included in the Engene software package

    Identifying progressive imaging genetic patterns via multi-task sparse canonical correlation analysis: a longitudinal study of the ADNI cohort

    Get PDF
    Motivation Identifying the genetic basis of the brain structure, function and disorder by using the imaging quantitative traits (QTs) as endophenotypes is an important task in brain science. Brain QTs often change over time while the disorder progresses and thus understanding how the genetic factors play roles on the progressive brain QT changes is of great importance and meaning. Most existing imaging genetics methods only analyze the baseline neuroimaging data, and thus those longitudinal imaging data across multiple time points containing important disease progression information are omitted. Results We propose a novel temporal imaging genetic model which performs the multi-task sparse canonical correlation analysis (T-MTSCCA). Our model uses longitudinal neuroimaging data to uncover that how single nucleotide polymorphisms (SNPs) play roles on affecting brain QTs over the time. Incorporating the relationship of the longitudinal imaging data and that within SNPs, T-MTSCCA could identify a trajectory of progressive imaging genetic patterns over the time. We propose an efficient algorithm to solve the problem and show its convergence. We evaluate T-MTSCCA on 408 subjects from the Alzheimer’s Disease Neuroimaging Initiative database with longitudinal magnetic resonance imaging data and genetic data available. The experimental results show that T-MTSCCA performs either better than or equally to the state-of-the-art methods. In particular, T-MTSCCA could identify higher canonical correlation coefficients and capture clearer canonical weight patterns. This suggests that T-MTSCCA identifies time-consistent and time-dependent SNPs and imaging QTs, which further help understand the genetic basis of the brain QT changes over the time during the disease progression. Availability and implementation The software and simulation data are publicly available at https://github.com/dulei323/TMTSCCA. Supplementary information Supplementary data are available at Bioinformatics online

    Growing Up Toxic: Chemical Exposures and Increases in Developmental Disease

    Get PDF
    Explains how exposure to toxic chemicals can harm health and impair development, causing premature birth, learning disabilities, behavioral disorders, asthma and allergies, and/or other problems. Suggests policy reforms

    Neuroblastoma patient outcomes, tumor differentiation, and ERK activation are correlated with expression levels of the ubiquitin ligase UBE4B.

    Get PDF
    BackgroundUBE4B is an E3/E4 ubiquitin ligase whose gene is located in chromosome 1p36.22. We analyzed the associations of UBE4B gene and protein expression with neuroblastoma patient outcomes and with tumor prognostic features and histology.MethodsWe evaluated the association of UBE4B gene expression with neuroblastoma patient outcomes using the R2 Platform. We screened neuroblastoma tumor samples for UBE4B protein expression using immunohistochemistry. FISH for UBE4B and 1p36 deletion was performed on tumor samples. We then evaluated UBE4B expression for associations with prognostic factors and with levels of phosphorylated ERK in neuroblastoma tumors and cell lines.ResultsLow UBE4B gene expression is associated with poor outcomes in patients with neuroblastoma and with worse outcomes in all patient subgroups. UBE4B protein expression was associated with neuroblastoma tumor differentiation, and decreased UBE4B protein levels were associated with high-risk features. UBE4B protein levels were also associated with levels of phosphorylated ERK.ConclusionsWe have demonstrated associations between UBE4B gene expression and neuroblastoma patient outcomes and prognostic features. Reduced UBE4B protein expression in neuroblastoma tumors was associated with high-risk features, a lack of differentiation, and with ERK activation. These results suggest UBE4B may contribute to the poor prognosis of neuroblastoma tumors with 1p36 deletions and that UBE4B expression may mediate neuroblastoma differentiation

    SWIM: A computational tool to unveiling crucial nodes in complex biological networks

    Get PDF
    SWItchMiner (SWIM) is a wizard-like software implementation of a procedure, previously described, able to extract information contained in complex networks. Specifically, SWIM allows unearthing the existence of a new class of hubs, called "fight-club hubs", characterized by a marked negative correlation with their first nearest neighbors. Among them, a special subset of genes, called "switch genes", appears to be characterized by an unusual pattern of intra- and inter-module connections that confers them a crucial topological role, interestingly mirrored by the evidence of their clinic-biological relevance. Here, we applied SWIM to a large panel of cancer datasets from The Cancer Genome Atlas, in order to highlight switch genes that could be critically associated with the drastic changes in the physiological state of cells or tissues induced by the cancer development. We discovered that switch genes are found in all cancers we studied and they encompass protein coding genes and non-coding RNAs, recovering many known key cancer players but also many new potential biomarkers not yet characterized in cancer context. Furthermore, SWIM is amenable to detect switch genes in different organisms and cell conditions, with the potential to uncover important players in biologically relevant scenarios, including but not limited to human cancer
    • …
    corecore