263 research outputs found

    Exploring matrix factorization techniques for significant genes identification of Alzheimer’s disease microarray gene expression data

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The wide use of high-throughput DNA microarray technology provide an increasingly detailed view of human transcriptome from hundreds to thousands of genes. Although biomedical researchers typically design microarray experiments to explore specific biological contexts, the relationships between genes are hard to identified because they are complex and noisy high-dimensional data and are often hindered by low statistical power. The main challenge now is to extract valuable biological information from the colossal amount of data to gain insight into biological processes and the mechanisms of human disease. To overcome the challenge requires mathematical and computational methods that are versatile enough to capture the underlying biological features and simple enough to be applied efficiently to large datasets.</p> <p>Methods</p> <p>Unsupervised machine learning approaches provide new and efficient analysis of gene expression profiles. In our study, two unsupervised knowledge-based matrix factorization methods, independent component analysis (ICA) and nonnegative matrix factorization (NMF) are integrated to identify significant genes and related pathways in microarray gene expression dataset of Alzheimer’s disease. The advantage of these two approaches is they can be performed as a biclustering method by which genes and conditions can be clustered simultaneously. Furthermore, they can group genes into different categories for identifying related diagnostic pathways and regulatory networks. The difference between these two method lies in ICA assume statistical independence of the expression modes, while NMF need positivity constrains to generate localized gene expression profiles.</p> <p>Results</p> <p>In our work, we performed FastICA and non-smooth NMF methods on DNA microarray gene expression data of Alzheimer’s disease respectively. The simulation results shows that both of the methods can clearly classify severe AD samples from control samples, and the biological analysis of the identified significant genes and their related pathways demonstrated that these genes play a prominent role in AD and relate the activation patterns to AD phenotypes. It is validated that the combination of these two methods is efficient.</p> <p>Conclusions</p> <p>Unsupervised matrix factorization methods provide efficient tools to analyze high-throughput microarray dataset. According to the facts that different unsupervised approaches explore correlations in the high-dimensional data space and identify relevant subspace base on different hypotheses, integrating these methods to explore the underlying biological information from microarray dataset is an efficient approach. By combining the significant genes identified by both ICA and NMF, the biological analysis shows great efficient for elucidating the molecular taxonomy of Alzheimer’s disease and enable better experimental design to further identify potential pathways and therapeutic targets of AD.</p

    Post hoc pattern matching: assigning significance to statistically defined expression patterns in single channel microarray data

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Researchers using RNA expression microarrays in experimental designs with more than two treatment groups often identify statistically significant genes with ANOVA approaches. However, the ANOVA test does not discriminate which of the multiple treatment groups differ from one another. Thus, <it>post hoc </it>tests, such as linear contrasts, template correlations, and pairwise comparisons are used. Linear contrasts and template correlations work extremely well, especially when the researcher has <it>a priori </it>information pointing to a particular pattern/template among the different treatment groups. Further, all pairwise comparisons can be used to identify particular, treatment group-dependent patterns of gene expression. However, these approaches are biased by the researcher's assumptions, and some treatment-based patterns may fail to be detected using these approaches. Finally, different patterns may have different probabilities of occurring by chance, importantly influencing researchers' conclusions about a pattern and its constituent genes.</p> <p>Results</p> <p>We developed a four step, <it>post hoc </it>pattern matching (PPM) algorithm to automate single channel gene expression pattern identification/significance. First, 1-Way Analysis of Variance (ANOVA), coupled with <it>post hoc </it>'all pairwise' comparisons are calculated for all genes. Second, for each ANOVA-significant gene, all pairwise contrast results are encoded to create unique pattern ID numbers. The # genes found in each pattern in the data is identified as that pattern's 'actual' frequency. Third, using Monte Carlo simulations, those patterns' frequencies are estimated in random data ('random' gene pattern frequency). Fourth, a Z-score for overrepresentation of the pattern is calculated ('actual' against 'random' gene pattern frequencies). We wrote a Visual Basic program (StatiGen) that automates PPM procedure, constructs an Excel workbook with standardized graphs of overrepresented patterns, and lists of the genes comprising each pattern. The visual basic code, installation files for StatiGen, and sample data are available as supplementary material.</p> <p>Conclusion</p> <p>The PPM procedure is designed to augment current microarray analysis procedures by allowing researchers to incorporate all of the information from post hoc tests to establish unique, overarching gene expression patterns in which there is no overlap in gene membership. In our hands, PPM works well for studies using from three to six treatment groups in which the researcher is interested in treatment-related patterns of gene expression. Hardware/software limitations and extreme number of theoretical expression patterns limit utility for larger numbers of treatment groups. Applied to a published microarray experiment, the StatiGen program successfully flagged patterns that had been manually assigned in prior work, and further identified other gene expression patterns that may be of interest. Thus, over a moderate range of treatment groups, PPM appears to work well. It allows researchers to assign statistical probabilities to patterns of gene expression that fit <it>a priori </it>expectations/hypotheses, it preserves the data's ability to show the researcher interesting, yet unanticipated gene expression patterns, and assigns the majority of ANOVA-significant genes to non-overlapping patterns.</p

    Concurrent use of prescription drugs and herbal medicinal products in older adults: A systematic review

    Get PDF
    This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.The use of herbal medicinal products (HMPs) is common among older adults. However, little is known about concurrent use with prescription drugs as well as the potential interactions associated with such combinations. Objective Identify and evaluate the literature on concurrent prescription and HMPs use among older adults to assess prevalence, patterns, potential interactions and factors associated with this use. Methods Systematic searches in MEDLINE, PsycINFO, EMBASE, CINAHL, AMED, Web of Science and Cochrane from inception to May 2017 for studies reporting concurrent use of prescription medicines with HMPs in adults (≥65 years). Quality was assessed using the Joanna Briggs Institute checklists. The Evidence for Policy and Practice Information and Co-ordinating Centre (EPPI-Centre) three stage approach to mixed method research was used to synthesise data. Results Twenty-two studies were included. A definition of HMPs or what was considered HMP was frequently missing. Prevalence of concurrent use by older adults varied widely between 5.3% and 88.3%. Prescription medicines most combined with HMPs were antihypertensive drugs, beta blockers, diuretics, antihyperlipidemic agents, anticoagulants, analgesics, antihistamines, antidiabetics, antidepressants and statins. The HMPs most frequently used were: ginkgo, garlic, ginseng, St John’s wort, Echinacea, saw palmetto, evening primrose oil and ginger. Potential risks of bleeding due to use of ginkgo, garlic or ginseng with aspirin or warfarin was the most reported herb-drug interaction. Some data suggests being female, a lower household income and less than high school education were associated with concurrent use. Conclusion Prevalence of concurrent prescription drugs and HMPs use among older adults is substantial and potential interactions have been reported. Knowledge of the extent and manner in which older adults combine prescription drugs will aid healthcare professionals can appropriately identify and manage patients at risk.Peer reviewedFinal Published versio

    Coordinated Expression Domains in Mammalian Genomes

    Get PDF
    Gene order in eukaryotic genomes is not random. Genes showing similar expression (coexpression) patterns are often clustered along the genome. The goal of this study is to characterize coexpression clustering in mammalian genomes and to investigate the underlying mechanisms.We detect clustering of coexpressed genes across multiple scales, from neighboring genes to chromosomal domains that span tens of megabases and, in some cases, entire chromosomes. Coexpression domains may be positively or negatively correlated with other domains, within and between chromosomes. We find that long-range expression domains are associated with gene density, which in turn is related to physical organization of the chromosomes within the nucleus. We show that gene expression changes between healthy and diseased tissue samples occur in a gene density-dependent manner.We demonstrate that coexpression domains exist across multiple scales. We identify potential mechanisms for short-range as well as long-range coexpression domains. We provide evidence that the three-dimensional architecture of the chromosomes may underlie long-range coexpression domains. Chromosome territory reorganization may play a role in common human diseases such as Alzheimer's disease and psoriasis

    Temporal transcriptome changes induced by MDV in marek's disease-resistant and -susceptible inbred chickens

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Marek's disease (MD) is a lymphoproliferative disease in chickens caused by Marek's disease virus (MDV) and characterized by T cell lymphoma and infiltration of lymphoid cells into various organs such as liver, spleen, peripheral nerves and muscle. Resistance to MD and disease risk have long been thought to be influenced both by genetic and environmental factors, the combination of which contributes to the observed outcome in an individual. We hypothesize that after MDV infection, genes related to MD-resistance or -susceptibility may exhibit different trends in transcriptional activity in chicken lines having a varying degree of resistance to MD.</p> <p>Results</p> <p>In order to study the mechanisms of resistance and susceptibility to MD, we performed genome-wide temporal expression analysis in spleen tissues from MD-resistant line 6<sub>3</sub>, susceptible line 7<sub>2 </sub>and recombinant congenic strain M (RCS-M) that has a phenotype intermediate between lines 6<sub>3 </sub>and 7<sub>2 </sub>after MDV infection. Three time points of the MDV life cycle in chicken were selected for study: 5 days post infection (dpi), 10dpi and 21dpi, representing the early cytolytic, latent and late cytolytic stages, respectively. We observed similar gene expression profiles at the three time points in line 6<sub>3 </sub>and RCS-M chickens that are both different from line 7<sub>2</sub>. Pathway analysis using Ingenuity Pathway Analysis (IPA) showed that MDV can broadly influence the chickens irrespective of whether they are resistant or susceptible to MD. However, some pathways like cardiac arrhythmia and cardiovascular disease were found to be affected only in line 7<sub>2</sub>; while some networks related to cell-mediated immune response and antigen presentation were enriched only in line 6<sub>3 </sub>and RCS-M. We identified 78 and 30 candidate genes associated with MD resistance, at 10 and 21dpi respectively, by considering genes having the same trend of expression change after MDV infection in lines 6<sub>3 </sub>and RCS-M. On the other hand, by considering genes with the same trend of expression change after MDV infection in lines 7<sub>2 </sub>and RCS-M, we identified 78 and 43 genes at 10 and 21dpi, respectively, which may be associated with MD-susceptibility.</p> <p>Conclusions</p> <p>By testing temporal transcriptome changes using three representative chicken lines with different resistance to MD, we identified 108 candidate genes for MD-resistance and 121 candidate genes for MD-susceptibility over the three time points. Genes included in our resistance or susceptibility genes lists that are also involved in more than 5 biofunctions, such as <it>CD8α</it>, <it>IL8</it>, <it>USP18</it>, and <it>CTLA4</it>, are considered to be important genes involved in MD-resistance or -susceptibility. We were also able to identify several biofunctions related with immune response that we believe play an important role in MD-resistance.</p

    Gene Expression in the Rodent Brain is Associated with Its Regional Connectivity

    Get PDF
    The putative link between gene expression of brain regions and their neural connectivity patterns is a fundamental question in neuroscience. Here this question is addressed in the first large scale study of a prototypical mammalian rodent brain, using a combination of rat brain regional connectivity data with gene expression of the mouse brain. Remarkably, even though this study uses data from two different rodent species (due to the data limitations), we still find that the connectivity of the majority of brain regions is highly predictable from their gene expression levels–the outgoing (incoming) connectivity is successfully predicted for 73% (56%) of brain regions, with an overall fairly marked accuracy level of 0.79 (0.83). Many genes are found to play a part in predicting both the incoming and outgoing connectivity (241 out of the 500 top selected genes, p-value<1e-5). Reassuringly, the genes previously known from the literature to be involved in axon guidance do carry significant information about regional brain connectivity. Surveying the genes known to be associated with the pathogenesis of several brain disorders, we find that those associated with schizophrenia, autism and attention deficit disorder are the most highly enriched in the connectivity-related genes identified here. Finally, we find that the profile of functional annotation groups that are associated with regional connectivity in the rodent is significantly correlated with the annotation profile of genes previously found to determine neural connectivity in C. elegans (Pearson correlation of 0.24, p<1e-6 for the outgoing connections and 0.27, p<1e-5 for the incoming). Overall, the association between connectivity and gene expression in a specific extant rodent species' brain is likely to be even stronger than found here, given the limitations of current data

    Molecular Insights into the Pathogenesis of Alzheimer's Disease and Its Relationship to Normal Aging

    Get PDF
    Alzheimer's disease (AD) is a complex neurodegenerative disorder that diverges from the process of normal brain aging by unknown mechanisms. We analyzed the global structure of age- and disease-dependent gene expression patterns in three regions from more than 600 brains. Gene expression variation could be almost completely explained by four transcriptional biomarkers that we named BioAge (biological age), Alz (Alzheimer), Inflame (inflammation), and NdStress (neurodegenerative stress). BioAge captures the first principal component of variation and includes genes statistically associated with neuronal loss, glial activation, and lipid metabolism. Normally BioAge increases with chronological age, but in AD it is prematurely expressed as if some of the subjects were 140 years old. A component of BioAge, Lipa, contains the AD risk factor APOE and reflects an apparent early disturbance in lipid metabolism. The rate of biological aging in AD patients, which cannot be explained by BioAge, is associated instead with NdStress, which includes genes related to protein folding and metabolism. Inflame, comprised of inflammatory cytokines and microglial genes, is broadly activated and appears early in the disease process. In contrast, the disease-specific biomarker Alz was selectively present only in the affected areas of the AD brain, appears later in pathogenesis, and is enriched in genes associated with the signaling and cell adhesion changes during the epithelial to mesenchymal (EMT) transition. Together these biomarkers provide detailed description of the aging process and its contribution to Alzheimer's disease progression

    A computational procedure for functional characterization of potential marker genes from molecular data: Alzheimer's as a case study

    Get PDF
    Abstract Background A molecular characterization of Alzheimer's Disease (AD) is the key to the identification of altered gene sets that lead to AD progression. We rely on the assumption that candidate marker genes for a given disease belong to specific pathogenic pathways, and we aim at unveiling those pathways stable across tissues, treatments and measurement systems. In this context, we analyzed three heterogeneous datasets, two microarray gene expression sets and one protein abundance set, applying a recently proposed feature selection method based on regularization. Results For each dataset we identified a signature that was successively evaluated both from the computational and functional characterization viewpoints, estimating the classification error and retrieving the most relevant biological knowledge from different repositories. Each signature includes genes already known to be related to AD and genes that are likely to be involved in the pathogenesis or in the disease progression. The integrated analysis revealed a meaningful overlap at the functional level. Conclusions The identification of three gene signatures showing a relevant overlap of pathways and ontologies, increases the likelihood of finding potential marker genes for AD.</p

    Neuronal hemoglobin affects dopaminergic cells' response to stress

    Get PDF
    Hemoglobin (Hb) is the major protein in erythrocytes and carries oxygen (O2) throughout the body. Recently, Hb has been found synthesized in atypical sites, including the brain. Hb is highly expressed in A9 dopaminergic (DA) neurons of the substantia nigra (SN), whose selective degeneration leads to Parkinson's disease (PD). Here we show that Hb confers DA cells' susceptibility to 1-methyl-4-phenylpyridinium (MPP(+)) and rotenone, neurochemical cellular models of PD. The toxic property of Hb does not depend on O2 binding and is associated with insoluble aggregate formation in the nucleolus. Neurochemical stress induces epigenetic modifications, nucleolar alterations and autophagy inhibition that depend on Hb expression. When adeno-associated viruses carrying \u3b1- and \u3b2-chains of Hb are stereotaxically injected into mouse SN, Hb forms aggregates and causes motor learning impairment. These results position Hb as a potential player in DA cells' homeostasis and dysfunction in PD. Copyright The Author(s) 201

    Distinct Transcriptome Expression of the Temporal Cortex of the Primate Microcebus murinus during Brain Aging versus Alzheimer's Disease-Like Pathology

    Get PDF
    Aging is the primary risk factor of neurodegenerative disorders such as Alzheimer's disease (AD). However, the molecular events occurring during brain aging are extremely complex and still largely unknown. For a better understanding of these age-associated modifications, animal models as close as possible to humans are needed. We thus analyzed the transcriptome of the temporal cortex of the primate Microcebus murinus using human oligonucleotide microarrays (Affymetrix). Gene expression profiles were assessed in the temporal cortex of 6 young adults, 10 healthy old animals and 2 old, “AD-like” animals that presented ß-amyloid plaques and cortical atrophy, which are pathognomonic signs of AD in humans. Gene expression data of the 14,911 genes that were detected in at least 3 samples were analyzed. By SAM (significance analysis of microarrays), we identified 47 genes that discriminated young from healthy old and “AD-like” animals. These findings were confirmed by principal component analysis (PCA). ANOVA of the expression data from the three groups identified 695 genes (including the 47 genes previously identified by SAM and PCA) with significant changes of expression in old and “AD-like” in comparison to young animals. About one third of these genes showed similar changes of expression in healthy aging and in “AD-like” animals, whereas more than two thirds showed opposite changes in these two groups in comparison to young animals. Hierarchical clustering analysis of the 695 markers indicated that each group had distinct expression profiles which characterized each group, especially the “AD-like” group. Functional categorization showed that most of the genes that were up-regulated in healthy old animals and down-regulated in “AD-like” animals belonged to metabolic pathways, particularly protein synthesis. These data suggest the existence of compensatory mechanisms during physiological brain aging that disappear in “AD-like” animals. These results open the way to new exploration of physiological and “AD-like” aging in primates
    corecore