716 research outputs found

    Human cancers over express genes that are specific to a variety of normal human tissues

    Full text link
    We have analyzed gene expression data from 3 different kinds of samples: normal human tissues, human cancer cell lines and leukemic cells from lymphoid and myeloid leukemia pediatric patients. We have searched for genes that are over expressed in human cancer and also show specific patterns of tissue-dependent expression in normal tissues. Using the expression data of the normal tissues we identified 4346 genes with a high variability of expression, and clustered these genes according to their relative expression level. Of 91 stable clusters obtained, 24 clusters included genes preferentially expressed either only in hematopoietic tissues or in hematopoietic and 1-2 other tissues; 28 clusters included genes preferentially expressed in various non-hematopoietic tissues such as neuronal, testis, liver, kidney, muscle, lung, pancreas and placenta. Analysis of the expression levels of these 2 groups of genes in the human cancer cell lines and leukemias, identified genes that were highly expressed in cancer cells but not in their normal counterparts, and were thus over expressed in the cancers. The different cancer cell lines and leukemias varied in the number and identity of these over expressed genes. The results indicate that many genes that are over expressed in human cancer cells are specific to a variety of normal tissues, including normal tissues other than those from which the cancer originated. It is suggested that this general property of cancer cells plays a major role in determining the behavior of the cancers, including their metastatic potential.Comment: To appear in PNA

    A Minimum-Labeling Approach for Reconstructing Protein Networks across Multiple Conditions

    Get PDF
    The sheer amounts of biological data that are generated in recent years have driven the development of network analysis tools to facilitate the interpretation and representation of these data. A fundamental challenge in this domain is the reconstruction of a protein-protein subnetwork that underlies a process of interest from a genome-wide screen of associated genes. Despite intense work in this area, current algorithmic approaches are largely limited to analyzing a single screen and are, thus, unable to account for information on condition-specific genes, or reveal the dynamics (over time or condition) of the process in question. Here we propose a novel formulation for network reconstruction from multiple-condition data and devise an efficient integer program solution for it. We apply our algorithm to analyze the response to influenza infection in humans over time as well as to analyze a pair of ER export related screens in humans. By comparing to an extant, single-condition tool we demonstrate the power of our new approach in integrating data from multiple conditions in a compact and coherent manner, capturing the dynamics of the underlying processes.Comment: Peer-reviewed and presented as part of the 13th Workshop on Algorithms in Bioinformatics (WABI2013

    Cycle-centrality in complex networks

    Full text link
    Networks are versatile representations of the interactions between entities in complex systems. Cycles on such networks represent feedback processes which play a central role in system dynamics. In this work, we introduce a measure of the importance of any individual cycle, as the fraction of the total information flow of the network passing through the cycle. This measure is computationally cheap, numerically well-conditioned, induces a centrality measure on arbitrary subgraphs and reduces to the eigenvector centrality on vertices. We demonstrate that this measure accurately reflects the impact of events on strategic ensembles of economic sectors, notably in the US economy. As a second example, we show that in the protein-interaction network of the plant Arabidopsis thaliana, a model based on cycle-centrality better accounts for pathogen activity than the state-of-art one. This translates into pathogen-targeted-proteins being concentrated in a small number of triads with high cycle-centrality. Algorithms for computing the centrality of cycles and subgraphs are available for download

    Induction in myeloid leukemic cells of genes that are expressed in different normal tissues

    Full text link
    Using DNA microarray and cluster analysis of expressed genes in a cloned line (M1-t-p53) of myeloid leukemic cells, we have analyzed the expression of genes that are preferentially expressed in different normal tissues. Clustering of 547 highly expressed genes in these leukemic cells showed 38 genes preferentially expressed in normal hematopoietic tissues and 122 other genes preferentially expressed in different normal non-hematopoietic tissues including neuronal tissues, muscle, liver and testis. We have also analyzed the genes whose expression in the leukemic cells changed after activation of wild-type p53 and treatment with the cytokine interleukin 6 (IL-6) or the calcium mobilizer thapsigargin (TG). Out of 620 such genes in the leukemic cells that were differentially expressed in normal tissues, clustering showed 80 genes that were preferentially expressed in hematopoietic tissues and 132 genes in different normal non-hematopietic tissues that also included neuronal tissues, muscle, liver and testis. Activation of p53 and treatment with IL-6 or TG induced different changes in the genes preferentially expressed in these normal tissues. These myeloid leukemic cells thus express genes that are expressed in normal non-hematopoietic tissues, and various treatments can reprogram these cells to induce other such non-hematopoietic genes. The results indicate that these leukemic cells share with normal hematopoietic stem cells the plasticity of differentiation to different cell types. It is suggested that this reprogramming to induce in malignant cells genes that are expressed in different normal tissues may be of clinical value in therapy

    CRISPR-Cas9 screens in human cells and primary neurons identify modifiers of C9ORF72 dipeptide-repeat-protein toxicity.

    Get PDF
    Hexanucleotide-repeat expansions in the C9ORF72 gene are the most common cause of amyotrophic lateral sclerosis and frontotemporal dementia (c9ALS/FTD). The nucleotide-repeat expansions are translated into dipeptide-repeat (DPR) proteins, which are aggregation prone and may contribute to neurodegeneration. We used the CRISPR-Cas9 system to perform genome-wide gene-knockout screens for suppressors and enhancers of C9ORF72 DPR toxicity in human cells. We validated hits by performing secondary CRISPR-Cas9 screens in primary mouse neurons. We uncovered potent modifiers of DPR toxicity whose gene products function in nucleocytoplasmic transport, the endoplasmic reticulum (ER), proteasome, RNA-processing pathways, and chromatin modification. One modifier, TMX2, modulated the ER-stress signature elicited by C9ORF72 DPRs in neurons and improved survival of human induced motor neurons from patients with C9ORF72 ALS. Together, our results demonstrate the promise of CRISPR-Cas9 screens in defining mechanisms of neurodegenerative diseases

    Compounds from an Unbiased Chemical Screen Reverse Both Er-to-Golgi Trafficking Defects and Mitochondrial Dysfunction in Parkinson's Disease Models

    Get PDF
    α-Synuclein (α-syn) is a small lipid-binding protein involved in vesicle trafficking whose function is poorly characterized. It is of great interest to human biology and medicine because α-syn dysfunction is associated with several neurodegenerative disorders, including Parkinson’s disease (PD). We previously created a yeast model of α-syn pathobiology, which established vesicle trafficking as a process that is particularly sensitive to α-syn expression. We also uncovered a core group of proteins with diverse activities related to α-syn toxicity that is conserved from yeast to mammalian neurons. Here, we report that a yeast strain expressing a somewhat higher level of α-syn also exhibits strong defects in mitochondrial function. Unlike our previous strain, genetic suppression of endoplasmic reticulum (ER)-to-Golgi trafficking alone does not suppress α-syn toxicity in this strain. In an effort to identify individual compounds that could simultaneously rescue these apparently disparate pathological effects of α-syn, we screened a library of 115,000 compounds. We identified a class of small molecules that reduced α-syn toxicity at micromolar concentrations in this higher toxicity strain. These compounds reduced the formation of α-syn foci, re-established ER-to-Golgi trafficking and ameliorated α-syn-mediated damage to mitochondria. They also corrected the toxicity of α-syn in nematode neurons and in primary rat neuronal midbrain cultures. Remarkably, the compounds also protected neurons against rotenone-induced toxicity, which has been used to model the mitochondrial defects associated with PD in humans. That single compounds are capable of rescuing the diverse toxicities of α-syn in yeast and neurons suggests that they are acting on deeply rooted biological processes that connect these toxicities and have been conserved for a billion years of eukaryotic evolution. Thus, it seems possible to develop novel therapeutic strategies to simultaneously target the multiple pathological features of PD.MGH/MIT Morris Udall Center of Excellence in Parkinson Disease Research (NS038372)Michael J. Fox Foundation for Parkinson's ResearchHoward Hughes Medical InstituteUnited States. National Institutes of Health (NS049221)American Parkinson Disease Association, Inc

    Genome-wide differentiation in closely related populations: the roles of selection and geographic isolation.

    Get PDF
    Population divergence in geographic isolation is due to a combination of factors. Natural and sexual selection may be important in shaping patterns of population differentiation, a pattern referred to as 'Isolation by Adaptation' (IBA). IBA can be complementary to the well-known pattern of 'Isolation by Distance' (IBD), in which the divergence of closely related populations (via any evolutionary process) is associated with geographic isolation. The barn swallow Hirundo rustica complex comprises six closely related subspecies, where divergent sexual selection is associated with phenotypic differentiation among allopatric populations. To investigate the relative contributions of selection and geographic distance to genome-wide differentiation, we compared genotypic and phenotypic variation from 350 barn swallows sampled across eight populations (28 pairwise comparisons) from four different subspecies. We report a draft whole genome sequence for H. rustica, to which we aligned a set of 9,493 single nucleotide polymorphisms (SNPs). Using statistical approaches to control for spatial autocorrelation of phenotypic variables and geographic distance, we find that divergence in traits related to migratory behavior and sexual signaling, as well as geographic distance together, explain over 70% of genome-wide divergence among populations. Controlling for IBD, we find 42% of genome-wide divergence is attributable to IBA through pairwise differences in traits related to migratory behavior and sexual signaling alone. By (i) combining these results with prior studies of how selection shapes morphological differentiation and (ii) accounting for spatial autocorrelation, we infer that morphological adaptation plays a large role in shaping population-level differentiation in this group of closely related populations. This article is protected by copyright. All rights reserved

    From Frege to chatGPT: Compositionality in language, cognition, and deep neural networks

    Full text link
    Compositionality has long been considered a key explanatory property underlying human intelligence: arbitrary concepts can be composed into novel complex combinations, permitting the acquisition of an open ended, potentially infinite expressive capacity from finite learning experiences. Influential arguments have held that neural networks fail to explain this aspect of behavior, leading many to dismiss them as viable models of human cognition. Over the last decade, however, modern deep neural networks (DNNs), which share the same fundamental design principles as their predecessors, have come to dominate artificial intelligence, exhibiting the most advanced cognitive behaviors ever demonstrated in machines. In particular, large language models (LLMs), DNNs trained to predict the next word on a large corpus of text, have proven capable of sophisticated behaviors such as writing syntactically complex sentences without grammatical errors, producing cogent chains of reasoning, and even writing original computer programs -- all behaviors thought to require compositional processing. In this chapter, we survey recent empirical work from machine learning for a broad audience in philosophy, cognitive science, and neuroscience, situating recent breakthroughs within the broader context of philosophical arguments about compositionality. In particular, our review emphasizes two approaches to endowing neural networks with compositional generalization capabilities: (1) architectural inductive biases, and (2) metalearning, or learning to learn. We also present findings suggesting that LLM pretraining can be understood as a kind of metalearning, and can thereby equip DNNs with compositional generalization abilities in a similar way. We conclude by discussing the implications that these findings may have for the study of compositionality in human cognition and by suggesting avenues for future research.Comment: 32 pages (50 pages including references), 8 figure

    The bottleneck may be the solution, not the problem

    Get PDF
    As a highly consequential biological trait, a memory \u201cbottleneck\u201d cannot escape selection pressures. It must therefore co-evolve with other cognitive mechanisms rather than act as an independent constraint. Recent theory and an implemented model of language acquisition suggest that a limit on working memory may evolve to help learning. Furthermore, it need not hamper the use of language for communication

    Ratings of age of acquisition of 299 words across 25 languages: Is there a cross-linguistic order of words?

    Get PDF
    We present a new set of subjective age-of-acquisition (AoA) ratings for 299 words (158 nouns, 141 verbs) in 25 languages from five language families (Afro-Asiatic: Semitic languages; Altaic: one Turkic language: Indo-European: Baltic, Celtic, Germanic, Hellenic, Slavic, and Romance languages; Niger-Congo: one Bantu language; Uralic: Finnic and Ugric languages). Adult native speakers reported the age at which they had learned each word. We present a comparison of the AoA ratings across all languages by contrasting them in pairs. This comparison shows a consistency in the orders of ratings across the 25 languages. The data were then analyzed (1) to ascertain how the demographic characteristics of the participants influenced AoA estimations and (2) to assess differences caused by the exact form of the target question (when did you learn vs. when do children learn this word); (3) to compare the ratings obtained in our study to those of previous studies; and (4) to assess the validity of our study by comparison with quasi-objective AoA norms derived from the MacArthur–Bates Communicative Development Inventories (MB-CDI). All 299 words were judged as being acquired early (mostly before the age of 6 years). AoA ratings were associated with the raters’ social or language status, but not with the raters’ age or education. Parents reported words as being learned earlier, and bilinguals reported learning them later. Estimations of the age at which children learn the words revealed significantly lower ratings of AoA. Finally, comparisons with previous AoA and MB-CDI norms support the validity of the present estimations. Our AoA ratings are available for research or other purposes
    corecore