716 research outputs found
Human cancers over express genes that are specific to a variety of normal human tissues
We have analyzed gene expression data from 3 different kinds of samples:
normal human tissues, human cancer cell lines and leukemic cells from lymphoid
and myeloid leukemia pediatric patients. We have searched for genes that are
over expressed in human cancer and also show specific patterns of
tissue-dependent expression in normal tissues. Using the expression data of the
normal tissues we identified 4346 genes with a high variability of expression,
and clustered these genes according to their relative expression level. Of 91
stable clusters obtained, 24 clusters included genes preferentially expressed
either only in hematopoietic tissues or in hematopoietic and 1-2 other tissues;
28 clusters included genes preferentially expressed in various
non-hematopoietic tissues such as neuronal, testis, liver, kidney, muscle,
lung, pancreas and placenta. Analysis of the expression levels of these 2
groups of genes in the human cancer cell lines and leukemias, identified genes
that were highly expressed in cancer cells but not in their normal
counterparts, and were thus over expressed in the cancers. The different cancer
cell lines and leukemias varied in the number and identity of these over
expressed genes. The results indicate that many genes that are over expressed
in human cancer cells are specific to a variety of normal tissues, including
normal tissues other than those from which the cancer originated. It is
suggested that this general property of cancer cells plays a major role in
determining the behavior of the cancers, including their metastatic potential.Comment: To appear in PNA
A Minimum-Labeling Approach for Reconstructing Protein Networks across Multiple Conditions
The sheer amounts of biological data that are generated in recent years have
driven the development of network analysis tools to facilitate the
interpretation and representation of these data. A fundamental challenge in
this domain is the reconstruction of a protein-protein subnetwork that
underlies a process of interest from a genome-wide screen of associated genes.
Despite intense work in this area, current algorithmic approaches are largely
limited to analyzing a single screen and are, thus, unable to account for
information on condition-specific genes, or reveal the dynamics (over time or
condition) of the process in question. Here we propose a novel formulation for
network reconstruction from multiple-condition data and devise an efficient
integer program solution for it. We apply our algorithm to analyze the response
to influenza infection in humans over time as well as to analyze a pair of ER
export related screens in humans. By comparing to an extant, single-condition
tool we demonstrate the power of our new approach in integrating data from
multiple conditions in a compact and coherent manner, capturing the dynamics of
the underlying processes.Comment: Peer-reviewed and presented as part of the 13th Workshop on
Algorithms in Bioinformatics (WABI2013
Cycle-centrality in complex networks
Networks are versatile representations of the interactions between entities
in complex systems. Cycles on such networks represent feedback processes which
play a central role in system dynamics. In this work, we introduce a measure of
the importance of any individual cycle, as the fraction of the total
information flow of the network passing through the cycle. This measure is
computationally cheap, numerically well-conditioned, induces a centrality
measure on arbitrary subgraphs and reduces to the eigenvector centrality on
vertices. We demonstrate that this measure accurately reflects the impact of
events on strategic ensembles of economic sectors, notably in the US economy.
As a second example, we show that in the protein-interaction network of the
plant Arabidopsis thaliana, a model based on cycle-centrality better accounts
for pathogen activity than the state-of-art one. This translates into
pathogen-targeted-proteins being concentrated in a small number of triads with
high cycle-centrality. Algorithms for computing the centrality of cycles and
subgraphs are available for download
Induction in myeloid leukemic cells of genes that are expressed in different normal tissues
Using DNA microarray and cluster analysis of expressed genes in a cloned line
(M1-t-p53) of myeloid leukemic cells, we have analyzed the expression of genes
that are preferentially expressed in different normal tissues. Clustering of
547 highly expressed genes in these leukemic cells showed 38 genes
preferentially expressed in normal hematopoietic tissues and 122 other genes
preferentially expressed in different normal non-hematopoietic tissues
including neuronal tissues, muscle, liver and testis. We have also analyzed the
genes whose expression in the leukemic cells changed after activation of
wild-type p53 and treatment with the cytokine interleukin 6 (IL-6) or the
calcium mobilizer thapsigargin (TG). Out of 620 such genes in the leukemic
cells that were differentially expressed in normal tissues, clustering showed
80 genes that were preferentially expressed in hematopoietic tissues and 132
genes in different normal non-hematopietic tissues that also included neuronal
tissues, muscle, liver and testis. Activation of p53 and treatment with IL-6 or
TG induced different changes in the genes preferentially expressed in these
normal tissues. These myeloid leukemic cells thus express genes that are
expressed in normal non-hematopoietic tissues, and various treatments can
reprogram these cells to induce other such non-hematopoietic genes. The results
indicate that these leukemic cells share with normal hematopoietic stem cells
the plasticity of differentiation to different cell types. It is suggested that
this reprogramming to induce in malignant cells genes that are expressed in
different normal tissues may be of clinical value in therapy
CRISPR-Cas9 screens in human cells and primary neurons identify modifiers of C9ORF72 dipeptide-repeat-protein toxicity.
Hexanucleotide-repeat expansions in the C9ORF72 gene are the most common cause of amyotrophic lateral sclerosis and frontotemporal dementia (c9ALS/FTD). The nucleotide-repeat expansions are translated into dipeptide-repeat (DPR) proteins, which are aggregation prone and may contribute to neurodegeneration. We used the CRISPR-Cas9 system to perform genome-wide gene-knockout screens for suppressors and enhancers of C9ORF72 DPR toxicity in human cells. We validated hits by performing secondary CRISPR-Cas9 screens in primary mouse neurons. We uncovered potent modifiers of DPR toxicity whose gene products function in nucleocytoplasmic transport, the endoplasmic reticulum (ER), proteasome, RNA-processing pathways, and chromatin modification. One modifier, TMX2, modulated the ER-stress signature elicited by C9ORF72 DPRs in neurons and improved survival of human induced motor neurons from patients with C9ORF72 ALS. Together, our results demonstrate the promise of CRISPR-Cas9 screens in defining mechanisms of neurodegenerative diseases
Compounds from an Unbiased Chemical Screen Reverse Both Er-to-Golgi Trafficking Defects and Mitochondrial Dysfunction in Parkinson's Disease Models
α-Synuclein (α-syn) is a small lipid-binding protein involved in vesicle trafficking whose function is poorly characterized. It is of great interest to human biology and medicine because α-syn dysfunction is associated with several neurodegenerative disorders, including Parkinson’s disease (PD). We previously created a yeast model of α-syn pathobiology, which established vesicle trafficking as a process that is particularly sensitive to α-syn expression. We also uncovered a core group of proteins with diverse activities related to α-syn toxicity that is conserved from yeast to mammalian neurons. Here, we report that a yeast strain expressing a somewhat higher level of α-syn also exhibits strong defects in mitochondrial function. Unlike our previous strain, genetic suppression of endoplasmic reticulum (ER)-to-Golgi trafficking alone does not suppress α-syn toxicity in this strain. In an effort to identify individual compounds that could simultaneously rescue these apparently disparate pathological effects of α-syn, we screened a library of 115,000 compounds. We identified a class of small molecules that reduced α-syn toxicity at micromolar concentrations in this higher toxicity strain. These compounds reduced the formation of α-syn foci, re-established ER-to-Golgi trafficking and ameliorated α-syn-mediated damage to mitochondria. They also corrected the toxicity of α-syn in nematode neurons and in primary rat neuronal midbrain cultures. Remarkably, the compounds also protected neurons against rotenone-induced toxicity, which has been used to model the mitochondrial defects associated with PD in humans. That single compounds are capable of rescuing the diverse toxicities of α-syn in yeast and neurons suggests that they are acting on deeply rooted biological processes that connect these toxicities and have been conserved for a billion years of eukaryotic evolution. Thus, it seems possible to develop novel therapeutic strategies to simultaneously target the multiple pathological features of PD.MGH/MIT Morris Udall Center of Excellence in Parkinson Disease Research (NS038372)Michael J. Fox Foundation for Parkinson's ResearchHoward Hughes Medical InstituteUnited States. National Institutes of Health (NS049221)American Parkinson Disease Association, Inc
Genome-wide differentiation in closely related populations: the roles of selection and geographic isolation.
Population divergence in geographic isolation is due to a combination of factors. Natural and sexual selection may be important in shaping patterns of population differentiation, a pattern referred to as 'Isolation by Adaptation' (IBA). IBA can be complementary to the well-known pattern of 'Isolation by Distance' (IBD), in which the divergence of closely related populations (via any evolutionary process) is associated with geographic isolation. The barn swallow Hirundo rustica complex comprises six closely related subspecies, where divergent sexual selection is associated with phenotypic differentiation among allopatric populations. To investigate the relative contributions of selection and geographic distance to genome-wide differentiation, we compared genotypic and phenotypic variation from 350 barn swallows sampled across eight populations (28 pairwise comparisons) from four different subspecies. We report a draft whole genome sequence for H. rustica, to which we aligned a set of 9,493 single nucleotide polymorphisms (SNPs). Using statistical approaches to control for spatial autocorrelation of phenotypic variables and geographic distance, we find that divergence in traits related to migratory behavior and sexual signaling, as well as geographic distance together, explain over 70% of genome-wide divergence among populations. Controlling for IBD, we find 42% of genome-wide divergence is attributable to IBA through pairwise differences in traits related to migratory behavior and sexual signaling alone. By (i) combining these results with prior studies of how selection shapes morphological differentiation and (ii) accounting for spatial autocorrelation, we infer that morphological adaptation plays a large role in shaping population-level differentiation in this group of closely related populations. This article is protected by copyright. All rights reserved
From Frege to chatGPT: Compositionality in language, cognition, and deep neural networks
Compositionality has long been considered a key explanatory property
underlying human intelligence: arbitrary concepts can be composed into novel
complex combinations, permitting the acquisition of an open ended, potentially
infinite expressive capacity from finite learning experiences. Influential
arguments have held that neural networks fail to explain this aspect of
behavior, leading many to dismiss them as viable models of human cognition.
Over the last decade, however, modern deep neural networks (DNNs), which share
the same fundamental design principles as their predecessors, have come to
dominate artificial intelligence, exhibiting the most advanced cognitive
behaviors ever demonstrated in machines. In particular, large language models
(LLMs), DNNs trained to predict the next word on a large corpus of text, have
proven capable of sophisticated behaviors such as writing syntactically complex
sentences without grammatical errors, producing cogent chains of reasoning, and
even writing original computer programs -- all behaviors thought to require
compositional processing. In this chapter, we survey recent empirical work from
machine learning for a broad audience in philosophy, cognitive science, and
neuroscience, situating recent breakthroughs within the broader context of
philosophical arguments about compositionality. In particular, our review
emphasizes two approaches to endowing neural networks with compositional
generalization capabilities: (1) architectural inductive biases, and (2)
metalearning, or learning to learn. We also present findings suggesting that
LLM pretraining can be understood as a kind of metalearning, and can thereby
equip DNNs with compositional generalization abilities in a similar way. We
conclude by discussing the implications that these findings may have for the
study of compositionality in human cognition and by suggesting avenues for
future research.Comment: 32 pages (50 pages including references), 8 figure
The bottleneck may be the solution, not the problem
As a highly consequential biological trait, a memory \u201cbottleneck\u201d cannot escape selection pressures. It must therefore co-evolve with other cognitive mechanisms rather than act as an independent constraint. Recent theory and an implemented model of language acquisition suggest that a limit on working memory may evolve to help learning. Furthermore, it need not hamper the use of language for communication
Ratings of age of acquisition of 299 words across 25 languages: Is there a cross-linguistic order of words?
We present a new set of subjective age-of-acquisition (AoA) ratings for 299 words (158 nouns, 141 verbs) in 25 languages from five language families (Afro-Asiatic: Semitic languages; Altaic: one Turkic language: Indo-European: Baltic, Celtic, Germanic, Hellenic, Slavic, and Romance languages; Niger-Congo: one Bantu language; Uralic: Finnic and Ugric languages). Adult native speakers reported the age at which they had learned each word. We present a comparison of the AoA ratings across all languages by contrasting them in pairs. This comparison shows a consistency in the orders of ratings across the 25 languages. The data were then analyzed (1) to ascertain how the demographic characteristics of the participants influenced AoA estimations and (2) to assess differences caused by the exact form of the target question (when did you learn vs. when do children learn this word); (3) to compare the ratings obtained in our study to those of previous studies; and (4) to assess the validity of our study by comparison with quasi-objective AoA norms derived from the MacArthur–Bates Communicative Development Inventories (MB-CDI). All 299 words were judged as being acquired early (mostly before the age of 6 years). AoA ratings were associated with the raters’ social or language status, but not with the raters’ age or education. Parents reported words as being learned earlier, and bilinguals reported learning them later. Estimations of the age at which children learn the words revealed significantly lower ratings of AoA. Finally, comparisons with previous AoA and MB-CDI norms support the validity of the present estimations. Our AoA ratings are available for research or other purposes
- …
