37,228 research outputs found

    Topic Similarity Networks: Visual Analytics for Large Document Sets

    Full text link
    We investigate ways in which to improve the interpretability of LDA topic models by better analyzing and visualizing their outputs. We focus on examining what we refer to as topic similarity networks: graphs in which nodes represent latent topics in text collections and links represent similarity among topics. We describe efficient and effective approaches to both building and labeling such networks. Visualizations of topic models based on these networks are shown to be a powerful means of exploring, characterizing, and summarizing large collections of unstructured text documents. They help to "tease out" non-obvious connections among different sets of documents and provide insights into how topics form larger themes. We demonstrate the efficacy and practicality of these approaches through two case studies: 1) NSF grants for basic research spanning a 14 year period and 2) the entire English portion of Wikipedia.Comment: 9 pages; 2014 IEEE International Conference on Big Data (IEEE BigData 2014

    Variation in the organization and subunit composition of the mammalian pyruvate dehydrogenase complex E2/E3BP core assembly

    Get PDF
    The final version of this article is available at the link below.Crucial to glucose homoeostasis in humans, the hPDC (human pyruvate dehydrogenase complex) is a massive molecular machine comprising multiple copies of three distinct enzymes (E1–E3) and an accessory subunit, E3BP (E3-binding protein). Its icosahedral E2/E3BP 60-meric ‘core’ provides the central structural and mechanistic framework ensuring favourable E1 and E3 positioning and enzyme co-operativity. Current core models indicate either a 48E2+12E3BP or a 40E2+20E3BP subunit composition. In the present study, we demonstrate clear differences in subunit content and organization between the recombinant hPDC core (rhPDC; 40E2+20E3BP), generated under defined conditions where E3BP is produced in excess, and its native bovine (48E2+12E3BP) counterpart. The results of the present study provide a rational basis for resolving apparent differences between previous models, both obtained using rhE2/E3BP core assemblies where no account was taken of relative E2 and E3BP expression levels. Mathematical modelling predicts that an ‘average’ 48E2+12E3BP core arrangement allows maximum flexibility in assembly, while providing the appropriate balance of bound E1 and E3 enzymes for optimal catalytic efficiency and regulatory fine-tuning. We also show that the rhE2/E3BP and bovine E2/E3BP cores bind E3s with a 2:1 stoichiometry, and propose that mammalian PDC comprises a heterogeneous population of assemblies incorporating a network of E3 (and possibly E1) cross-bridges above the core surface.This work was partly supported by EPSRC (under grants GR/R99393/01 and EP/C015452/1)

    Epidemic Information Diffusion: A Simple Solution to Support Community-based Recommendations in P2P Overlays

    Full text link
    Epidemic protocols proved to be very efficient solutions for supporting dynamic and complex information diffusion in highly dis- tributed computing infrastructures, like P2P environments. They are useful bricks for building and maintaining virtual network topologies, in the form of overlay networks as well as to support pervasive diffusion of information when it is injected into the network. This paper proposes a simple architecture exploiting the features of epidemic approaches to foster a collaborative percolation of information between computing nodes belonging to the network aimed at building a system that groups similar users and spread useful information among them.Comment: 8 pages, 2 figure

    Corporate Social Responsibility and the Environment: A Theoretical Perspective

    Get PDF
    We survey the growing theoretical literature on the motives for and welfare effects of corporate greening. We show how both market and political forces are making environmental CSR profitable, and we also discuss morally-motivated or altruistic CSR. Welfare effects of CSR are subtle and situation-contingent, and there is no guarantee that CSR enhances social welfare. We identify numerous areas in which additional theoretical work is needed.corporate social responsibility, environment, self-regulation, preemption, private politics

    MINTmap: fast and exhaustive profiling of nuclear and mitochondrial tRNA fragments from short RNA-seq data.

    Get PDF
    Transfer RNA fragments (tRFs) are an established class of constitutive regulatory molecules that arise from precursor and mature tRNAs. RNA deep sequencing (RNA-seq) has greatly facilitated the study of tRFs. However, the repeat nature of the tRNA templates and the idiosyncrasies of tRNA sequences necessitate the development and use of methodologies that differ markedly from those used to analyze RNA-seq data when studying microRNAs (miRNAs) or messenger RNAs (mRNAs). Here we present MINTmap (for MItochondrial and Nuclear TRF mapping), a method and a software package that was developed specifically for the quick, deterministic and exhaustive identification of tRFs in short RNA-seq datasets. In addition to identifying them, MINTmap is able to unambiguously calculate and report both raw and normalized abundances for the discovered tRFs. Furthermore, to ensure specificity, MINTmap identifies the subset of discovered tRFs that could be originating outside of tRNA space and flags them as candidate false positives. Our comparative analysis shows that MINTmap exhibits superior sensitivity and specificity to other available methods while also being exceptionally fast. The MINTmap codes are available through https://github.com/TJU-CMC-Org/MINTmap/ under an open source GNU GPL v3.0 license

    The Cinderella Complex: Word Embeddings Reveal Gender Stereotypes in Movies and Books

    Full text link
    Our analysis of thousands of movies and books reveals how these cultural products weave stereotypical gender roles into morality tales and perpetuate gender inequality through storytelling. Using the word embedding techniques, we reveal the constructed emotional dependency of female characters on male characters in stories
    • …
    corecore