37,228 research outputs found
Topic Similarity Networks: Visual Analytics for Large Document Sets
We investigate ways in which to improve the interpretability of LDA topic
models by better analyzing and visualizing their outputs. We focus on examining
what we refer to as topic similarity networks: graphs in which nodes represent
latent topics in text collections and links represent similarity among topics.
We describe efficient and effective approaches to both building and labeling
such networks. Visualizations of topic models based on these networks are shown
to be a powerful means of exploring, characterizing, and summarizing large
collections of unstructured text documents. They help to "tease out"
non-obvious connections among different sets of documents and provide insights
into how topics form larger themes. We demonstrate the efficacy and
practicality of these approaches through two case studies: 1) NSF grants for
basic research spanning a 14 year period and 2) the entire English portion of
Wikipedia.Comment: 9 pages; 2014 IEEE International Conference on Big Data (IEEE BigData
2014
Variation in the organization and subunit composition of the mammalian pyruvate dehydrogenase complex E2/E3BP core assembly
The final version of this article is available at the link below.Crucial to glucose homoeostasis in humans, the hPDC (human pyruvate dehydrogenase complex) is a massive molecular machine comprising multiple copies of three distinct enzymes (E1–E3) and an accessory subunit, E3BP (E3-binding protein). Its icosahedral E2/E3BP 60-meric ‘core’ provides the central structural and mechanistic framework ensuring favourable E1 and E3 positioning and enzyme co-operativity. Current core models indicate either a 48E2+12E3BP or a 40E2+20E3BP subunit composition. In the present study, we demonstrate clear differences in subunit content and organization between the recombinant hPDC core (rhPDC; 40E2+20E3BP), generated under defined conditions where E3BP is produced in excess, and its native bovine (48E2+12E3BP) counterpart. The results of the present study provide a rational basis for resolving apparent differences between previous models, both obtained using rhE2/E3BP core assemblies where no account was taken of relative E2 and E3BP expression levels. Mathematical modelling predicts that an ‘average’ 48E2+12E3BP core arrangement allows maximum flexibility in assembly, while providing the appropriate balance of bound E1 and E3 enzymes for optimal catalytic efficiency and regulatory fine-tuning. We also show that the rhE2/E3BP and bovine E2/E3BP cores bind E3s with a 2:1 stoichiometry, and propose that mammalian PDC comprises a heterogeneous population of assemblies incorporating a network of E3 (and possibly E1) cross-bridges above the core surface.This work was partly supported by EPSRC (under grants GR/R99393/01 and EP/C015452/1)
Epidemic Information Diffusion: A Simple Solution to Support Community-based Recommendations in P2P Overlays
Epidemic protocols proved to be very efficient solutions for supporting
dynamic and complex information diffusion in highly dis- tributed computing
infrastructures, like P2P environments. They are useful bricks for building and
maintaining virtual network topologies, in the form of overlay networks as well
as to support pervasive diffusion of information when it is injected into the
network. This paper proposes a simple architecture exploiting the features of
epidemic approaches to foster a collaborative percolation of information
between computing nodes belonging to the network aimed at building a system
that groups similar users and spread useful information among them.Comment: 8 pages, 2 figure
Corporate Social Responsibility and the Environment: A Theoretical Perspective
We survey the growing theoretical literature on the motives for and welfare effects of corporate greening. We show how both market and political forces are making environmental CSR profitable, and we also discuss morally-motivated or altruistic CSR. Welfare effects of CSR are subtle and situation-contingent, and there is no guarantee that CSR enhances social welfare. We identify numerous areas in which additional theoretical work is needed.corporate social responsibility, environment, self-regulation, preemption, private politics
MINTmap: fast and exhaustive profiling of nuclear and mitochondrial tRNA fragments from short RNA-seq data.
Transfer RNA fragments (tRFs) are an established class of constitutive regulatory molecules that arise from precursor and mature tRNAs. RNA deep sequencing (RNA-seq) has greatly facilitated the study of tRFs. However, the repeat nature of the tRNA templates and the idiosyncrasies of tRNA sequences necessitate the development and use of methodologies that differ markedly from those used to analyze RNA-seq data when studying microRNAs (miRNAs) or messenger RNAs (mRNAs). Here we present MINTmap (for MItochondrial and Nuclear TRF mapping), a method and a software package that was developed specifically for the quick, deterministic and exhaustive identification of tRFs in short RNA-seq datasets. In addition to identifying them, MINTmap is able to unambiguously calculate and report both raw and normalized abundances for the discovered tRFs. Furthermore, to ensure specificity, MINTmap identifies the subset of discovered tRFs that could be originating outside of tRNA space and flags them as candidate false positives. Our comparative analysis shows that MINTmap exhibits superior sensitivity and specificity to other available methods while also being exceptionally fast. The MINTmap codes are available through https://github.com/TJU-CMC-Org/MINTmap/ under an open source GNU GPL v3.0 license
The Cinderella Complex: Word Embeddings Reveal Gender Stereotypes in Movies and Books
Our analysis of thousands of movies and books reveals how these cultural
products weave stereotypical gender roles into morality tales and perpetuate
gender inequality through storytelling. Using the word embedding techniques, we
reveal the constructed emotional dependency of female characters on male
characters in stories
- …