1,165 research outputs found

    Deep Learning Relevance: Creating Relevant Information (as Opposed to Retrieving it)

    Full text link
    What if Information Retrieval (IR) systems did not just retrieve relevant information that is stored in their indices, but could also "understand" it and synthesise it into a single document? We present a preliminary study that makes a first step towards answering this question. Given a query, we train a Recurrent Neural Network (RNN) on existing relevant information to that query. We then use the RNN to "deep learn" a single, synthetic, and we assume, relevant document for that query. We design a crowdsourcing experiment to assess how relevant the "deep learned" document is, compared to existing relevant documents. Users are shown a query and four wordclouds (of three existing relevant documents and our deep learned synthetic document). The synthetic document is ranked on average most relevant of all.Comment: Neu-IR '16 SIGIR Workshop on Neural Information Retrieval, July 21, 2016, Pisa, Ital

    Entropy and Graph Based Modelling of Document Coherence using Discourse Entities: An Application

    Full text link
    We present two novel models of document coherence and their application to information retrieval (IR). Both models approximate document coherence using discourse entities, e.g. the subject or object of a sentence. Our first model views text as a Markov process generating sequences of discourse entities (entity n-grams); we use the entropy of these entity n-grams to approximate the rate at which new information appears in text, reasoning that as more new words appear, the topic increasingly drifts and text coherence decreases. Our second model extends the work of Guinaudeau & Strube [28] that represents text as a graph of discourse entities, linked by different relations, such as their distance or adjacency in text. We use several graph topology metrics to approximate different aspects of the discourse flow that can indicate coherence, such as the average clustering or betweenness of discourse entities in text. Experiments with several instantiations of these models show that: (i) our models perform on a par with two other well-known models of text coherence even without any parameter tuning, and (ii) reranking retrieval results according to their coherence scores gives notable performance gains, confirming a relation between document coherence and relevance. This work contributes two novel models of document coherence, the application of which to IR complements recent work in the integration of document cohesiveness or comprehensibility to ranking [5, 56]

    Near-optimal adjacency labeling scheme for power-law graphs

    Get PDF
    An adjacency labeling scheme is a method that assigns labels to the vertices of a graph such that adjacency between vertices can be inferred directly from the assigned label, without using a centralized data structure. We devise adjacency labeling schemes for the family of power-law graphs. This family that has been used to model many types of networks, e.g. the Internet AS-level graph. Furthermore, we prove an almost matching lower bound for this family. We also provide an asymptotically near- optimal labeling scheme for sparse graphs. Finally, we validate the efficiency of our labeling scheme by an experimental evaluation using both synthetic data and real-world networks of up to hundreds of thousands of vertices

    Near Optimal Adjacency Labeling Schemes for Power-Law Graphs

    Get PDF
    An adjacency labeling scheme labels the n nodes of a graph with bit strings in a way that allows, given the labels of two nodes, to determine adjacency based only on those bit strings. Though many graph families have been meticulously studied for this problem, a non-trivial labeling scheme for the important family of power-law graphs has yet to be obtained. This family is particularly useful for social and web networks as their underlying graphs are typically modelled as power-law graphs. Using simple strategies and a careful selection of a parameter, we show upper bounds for such labeling schemes of ~O(sqrt^{alpha}(n)) for power law graphs with coefficient alpha;, as well as nearly matching lower bounds. We also show two relaxations that allow for a label of logarithmic size, and extend the upper-bound technique to produce an improved distance labeling scheme for power-law graphs

    Dereplication Guided Discovery of Secondary Metabolites of Mixed Biosynthetic Origin from<i> Aspergillus aculeatus</i>

    Get PDF
    Investigation of the chemical profile of the industrially important black filamentous fungus Aspergillus aculeatus by UHPLC-DAD-HRMS and subsequent dereplication has led to the discovery of several novel compounds. Isolation and extensive 1D and 2D NMR spectroscopic analyses allowed for structural elucidation of a dioxomorpholine, a unique okaramine, an aflavinine and three novel structures of mixed biosynthetic origin, which we have named aculenes A–C. Moreover, known analogues of calbistrins, okaramines and secalonic acids were detected. All novel compounds were tested for antifungal activity against Candida albicans, however all showed only weak or no activity. Aspergillus aculeatus IBT 21030 was additionally shown to be capable of producing sclerotia. Examination of the sclerotia revealed a highly regulated production of metabolites in these morphological structures

    Pubertal development in healthy children is mirrored by DNA methylation patterns in peripheral blood

    Get PDF
    Puberty marks numerous physiological processes which are initiated by central activation of the hypothalamic–pituitary–gonadal axis, followed by development of secondary sexual characteristics. To a large extent, pubertal timing is heritable, but current knowledge of genetic polymorphisms only explains few months in the large inter-individual variation in the timing of puberty. We have analysed longitudinal genome-wide changes in DNA methylation in peripheral blood samples (n = 102) obtained from 51 healthy children before and after pubertal onset. We show that changes in single methylation sites are tightly associated with physiological pubertal transition and altered reproductive hormone levels. These methylation sites cluster in and around genes enriched for biological functions related to pubertal development. Importantly, we identified that methylation of the genomic region containing the promoter of TRIP6 was co-ordinately regulated as a function of pubertal development. In accordance, immunohistochemistry identified TRIP6 in adult, but not pre-pubertal, testicular Leydig cells and circulating TRIP6 levels doubled during puberty. Using elastic net prediction models, methylation patterns predicted pubertal development more accurately than chronological age. We demonstrate for the first time that pubertal attainment of secondary sexual characteristics is mirrored by changes in DNA methylation patterns in peripheral blood. Thus, modulations of the epigenome seem involved in regulation of the individual pubertal timing
    • …
    corecore