146 research outputs found

    Probabilistic approaches for modeling text structure and their application to text-to-text generation

    Get PDF
    Since the early days of generation research, it has been acknowledged that modeling the global structure of a document is crucial for producing coherent, readable output. However, traditional knowledge-intensive approaches have been of limited utility in addressing this problem since they cannot be effectively scaled to operate in domain-independent, large-scale applications. Due to this difficulty, existing text-to-text generation systems rarely rely on such structural information when producing an output text. Consequently, texts generated by these methods do not match the quality of those written by humans – they are often fraught with severe coherence violations and disfluencies. In this chapter, I will present probabilistic models of document structure that can be effectively learned from raw document collections. This feature distinguishes these new models from traditional knowledge intensive approaches used in symbolic concept-to-text generation. Our results demonstrate that these probabilistic models can be directly applied to content organization, and suggest that these models can prove useful in an even broader range of text-to-text applications than we have considered here.National Science Foundation (U.S.) (CAREER grant IIS- 0448168)Microsoft Research. New Faculty Fellowshi

    A Sex-Specific Metabolite Identified in a Marine Invertebrate Utilizing Phosphorus-31 Nuclear Magnetic Resonance

    Get PDF
    Hormone level differences are generally accepted as the primary cause for sexual dimorphism in animal and human development. Levels of low molecular weight metabolites also differ between men and women in circulating amino acids, lipids and carbohydrates and within brain tissue. While investigating the metabolism of blue crab tissues using Phosphorus-31 Nuclear Magnetic Resonance, we discovered that only the male blue crab (Callinectes sapidus) contained a phosphorus compound with a chemical shift well separated from the expected phosphate compounds. Spectra obtained from male gills were readily differentiated from female gill spectra. Analysis from six years of data from male and female crabs documented that the sex-specificity of this metabolite was normal for this species. Microscopic analysis of male and female gills found no differences in their gill anatomy or the presence of parasites or bacteria that might produce this phosphorus compound. Analysis of a rare gynandromorph blue crab (laterally, half male and half female) proved that this sex-specificity was an intrinsic biochemical process and was not caused by any variations in the diet or habitat of male versus female crabs. The existence of a sex-specific metabolite is a previously unrecognized, but potentially significant biochemical phenomenon. An entire enzyme system has been synthesized and activated only in one sex. Unless blue crabs are a unique species, sex-specific metabolites are likely to be present in other animals. Would the presence or absence of a sex-specific metabolite affect an animal's development, anatomy and biochemistry

    Linking genes to literature: text mining, information extraction, and retrieval applications for biology

    Get PDF
    Efficient access to information contained in online scientific literature collections is essential for life science research, playing a crucial role from the initial stage of experiment planning to the final interpretation and communication of the results. The biological literature also constitutes the main information source for manual literature curation used by expert-curated databases. Following the increasing popularity of web-based applications for analyzing biological data, new text-mining and information extraction strategies are being implemented. These systems exploit existing regularities in natural language to extract biologically relevant information from electronic texts automatically. The aim of the BioCreative challenge is to promote the development of such tools and to provide insight into their performance. This review presents a general introduction to the main characteristics and applications of currently available text-mining systems for life sciences in terms of the following: the type of biological information demands being addressed; the level of information granularity of both user queries and results; and the features and methods commonly exploited by these applications. The current trend in biomedical text mining points toward an increasing diversification in terms of application types and techniques, together with integration of domain-specific resources such as ontologies. Additional descriptions of some of the systems discussed here are available on the internet

    Role of forested land for natural flood management in the UK: A review

    Get PDF

    Architectural Paint Research and the Archaeology of Buildings

    Get PDF
    Architectural Paint Research (APR) is the archaeological study of interior and exterior applied decoration. Over time, applied layers of paint and other decorative finishes build-up on the surface of a built structure, encapsulating microscopic deposits of material evidence. This evidence can be used to inform the phase dating of a structure, or illuminate the historic function of a space. It can challenge preconceived ideas of how specific areas were decorated, and track the changes in aesthetics over time. It can identify when architects’ ideologies have been balanced by practical considerations. It can provide an insight into the intangible and ephemeral atmosphere that decoration gives to a room. Finally, it can examine the dirt trapped between layers of decoration and thus categorize the physical environmental conditions that surrounded a building at varying points in its history. Although used in the commercial heritage and conservation sectors, Architectural Paint Research is almost completely unknown to building archaeologists. This article aims to introduce APR to a new audience, and argues that is an invaluable tool in the archaeological interpretation of buildings

    Microplanning with Communicative Intentions: The SPUD System

    Get PDF
    The process of microplanning in Natural Language Generation (NLG) encompasses a range of problems in which a generator must bridge underlying domain-specific representations and general linguistic representations. These problems include constructing linguistic referring expressions to identify domain objects, selecting lexical items to express domain concepts, and using complex linguistic constructions to concisely convey related domain facts. In this paper, we argue that such problems are best solved through a uniform, comprehensive, declarative process. In our approach, the generator directly explores a search space for utterances described by a linguistic grammar. At each stage of search, the generator uses a model of interpretation, which characterizes the potential links between the utterance and the domain and context, to assess its progress in conveying domain-specific representations. We further address the challenges for implementation and knowledge representation in this approach. We show how to implement this approach effectively by using the lexicalized tree-adjoining grammar formalism (LTAG) to connect structure to meaning and using modal logic programming to connect meaning to context. We articulate a detailed methodology for designing grammatical and conceptua
    • 

    corecore