86,138 research outputs found

    Using genetic algorithms to create meaningful poetic text

    Get PDF
    Work carried out when all authors were at the University of Edinburgh.Peer reviewedPostprin

    A Generic Alerting Service for Digital Libraries

    Get PDF
    Users of modern digital libraries (DLs) can keep themselves up-to-date by searching and browsing their favorite collections, or more conveniently by resorting to an alerting service. The alerting service notifies its clients about new or changed documents. Proprietary and mediating alerting services fail to fluidly integrate information from differing collections. This paper analyses the conceptual requirements of this much-sought after service for digital libraries. We demonstrate that the differing concepts of digital libraries and its underlying technical design has extensive influence (a) the expectations, needs and interests of users regarding an alerting service, and (b) on the technical possibilities of the implementation of the service. Our findings will show that the range of issues surrounding alerting services for digital libraries, their design and use is greater than one may anticipate. We also show that, conversely, the requirements for an alerting service have considerable impact on the concepts of DL design. Our findings should be of interest for librarians as well as system designers. We highlight and discuss the far-reaching implications for the design of, and interaction with, libraries. This paper discusses the lessons learned from building such a distributed alerting service. We present our prototype implementation as a proof-of-concept for an alerting service for open DL software

    Adapting Sequence to Sequence models for Text Normalization in Social Media

    Full text link
    Social media offer an abundant source of valuable raw data, however informal writing can quickly become a bottleneck for many natural language processing (NLP) tasks. Off-the-shelf tools are usually trained on formal text and cannot explicitly handle noise found in short online posts. Moreover, the variety of frequently occurring linguistic variations presents several challenges, even for humans who might not be able to comprehend the meaning of such posts, especially when they contain slang and abbreviations. Text Normalization aims to transform online user-generated text to a canonical form. Current text normalization systems rely on string or phonetic similarity and classification models that work on a local fashion. We argue that processing contextual information is crucial for this task and introduce a social media text normalization hybrid word-character attention-based encoder-decoder model that can serve as a pre-processing step for NLP applications to adapt to noisy text in social media. Our character-based component is trained on synthetic adversarial examples that are designed to capture errors commonly found in online user-generated text. Experiments show that our model surpasses neural architectures designed for text normalization and achieves comparable performance with state-of-the-art related work.Comment: Accepted at the 13th International AAAI Conference on Web and Social Media (ICWSM 2019

    Vertebrate DNA in Fecal Samples from Bonobos and Gorillas: Evidence for Meat Consumption or Artefact?

    Get PDF
    Background: Deciphering the behavioral repertoire of great apes is a challenge for several reasons. First, due to their elusive behavior in dense forest environments, great ape populations are often difficult to observe. Second, members of the genus Pan are known to display a great variety in their behavioral repertoire; thus, observations from one population are not necessarily representative for other populations. For example, bonobos (Pan paniscus) are generally believed to consume almost no vertebrate prey. However, recent observations show that at least some bonobo populations may consume vertebrate prey more commonly than previously believed. We investigated the extent of their meat consumption using PCR amplification of vertebrate mitochondrial DNA (mtDNA) segments from DNA extracted from bonobo feces. As a control we also attempted PCR amplifications from gorilla feces, a species assumed to be strictly herbivorous. Principal Findings: We found evidence for consumption of a variety of mammalian species in about 16% of the samples investigated. Moreover, 40% of the positive DNA amplifications originated from arboreal monkeys. However, we also found duiker and monkey mtDNA in the gorilla feces, albeit in somewhat lower percentages. Notably, the DNA sequences isolated from the two ape species fit best to the species living in the respective regions. This result suggests that the sequences are of regional origin and do not represent laboratory contaminants. Conclusions: Our results allow at least three possible and mutually not exclusive conclusions. First, all results may represent contamination of the feces by vertebrate DNA from the local environment. Thus, studies investigating a species' diet from feces DNA may be unreliable due to the low copy number of DNA originating from diet items. Second, there is some inherent difference between the bonobo and gorilla feces, with only the later ones being contaminated. Third, similar to bonobos, for which the consumption of monkeys has only recently been documented, the gorilla population investigated (for which very little observational data are as yet available) may occasionally consume small vertebrates. Although the last explanation is speculative, it should not be discarded a-priori given that observational studies continue to unravel new behaviors in great ape species
    • ā€¦
    corecore