10,294 research outputs found

    Establishing Nomological Networks for Behavioral Science: a Natural Language Processing Based Approach

    Get PDF
    As the accumulated research base of the behavioral sciences have grown, the amount of actual knowledge discovery has not kept pace as evidenced by an increasing number of disconnected theories and the related problem of construct proliferation. Therefore, integrating social and behavioral sciences across research areas or even disciplines in a meaningful way is imperative. Despite the information systems (IS) discipline’s leadership on creating nomological networks and inter-nomological networks for research integration, a quantitative approach to automatically establish nomological networks from large-scale data is missing. Based on the design science paradigm, we therefore propose a novel natural language processing based approach bringing together these two previous research endeavors. We used a dataset consisting of all the relevant behavioral studies from two tops journal in the IS and psychology fields to evaluate our approach in comparison to human decisions. Finally, the limitations and possible extensions of our approach are critically discussed

    Chemical entity extraction using CRF and an ensemble of extractors

    Get PDF

    Citation Handling: Processing Citation Texts in Scientific Documents

    Get PDF
    Citation sentences (sentences that cite other papers) play a key role in the summarization of scientific articles. However, a citation-based summarization system that depends on generic natural language processing components, such as parsers or sentence compressors, will perform poorly if those components cannot handle citations correctly. In this thesis, I examine the effect of citation handling on parsing, sentence compression, and multi-document summarization. There are two types of citations that occur in citation sentences: constituent citations and parenthetical citations. I propose an automatic citation classifier based on training data created through Mechanical Turk tasks. I demonstrate that the use of type-specific citation handling as pre-processing improves the performance of a state-of-the-art generic parser, both for quality of the parse trees and running time. Extrinsic evaluations demonstrate that improving the performance of a parser on citation sentences in turn improves the performance of a sentence compressor, Trimmer (Zajic et al., 2007), and a multi-document summarization system, MASCS, according to several summarization measures
    • …
    corecore