272,957 research outputs found

    Context aware guidance for multimedia authoring: harmonizing domain and discourse knowledge

    Get PDF
    This paper presents an approach to assist authors during the authoring of multimedia presentations. We extend existing authoring support by integrating processes of topic identification, content collection and discourse structure building in a single environment. This integration allows identification of the context of the authoring process. Our approach combines this process context awareness with explicit domain and discourse knowledge to steer system suggestions. We evaluate our approach with an experimental system prototyp

    Demonstratives, referent identification and topicality in Wambon and some other Papuan languages

    Get PDF
    Abstract In Papuan languages like Wambon and Urim demonstrative forms are used both in contexts of referent identification, e.g. as demonstrative operators in noun phrases, and in topicality contexts, e.g. as topic markers with adverbial clauses and phrases, recapitulative clauses, new topic NPs and given topic NPs. Using notions from the Functional Grammar framework (Dik, 1989), I present a non-unified account of the demonstrative forms: helping the addressee to identify referents by giving deictic hints like ‘close to speaker’ and orienting the addressee about the topical cohesion of the discourse are two separate functional domains in language. This ‘two-domain’ hypothesis, which views the demonstrative forms as having two synchronically unrelated functions, explains the fact that in Wambon and Urim the demonstratives show important differences in form and behaviour depending on whether they are used for referent identification or for expressing topicality distinctions. The ‘two-domain’ hypothesis explains such formal differences but cannot explain the formal similarities between topic markers and demonstrative operators in Papuan languages like Wambon and Urim. To explain these formal similarities I suggest a diachronic development: in several Papuan languages topic markers developed from demonstrative operators. In the relatively well-documented Awyu-family of Papuan languages this process can be traced: in Wambon, the resumptive demonstrative pronoun- eve integrated in the preceding NP as a topic marker in stative clauses with a very transparant dichotomous topic-comment structure. In Korowai, also of the Awyu-family, the clitic -efè, function as a demonstrative operator and functions solely as a topic marker

    Tentative Reference Acts? ‘Recognitional Demonstratives’ as Means of Suggesting Mutual Knowledge – or Overriding a Lack of It

    Get PDF
    In an explorative study on German oral corpus data we investigate recognitional use of proximal demonstratives as a means of explicit speaker-hearer interaction shaping the discourse structure. We show that recognitionals mark tentative reference acts in that speakers suggest - or pretend - mutual knowledge of the referent, at the same time appealing to the hearers to accept the reference. Hearers may tacitly or explicitly accept the referential act or deny it asking for clarification, in the latter case making speakers change the intended local discourse topic. On these grounds we argue against a differentiation between recognitional and indefinite demonstratives, subsuming both as kinds of recognitional use under ‘pretended’ cognitive proximity

    Topic Independent Identification of Agreement and Disagreement in Social Media Dialogue

    Full text link
    Research on the structure of dialogue has been hampered for years because large dialogue corpora have not been available. This has impacted the dialogue research community's ability to develop better theories, as well as good off the shelf tools for dialogue processing. Happily, an increasing amount of information and opinion exchange occur in natural dialogue in online forums, where people share their opinions about a vast range of topics. In particular we are interested in rejection in dialogue, also called disagreement and denial, where the size of available dialogue corpora, for the first time, offers an opportunity to empirically test theoretical accounts of the expression and inference of rejection in dialogue. In this paper, we test whether topic-independent features motivated by theoretical predictions can be used to recognize rejection in online forums in a topic independent way. Our results show that our theoretically motivated features achieve 66% accuracy, an improvement over a unigram baseline of an absolute 6%.Comment: @inproceedings{Misra2013TopicII, title={Topic Independent Identification of Agreement and Disagreement in Social Media Dialogue}, author={Amita Misra and Marilyn A. Walker}, booktitle={SIGDIAL Conference}, year={2013}

    How did the discussion go: Discourse act classification in social media conversations

    Full text link
    We propose a novel attention based hierarchical LSTM model to classify discourse act sequences in social media conversations, aimed at mining data from online discussion using textual meanings beyond sentence level. The very uniqueness of the task is the complete categorization of possible pragmatic roles in informal textual discussions, contrary to extraction of question-answers, stance detection or sarcasm identification which are very much role specific tasks. Early attempt was made on a Reddit discussion dataset. We train our model on the same data, and present test results on two different datasets, one from Reddit and one from Facebook. Our proposed model outperformed the previous one in terms of domain independence; without using platform-dependent structural features, our hierarchical LSTM with word relevance attention mechanism achieved F1-scores of 71\% and 66\% respectively to predict discourse roles of comments in Reddit and Facebook discussions. Efficiency of recurrent and convolutional architectures in order to learn discursive representation on the same task has been presented and analyzed, with different word and comment embedding schemes. Our attention mechanism enables us to inquire into relevance ordering of text segments according to their roles in discourse. We present a human annotator experiment to unveil important observations about modeling and data annotation. Equipped with our text-based discourse identification model, we inquire into how heterogeneous non-textual features like location, time, leaning of information etc. play their roles in charaterizing online discussions on Facebook

    Parsing Argumentation Structures in Persuasive Essays

    Full text link
    In this article, we present a novel approach for parsing argumentation structures. We identify argument components using sequence labeling at the token level and apply a new joint model for detecting argumentation structures. The proposed model globally optimizes argument component types and argumentative relations using integer linear programming. We show that our model considerably improves the performance of base classifiers and significantly outperforms challenging heuristic baselines. Moreover, we introduce a novel corpus of persuasive essays annotated with argumentation structures. We show that our annotation scheme and annotation guidelines successfully guide human annotators to substantial agreement. This corpus and the annotation guidelines are freely available for ensuring reproducibility and to encourage future research in computational argumentation.Comment: Under review in Computational Linguistics. First submission: 26 October 2015. Revised submission: 15 July 201

    Follow-up question handling in the IMIX and Ritel systems: A comparative study

    Get PDF
    One of the basic topics of question answering (QA) dialogue systems is how follow-up questions should be interpreted by a QA system. In this paper, we shall discuss our experience with the IMIX and Ritel systems, for both of which a follow-up question handling scheme has been developed, and corpora have been collected. These two systems are each other's opposites in many respects: IMIX is multimodal, non-factoid, black-box QA, while Ritel is speech, factoid, keyword-based QA. Nevertheless, we will show that they are quite comparable, and that it is fruitful to examine the similarities and differences. We shall look at how the systems are composed, and how real, non-expert, users interact with the systems. We shall also provide comparisons with systems from the literature where possible, and indicate where open issues lie and in what areas existing systems may be improved. We conclude that most systems have a common architecture with a set of common subtasks, in particular detecting follow-up questions and finding referents for them. We characterise these tasks using the typical techniques used for performing them, and data from our corpora. We also identify a special type of follow-up question, the discourse question, which is asked when the user is trying to understand an answer, and propose some basic methods for handling it

    A Topic-Agnostic Approach for Identifying Fake News Pages

    Full text link
    Fake news and misinformation have been increasingly used to manipulate popular opinion and influence political processes. To better understand fake news, how they are propagated, and how to counter their effect, it is necessary to first identify them. Recently, approaches have been proposed to automatically classify articles as fake based on their content. An important challenge for these approaches comes from the dynamic nature of news: as new political events are covered, topics and discourse constantly change and thus, a classifier trained using content from articles published at a given time is likely to become ineffective in the future. To address this challenge, we propose a topic-agnostic (TAG) classification strategy that uses linguistic and web-markup features to identify fake news pages. We report experimental results using multiple data sets which show that our approach attains high accuracy in the identification of fake news, even as topics evolve over time.Comment: Accepted for publication in the Companion Proceedings of the 2019 World Wide Web Conference (WWW'19 Companion). Presented in the 2019 International Workshop on Misinformation, Computational Fact-Checking and Credible Web (MisinfoWorkshop2019). 6 page

    Communication difficulties following right hemisphere stroke : applying evidence to clinical management

    Get PDF
    Following reports in the 1960s that language may be affected by right hemisphere (RH) lesions, many limitations to effective communication in the right hemisphere damaged (RHD) population have been described and evidenced. However, stereotypical portrayals and descriptions of carefully selected cases may be misleading as to the extent of communication deficits. In many of the parameters in which RHD patients are presented as typically impaired, e.g. discourse skills, a less severe picture may emerge where data from the non-brain damaged (NBD) population are considered, with age and education variables controlled. Subsequent to RHD, some people show deficit on some communication measures, but many of these communication behaviours are also present in some NBD adults. Thus diagnosis of deficit must be made with reference both to the healthy peer population and the individual's pre-lesion behaviour. The authors' right RH stroke research programme includes studies of incidence of communication deficit, comparisons of RHD and NBD groups in various spoken discourse and comprehension tasks, comparison of RHD groups of different ages, detailed analysis of topic within discourse in RHD and NBD groups, family members' views of communication behaviour following RHD, and the natural course of communication change during the first year after RH stroke. The findings from several studies are summarised and used as the basis for management recommendations, which may guide future outcome research. There is an urgent need for the evaluation of communication management programmes, to determine whether therapists may with confidence offer an effective intervention service to those people whose communication skills are affected by RHD
    corecore