221 research outputs found

    Investigating Topic Modelling for Therapy Dialogue Analysis

    Get PDF
    Previous research shows that aspects of doctor-patient communication in therapy can predict patient symptoms, satisfaction and future adherence to treatment (a significant problem with conditions such as schizophrenia). However, automatic prediction has so far shown success only when based on low-level lexical features, and it is unclear how well these can generalise to new data, or whether their effectiveness is due to their capturing aspects of style, structure or content. Here, we examine the use of topic as a higher-level measure of content, more likely to generalise and to have more explanatory power. Investigations show that while topics predict some important factors such as patient satisfaction and ratings of therapy quality, they lack the full predictive power of lower-level features. For some factors, unsupervised methods produce models comparable to manual annotation.

    Linguistic Indicators of Severity and Progress in Online Text-based Therapy for Depression

    Get PDF
    Mental illnesses such as depression andanxiety are highly prevalent, and therapyis increasingly being offered online. Thisnew setting is a departure from face-to-face therapy, and offers both a challengeand an opportunity – it is not yet knownwhat features or approaches are likely tolead to successful outcomes in such a dif-ferent medium, but online text-based ther-apy provides large amounts of data for lin-guistic analysis. We present an initial in-vestigation into the application of compu-tational linguistic techniques, such as topicand sentiment modelling, to online ther-apy for depression and anxiety. We findthat important measures such as symptomseverity can be predicted with compara-ble accuracy to face-to-face data, usinggeneral features such as discussion topicand sentiment; however, measures of pa-tient progress are captured only by finer-grained lexical features, suggesting thataspects of style or dialogue structure mayalso be important

    Alzheimer's Dementia Recognition Using Acoustic, Lexical, Disfluency and Speech Pause Features Robust to Noisy Inputs

    Get PDF
    INTERSPEECH 2021. arXiv admin note: substantial text overlap with arXiv:2106.09668INTERSPEECH 2021. arXiv admin note: substantial text overlap with arXiv:2106.09668INTERSPEECH 2021. arXiv admin note: substantial text overlap with arXiv:2106.09668We present two multimodal fusion-based deep learning models that consume ASR transcribed speech and acoustic data simultaneously to classify whether a speaker in a structured diagnostic task has Alzheimer's Disease and to what degree, evaluating the ADReSSo challenge 2021 data. Our best model, a BiLSTM with highway layers using words, word probabilities, disfluency features, pause information, and a variety of acoustic features, achieves an accuracy of 84% and RSME error prediction of 4.26 on MMSE cognitive scores. While predicting cognitive decline is more challenging, our models show improvement using the multimodal approach and word probabilities, disfluency and pause information over word-only models. We show considerable gains for AD classification using multimodal fusion and gating, which can effectively deal with noisy inputs from acoustic features and ASR hypotheses

    Not All Comments are Equal: Insights into Comment Moderation from a Topic-Aware Model

    Get PDF
    Moderation of reader comments is a significant problem for online news platforms. Here, we experiment with models for automatic moderation, using a dataset of comments from a popular Croatian newspaper. Our analysis shows that while comments that violate the moderation rules mostly share common linguistic and thematic features, their content varies across the different sections of the newspaper. We therefore make our models topic-aware, incorporating semantic features from a topic model into the classification decision. Our results show that topic information improves the performance of the model, increases its confidence in correct outputs, and helps us understand the model's outputs

    The Use of English Colour Terms in Big Data

    Get PDF
    This study explores the use of English colour names in large datasets from informal Twitter messages and the well-structured corpus of Google Books. Because colour names in text have no directly associated chromatic stimuli, the corresponding colour categories of colour words was assessed from responses in an online colour naming experiment. A comparison of the frequency in the three datasets revealed that the mapping of colour names to perceptually uniform colour spaces does not reflect natural language colour distributions

    Incremental Composition in Distributional Semantics

    Get PDF
    Despite the incremental nature of Dynamic Syntax (DS), the semantic grounding of it remains that of predicate logic, itself grounded in set theory, so is poorly suited to expressing the rampantly context-relative nature of word meaning, and related phenomena such as incremental judgements of similarity needed for the modelling of disambiguation. Here, we show how DS can be assigned a compositional distributional semantics which enables such judgements and makes it possible to incrementally disambiguate language constructs using vector space semantics. Building on a proposal in our previous work, we implement and evaluate our model on real data, showing that it outperforms a commonly used additive baseline. In conclusion, we argue that these results set the ground for an account of the non-determinism of lexical content, in which the nature of word meaning is its dependence on surrounding context for its construal

    Creative Language Generation in a Society of Engagement and Reflection

    Get PDF
    Conference proceeding from ICCC'20 International Conference on Computational Creativity. Many existing models of narrative and language generation use rigid sequences of steps which are cognitively implausible and limit creativity. Iterative models based on Sharples' cycle of engagement and reflection improve on this by incorporating self-evaluation but still have a rigid arrangement of parts. This paper outlines how a multi-agent approach could be used to break apart the cycle into a more fluid society of engagement and reflection, whose constituent agents interact with one another to produce a text. Our approach is to work in a simple domain in order to focus on the underlying processes, and to avoid the Eliza effect during evaluation

    Incremental Composition in Distributional Semantics

    Get PDF
    Despite the incremental nature of Dynamic Syntax (DS), the semantic grounding of it remains that of predicate logic, itself grounded in set theory, so is poorly suited to expressing the rampantly context-relative nature of word meaning, and related phenomena such as incremental judgements of similarity needed for the modelling of disambiguation. Here, we show how DS can be assigned a compositional distributional semantics which enables such judgements and makes it possible to incrementally disambiguate language constructs using vector space semantics. Building on a proposal in our previous work, we implement and evaluate our model on real data, showing that it outperforms a commonly used additive baseline. In conclusion, we argue that these results set the ground for an account of the non-determinism of lexical content, in which the nature of word meaning is its dependence on surrounding context for its construal

    Single pulse and profile variability study of PSR J1022+1001

    Get PDF
    Millisecond pulsars (MSPs) are known as highly stable celestial clocks. Nevertheless, recent studies have revealed the unstable nature of their integrated pulse profiles, which may limit the achievable pulsar timing precision. In this paper, we present a case study on the pulse profile variability of PSR J1022+1001. We have detected approximately 14,000 sub-pulses (components of single pulses) in 35-hr long observations, mostly located at the trailing component of the integrated profile. Their flux densities and fractional polarisation suggest that they represent the bright end of the energy distribution in ordinary emission mode and are not giant pulses. The occurrence of sub-pulses from the leading and trailing components of the integrated profile is shown to be correlated. For sub-pulses from the latter, a preferred pulse width of approximately 0.25 ms has been found. Using simultaneous observations from the Effelsberg 100-m telescope and the Westerbork Synthesis Radio Telescope, we have found that the integrated profile varies on a timescale of a few tens of minutes. We show that improper polarisation calibration and diffractive scintillation cannot be the sole reason for the observed instability. In addition, we demonstrate that timing residuals generated from averages of the detected sub-pulses are dominated by phase jitter, and place an upper limit of ~700 ns for jitter noise based on continuous 1-min integrations.Comment: 13 pages, 20 figures, 3 tables, accepted for publication in MNRA
    • …
    corecore