2,218 research outputs found

    Using Linguistic Features to Estimate Suicide Probability of Chinese Microblog Users

    Full text link
    If people with high risk of suicide can be identified through social media like microblog, it is possible to implement an active intervention system to save their lives. Based on this motivation, the current study administered the Suicide Probability Scale(SPS) to 1041 weibo users at Sina Weibo, which is a leading microblog service provider in China. Two NLP (Natural Language Processing) methods, the Chinese edition of Linguistic Inquiry and Word Count (LIWC) lexicon and Latent Dirichlet Allocation (LDA), are used to extract linguistic features from the Sina Weibo data. We trained predicting models by machine learning algorithm based on these two types of features, to estimate suicide probability based on linguistic features. The experiment results indicate that LDA can find topics that relate to suicide probability, and improve the performance of prediction. Our study adds value in prediction of suicidal probability of social network users with their behaviors

    Mental distress detection and triage in forum posts: the LT3 CLPsych 2016 shared task system

    Get PDF
    This paper describes the contribution of LT3 for the CLPsych 2016 Shared Task on automatic triage of mental health forum posts. Our systems use multiclass Support Vector Machines (SVM), cascaded binary SVMs and ensembles with a rich feature set. The best systems obtain macro-averaged F-scores of 40% on the full task and 80% on the green versus alarming distinction. Multiclass SVMs with all features score best in terms of F-score, whereas feature filtering with bi-normal separation and classifier ensembling are found to improve recall of alarming posts

    Detecting suicidality on Twitter

    Get PDF
    Twitter is increasingly investigated as a means of detecting mental health status, including depression and suicidality, in the population. However, validated and reliable methods are not yet fully established. This study aimed to examine whether the level of concern for a suicide-related post on Twitter could be determined based solely on the content of the post, as judged by human coders and then replicated by machine learning. From 18th February 2014 to 23rd April 2014, Twitter was monitored for a series of suicide-related phrases and terms using the public Application Program Interface (API). Matching tweets were stored in a data annotation tool developed by the Commonwealth Scientific and Industrial Research Organisation (CSIRO). During this time, 14,701 suicide-related tweets were collected: 14% were randomly (n = 2000) selected and divided into two equal sets (Set A and B) for coding by human researchers. Overall, 14% of suicide-related tweets were classified as ‘strongly concerning’, with the majority coded as ‘possibly concerning’ (56%) and the remainder (29%) considered ‘safe to ignore’. The overall agreement rate among the human coders was 76% (average κ = 0.55). Machine learning processes were subsequently applied to assess whether a ‘strongly concerning’ tweet could be identified automatically. The computer classifier correctly identified 80% of ‘strongly concerning’ tweets and showed increasing gains in accuracy; however, future improvements are necessary as a plateau was not reached as the amount of data increased. The current study demonstrated that it is possible to distinguish the level of concern among suicide-related tweets, using both human coders and an automatic machine classifier. Importantly, the machine classifier replicated the accuracy of the human coders. The findings confirmed that Twitter is used by individuals to express suicidality and that such posts evoked a level of concern that warranted further investigation. However, the predictive power for actual suicidal behaviour is not yet known and the findings do not directly identify targets for intervention.This project was supported in part by funding from the NSW Mental Health Commission and the NHMRC John Cade Fellowship 1056964. PJB and ALC are supported by the NHMRC Early Career Fellowships 1035262 and 1013199

    Assessment Of The Prevalence Of Suicide Among Young Adults Using Machine Learning

    Get PDF
    Due to the high rate of suicide all over the world resulting in about 800,000 people dying by suicide each year. The instances where suicide victims constantly publish suicide messages deliberately to express their feelings on social media, there is need to address suicide issues, and how suicide can be prevented. Therefore, as a solution to this, there is need to create a model that classifies these users" social media posts and identify users with suicidal ideations, so as to prevent future suicide cases by getting the identified users the necessary help needed. The study adopted a binary classification of a suicide-related tweet with respect to age 15 up till 29 years, on a document-level basis. A machine learning approach was employed to solve the problem of tweet classification and predictions. The dataset was generated from a Twitter API. It was observed that suicidal issues are rampant among the young adult, which need urgent attention. The paper recommended that timely intervention should be provided so as to reduce suicidal victims and preserve the future of young adults

    Detecting Suicide Risk From Wristworn Activity Tracker Data Using Machine Learning Approaches

    Get PDF
    Suicide is a prevalent cause of death worldwide and depression is a primary concern of many suicidal acts. It is possible that an individual during depression never has any suicidal thoughts at all. On the other hand, some individuals in stable condition with no apparent symptoms of depression feel urges to commit suicide (suicidal ideation). Many such individuals never let anyone know what they are feeling or planning. Suicidal ideation considered an important precursor to suicidal acts. Detecting the suicide risk in individuals with mood disorders is a major challenge. The current clinical practice to assess suicide risk in these vulnerable individuals based on structured or semi-structured psychiatric interviews is inadequate as many of the suicidal behaviors often occur unpredictably especially during apparent clinical remission. Furthermore, some of these individuals are unable or unwilling to share their experiences with clinicians. An objective feature that can continuously monitor risk of suicidal thoughts would be advantageous in such situations. Our research focused on finding objective features in activity data for detecting suicidal ideation in a sample of individuals diagnosed with Bipolar I, Bipolar II, or Unipolar who were currently in a euthymic state. Euthymic state is considered a non-depressed and reasonably positive mood state, but individuals in this state may still have suicidal ideation. Hence, our work explores detecting risk of suicidal thoughts in euthymic individuals in a group of mood disorder subjects using machine-learning approaches. Statistically significant differences were observed between activity features of euthymic and depressed individuals. A strong negative correlation was observed between activity feature vulnerability index with self-rated suicidal ideation. This study demonstrates that we can use machine learning techniques to detect risk of suicide in euthymic individuals from activity data. The main advantage of using activity data is that it would be cost effective, since many people commonly use activity trackers

    Detecting Suicidality in Arabic Tweets Using Machine Learning and Deep Learning Techniques

    Full text link
    Social media platforms have revolutionized traditional communication techniques by enabling people globally to connect instantaneously, openly, and frequently. People use social media to share personal stories and express their opinion. Negative emotions such as thoughts of death, self-harm, and hardship are commonly expressed on social media, particularly among younger generations. As a result, using social media to detect suicidal thoughts will help provide proper intervention that will ultimately deter others from self-harm and committing suicide and stop the spread of suicidal ideation on social media. To investigate the ability to detect suicidal thoughts in Arabic tweets automatically, we developed a novel Arabic suicidal tweets dataset, examined several machine learning models, including Na\"ive Bayes, Support Vector Machine, K-Nearest Neighbor, Random Forest, and XGBoost, trained on word frequency and word embedding features, and investigated the ability of pre-trained deep learning models, AraBert, AraELECTRA, and AraGPT2, to identify suicidal thoughts in Arabic tweets. The results indicate that SVM and RF models trained on character n-gram features provided the best performance in the machine learning models, with 86% accuracy and an F1 score of 79%. The results of the deep learning models show that AraBert model outperforms other machine and deep learning models, achieving an accuracy of 91\% and an F1-score of 88%, which significantly improves the detection of suicidal ideation in the Arabic tweets dataset. To the best of our knowledge, this is the first study to develop an Arabic suicidality detection dataset from Twitter and to use deep-learning approaches in detecting suicidality in Arabic posts

    Online suicide prevention through optimised text classification

    Get PDF
    Online communication platforms are increasingly used to express suicidal thoughts. There is considerable interest in monitoring such messages, both for population-wide and individual prevention purposes, and to inform suicide research and policy. Online information overload prohibits manual detection, which is why keyword search methods are typically used. However, these are imprecise and unable to handle implicit references or linguistic noise. As an alternative, this study investigates supervised text classification to model and detect suicidality in Dutch-language forum posts. Genetic algorithms were used to optimise models through feature selection and hyperparameter optimisation. A variety of features was found to be informative, including token and character ngram bags-of-words, presence of salient suicide-related terms and features based on LSA topic models and polarity lexicons. The results indicate that text classification is a viable and promising strategy for detecting suicide-related and alarming messages, with F-scores comparable to human annotators (93% for relevant messages, 70% for severe messages). Both types of messages can be detected with high precision and minimal noise, even on large high-skew corpora. This suggests that they would be fit for use in a real-world prevention setting
    • …
    corecore