108 research outputs found
Using Linguistic Features to Estimate Suicide Probability of Chinese Microblog Users
If people with high risk of suicide can be identified through social media
like microblog, it is possible to implement an active intervention system to
save their lives. Based on this motivation, the current study administered the
Suicide Probability Scale(SPS) to 1041 weibo users at Sina Weibo, which is a
leading microblog service provider in China. Two NLP (Natural Language
Processing) methods, the Chinese edition of Linguistic Inquiry and Word Count
(LIWC) lexicon and Latent Dirichlet Allocation (LDA), are used to extract
linguistic features from the Sina Weibo data. We trained predicting models by
machine learning algorithm based on these two types of features, to estimate
suicide probability based on linguistic features. The experiment results
indicate that LDA can find topics that relate to suicide probability, and
improve the performance of prediction. Our study adds value in prediction of
suicidal probability of social network users with their behaviors
Assessing Suicide Risk and Emotional Distress in Chinese Social Media: A Text Mining and Machine Learning Study
published_or_final_versio
Suicide Communication on Social Media and Its Psychological Mechanisms: An Examination of Chinese Microblog Users
Background: This study aims to examine the characteristics of people who talk about suicide on Chinese microblogs (referred to as Weibo suicide communication (WSC)), and the psychological antecedents of such behaviors. Methods: An online survey was conducted on Weibo users. Differences in psychological and social demographic characteristics between those who exhibited WSC and those who did not were examined. Three theoretical models were proposed to explain the psychological mechanisms of WSC and their fitness was examined by Structural Equation Modeling (SEM). Results: 12.03% of our respondents exhibited WSC in the past 12 months. The WSC group was significantly younger and less educated, preferred using blogs and online forums for expressing themselves, and reported significantly greater suicide ideation, negative affectivity, and vulnerable personality compared to non-WSC users. SEM examinations found that Weibo users with higher negative affectivity or/and suicidal ideation, who were also using blogs and forums more, exhibited a significantly higher possibility of WSC. Conclusion: Weibo users who are at greater suicide risk are more likely to talk about suicide on Weibo. WSC is a sign of negative affectivity or suicide ideation, and should be responded to with emotional support and suicide prevention services.published_or_final_versio
Social media mental health analysis framework through applied computational approaches
Studies have shown that mental illness burdens not only public health and productivity but also established market economies throughout the world. However, mental disorders are difficult to diagnose and monitor through traditional methods, which heavily rely on interviews, questionnaires and surveys, resulting in high under-diagnosis and under-treatment rates. The increasing use of online social media, such as Facebook and Twitter, is now a common part of people’s everyday life. The continuous and real-time user-generated content often reflects feelings, opinions, social status and behaviours of individuals, creating an unprecedented wealth of person-specific information. With advances in data science, social media has already been increasingly employed in population health monitoring and more recently mental health applications to understand mental disorders as well as to develop online screening and intervention tools. However, existing research efforts are still in their infancy, primarily aimed at highlighting the potential of employing social media in mental health research. The majority of work is developed on ad hoc datasets and lacks a systematic research pipeline. [Continues.]</div
Active Learning With Complementary Sampling for Instructing Class-Biased Multi-Label Text Emotion Classification
High-quality corpora have been very scarce for the text emotion research. Existing corpora with multi-label emotion annotations have been either too small or too class-biased to properly support a supervised emotion learning. In this paper, we propose a novel active learning method for efficiently instructing the human annotations for a less-biased and high-quality multi-label emotion corpus. Specifically, to compensate annotation for the minority-class examples, we propose a complementary sampling strategy based on unlabeled resources by measuring a probabilistic distance between the expected emotion label distribution in a temporary corpus and an uniform distribution. Qualitative evaluations are also given to the unlabeled examples, in which we evaluate the model uncertainties for multi-label emotion predictions, their syntactic representativeness for the other unlabeled examples, and their diverseness to the labeled examples, for a high-quality sampling. Through active learning, a supervised emotion classifier gets progressively improved by learning from these new examples. Experiment results suggest that by following these sampling strategies we can develop a corpus of high-quality examples with significantly relieved bias for emotion classes. Compared to the learning procedures based on traditional active learning algorithms, our learning procedure indicates the most efficient learning curve and estimates the best multi-label emotion predictions
- …