5,212 research outputs found

    Using of Natural Language Processing Techniques in Suicide Research

    Get PDF
    It is estimated that each year many people, most of whom are teenagers and young adults die by suicide worldwide. Suicide receives special attention with many countries developing national strategies for prevention. Since, more medical information is available in text, Preventing the growing trend of suicide in communities requires analyzing various textual resources, such as patient records, information on the web or questionnaires. For this purpose, this study systematically reviews recent studies related to the use of natural language processing techniques in the area of peopleโ€™s health who have completed suicide or are at risk. After electronically searching for the PubMed and ScienceDirect databases and studying articles by two reviewers, 21 articles matched the inclusion criteria. This study revealed that, if a suitable data set is available, natural language processing techniques are well suited for various types of suicide related research

    Cross-cultural mood perception in pop songs and its alignment with mood detection algorithms

    Get PDF
    Do people from different cultural backgrounds perceive the mood in music the same way? How closely do human ratings across different cultures approximate automatic mood detection algorithms that are often trained on corpora of predominantly Western popular music? Analyzing 166 participants responses from Brazil, South Korea, and the US, we examined the similarity between the ratings of nine categories of perceived moods in music and estimated their alignment with four popular mood detection algorithms. We created a dataset of 360 recent pop songs drawn from major music charts of the countries and constructed semantically identical mood descriptors across English, Korean, and Portuguese languages. Multiple participants from the three countries rated their familiarity, preference, and perceived moods for a given song. Ratings were highly similar within and across cultures for basic mood attributes such as sad, cheerful, and energetic. However, we found significant cross-cultural differences for more complex characteristics such as dreamy and love. To our surprise, the results of mood detection algorithms were uniformly correlated across human ratings from all three countries and did not show a detectable bias towards any particular culture. Our study thus suggests that the mood detection algorithms can be considered as an objective measure at least within the popular music context

    Sentiment Polarity Classification of Comments on Korean News Articles Using Feature Reweighting

    Get PDF
    ์ผ๋ฐ˜์ ์œผ๋กœ ์ธํ„ฐ๋„ท ์‹ ๋ฌธ ๊ธฐ์‚ฌ์— ๋Œ€ํ•œ ๋Œ“๊ธ€์€ ๊ทธ ์‹ ๋ฌธ ๊ธฐ์‚ฌ์— ๋Œ€ํ•œ ์ฃผ๊ด€์ ์ธ ๊ฐ์ •์ด๋‚˜ ์˜๊ฒฌ์„ ํฌํ•จํ•˜๊ณ  ์žˆ๋‹ค. ๋”ฐ๋ผ์„œ ์ด๋Ÿฐ ์‹ ๋ฌธ ๊ธฐ์‚ฌ์˜ ๋Œ“๊ธ€์— ๋Œ€ํ•œ ๊ฐ์ •์„ ์ธ์‹ํ•˜๊ณ  ๋ถ„๋ฅ˜ํ•˜๋Š” ๋ฐ์—๋Š” ๊ทธ ์‹ ๋ฌธ ๊ธฐ์‚ฌ์˜ ์›๋ฌธ ๋‚ด์šฉ์ด ์ค‘์š”ํ•œ ์˜ํ–ฅ์„ ๋ฏธ์นœ๋‹ค. ์ด๋Ÿฐ ์ ์— ์ฐฉ์•ˆํ•˜์—ฌ ๋ณธ ๋…ผ๋ฌธ์€ ๊ธฐ์‚ฌ์˜ ์›๋ฌธ ๋‚ด์šฉ๊ณผ ๊ฐ์ • ์‚ฌ์ „์„ ์ด์šฉํ•˜๋Š” ๊ฐ€์ค‘์น˜ ์กฐ์ • ๋ฐฉ๋ฒ•์„ ์ œ์•ˆํ•˜๊ณ , ์ œ์•ˆ๋œ ๊ฐ€์ค‘์น˜ ์กฐ์ • ๋ฐฉ๋ฒ•์„ ์ด์šฉํ•ด์„œ ํ•œ๊ตญ์–ด ์‹ ๋ฌธ ๊ธฐ์‚ฌ์˜ ๋Œ“๊ธ€์— ๋Œ€ํ•œ ๊ฐ์ • ์ด์ง„ ๋ถ„๋ฅ˜ ๋ฐฉ๋ฒ•์„ ์ œ์•ˆํ•œ๋‹ค. ๊ฐ€์ค‘์น˜ ์กฐ์ • ๋ฐฉ๋ฒ•์—๋Š” ๋‹ค์–‘ํ•œ ์ž์งˆ ์ง‘ํ•ฉ์ด ์‚ฌ์šฉ๋˜๋Š”๋ฐ ๊ทธ๊ฒƒ์€ ๋Œ“๊ธ€์— ํฌํ•จ๋œ ๊ฐ์ • ๋‹จ์–ด, ๊ทธ๋ฆฌ๊ณ  ๊ฐ์ • ์‚ฌ์ „๊ณผ ๋‰ด์Šค ๊ธฐ์‚ฌ์˜ ๋ณธ๋ฌธ์— ๊ด€๋ จ๋œ ์ž์งˆ๋“ค, ๋งˆ์ง€๋ง‰์œผ๋กœ ๋‰ด์Šค ๊ธฐ์‚ฌ์˜ ์นดํ…Œ๊ณ ๋ฆฌ ์ •๋ณด๊ฐ€ ํฌํ•จ๋˜์–ด ์žˆ๋‹ค. ์—ฌ๊ธฐ์„œ ๋งํ•˜๋Š” ๊ฐ์ • ์‚ฌ์ „์€ ํ•œ๊ตญ์–ด ๊ฐ์ • ์‚ฌ์ „์„ ์˜๋ฏธํ•˜๋ฉฐ ์•„์ง ๊ณต๊ฐœ๋œ ๊ฒƒ์ด ์—†๊ธฐ ๋•Œ๋ฌธ์—, ๊ธฐ์กด์— ์žˆ๋Š” ์˜์–ด ๊ฐ์ • ์‚ฌ์ „์„ ์ด์šฉํ•˜์—ฌ ๊ตฌ์ถ•ํ•˜์˜€๋‹ค. ๋ณธ ๋…ผ๋ฌธ์—์„œ ์ œ์•ˆ๋œ ๊ฐ์ • ์ด์ง„ ๋ถ„๋ฅ˜๋Š” ๊ธฐ๊ณ„ ํ•™์Šต์„ ์ด์šฉํ•œ๋‹ค. ์ผ๋ฐ˜์ ์œผ๋กœ ๊ธฐ๊ณ„ ํ•™์Šต์„ ์œ„ํ•ด์„œ๋Š” ํ•™์Šต ๋ง๋ญ‰์น˜๊ฐ€ ํ•„์š”ํ•œ๋ฐ ํŠน๋ณ„ํžˆ ๊ฐ์ • ๋ถ„๋ฅ˜ ๋ฌธ์ œ์—์„œ๋Š” ๊ธ์ • ํ˜น์€ ๋ถ€์ • ๊ฐ์ • ํƒœ๊ทธ๊ฐ€ ๋ถ€์ฐฉ๋œ ๋ง๋ญ‰์น˜๊ฐ€ ํ•„์š”ํ•˜๋‹ค. ์ด ๋ง๋ญ‰์น˜์˜ ๊ฒฝ์šฐ๋„, ๊ณต๊ฐœ๋œ ํ•œ๊ตญ์–ด ๊ฐ์ • ๋ง๋ญ‰์น˜๊ฐ€ ์•„์ง ์—†๊ธฐ ๋•Œ๋ฌธ์— ๋ง๋ญ‰์น˜๋ฅผ ์ง์ ‘ ๊ตฌ์ถ•ํ•˜์˜€๋‹ค. ์‚ฌ์šฉ๋œ ๊ธฐ๊ณ„ ํ•™์Šต ๋ฐฉ๋ฒ•์œผ๋กœ๋Š” Na&iumlve Bayes, k-NN, SVM์ด ์žˆ๊ณ , ์ž์งˆ ์„ ํƒ ๋ฐฉ๋ฒ•์œผ๋กœ๋Š” Document Frequency, ฯ‡^2 statistic, Information Gain์ด ์žˆ๋‹ค. ๊ทธ ๊ฒฐ๊ณผ, ๋Œ“๊ธ€ ์•ˆ์— ํฌํ•จ๋œ ๊ฐ์ • ๋‹จ์–ด์™€ ๊ทธ ๋Œ“๊ธ€์— ๋Œ€ํ•œ ๊ธฐ์‚ฌ ๋ณธ๋ฌธ์ด ๊ฐ์ • ๋ถ„๋ฅ˜์— ๋งค์šฐ ํšจ๊ณผ์ ์ธ ์ž์งˆ์ž„์„ ํ™•์ธํ•  ์ˆ˜ ์žˆ์—ˆ๋‹ค.Chapter 1 Introduction 1 Chapter 2 Related Works 4 2.1 Sentiment Classification 4 2.2 Feature Weighting in Vector Space Model 5 2.3 Feature Extraction and Selection 7 2.4 Classifiers 10 2.5 Accuracy Measures 14 Chapter 3 Feature Reweighting 16 3.1 Feature extraction in Korean 16 3.2 Feature Reweighting Methods 17 3.3 Examples of Feature Reweighting Methods 18 Chapter 4 Sentiment Polarity Classification System 21 4.1 Model Generation 21 4.2 Sentiment Polarity Classification 23 Chapter 5 Data Preparation 25 5.1 Korean Sentiment Corpus 25 5.2 Korean Sentiment Lexicon 27 Chapter 6 Experiments 29 6.1 Experimental Environment 29 6.2 Experimental Results 30 Chapter 7 Conclusions and Future Works 38 Bibliography 40 Acknowledgments 4

    A Survey on Awesome Korean NLP Datasets

    Full text link
    English based datasets are commonly available from Kaggle, GitHub, or recently published papers. Although benchmark tests with English datasets are sufficient to show off the performances of new models and methods, still a researcher need to train and validate the models on Korean based datasets to produce a technology or product, suitable for Korean processing. This paper introduces 15 popular Korean based NLP datasets with summarized details such as volume, license, repositories, and other research results inspired by the datasets. Also, I provide high-resolution instructions with sample or statistics of datasets. The main characteristics of datasets are presented on a single table to provide a rapid summarization of datasets for researchers.Comment: 11 pages, 1 horizontal page for large tabl
    • โ€ฆ
    corecore