19 research outputs found

    An Illustrative Application of Topic Modeling Method to a Farmers Diary

    No full text
    ์ตœ๊ทผ ๋“ค์–ด ๊ฐ์ข… ๋ฌธํ—Œ์ž๋ฃŒ๋“ค์˜ ๋””์ง€ํ„ธํ™”๊ฐ€ ๊ธ‰์†ํžˆ ์ง„ํ–‰๋˜๊ณ  ์žˆ์œผ๋ฉฐ ์ผ์ƒ์ƒํ™œ์‚ฌ ์ž๋ฃŒ๋กœ์„œ์˜ ์˜์˜๊ฐ€ ์ƒˆ๋กญ๊ฒŒ ๋ถ€๊ฐ๋˜์–ด์˜จ ์ผ๊ธฐ์ž๋ฃŒ ์—ญ์‹œ ์˜ˆ์™ธ๋Š” ์•„๋‹ˆ๋‹ค. ๊ทธ๋Ÿฌ๋‚˜ ๋””์ง€ํ„ธํ™”๋œ ํ…์ŠคํŠธ์ž๋ฃŒ๋“ค์€ ๊ทธ ๋ฐฉ๋Œ€ํ•œ ๊ทœ๋ชจ๋กœ ์ธํ•˜์—ฌ ์ „ํ†ต์ ์ธ ํ…์ŠคํŠธ๋ถ„์„๋ฐฉ๋ฒ•์œผ๋กœ๋Š” ์†Œํ™”ํ•ด๋‚ด๊ธฐ์— ํ•œ๊ณ„๊ฐ€ ์žˆ๋‹ค. ๋ณธ ์—ฐ๊ตฌ์—์„œ๋Š” ํ•ด๋‹น ๋ถ„์•ผ์— ๋Œ€ํ•œ ๋ณ„๋‹ค๋ฅธ ์‚ฌ์ „์  ์ „๋ฌธ์ง€์‹์ด ์—†์ด๋„ ๋ฐฉ๋Œ€ํ•œ ๋””์ง€ํ„ธ ํ…์ŠคํŠธ์ž๋ฃŒ๋กœ๋ถ€ํ„ฐ ์†Œ์ˆ˜์˜ ์˜๋ฏธ ์žˆ๋Š” ํ† ํ”ฝ์„ ์ถ”์ถœํ•ด์ฃผ๋Š” ์•Œ๊ณ ๋ฆฌ์ฆ˜์œผ๋กœ ์•Œ๋ ค์ง„ ํ† ํ”ฝ๋ชจ๋ธ๋ง ๊ธฐ๋ฒ•์˜ ํŠน์ง•๊ณผ ์ด๋ก ์  ์ „์ œ์— ๋Œ€ํ•ด ์‚ดํŽด๋ณด๊ณ , ์ด๋ฅผ ๋†๋ฏผ์ผ๊ธฐ ๋ถ„์„์— ์˜ˆ์‹œ์ ์œผ๋กœ ์ ์šฉํ•ด๋ณด์•˜๋‹ค. ํ† ํ”ฝ๋ชจ๋ธ๋ง ๊ธฐ๋ฒ•์„ ์ ์šฉํ•˜์—ฌ ์•„ํฌ์ผ๊ธฐ์—์„œ ์ถ”์ถœ๋œ ํ† ํ”ฝ๋“ค์€ ํ•ด์„๊ฐ€๋Šฅ์„ฑ์ด๋‚˜ ์™ธ์  ํƒ€๋‹น๋„ ์ธก๋ฉด์—์„œ ์œ ์˜๋ฏธํ•œ ๊ฒƒ์œผ๋กœ ๋“œ๋Ÿฌ๋‚ฌ๋‹ค. ์ „ํ†ต์  ํ…์ŠคํŠธ๋ถ„์„๋ฐฉ๋ฒ•์— ์˜ํ•œ ์—ฐ๊ตฌ๊ฒฐ๊ณผ์™€์˜ ๋น„๊ต์—์„œ๋„ ๋Œ€์ฒด๋กœ ์ผ๋งฅ์ƒํ†ตํ•˜๋Š” ๊ฒƒ์œผ๋กœ ๋‚˜ํƒ€๋‚ฌ์œผ๋ฉฐ, ๋” ๋‚˜์•„๊ฐ€ ๊ธฐ์กด์—ฐ๊ตฌ์—์„œ๋Š” ๊ฐ„๊ณผํ•˜์˜€๋˜ ์ƒˆ๋กœ์šด ํ† ํ”ฝ์„ ๋ฐœ๊ฒฌํ•ด๋‚ผ ์ˆ˜๋„ ์žˆ์Œ์„ ๋ณด์—ฌ์ฃผ์—ˆ๋‹ค. ์ด๋Ÿฐ ์—ฐ๊ตฌ๊ฒฐ๊ณผ์— ๊ธฐ๋ฐ˜ํ•˜์—ฌ ํ–ฅํ›„ ์ผ๊ธฐ์ž๋ฃŒ ์—ฐ๊ตฌ์— ํ† ํ”ฝ๋ชจ๋ธ๋ง ๊ธฐ๋ฒ•์ด ๋ณธ๊ฒฉ์ ์œผ๋กœ ํ™œ์šฉ๋˜๊ธฐ ์œ„ํ•ด์„œ๋Š” ๊ฒ€ํ† ํ•ด์•ผ ํ•  ๋ถ€๋ถ„์ด ๋ฌด์—‡์ธ์ง€ ํ† ํ”ฝ๋ชจ๋ธ๋ง์˜ ์ฃผ์š” ํŠน์ง•์œผ๋กœ ์•Œ๋ ค์ง„ 1) ์—ฐ๊ตฌ ๋ถ„์•ผ์— ๋Œ€ํ•œ ์‚ฌ์ „์  ์ง€์‹์„ ์š”๊ตฌํ•˜์ง€ ์•Š๋Š” ์ , 2) ๋ฉ€๋ฆฌ์„œ ์ฝ๊ธฐ, 3) ์–ดํœ˜์ž๋ฃจ ๊ฐ€์ •๊ณผ ๊ด€๊ณ„์  ์˜๋ฏธ ์ „์ œ๋ฅผ ์ค‘์‹ฌ์œผ๋กœ ๋…ผ์˜ํ•ด ๋ณด์•˜๋‹ค.Rapid digitization of text documents, including personal diaries, raised a new puzzle: how can researchers analyze large quantities of textual data efficiently and effectively? The author presents topic modeling as a promising solution to these challenges. The most distinctive feature of topic models is that they provide an automated procedure for coding the content of a corpus of texts into a set of substantively meaningful categories called topics. The author discussed the theoretical presumptions of the topic modeling technique. The author illustrated the strength of topic modeling methods as a means of analyzing large text corpora by applying them to a farmers diary (Appo diary). Topics extracted by topic modeling method are significant in terms of interpretability and external validity. Most of the results of topic modeling coincide with the results of traditional content analysis. In addition, topic modeling extracted a new topic, which the traditional content analysis had overlooked. Based on this findings, the author discussed the demands and limitations of the methods focusing on three major characteristics of topic modeling methods: Bag of words assumption, no need of a priori coding list (prior domain expertise), and distant reading
    corecore