44 research outputs found

    Webbasierte linguistische Forschung: Mรถglichkeiten und Begrenzungen beim Umgang mit Massendaten

    Get PDF
    Over the past ten to fifteen years, web-based methods of sociological research have emerged alongside classical methods such as interviews, observations and experiments, and linguistic research is increasingly relying upon them as well. This paper provides an overview of three web-based approaches, i.e. online surveys, crowd-sourcing and web-based corpus analyses. Examples from specific projects serve to reflect upon these methods, address their potential and limitations, and make a critical appraisal. Internet-based empirical research produces vast and highly diverse quantities of (speaker-based or textual) data, presenting linguistic research with new opportunities and challenges. New procedures are required to make effective use of these resources

    Translation crowdsourcing: creating a multilingual corpus of online educational content

    Get PDF
    The present work describes a multilingual corpus of online content in the educational domain, i.e. Massive Open Online Course material, ranging from course forum text to subtitles of online video lectures, that has been developed via large-scale crowdsourcing. The English source text is manually translated into 11 European and BRIC languages using the CrowdFlower platform. During the process several challenges arose which mainly involved the in-domain text genre, the large text volume, the idiosyncrasies of each target language, the limitations of the crowdsourcing platform, as well as the quality assurance and workflow issues of the crowdsourcing process. The corpus constitutes a product of the EU-funded TraMOOC project and is utilised in the project in order to train, tune and test machine translation engines

    Extracting semantic entities and events from sports tweets

    Get PDF
    Large volumes of user-generated content on practically every major issue and event are being created on the microblogging site Twitter. This content can be combined and processed to detect events, entities and popular moods to feed various knowledge-intensive practical applications. On the downside, these content items are very noisy and highly informal, making it difficult to extract sense out of the stream. In this paper, we exploit various approaches to detect the named entities and significant micro-events from usersโ€™ tweets during a live sports event. Here we describe how combining linguistic features with background knowledge and the use of Twitter-specific features can achieve high, precise detection results (f-measure = 87%) in different datasets. A study was conducted on tweets from cricket matches in the ICC World Cup in order to augment the event-related non-textual media with collective intelligence

    ์ˆ˜์ง‘ ๊ฒฐ๊ณผ์˜ ํ‘œํ˜„ ๋‹ค์–‘์„ฑ ํ–ฅ์ƒ์„ ์ค‘์‹ฌ์œผ๋กœ

    Get PDF
    ํ•™์œ„๋…ผ๋ฌธ(์„์‚ฌ)--์„œ์šธ๋Œ€ํ•™๊ต ๋Œ€ํ•™์› :์œตํ•ฉ๊ณผํ•™๊ธฐ์ˆ ๋Œ€ํ•™์› ์œตํ•ฉ๊ณผํ•™๋ถ€(๋””์ง€ํ„ธ์ •๋ณด์œตํ•ฉ์ „๊ณต),2019. 8. ์ด์ค‘์‹.The conversational agent is a system that receives the natural language from the user and understand the intent for performing the function. With the advancement of speech recognition technology and the development platform of IT companies, service development using a conversational agent is becoming popular. To develop such a conversational agent, a large amount of training data is required. Currently, conversational agents provide a way for users to interact as if they were human beings. Accordingly, the conversational agent needs to understand the users intent and Understanding intent is learned through various and large amount of training data. However, collecting training data for the development of conversational agents is a very difficult task because of the diversity of expressions and the limitations of collections methods in natural language. Diversity of expressions means having different structures with the same meaning, collecting training data should take characteristic into consideration. Although some methods of collecting are proposed, problems such as time, cost, and accessibility are raised. With the recent development of artificial intelligence, crowdsourcing has developed and the possibility of solving these problems can be seen. Crowdsourcing has the advantage of solving problems that are difficult for a computer to solve from people and collecting data to a large number of people at low cost. In practice, the possibility of using crowdsourcing in relation to the training data acquisition is raised. However, although quality of crowdsourcing is influenced greatly by the task design method and diversity of training data is important, understanding of task design method is insufficient. Therefore, this paper focuses on improving the diversity of expression, examines the effect of task design elements on training data collection, and then suggests a design method that can collect training data effectively. For this purpose, this paper selects three design elements(task amount, bonus compensation method, social proof based explanation method) to explore the effect of task design elements and conducts 3 experiments of three design elements. The paraphrasing task that possibility of training data acquisition is proven was used, 1473 data were collected from MTurk using $73.65. The collected data were analyzed with four indicators(semantic equivalence, diversity, error rate, and execution time). As a result of analysis, it was difficult to get data with the same meaning as the amount of task increased. In terms of bonus compensation method, the efficiency of collection increased when offering bonus compensation. Finally, in terms of the social proof- based explanations, there is a trade-off relationship between diversity and efficiency. Individual differences in collecting among participants and pressure on collecting results were discussed, and an integrated task design method was suggested. This paper has academic significance in that it studies the possibility of improving the quality of collecting, mainly focusing on the study of the possibility of collecting training data. In addition, it has significance in terms of timeliness and usefulness in trying to solve the problem that is actually experienced in the industrial field. Finally, there is significance in terms of convergence in that it combines social psychology theory, HCI and engineering.๋Œ€ํ™”ํ˜• ์—์ด์ „ํŠธ๋Š” ์‚ฌ์šฉ์ž๋กœ๋ถ€ํ„ฐ ์ž์—ฐ์–ด๋ฅผ ์ž…๋ ฅ ๋ฐ›์•„ ์ธํ…ํŠธ๋ฅผ ํŒŒ์•…ํ•˜๊ณ  ๊ธฐ๋Šฅ์„ ์ˆ˜ํ–‰ํ•˜๋Š” ์‹œ์Šคํ…œ์ด๋‹ค. ์Œ์„ฑ ์ธ์‹ ๊ธฐ์ˆ ์˜ ๊ณ ๋„ํ™”์™€ ๊ฑฐ๋Œ€ IT ๊ธฐ์—…๋“ค์„ ์ค‘์‹ฌ์œผ๋กœ ๊ฐœ๋ฐœ ํ”Œ๋žซํผ์„ ์ œ๊ณตํ•จ์— ๋”ฐ๋ผ ๋Œ€ํ™”ํ˜• ์—์ด์ „ํŠธ๋ฅผ ์ด์šฉํ•œ ์„œ๋น„์Šค ๊ฐœ๋ฐœ์ด ๋ณดํŽธํ™”๋˜๊ณ  ์žˆ๋‹ค. ์ด๋Ÿฌํ•œ ๋Œ€ํ™”ํ˜• ์—์ด์ „ํŠธ๋ฅผ ๊ฐœ๋ฐœํ•˜๊ธฐ ์œ„ํ•ด์„œ๋Š” ๋‹ค์–‘ํ•˜๊ณ  ๋งŽ์€ ์–‘์˜ ํ•™์Šต๋ฐ์ดํ„ฐ๊ฐ€ ํ•„์š”ํ•˜๋‹ค. ํ˜„์žฌ ๋Œ€ํ™”ํ˜• ์—์ด์ „ํŠธ๋Š” ์‚ฌ์šฉ์ž์—๊ฒŒ ์‚ฌ๋žŒ์ฒ˜๋Ÿผ ๋Œ€ํ™”ํ•˜๋Š” ์ƒํ˜ธ์ž‘์šฉ ๋ฐฉ์‹์„ ์ œ๊ณตํ•œ๋‹ค. ์ด์— ๋”ฐ๋ผ ๋Œ€ํ™”ํ˜• ์—์ด์ „ํŠธ๋Š” ์‚ฌ์šฉ์ž์˜ ๋Œ€ํ™” ์ธํ…ํŠธ๋ฅผ ํŒŒ์•…ํ•ด์•ผ ํ•˜๋ฉฐ, ์ธํ…ํŠธ ํŒŒ์•…์€ ๋‹ค์–‘ํ•˜๊ณ  ๋งŽ์€ ์–‘์˜ ํ•™์Šต๋ฐ์ดํ„ฐ๋ฅผ ํ†ตํ•ด ํ•™์Šต๋˜๊ธฐ ๋•Œ๋ฌธ์ด๋‹ค. ํ•˜์ง€๋งŒ ๋Œ€ํ™”ํ˜• ์—์ด์ „ํŠธ ๊ฐœ๋ฐœ์„ ์œ„ํ•œ ํ•™์Šต๋ฐ์ดํ„ฐ๋ฅผ ์ˆ˜์ง‘ํ•˜๋Š” ๊ฒƒ์€ ์ž์—ฐ์–ด์˜ ํ‘œํ˜„ ๋‹ค์–‘์„ฑ๊ณผ ์ˆ˜์ง‘ ๋ฐฉ๋ฒ•์˜ ํ•œ๊ณ„๋กœ ์ธํ•ด ๋งค์šฐ ์–ด๋ ค์šด ์ž‘์—…์ด๋‹ค. ์ž์—ฐ์–ด์˜ ํ‘œํ˜„ ๋‹ค์–‘์„ฑ์€ ๊ฐ™์€ ์˜๋ฏธ๋ฅผ ๊ฐ€์ง€๋ฉด์„œ ๋‹ค๋ฅธ ๊ตฌ์กฐ๋ฅผ ๊ฐ€์งˆ ์ˆ˜ ์žˆ์Œ์„ ๋œปํ•˜๋ฉฐ, ํ•™์Šต๋ฐ์ดํ„ฐ ์ˆ˜์ง‘์€ ์ด๋Ÿฌํ•œ ํŠน์„ฑ์ด ๊ณ ๋ ค๋˜์–ด์•ผ ํ•œ๋‹ค. ์ˆ˜์ง‘ํ•  ์ˆ˜ ์žˆ๋Š” ๋ฐฉ๋ฒ•๋“ค์ด ์ผ๋ถ€ ์ œ์•ˆ๋˜๊ธด ํ•˜์˜€์œผ๋‚˜ ์‹œ๊ฐ„, ๋น„์šฉ, ์ ‘๊ทผ์„ฑ ๋“ฑ์˜ ๋ฌธ์ œ๊ฐ€ ์ œ๊ธฐ๋˜๊ณ  ์žˆ๋‹ค. ์ตœ๊ทผ ์ธ๊ณต์ง€๋Šฅ ๊ฐœ๋ฐœ์ด ํ™œ์„ฑํ™”๋จ์— ๋”ฐ๋ผ ํฌ๋ผ์šฐ๋“œ์†Œ์‹ฑ ๋ถ„์•ผ๊ฐ€ ๋ฐœ์ „ํ•˜๋ฉด์„œ ์ด๋Ÿฌํ•œ ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐํ•  ๊ฐ€๋Šฅ์„ฑ์„ ์—ฟ๋ณผ ์ˆ˜ ์žˆ๊ฒŒ ๋˜์—ˆ๋‹ค. ํฌ๋ผ์šฐ๋“œ์†Œ์‹ฑ์€ ์ปดํ“จํ„ฐ๊ฐ€ ํ•ด๊ฒฐํ•˜๊ธฐ ์–ด๋ ค์šด ๋ฌธ์ œ๋ฅผ ์‚ฌ๋žŒ๋“ค๋กœ๋ถ€ํ„ฐ ํ’€๋ฉฐ, ์ ์€ ๋น„์šฉ์œผ๋กœ ๋‹ค์ˆ˜์˜ ์‚ฌ๋žŒ๋“ค์—๊ฒŒ ๋ฐ์ดํ„ฐ๋ฅผ ์ˆ˜์ง‘ํ•  ์ˆ˜ ์žˆ๋Š” ์žฅ์ ์ด ์žˆ๋‹ค. ์‹ค์ œ, ํ•™์Šต๋ฐ์ดํ„ฐ ์ˆ˜์ง‘๊ณผ ๊ด€๋ จํ•˜์—ฌ ํฌ๋ผ์šฐ๋“œ์†Œ์‹ฑ์˜ ํ™œ์šฉ ๊ฐ€๋Šฅ์„ฑ์ด ์ œ๊ธฐ๋˜๊ณ  ์žˆ๋‹ค. ํ•˜์ง€๋งŒ ํƒœ์Šคํฌ ๋””์ž์ธ ๋ฐฉ์‹์— ๋”ฐ๋ผ ํฌ๋ผ์šฐ๋“œ์†Œ์‹ฑ์˜ ์ˆ˜์ง‘๊ฒฐ๊ณผ๊ฐ€ ๋งŽ์€ ์˜ํ–ฅ์„ ๋ฐ›๊ณ , ํ•™์Šต๋ฐ์ดํ„ฐ์˜ ๋‹ค์–‘์„ฑ์ด ์ค‘์š”ํ•จ์—๋„ ๋ถˆ๊ตฌํ•˜๊ณ  ํƒœ์Šคํฌ ๋””์ž์ธ ๋ฐฉ์‹์— ๋Œ€ํ•œ ์ดํ•ด๊ฐ€ ๋ถ€์กฑํ•œ ์ƒํ™ฉ์ด๋‹ค. ๋”ฐ๋ผ์„œ ๋ณธ ์—ฐ๊ตฌ๋Š” ํ‘œํ˜„์˜ ๋‹ค์–‘์„ฑ ํ–ฅ์ƒ์— ์ดˆ์ ์„ ๋งž์ถฐ ํƒœ์Šคํฌ ๋””์ž์ธ ์š”์†Œ๊ฐ€ ํ•™์Šต๋ฐ์ดํ„ฐ ์ˆ˜์ง‘ ๊ฒฐ๊ณผ์— ๋ฏธ์น˜๋Š” ์˜ํ–ฅ์„ ์•Œ์•„๋ณด๊ณ , ํšจ๊ณผ์ ์œผ๋กœ ํ•™์Šต๋ฐ์ดํ„ฐ๋ฅผ ์ˆ˜์ง‘ํ•  ์ˆ˜ ์žˆ๋Š” ๋””์ž์ธ ๋ฐฉ์•ˆ์„ ์ œ์–ธํ•˜๊ณ ์ž ํ•œ๋‹ค. ์ด๋ฅผ ์œ„ํ•ด ๋ณธ ์—ฐ๊ตฌ์—์„œ๋Š” ํฌ๋ผ์šฐ๋“œ์†Œ์‹ฑ ๊ธฐ๋ฐ˜์˜ ํ•™์Šต๋ฐ์ดํ„ฐ ์ˆ˜์ง‘ ๊ฒฐ๊ณผ์— ์˜ํ–ฅ์„ ์ฃผ๋Š” ํƒœ์Šคํฌ ๋””์ž์ธ ์š”์†Œ๋“ค์„ ์„ ์ •ํ•˜์—ฌ ์ด์— ๋”ฐ๋ฅธ ์˜ํ–ฅ์„ ์•Œ์•„๋ณด๊ณ ์ž ์ผ๋ จ์˜ 3๊ฐ€์ง€ ์‹คํ—˜(ํƒœ์Šคํฌ ์–‘, ๋ณด๋„ˆ์Šค ๋ณด์ƒ ๋ฐฉ์‹, Social Proof ๊ธฐ๋ฐ˜ ์„ค๋ช… ๋ฐฉ์‹)์„ ์ง„ํ–‰ํ–ˆ๋‹ค. ์ˆ˜์ง‘๊ฐ€๋Šฅ์„ฑ์ด ๊ฒ€์ฆ๋œ ํŒจ๋Ÿฌํ”„๋ ˆ์ด์ง• ํƒœ์Šคํฌ๋ฅผ ์‚ฌ์šฉํ•˜์˜€์œผ๋ฉฐ, MTurk์„ ํ†ตํ•ด 480๋ช…์˜ ์ฐธ๊ฐ€์ž๋กœ๋ถ€ํ„ฐ 73.65๋‹ฌ๋Ÿฌ๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ 1473๊ฐœ์˜ ๋ฐ์ดํ„ฐ๋ฅผ ์ˆ˜์ง‘ํ•˜์˜€๋‹ค. ์ˆ˜์ง‘ํ•œ ๋ฐ์ดํ„ฐ๋Š” 4๊ฐ€์ง€ ์ง€ํ‘œ(์˜๋ฏธ์  ๋™๋“ฑ์„ฑ, ๋‹ค์–‘์„ฑ, ์—๋Ÿฌ ๋น„์œจ, ์ˆ˜ํ–‰ ์‹œ๊ฐ„)๋กœ ๋ถ„์„๋˜์—ˆ๋‹ค. ๋ถ„์„ ๊ฒฐ๊ณผ, ํƒœ์Šคํฌ ์–‘์ด ๋Š˜์–ด๋‚ ์ˆ˜๋ก ๊ฐ™์€ ์˜๋ฏธ๋ฅผ ๊ฐ–๋Š” ๋ฐ์ดํ„ฐ๋ฅผ ์–ป๊ธฐ ์–ด๋ ค์› ๋‹ค. ๋ณด๋„ˆ์Šค ๋ณด์ƒ ๋ฐฉ์‹ ์ธก๋ฉด์—์„œ๋Š”, ๋ณด๋„ˆ์Šค ๋ณด์ƒ ๋ฐฉ์‹์„ ์ œ๊ณตํ•  ๋•Œ ์ˆ˜์ง‘์˜ ํšจ์œจ์„ฑ์ด ๋†’์•„์กŒ๋‹ค. ๋งˆ์ง€๋ง‰์œผ๋กœ Social Proof ๊ธฐ๋ฐ˜ ์„ค๋ช… ๋ฐฉ์‹ ์ธก๋ฉด์—์„œ๋Š” ๋‹ค์–‘์„ฑ๊ณผ ํšจ์œจ์„ฑ ์‚ฌ์ด์˜ ํŠธ๋ ˆ์ด๋“œ ์˜คํ”„(Trade- off) ๊ด€๊ณ„๊ฐ€ ๋‚˜ํƒ€๋‚ฌ๋‹ค. ์ตœ์ข…์ ์œผ๋กœ ์ฐธ๊ฐ€์ž ๊ฐ„ ์ˆ˜์ง‘์˜ ๊ฐœ์ธ์ฐจ, ์ˆ˜์ง‘ ๊ฒฐ๊ณผ์— ๋Œ€ํ•œ ์••๋ฐ•์— ๋Œ€ํ•ด ๋…ผ์˜ํ•˜๊ณ , ์‹คํ—˜ ๊ฒฐ๊ณผ๋ฅผ ์ข…ํ•ฉํ•˜์—ฌ ํ†ตํ•ฉ์ ์ธ ํƒœ์Šคํฌ ๋””์ž์ธ ๋ฐฉ์‹์„ ์ œ์–ธํ•˜์˜€๋‹ค. ๋ณธ ์—ฐ๊ตฌ๋Š” ํ•™์Šต๋ฐ์ดํ„ฐ์˜ ์ˆ˜์ง‘ ๊ฐ€๋Šฅ์„ฑ์„ ๋ฐํžˆ๋Š” ์—ฐ๊ตฌ๊ฐ€ ์ฃผ๋ฅผ ์ด๋ฃจ๋Š” ๊ฐ€์šด๋ฐ, ์ˆ˜์ง‘ ๊ฒฐ๊ณผ๋ฅผ ํ–ฅ์ƒ์‹œํ‚ฌ ์ˆ˜ ์žˆ๋Š” ๋ฐฉ์•ˆ์„ ์—ฐ๊ตฌํ•œ๋‹ค๋Š” ์ ์—์„œ ํ•™์ˆ ์  ์˜์˜๋ฅผ ๊ฐ–๋Š”๋‹ค. ๋˜ํ•œ ๋Œ€ํ™”ํ˜• ์—์ด์ „ํŠธ์˜ ๊ฐœ๋ฐœ์ด ๋ณดํŽธํ™”๋˜๋Š” ์‹œ์ ์—, ์‚ฐ์—… ๋ถ„์•ผ์—์„œ ์‹ค์ œ ๊ฒช๊ณ  ์žˆ๋Š” ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐํ•˜๊ณ ์ž ํ•œ๋‹ค๋Š” ์ ์—์„œ ์‹œ์˜์„ฑ๊ณผ ์œ ์šฉ์„ฑ ์ธก๋ฉด์˜ ์˜์˜๋ฅผ ๊ฐ–๋Š”๋‹ค. ๋งˆ์ง€๋ง‰์œผ๋กœ ์‚ฌํšŒ์‹ฌ๋ฆฌํ•™ ์ด๋ก , HCI, ๊ณตํ•™ ๋ถ„์•ผ๋ฅผ ์ ‘๋ชฉํ•œ๋‹ค๋Š” ์ ์—์„œ ์œตํ•ฉ์  ์˜์˜๋ฅผ ๊ฐ–๋Š”๋‹ค.์ œ 1์žฅ ์„œ๋ก  1 ์ œ 1์ ˆ ์—ฐ๊ตฌ์˜ ๋ฐฐ๊ฒฝ 1 ์ œ 2์ ˆ ๋…ผ๋ฌธ์˜ ๊ตฌ์„ฑ 7 ์ œ 2์žฅ ์ด๋ก ์  ๋ฐฐ๊ฒฝ 8 ์ œ 1์ ˆ ๋Œ€ํ™”ํ˜• ์—์ด์ „ํŠธ์˜ ์ธํ…ํŠธ ํŒŒ์•… 8 ์ œ 2์ ˆ ์ž์—ฐ์–ด ํ•™์Šต๋ฐ์ดํ„ฐ ๊ด€๋ จ ํฌ๋ผ์šฐ๋“œ์†Œ์‹ฑ ํ™œ์šฉ ์—ฐ๊ตฌ 10 ์ œ 3์ ˆ ํฌ๋ผ์šฐ๋“œ์†Œ์‹ฑ ์ˆ˜์ง‘ ๊ฒฐ๊ณผ์™€ ๊ด€๋ จ๋œ ํƒœ์Šคํฌ ๋””์ž์ธ ์š”์ธ 12 ์ œ 4์ ˆ Social Proof ํšจ๊ณผ 16 ์ œ 3์žฅ ์—ฐ๊ตฌ ๋ฌธ์ œ 18 ์ œ 4์žฅ ์—ฐ๊ตฌ ๋ฐฉ๋ฒ• 21 ์ œ 1์ ˆ ํƒœ์Šคํฌ ๋ฐ ์‹คํ—˜์ ˆ์ฐจ 22 ์ œ 2์ ˆ ์‹คํ—˜๋ฌผ 23 ์ œ 3์ ˆ ์ธก์ • ์ง€ํ‘œ ๋ฐ ๋ถ„์„๋ฐฉ๋ฒ• 27 ์ œ 5์žฅ ์—ฐ๊ตฌ ๊ฒฐ๊ณผ 33 ์ œ 1์ ˆ ํƒœ์Šคํฌ ์–‘์— ๋”ฐ๋ฅธ ์ˆ˜์ง‘ ๊ฒฐ๊ณผ 33 ์ œ 2์ ˆ ๋ณด๋„ˆ์Šค ๋ณด์ƒ ๋ฐฉ์‹์— ๋”ฐ๋ฅธ ์ˆ˜์ง‘ ๊ฒฐ๊ณผ 39 ์ œ 3์ ˆ Social Proof ๊ธฐ๋ฐ˜ ์„ค๋ช… ๋ฐฉ์‹์— ๋”ฐ๋ฅธ ์ˆ˜์ง‘ ๊ฒฐ๊ณผ 46 ์ œ 6์žฅ ๋””์ž์ธ ์ œ์–ธ 55 ์ œ 7์žฅ ๊ฒฐ๋ก  58 ์ œ 1์ ˆ ์—ฐ๊ตฌ ๊ฒฐ๊ณผ์˜ ์š”์•ฝ 58 ์ œ 2์ ˆ ์—ฐ๊ตฌ์˜ ํ•œ๊ณ„ 59 ์ œ 3์ ˆ ์—ฐ๊ตฌ์˜ ์˜์˜ 60 ์ฐธ๊ณ ๋ฌธํ—Œ 61 Abstract 69Maste

    Towards Informing an Intuitive Mission Planning Interface for Autonomous Multi-Asset Teams via Image Descriptions

    Get PDF
    Establishing a basis for certification of autonomous systems using trust and trustworthiness is the focus of Autonomy Teaming and TRAjectories for Complex Trusted Operational Reliability (ATTRACTOR). The Human-Machine Interface (HMI) team is working to capture and utilize the multitude of ways in which humans are already comfortable communicating mission goals and translate that into an intuitive mission planning interface. Several input/output modalities (speech/audio, typing/text, touch, and gesture) are being considered and investigated in the context human-machine teaming for the ATTRACTOR design reference mission (DRM) of Search and Rescue or (more generally) intelligence, surveillance, and reconnaissance (ISR). The first of these investigations, the Human Informed Natural-language GANs Evaluation (HINGE) data collection effort, is aimed at building an image description database to train a Generative Adversarial Network (GAN). In addition to building an image description database, the HMI team was interested if, and how, modality (spoken vs. written) affects different aspects of the image description given. The results will be analyzed to better inform the designing of an interface for mission planning

    Brittany Bernal - Sensorimotor Adaptation of Speech Through a Virtually Shortened Vocal Tract

    Get PDF
    The broad objective of this line of research is to understand how auditory feedback manipulations may be used to elicit involuntary changes in speech articulation. We examine speech sensorimotor adaptation to supplement the development of speech rehabilitation applications that benefit from this learning phenomenon. By manipulating the acoustics of oneโ€™s auditory feedback, it is possible to elicit involuntary changes in speech articulation. We seek to understand how virtually manipulating participantsโ€™ perception of vowel space affects their speech movements by assessing acoustic variables such as formant frequency changes. Participants speak through a digital audio processing device that virtually alters the perceived size of their vocal tract. It is hypothesized that this modification to auditory feedback will facilitate adaptive changes in motor behavior as indicated by acoustic changes resulting from speech articulation. This study will determine how modifying the perception of vocal tract size affects articulatory behavior, indicated by changes in formant frequencies and changes in vowel space area. This work will also determine if and how the size of the virtual vowel space affects the magnitude and direction of sensorimotor adaptation for speech. The ultimate aim is to determine how important it is for the virtual vowel space to mimic the talkerโ€™s real vowel space, and whether or not perturbing the size of the perceived vowel space may facilitate or impede involuntary adaptive learning for speech. Sensorimotor Adaptation of Speech Through a Virtually Shortened Vocal Tract by Brittany Bernal is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.https://epublications.marquette.edu/mcnair_2014/1009/thumbnail.jp

    Developing and validating a methodology for crowdsourcing L2 speech ratings in Amazon Mechanical Turk

    Get PDF
    Researchers have increasingly turned to Amazon Mechanical Turk (AMT) to crowdsource speech data, predominantly in English. Although AMT and similar platforms are well positioned to enhance the state of the art in L2 research, it is unclear if crowdsourced L2 speech ratings are reliable, particularly in languages other than English. The present study describes the development and deployment of an AMT task to crowdsource comprehensibility, fluency, and accentedness ratings for L2 Spanish speech samples. Fifty-four AMT workers who were native Spanish speakers from 11 countries participated in the ratings. Intraclass correlation coefficients were used to estimate group-level interrater reliability, and Rasch analyses were undertaken to examine individual differences in rater severity and fit. Excellent reliability was observed for the comprehensibility and fluency ratings, but indices were slightly lower for accentedness, leading to recommendations to improve the task for future data collection
    corecore