Search CORE

2,485 research outputs found

Applications of Text Analysis Tools for Spoken Response Grading

Author: Crossley Scott
McNamara Danielle
Publication venue: Michigan State University Center for Language Education and Research
Publication date: 01/06/2013
Field of study

ScholarSpace at University of Hawai'i at Manoa

Shallow Analysis Based Assessment of Syntactic Complexity for Automated Speech Scoring

Author: Beckman Institute
Huichao Xue
Su-Youn Yoon
Suma Bhat
Publication venue
Publication date: 24/04/2020
Field of study

Abstract Designing measures that capture various aspects of language ability is a central task in the design of systems for automatic scoring of spontaneous speech. In this study, we address a key aspect of language proficiency assessment -syntactic complexity. We propose a novel measure of syntactic complexity for spontaneous speech that shows optimum empirical performance on real world data in multiple ways. First, it is both robust and reliable, producing automatic scores that agree well with human rating compared to the stateof-the-art. Second, the measure makes sense theoretically, both from algorithmic and native language acquisition points of view

CiteSeerX

Using Ontology-Based Approaches to Representing Speech Transcripts for Automated Speech Scoring

Author: Chen Miao
Publication venue: SURFACE at Syracuse University
Publication date: 01/08/2013
Field of study

Text representation is a process of transforming text into some formats that computer systems can use for subsequent information-related tasks such as text classification. Representing text faces two main challenges: meaningfulness of representation and unknown terms. Research has shown evidence that these challenges can be resolved by using the rich semantics in ontologies. This study aims to address these challenges by using ontology-based representation and unknown term reasoning approaches in the context of content scoring of speech, which is a less explored area compared to some common ones such as categorizing text corpus (e.g. 20 newsgroups and Reuters). From the perspective of language assessment, the increasing amount of language learners taking second language tests makes automatic scoring an attractive alternative to human scoring for delivering rapid and objective scores of written and spoken test responses. This study focuses on the speaking section of second language tests and investigates ontology-based approaches to speech scoring. Most previous automated speech scoring systems for spontaneous responses of test takers assess speech by primarily using acoustic features such as fluency and pronunciation, while text features are less involved and exploited. As content is an integral part of speech, the study is motivated by the lack of rich text features in speech scoring and is designed to examine the effects of different text features on scoring performance. A central question to the study is how speech transcript content can be represented in an appropriate means for speech scoring. Previously used approaches from essay and speech scoring systems include bag-of-words and latent semantic analysis representations, which are adopted as baselines in this study; the experimental approaches are ontology-based, which can help improving meaningfulness of representation units and estimating importance of unknown terms. Two general domain ontologies, WordNet and Wikipedia, are used respectively for ontology-based representations. In addition to comparison between representation approaches, the author analyzes which parameter option leads to the best performance within a particular representation. The experimental results show that on average, ontology-based representations slightly enhances speech scoring performance on all measurements when combined with the bag-of-words representation; reasoning of unknown terms can increase performance on one measurement (cos.w4) but decrease others. Due to the small data size, the significance test (t-test) shows that the enhancement of ontology-based representations is inconclusive. The contributions of the study include: 1) it examines the effects of different representation approaches on speech scoring tasks; 2) it enhances the understanding of the mechanisms of representation approaches and their parameter options via in-depth analysis; 3) the representation methodology and framework can be applied to other tasks such as automatic essay scoring

CiteSeerX

Syracuse University Research Facility and Collaborative Environment

New and not so new methods for assessing oral communication

Author: Li Zhi
Ockey Gary J.
Publication venue: 'Universitat Jaume I'
Publication date: 01/01/2015
Field of study

The assessment of oral communication has continued to evolve over the past few decades. The construct being assessed has broadened to include interactional competence, and technology has played a role in the types of tasks that are currently popular. In this paper, we discuss the factors that affect the process of oral communication assessment, current conceptualizations of the construct to be assessed, and five tasks that are used to assess this construct. These tasks include oral proficiency interviews, paired/group oral discussion tasks, simulated tasks, integrated oral communication tasks, and elicited imitation tasks. We evaluate these tasks based on current conceptualizations of the construct of oral communication, and conclude that they do not assess a broad construct of oral communication equally. Based on our evaluation, we advise test developers to consider the aspects of oral communication that they aim to include or exclude in their assessment when they select one of these task types

Digital Repository @ Iowa State University (ISU)

Repositori Institucional de la Universitat Jaume I

DIALNET

To What Extent is Collocation Knowledge Associated with Oral Proficiency? A Corpus-Based Approach to Word Association

Author: Clenton J
Eguchi M
Kyle K
Saito K
Uchihara T
Publication venue: 'Academy of Traumatology'
Publication date: 01/06/2022
Field of study

This study examined the relationship between second language (L2) learners’ collocation knowledge and oral proficiency. A new approach to measuring collocation was adopted by eliciting responses through a word association task and using corpus-based measures (absolute frequency count, t-score, MI score) to analyze the degree to which stimulus words and responses were collocated. Oral proficiency was measured using human judgements and objective measures of fluency (articulation rate, silent pause ratio, filled pause ratio) and lexical richness (diversity, frequency, range). Forty Japanese university students completed a word association task and a spontaneous speaking task (picture narrative). Results indicated that speakers who used more low-frequency collocations in the word association task (i.e., lower collocation frequency scores) spoke faster with fewer silent pauses and were perceived to be more fluent. Speakers who provided more strongly associated collocations (as measured by MI) used more sophisticated lexical items and were perceived to be lexically proficient. Collocation knowledge remained as a unique predictor after the influence of learners’ vocabulary size (i.e., knowledge of single-word items) was considered. These findings support the key role that collocation plays in oral proficiency and provide important insights into understanding L2 speech development from the perspective of phraseological competence

UCL Discovery

Factors Affecting Grammatical and Lexical Complexity of Long-Term L2 Speakers’ Oral Proficiency

There remains considerable disagreement about which factors drive second language (L2) ultimate attainment. Age of onset (AO) appears to be a robust factor, lending support to theories of maturational constraints on L2 acquisition. The present study is an investigation of factors that influence grammatical and lexical complexity at the stage of L2 ultimate attainment. Grammatical and lexical complexity were assessed in 102 spontaneous oral interviews. Interviewees' AOs ranged from 7 to 17 years old. Multifactorial analyses yielded consistently significant effects of gender and level of education for grammatical and lexical complexity. Additionally, native language use at work was a significant predictor for lexical complexity; conversely, AO did not emerge as a significant factor. We conclude that grammatical and lexical complexity at the stage of L2 ultimate attainment is the result of a complex interplay of variables that are general to language learning and performance rather than L2 specific

University of Essex Research Repository

Crossref

Proceedings - University of Groningen

University of Groningen

Kölner UniversitätsPublikationsServer

ARTS repository - University of Groningen

MAnnheim DOCument Server

Dissertations of the University of Groningen

Foneettinen sujuvuus suomessa toisena kielenä: Lukiolaisten spontaanin puheen akustinen analyysi

Author: Koivusalo Liisa
Publication venue: Helsingfors universitet
Publication date: 01/01/2022
Field of study

Speaking fluently is an important goal for second language (L2) learners. In L2 research, fluency is often studied by measuring temporal features in speech. These features include speed (rate of speech), breakdown (use of silent and filled pauses), and repair (self-corrections and repetitions) phenomena. Fluent speakers generally have a higher rate of speech and fewer hesitations and interruptions than beginner language learners. In this thesis, phonetic fluency of high school students’ L2 Finnish speech is studied in relation to human ratings of fluency and overall proficiency. The topic is essential for the development of automated assessment of L2 speech, as phonetic fluency measures can be used for predicting a speaker’s fluency and proficiency level automatically. Although the effect of different fluency measures on perceived fluency level has been widely studied during the last decades, research on phonetic fluency in Finnish as L2 is still limited. Phonetic fluency in high school students’ speech in L2 Finnish has not been studied before. The speech samples and ratings used in this thesis are a part of a larger dataset collected in the DigiTala research project. The analyzed data contained spontaneous speech samples in L2 Finnish from 53 high school students of different language backgrounds. All samples were assessed by expert raters for fluency and overall proficiency. The speech samples were annotated by marking intervals containing silent pauses, filled pauses, corrections and repetitions, and individual words. Several phonetic fluency measures were calculated for each sample from the durations of the annotated intervals. The contribution of phonetic fluency measures to human ratings of fluency and proficiency was studied using simple and multiple linear regression models. Speech rate was found to be the strongest predictor for both fluency and proficiency ratings in simple linear regression. Articulation rate, portion of long silent pauses, mean duration of long silent pauses, mean duration of breaks between utterances, and rate of short silent pauses per minute were also statistically significant predictors of both fluency and proficiency ratings. Multiple linear regression models improved the simple models for both fluency and proficiency: for fluency, a model with a combination of articulation rate and the portion of long silent pauses performed the best, and for proficiency, a model with a combination of speech rate and mean duration of short silent pauses. Perceived fluency level is often affected by a combination of different phonetic fluency measures, and it seems that human raters ground their assessments on this combination, although some phonetic fluency measures might be more important on their own than others. The findings of this thesis expand previous knowledge on phonetic fluency in L2 Finnish and can benefit both language learners and teachers, as well as developers of automatic assessment of L2 speech.Sujuvaa puhetaitoa pidetään tärkeänä tavoitteena toisen kielen (L2) oppimisessa. L2-puheen tutkimuksissa sujuvuutta tutkitaan usein puheesta mitattavilla temporaalisilla piirteillä, joita ovat esimerkiksi puheen nopeus, tauot, korjaukset ja toistot. Nopea, vähän epäröintiä ja keskeytyksiä sisältävä puhe mielletään usein sujuvaksi, ja toisen kielen oppimisen alkuvaiheessa puhe on epäsujuvampaa. Tässä tutkielmassa tutkitaan lukiolaisten L2-suomen foneettista sujuvuutta puheesta mitattavien foneettisten sujuvuuspiirteiden sekä sujuvuus- ja taitotasoarvioiden avulla. Tutkimusaihe liittyy myös puheen automaattisen arvioinnin kehittämiseen, sillä kielenoppijan sujuvuus- ja taitotasoa voidaan ennustaa automaattisesti foneettisten sujuvuuspiirteiden avulla. Vaikka sujuvuuspiirteiden ja arviointien välistä yhteyttä on tutkittu melko paljon viime vuosikymmeninä, L2-suomen foneettiseen sujuvuuteen liittyviä tutkimuksia on yhä vähän. Lukiolaisten L2-suomen foneettista sujuvuutta ei ole aiemmin tutkittu. Tutkielmassa käytetty puhe- ja arviointiaineisto on osa suurempaa aineistoa, joka on kerätty DigiTala-tutkimusprojektissa. Analysoitu aineisto sisälsi 53 spontaania puhenäytettä lukiolaisilta, jotka puhuvat suomea toisena kielenä. Lisäksi jokaisen puhenäytteen sujuvuus ja yleinen taitotaso oli arvioitu. Puhenäytteisiin annotoitiin hiljaiset ja täytetyt tauot, korjaukset ja toistot sekä yksittäiset sanat. Annotoitujen intervallien kestoista laskettiin useita foneettisia sujuvuuspiirteitä jokaiselle puhenäytteelle. Foneettisten sujuvuuspiirteiden vaikutusta ihmisarvioihin tutkittiin lineaaristen regressiomallien avulla. Puhenopeus ennusti yhden selittävän muuttujan malleissa sekä sujuvuus- että taitotasoarvioita parhaiten. Tämän lisäksi artikulaationopeus, pitkien hiljaisten taukojen osuus, pitkien hiljaisten taukojen keskimääräinen kesto, yhtenäisten puhejaksojen välisten keskeytysten keskimääräinen kesto ja lyhyiden hiljaisten taukojen suhteellinen lukumäärä olivat tilastollisesti merkitseviä ennustajia yhden selittävän muuttujan malleissa. Useamman selittävän muuttujan mallit paransivat aiempien mallien selitysvoimaa sekä sujuvuus- että taitotasoarvioissa: artikulaationopeuden ja pitkien hiljaisten taukojen osuuden yhdistelmä ennusti sujuvuusarvioita parhaiten, ja puhenopeuden ja lyhyiden hiljaisten taukojen keskimääräisen keston yhdistelmä taitotasoarvioita. Puheen havaittuun sujuvuuteen vaikuttaa usein yhdistelmä erilaisia sujuvuuspiirteitä, vaikka yksittäisten piirteiden vaikutukset voivat olla keskenään erilaisia. Tutkielman tulokset lisäävät tietoa L2-suomen foneettisesta sujuvuudesta, ja ne ovat tarpeellisia niin kielenoppijoille, -opettajille kuin puheen automaattisten arviointityökalujen kehittäjille

Helsingin yliopiston digitaalinen arkisto

Moving between languages: Turkish returnees from Germany

Author: Daller Michael Helmut
Treffers-Daller Jeanine
Publication venue: Frank & Timme
Publication date: 01/01/2014
Field of study

Central Archive at the University of Reading

The relationship between task difficulty and second language fluency in French:a mixed-methods approach

Author: Bachman
Bosker
Brindley
Cohen
Coughlan
de Bot
De Jong
De Jong
Derwing
Derwing
Dörnyei
Ejzenberg
Ericsson
Ericsson
Fillmore
Foster
Freed
Freed
Gilabert
Goldman-Eisler
Grosjean
Ishikawa
Iwashita
Jackson
Kormos
Kormos
Kormos
Kormos
Kormos
Lennon
Lennon
Levelt
Levelt
Levelt
Mori
Norris
Préfontaine
Préfontaine
Riggenbach
Robinson
Robinson
Robinson
Robinson
Robinson
Robinson
Robinson
Rubin
Révész
Samuda
Segalowitz
Skehan
Skehan
Skehan
Skehan
Skehan
Tavakoli
Tavakoli
Towell
Towell
Publication venue: 'Wiley'
Publication date: 01/01/2015
Field of study

While there exists a considerable body of literature on task-based difficulty and second language (L2) fluency in English as a second language (ESL), there has been little investigation with French learners. This mixed-methods study examines learner appraisals of task difficulty and their relationship to automated utterance fluency measures in French under three different task conditions. Participants were 40 adult learners of French at varying levels of proficiency studying in a university immersion context in Québec. Appraisal of task difficulty was assessed quantitatively by participants’ self-reports in response to a five-item questionnaire and qualitatively by retrospective interviews. Utterance fluency was operationalized by four temporal variables and measured by Praat, a speech analysis software program. Across tasks, the quantitative results indicate that appraisals of lexical retrieval difficulty and fluency difficulty were most strongly related to perceived overall task difficulty. The qualitative analysis shows how L2 speakers evaluated the difficulty of each task as well as the features that either contributed to or limited their L2 fluency. Students’ fluency in performing the three tasks was found to differ for articulation rate and average pause time, but not for pause frequency or phonation-time ratio

Crossref

Lancaster E-Prints