3,411 research outputs found

    Investigating the effect of auxiliary objectives for the automated grading of learner english speech transcriptions

    Get PDF
    We address the task of automatically grading the language proficiency of spontaneous speech based on textual features from automatic speech recognition transcripts. Motivated by recent advances in multi-task learning, we develop neural networks trained in a multi-task fashion that learn to predict the proficiency level of non-native English speakers by taking advantage of inductive transfer between the main task (grading) and auxiliary prediction tasks: morpho-syntactic labeling, language modeling, and native language identification (L1). We encode the transcriptions with both bi-directional recurrent neural networks and with bi-directional representations from transformers, compare against a feature-rich baseline, and analyse performance at different proficiency levels and with transcriptions of varying error rates. Our best performance comes from a transformer encoder with L1 prediction as an auxiliary task. We discuss areas for improvement and potential applications for text-only speech scoring.Cambridge Assessmen

    Strangeness production in jets from p+p \sqrt{s} = 200 GeV collisions

    Full text link
    Measurements of strangeness production in jets help illuminate the QCD mechanisms in fragmentation. Furthermore, they provide a crucial baseline for heavy-ion studies where modifications in jet chemistry have recently been predicted. We present new results on strange particle production in jets from p+p \sqrt{s} = 200 GeV collisions measured by the STAR experiment. The momentum distributions of the \Lambda, \bar{\Lambda} and K0Short particles are obtained using various jet finding algorithms, and then compared to various models. Strange particle ratios in jets are obtained and compared to values obtained from the inclusive spectra. Finally, we show jets tagged with leading strange baryons and mesons, in order to investigate whether gluon or quark jets can be isolated in this way.Comment: 5 pages, 4 figures, Winter Workshop on Nuclear Dynamics 2010, Jamaic

    Marketing of giftwares in the United States.

    Full text link
    Thesis ()--Boston Universit

    On saturation of charged hadron production in pp collisions at LHC

    Full text link
    First results on charged hadron transverse momentum spectra in pp collisions obtained by the CMS Collaboration at LHC were analyzed in z-scaling approach. The first LHC data confirm z-scaling. The saturation regime of the scaling function psi(z) observed in pp and antp-pp interactions at lower energy sqrt s = 19-1960 GeV is verified. The saturation of psi(z) for charged hadrons is found down to z=0.05 at the highest energy sqrt s = 2360 GeV reached till now at colliders. A microscopic scenario of hadron production is discussed in connection with search for new signatures of phase transitions in hadron matter. Constituent energy loss and its dependencies on the transverse momentum of charged hadrons and collision energy are estimated. The beam energy scan at LHC in the saturation region is suggested.Comment: LaTeX, 6 pages, 6 figure

    Impact of ASR performance on free speaking language assessment

    Get PDF
    In free speaking tests candidates respond in spontaneous speech to prompts. This form of test allows the spoken language proficiency of a non-native speaker of English to be assessed more fully than read aloud tests. As the candidate's responses are unscripted, transcription by automatic speech recognition (ASR) is essential for automated assessment. ASR will never be 100% accurate so any assessment system must seek to minimise and mitigate ASR errors. This paper considers the impact of ASR errors on the performance of free speaking test auto-marking systems. Firstly rich linguistically related features, based on part-of-speech tags from statistical parse trees, are investigated for assessment. Then, the impact of ASR errors on how well the system can detect whether a learner's answer is relevant to the question asked is evaluated. Finally, the impact that these errors may have on the ability of the system to provide detailed feedback to the learner is analysed. In particular, pronunciation and grammatical errors are considered as these are important in helping a learner to make progress. As feedback resulting from an ASR error would be highly confusing, an approach to mitigate this problem using confidence scores is also analysed

    Is soft physics entropy driven?

    Full text link
    The soft physics, pT < 2 GeV/c, observables at both RHIC and the SPS have now been mapped out in quite specific detail. From these results there is mounting evidence that this regime is primarily driven by the multiplicity per unit rapidity, dNch/deta. This suggests that the entropy of the system alone is the underlying driving force for many of the global observables measured in heavy-ion collisions. That this is the case and there is an apparent independence on collision energy is surprising. I present the evidence for this multiplicity scaling and use it to make some extremely naive predictions for the soft sector results at the LHC.Comment: Proceedings of Hot Quarks 2006. 8 figures, 6 page

    CAMsterdam at SemEval-2019 task 6: Neural and graph-based feature extraction for the identification of offensive tweets

    Get PDF
    We describe the CAMsterdam team entry to the SemEval-2019 Shared Task 6 on offen-sive language identification in Twitter data.Our proposed model learns to extract tex-tual features using a multi-layer recurrent net-work, and then performs text classification us-ing gradient-boosted decision trees (GBDT). A self-attention architecture enables the model to focus on the most relevant areas in the text.We additionally learn globally optimised em-beddings for hashtags using node2vec, which are given as additional tweet features to the GBDT classifier.Our best model obtains78.79% macro F1-score on detecting offensive language (subtask A), 66.32% on categorising offence types (targeted/untargeted; subtask B),and 55.36% on identifying the target of of-fence (subtask C)
    corecore