1,814 research outputs found
Treebanks gone bad: generating a treebank of ungrammatical English
This paper describes how a treebank of ungrammatical
sentences can be created from a treebank of well-formed sentences. The treebank creation procedure involves the automatic introduction of frequently occurring grammatical errors into the sentences in an existing treebank, and the minimal transformation of the analyses in the treebank so
that they describe the newly created ill-formed sentences.
Such a treebank can be used to test how well a parser is able to ignore grammatical errors in texts (as people can), and can be used to induce a grammar capable of analysing such sentences. This paper also demonstrates the first of these uses
"cba to check the spelling" investigating parser performance on discussion forum posts
We evaluate the Berkeley parser on text from an online discussion forum. We evaluate the parser output with and without gold tokens and spellings (using Sparseval and Parseval), and we compile a list of problematic phenomena
for this domain. The Parseval f-score for a small development set is 77.56. This increases to 80.27 when we apply a set of simple transformations to the input sentences and to the Wall Street Journal (WSJ) training sections
The Impact of International and NESB Students on Measured Learning and Standards in Australian Higher Education
Do international students and/or students from non-English language speaking backgrounds (NESB students) perform worse than other students in Australian undergraduate classrooms? What happens to other students' marks when these students are added to classrooms? I provide new empirical evidence on these questions using very recent administrative panel data from the business faculties of two Australian Technology Network universities. Results show that both international students and NESB students perform significantly worse than other students, even controlling for selection into courses. Both effects are large and do not disappear after the first semester, but non-English speaking background predicts substantially more of a reduction in marks than international student status. Adding international NESB students to a tutorial leads to a reduction in the marks of English-speaking students in that tutorial, whereas the marks of all students benefit from the addition of domestic NESB students to tutorials.Finally, evidence of an upward buoying effect on marks is found from adding international NESB students to courses, which is likely due to the presence of grading on a curve at the course level, but this effect is only felt by international NESB students themselves. Logic suggests that this rise is unlikely to be due to a true learning effect, implying that on average, international NESB students' already low marks are inflated in courses with large fractions of such students.higher education; Australia; peer effects; international students; NESB
Similarity rules! Exploring methods for ad-hoc rule detection
We examine the role of similarity in ad hoc rule detection and show how previous methods can be made more corpus independent and more generally applicable. Specifically, we show that the similarity of a rule to others in the grammar is a crucial factor in determining the reliability of a rule, providing information unavailable in frequency. We also include a way to score rules which are not in the training data, thereby providing a platform for grammar generalization
GenERRate: generating errors for use in grammatical error detection
This paper explores the issue of automatically generated ungrammatical data and its use in error detection, with a focus on the task of classifying a sentence as grammatical or ungrammatical. We present an error generation tool called GenERRate and show how GenERRate can be used to improve the performance of a classifier on learner data. We describe
initial attempts to replicate Cambridge Learner Corpus errors using GenERRate
The Multitasking of Household Production
The standard household production model does not incorporate multitasking, although time-diary data reveal that individuals regularly multi-task. We formulate a model where time spent in child care can be sole-tasked or multitasked with other household production activities. This model implies associations between household productivity factors and both child outcomes and parental time use. We then use data from the Longitudinal Study of Australian Children and the Australian Time Use Surveys to examine the empirical validity of these implications. Consistent with our model's predictions, household productivity factors are associated both with child outcomes and parental time use.
Tobit or OLS? An Empirical Evaluation Under Different Diary Window Lengths
Time use researchers frequently debate whether it is more appropriate to fit censored regression (Tobit) models using maximum likelihood estimation or linear models using ordinary least squares (OLS) to explain individuals’ allocations of time to different activities as recorded in time-diary data. One side argues that estimation of Tobit models addresses the significant censoring (i.e., large numbers of zeros) typically found in time-diary data and that OLS estimation leads to biased and inconsistent estimates. The opposing side argues that optimization occurs over a longer period than that covered by the typical time diary, and thus that reported zeros represent measurement error rather than true non-participation in the activity, in which case OLS is preferred. We use the Australian Time Use Surveys, which record information for two consecutive diary days, to estimate censored and linear versions of a parental child care model for both 24-hour and 48-hour windows of observation in order to determine the empirical consequences of estimation technique and diary length. We find a moderate amount of measurement error when we use the 24-hour window compared to the 48-hour window, but a large number of zeros in the shorter window remain zeroes when we double the window length. Most of the qualitative conclusions we draw are similar for the two windows of observation and the two estimation methods, although there are some slight differences in the magnitudes and statistical significance of the estimates. Importantly, Tobit estimates appear to be more sensitive to window length than OLS estimates.Tobit; OLS; time-diary data
Treebank Embedding Vectors for Out-of-domain Dependency Parsing
A recent advance in monolingual dependency parsing is the idea of a treebank
embedding vector, which allows all treebanks for a particular language to be
used as training data while at the same time allowing the model to prefer
training data from one treebank over others and to select the preferred
treebank at test time. We build on this idea by 1) introducing a method to
predict a treebank vector for sentences that do not come from a treebank used
in training, and 2) exploring what happens when we move away from predefined
treebank embedding vectors during test time and instead devise tailored
interpolations. We show that 1) there are interpolated vectors that are
superior to the predefined ones, and 2) treebank vectors can be predicted with
sufficient accuracy, for nine out of ten test languages, to match the
performance of an oracle approach that knows the most suitable predefined
treebank embedding for the test set.Comment: Camera ready for ACL 202
Using parse features for preposition selection and error detection
We evaluate the effect of adding parse features to a leading model of preposition usage. Results show a significant improvement in the preposition selection task on
native speaker text and a modest increment in precision and recall in an ESL error detection task. Analysis of the parser output indicates that it is robust enough in the face
of noisy non-native writing to extract useful information
- …