Search CORE

4,028 research outputs found

A Winnow-Based Approach to Context-Sensitive Spelling Correction

Author: Golding Andrew R.
Roth Dan
Publication venue
Publication date: 31/10/1998
Field of study

A large class of machine-learning problems in natural language require the characterization of linguistic context. Two characteristic properties of such problems are that their feature space is of very high dimensionality, and their target concepts refer to only a small subset of the features in the space. Under such conditions, multiplicative weight-update algorithms such as Winnow have been shown to have exceptionally good theoretical properties. We present an algorithm combining variants of Winnow and weighted-majority voting, and apply it to a problem in the aforementioned class: context-sensitive spelling correction. This is the task of fixing spelling errors that happen to result in valid words, such as substituting "to" for "too", "casual" for "causal", etc. We evaluate our algorithm, WinSpell, by comparing it against BaySpell, a statistics-based method representing the state of the art for this task. We find: (1) When run with a full (unpruned) set of features, WinSpell achieves accuracies significantly higher than BaySpell was able to achieve in either the pruned or unpruned condition; (2) When compared with other systems in the literature, WinSpell exhibits the highest performance; (3) The primary reason that WinSpell outperforms BaySpell is that WinSpell learns a better linear separator; (4) When run on a test set drawn from a different corpus than the training set was drawn from, WinSpell is better able than BaySpell to adapt, using a strategy we will present that combines supervised learning on the training set with unsupervised learning on the (noisy) test set.Comment: To appear in Machine Learning, Special Issue on Natural Language Learning, 1999. 25 page

arXiv.org e-Print Archive

CiteSeerX

Applying Winnow to Context-Sensitive Spelling Correction

Author: Golding Andrew R.
Roth Dan
Publication venue
Publication date: 01/01/1996
Field of study

Multiplicative weight-updating algorithms such as Winnow have been studied extensively in the COLT literature, but only recently have people started to use them in applications. In this paper, we apply a Winnow-based algorithm to a task in natural language: context-sensitive spelling correction. This is the task of fixing spelling errors that happen to result in valid words, such as substituting {\it to\/} for {\it too}, {\it casual\/} for {\it causal}, and so on. Previous approaches to this problem have been statistics-based; we compare Winnow to one of the more successful such approaches, which uses Bayesian classifiers. We find that: (1)~When the standard (heavily-pruned) set of features is used to describe problem instances, Winnow performs comparably to the Bayesian method; (2)~When the full (unpruned) set of features is used, Winnow is able to exploit the new features and convincingly outperform Bayes; and (3)~When a test set is encountered that is dissimilar to the training set, Winnow is better than Bayes at adapting to the unfamiliar test set, using a strategy we will present for combining learning on the training set with unsupervised learning on the (noisy) test set.Comment: 9 page

arXiv.org e-Print Archive

CiteSeerX

The repeatability of self-reported exposure after miscarriage

Author: Farrow A
Farrow SC
Golding J
Little R
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/1996
Field of study

BACKGROUND: The Avon Longitudinal Study of Pregnancy and Childhood is a prospective study of women who were resident in Avon and who were expected to deliver a baby between April 1991 and December 1992. METHODS: The study provided an opportunity to test the repeatability of responses from 220 women who experienced a miscarriage and who reported exposure to occupational substances and common household products and appliances in two questionnaires. The first questionnaire was completed in the early part of the pregnancy and the second after the miscarriage. Women were asked to score their frequency of exposure on a five-point scale from 'daily' to 'never'. Their responses were analysed to assess the degree of agreement between replies to identical questions in the two questionnaires using the kappa statistic. A new frequency variable was created which compared the replies for the two questionnaires; this was analysed for all exposures by cross-tabulation with possible explanatory variables (age of mother, social class, history of miscarriage and the time lag between questionnaires). RESULTS: In general there was good agreement in the reported exposures to 48 substances and products. The results showed a small and consistent pattern of reporting exposures less frequently in the second questionnaire, i.e. after miscarriage. This was not explained by the analysis of possible confounding variables. Given the literature, the authors had expected to find a shift in the opposite direction. CONCLUSION: The study reinforces the need to be cautious when using the results from single surveys of retrospective self-reported exposure

Crossref

Brunel University Research Archive

Explore Bristol Research

Does Openness to Trade Make Countries More Vulnerable to Sudden Stops, or Less? Using Gravity to Establish Causality

Author: Adams Bret R.
Golding Sarah E.
Rao Raj R.
Valerie Kristoffer
Publication venue
Publication date: 01/01/2010
Field of study

Openness to trade is one factor that has been identified as determining whether a country is prone to sudden stops in capital inflows, crashes in currencies, or severe recessions. Some believe that openness raises vulnerability to foreign shocks, while others believe that it makes adjustment to crises less painful. Several authors have offered empirical evidence that having a large tradable sector reduces the contraction necessary to adjust to a given cut-off in funding. This would help explain lower vulnerability to crises in Asia than in Latin America. Such studies may, however, be subject to the problem that trade is endogenous. Using the gravity instrument for trade openness, which is constructed from geographical determinants of bilateral trade, this paper finds that openness indeed makes countries less vulnerable, both to severe sudden stops and currency crashes, and that the relationship is even stronger when correcting for the endogeneity of trade.

Directory of Open Access Journals

PubMed Central

Research Papers in Economics

VCU Scholars Compass

A Bayesian hybrid method for context-sensitive spelling correction

Author: Golding Andrew R.
Publication venue
Publication date: 01/01/1995
Field of study

Two classes of methods have been shown to be useful for resolving lexical ambiguity. The first relies on the presence of particular words within some distance of the ambiguous target word; the second uses the pattern of words and part-of-speech tags around the target word. These methods have complementary coverage: the former captures the lexical ``atmosphere'' (discourse topic, tense, etc.), while the latter captures local syntax. Yarowsky has exploited this complementarity by combining the two methods using decision lists. The idea is to pool the evidence provided by the component methods, and to then solve a target problem by applying the single strongest piece of evidence, whatever type it happens to be. This paper takes Yarowsky's work as a starting point, applying decision lists to the problem of context-sensitive spelling correction. Decision lists are found, by and large, to outperform either component method. However, it is found that further improvements can be obtained by taking into account not just the single strongest piece of evidence, but ALL the available evidence. A new hybrid method, based on Bayesian classifiers, is presented for doing this, and its performance improvements are demonstrated.Comment: 15 page

arXiv.org e-Print Archive

CiteSeerX

Assessment worlds colliding? Negotiating between discourses of assessment on an online open course

Author: Cameron R.
Colley H.
Golding B.
Jonathan Hughes
Jordan S.
Laura Hills
Publication venue: 'Informa UK Limited'
Publication date: 01/01/2016
Field of study

Using the badged open course, Taking your first steps into Higher Education, this case study examines how assessment on online open courses draws on concepts of assessment used within formal and informal learning. Our experience was that assessment used within open courses, such as massive open online courses, is primarily determined by the requirements of quality assurance processes to award a digital badge or statement of participation as well as what is technologically possible. However, this disregards much recent work in universities that use assessment in support of learning. We suggest that designers of online open courses should pay greater attention to the relationship of assessment and learning to improve participant course completion

Crossref

Open Research Online

Universality in escape from a modulated potential well

Author: Dykman M. I.
Golding B.
Kruse J. R.
McCann L. I.
Ryvkine D.
Publication venue: 'AIP Publishing'
Publication date: 29/04/2002
Field of study

We show that the rate of activated escape

W

from a periodically modulated potential displays scaling behavior versus modulation amplitude

A

. For adiabatic modulation of an optically trapped Brownian particle, measurements yield

\ln W\propto (A_{\rm c} - A)^{\mu}

with

\mu = 1.5

. The theory gives

\mu=3/2

in the adiabatic limit and predicts a crossover to

\mu=2

scaling as

A

approaches the bifurcation point where the metastable state disappears.Comment: 4 pages, 3 figure

arXiv.org e-Print Archive

Crossref

Holidaying with the family pet: No dogs allowed!

Author: Anderson W. P.
Bauman A. E.
Frohlich R.
Golding C.
Neil Carr
Rogers J
Scott Cohen
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 18/05/2009
Field of study

This paper assesses the extent to which dog owners located in Brisbane, Australia, wish to holiday with their pets, and whether there is a gap between this desire and reality. The paper also examines the extent to which this demand is being catered for by the tourism accommodation sector. The need for this study reflects the increasingly significant role dogs are playing in the lives of humans, and the scale of the dog-owning population. The results suggest that, although there is a strong desire among dog owners to take holidays with their pets, the actualisation of this desire is comparatively low. A significant obstacle to the realisation of this desire appears to be a dearth of pet-friendly accommodation. This has implications for the ability of the tourism industry to benefit from this potentially lucrative market, that is, the dog-owning population

Crossref

Bournemouth University Research Online

Surrey Research Insight

Application of biocomposite edible coatings based on pea starch and guar gum on quality, storability and shelf life of ‘Valencia’ oranges

Author: Chockchaisawasdee Suwimol
Golding John B.
Morques José R.
Pristijono Penta
Saberi Bahareh
Scarlett Christopher J.
Stathopoulos Costas E.
Publication venue
Publication date: 01/03/2018
Field of study

Novel edible composite coatings based on pea starch and guar gum (PSGG), PSGG blended with lipid mixture containing the hydrophobic compounds shellac and oleic acid (PSGG-Sh), and a layer-by-layer (LBL) approach (PSGG as an internal layer and shellac as an external layer), were investigated and compared with a commercial wax (CW) and uncoated fruit on postharvest quality of ‘Valencia’ oranges held for up to four weeks at 20 °C and 5 °C with an additional storage for 7 d at 20 °C. The incorporation of lipid compounds into the PSGG coatings (PSGG-Sh) generally resulted in the best performance in reducing fruit respiration rate, ethylene production, weight and firmness loss, peel pitting, and fruit decay rate of the coated oranges. Fruit coated with PSGG-Sh and a single layer PSGG coatings generally resulted in higher scores for overall flavor and freshness after four weeks at 5 °C followed by one week at 20 °C than uncoated fruit, as assessed by a sensory panel. Although the LBL coating reduced weight loss and respiration rate with improved firmness retention to a greater extent than the single layer PSGG coating, the bilayer coating also resulted in higher levels of ethanol causing increased perception of off-flavors. Overall results suggested that PSGG-based edible coatings could be a beneficial substitute to common commercial waxes for maintaining quality and storability, as well as extending shelf life of citrus fruit and potentially other fresh horticultural produce

Abertay Research Portal

Crossref

Fifty years of spellchecking

Author: Blair CR
Brooks G
Carlson AJ
Cucerzan S
Damerau FJ
Damerau FJ
Golding AR
Golding AR
Leech G
Levenshtein VI
McIlroy MD
Mihov S
Mitton R
Mitton R
Mitton R
Mitton R
Morris R
Oflazer K
Pedler J
Peterson JL
Peterson JL
Pollock JL
Roger Mitton
Savary A
Sterling CM
Veronis J
Wagner RA
Wing AM
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2010
Field of study

A short history of spellchecking from the late 1950s to the present day, describing its development through dictionary lookup, affix stripping, correction, confusion sets, and edit distance to the use of gigantic databases

Crossref

Birkbeck Institutional Research Online