Search CORE

33 research outputs found

The Impact of Word, Multiple Word, and Sentence Input on Virtual Keyboard Decoding Performance

Author: Bi X.
Chelba C.
Jurafsky D.
MacKenzie I. S.
Stolcke A.
Yi X.
Young S. J.
Zhai S.
Publication venue: Proceedings of the ACM Conference on Human Factors in Computing Syste s
Publication date: 08/01/2018
Field of study

Entering text on non-desktop computing devices is often done via an onscreen virtual keyboard. Input on such keyboards normally consists of a sequence of noisy tap events that specify some amount of text, most commonly a single word. But is single word-at-a-time entry the best choice? This paper compares user performance and recognition accuracy of wordat- a-time, phrase-at-a-time, and sentence-at-a-time text entry on a smartwatch keyboard. We evaluate the impact of differing amounts of input in both text copy and free composition tasks. We found providing input of an entire sentence significantly improved entry rates from 26wpm to 32wpm while keeping character error rates below 4%. In offline experiments with more processing power and memory, sentence input was recognized with a much lower 2.0% error rate. Our findings suggest virtual keyboards can enhance performance by encouraging users to provide more input per recognition event.This work was supported by Google Faculty awards (K.V. and P.O.K.

Michigan Technological University

Crossref

Apollo (Cambridge)

CUED - Cambridge University Engineering Department

Combining Statistical Language Models via the Latent Maximum Entropy Principle

Author: A. Berger
A. Dempster
B. Roark
C. Shannon
C. Chelba
Dale Schuurmans
F. Peng
Fuchun Peng
J. Bellegarda
J. Darroch
R. Rosenfeld
R. Rosenfeld
S. Abney
S. Della Pietra
S. Lauritzen
S. Chen
S. Chen
S. Khudanpur
Shaojun Wang
T. Hofmann
Y. Bengio
Yunxin Zhao
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Statistical Language Modelling

Author: A. Papoulis
A.L. Berger
A.P. Dempster
C. Chelba
C.D. Manning
C.E. Shannon
C.E. Shannon
C.J. Rijsbergen van
C.T. Yu
D.D. Palmer
D.M. Bikel
E. Shriberg
F. Jelinek
H. Ney
J. Hobbs
K.W. Church
K.W. Church
M. Federico
M.H. DeGroot
M.W. Berry
R. Kuhn
R. Rosenfeld
R.A. Redner
S. Deerwester
S. Renals
S.M. Katz
S.P. Harter
W.B. Frakes
Y. Gotoh
Y. Gotoh
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2003
Field of study

Grammar-based natural language processing has reached a level where it can `understand' language to a limited degree in restricted domains. For example, it is possible to parse textual material very accurately and assign semantic relations to parts of sentences. An alternative approach originates from the work of Shannon over half a century ago [41], [42]. This approach assigns probabilities to linguistic events, where mathematical models are used to represent statistical knowledge. Once models are built, we decide which event is more likely than the others according to their probabilities. Although statistical methods currently use a very impoverished representation of speech and language (typically finite state), it is possible to train the underlying models from large amounts of data. Importantly, such statistical approaches often produce useful results. Statistical approaches seem especially well-suited to spoken language which is often spontaneous or conversational and not readily amenable to standard grammar-based approaches

CiteSeerX

Crossref

Edinburgh Research Archive

Investigating the Global Semantic Impact of Speech Recognition Error on Spoken Content Collections

Author: C. Chelba
J. Allan
W. Byrne
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2009
Field of study

Crossref

Retrieval and browsing of spoken content

Author: C. Chelba
M. Saraclar
T.J. Hazen
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

Crossref

Are Morphosyntactic Taggers Suitable to Improve Automatic Transcription?

Author: A. Valli
C. Chelba
P. Brown
R. Iyer
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date: 01/01/2006
Field of study

International audienceThe aim of our paper is to study the interest of part of speech (POS) tagging to improve speech recognition. We first evaluate the part of misrecognized words that can be corrected using POS information; the analysis of a short extract of French radio broadcast news shows that an absolute decrease of the word error rate by 1.1% can be expected. We also demonstrate quantitatively that traditional POS taggers are reliable when applied to spoken corpus, including automatic transcriptions. This new result enables us to effectively use POS tag knowledge to improve, in a postprocessing stage, the quality of transcriptions, especially correcting agreement errors

HAL-CentraleSupelec

Crossref

INRIA a CCSD electronic archive server

Hal-Diderot

HAL-Rennes 1

Large Scale Distributed Acoustic Modeling With Back-Off ${\rm N}$ -Grams

Author: C. Chelba
F. Pereira
Peng Xu
T. Richardson
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

Crossref

Phoneme-Lattice to Phoneme-Sequence Matching Algorithm Based on Dynamic Programming

Author: C. Chelba
H. Xu
J. Wilpon
U. Nambiar
V.I. Levenshtein
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

Crossref

A Rational Model of Word Skipping in Reading: Ideal Integration of Visual and Linguistic Information

Author: Alhama R. G.
Bicknell K.
Bicknell K.
Chelba C.
Duan Y.
Publication venue: 'Wiley'
Publication date: 01/01/2019
Field of study

During reading, readers intentionally do not fixate a wordwhen highly confident in its identity. In a rational model ofreading, word skipping decisions should be complex functionsof the particular word, linguistic context, and visual informa-tion available. In contrast, simple heuristic of reading onlypredicts additive effects of word and context features. Here wetest these predictions by implementing a rational model withBayesian inference, and predicting human skipping with theentropy of this model’s posterior distribution. Results showeda significant effect of the entropy in predicting skipping abovea strong baseline model including word and context features.This pattern held for entropy measures from rational modelswith a frequency prior but not from ones with a 5-gram prior.These results suggest complex interactions between visual in-put and linguistic knowledge as predicted by the rational modelof reading, and a dominant role of frequency in making skip-ping decisions

Crossref

eScholarship - University of California