13 research outputs found
Embedded Large-Scale Handwritten Chinese Character Recognition
As handwriting input becomes more prevalent, the large symbol inventory
required to support Chinese handwriting recognition poses unique challenges.
This paper describes how the Apple deep learning recognition system can
accurately handle up to 30,000 Chinese characters while running in real-time
across a range of mobile devices. To achieve acceptable accuracy, we paid
particular attention to data collection conditions, representativeness of
writing styles, and training regimen. We found that, with proper care, even
larger inventories are within reach. Our experiments show that accuracy only
degrades slowly as the inventory increases, as long as we use training data of
sufficient quality and in sufficient quantity.Comment: 5 pages, 7 figure
Statistical language model adaptation: review and perspectives
Speech recognition performance is severely affected when the lexical, syntactic, or semantic characteristics of the discourse in the training and recognition tasks differ. The aim of language model adaptation is to exploit specific, albeit limited, knowledge about the recognition task to compensate for this mismatch. More generally, an adaptive language model seeks to maintain an adequate representation of the current task domain under changing conditions involving potential variations in vocabulary, syntax, content, and style. This paper presents an overview of the major approaches proposed to address this issue, and offers some perspectives regarding their comparative merits and associated tradeoffs. Ó 2003 Elsevier B.V. All rights reserved. 1
Reverse transfer learning: Can word embeddings trained for different NLP tasks improve neural language models?
status: publishe