Search CORE

3 research outputs found

A Gaussian probability accelerator for SPHINX 3

Author: Mathew Binu K.
Publication venue: University of Utah
Publication date: 22/07/2003
Field of study

technical reportAccurate real-time speech recognition is not currently possible in the mobile embedded space where the need for natural voice interfaces is clearly important. The continuous nature of speech recognition coupled with an inherently large working set creates significant cache interference with other processes. Hence real-time recognition is problematic even on high-performance general-purpose platforms. This paper provides a detailed analysis of CMU?s latest speech recognizer (Sphinx 3.2), identifies three distinct processing phases, and quantifies the architectural requirements for each phase. Several optimizations are then described which expose parallelism and drastically reduce the bandwidth and power requirements for real-time recognition. A special-purpose accelerator for the dominant Gaussian probability phase is developed for a 0.25 CMOS process which is then analyzed and compared with Sphinx?s measured energy and performance on a 0.13 2.4 GHz Pentium4 system. The results show an improvement in power consumption by a factor of 29 at equivalent processing throughput. However after normalizing for process, the specialpurpose approach has twice the throughput, and consumes 104 times less energy than the general-purpose accelerator. The energy-delay product is a better comparison metric due to the inherent design trade-offs between energy consumption and performance. The energydelay product of the special-purpose approach is 196 times better than the Pentium4. These results provide strong evidence that real-time large vocabulary speech recognition can be done within a power budget commensurate with embedded processing using today?s technology

The University of Utah: J. Willard Marriott Digital Library

Architectural optimizations for low-power, real-time speech recognition

Author: Rajeev Krishna
Scott Mahlke
Todd Austin
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2004
Field of study

Crossref

A hardware accelerator for speech recognition algorithms

Author: Bisiani D.A.
Chandy K.M.
Constraint-Directed Search M.S.
J.
Kavaler Rober
Noll R.A.
Padua David A.
R. Bisiani
Reddy B T
Reddy H.
S.
Sakoc Hiroaki
T. S. Anantharaman
Uht Augustus K.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref