It is known that problems arise in long sessions of voice\ud tape-recording for off-line data entry to computers via speech\ud recognition systems, as a result of operator fatigue or loss of\ud attention. In this study the task of reading vehicle licence\ud plates aloud for one hour was simulated in laboratory conditions,\ud each speaker undergoing one recording session with feedback on\ud recognition accuracy and one without feedback. No significant\ud difference in recognition success rates between the two\ud conditions was discovered. The audio tape-recordings of the\ud sessions were analysed acoustically for fatigue-induced changes\ud both in long-term prosodic characteristics (including fundamental\ud frequency, intensity, spectral balance and rate) and in segmental\ud characteristics such as frequency of occurrence of different\ud sound types and segmental durations. Although a number of\ud intra-speaker di-fferences were detected in various measures, no\ud consistent tendencies were found in all speakers. It is concluded\ud that the choice of speakers and conditions resulted in\ud insufficient fatigue to produce clear-cut effects; however, it is\ud felt that the more sophisticated techniques developed during the\ud project on the basis of the automatic segmentation of continuous\ud speech is capable of more revealing analysis than the relatively\ud crude techniques used previously. It is not felt likely that this\ud would result in automatic techniques for improving speech\ud recognition accuracy, but it could form the basis for more\ud effective operator training and assessment of new applications\ud and techniques
To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.