The role of duration, amplitude and fundamental frequency of syllabic vocalic nuclei is investigated for marking prosodic stress in spontaneous American English discourse. Local maxima of di#erent evidence variables, implemented as combinations of the three basic parameters # duration, amplitude and pitch #, are supposed to be related with prosodic stress. As reference, two di#erent subsets from the OGI English stories database were manually marked in terms of prosodic stress bytwo di#erent trained linguists. The ROC curves, built on the training examples, show that both transcribers grant a major role to the amplitude and duration rather than to the pitch of the vocalic nuclei. More complex evidence variables, involving a product of the three basic parameters, allow around 80# primary stressed and 77# unstressed syllables to be correctly recognized in the test #les of both transcribers' datasets. The agreementbetween the two transcribers on a set of common #les supplies only slightly..
To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.