Search CORE

3 research outputs found

Using spatial audio cues from speech excitation for meeting speech segmentation

Author: Burnett I.
Cheng Eva
Ritz Christian
Publication venue: 'Sociological Research Online'
Publication date: 01/01/2006
Field of study

Multiparty meetings generally involve stationary participants. Participant location information can thus be used to segment the recorded meeting speech into each speaker\u27s \u27turn\u27 for meeting \u27browsing\u27. To represent speaker location information from speech, previous research showed that the most reliable time delay estimates are extracted from the Hubert envelope of the linear prediction residual signal. The authors\u27 past work has proposed the use of spatial audio cues to represent speaker location information. This paper proposes extracting spatial audio cues from the Hubert envelope of the speech residual for indicating changing speaker location for meeting speech segmentation. Experiments conducted on recordings of a real acoustic environment show that spatial cues from the Hubert envelope are more consistent across frequency subbands and can clearly distinguish between spatially distributed speakers, compared to spatial cues estimated from the recorded speech or residual signal

Research Online

Varying microphone patterns for meeting speech segmentation using spatial audio cues

Author: Burnett I.
Cheng Eva
Ritz Christian
Publication venue: 'Sociological Research Online'
Publication date: 01/01/2006
Field of study

Meetings, common to many business environments, generally involve stationary participants. Thus, participant location information can be used to segment meeting speech recordings into each speaker’s ‘turn’. The authors’ previous work proposed the use of spatial audio cues to represent the speaker locations. This paper studies the validity of using spatial audio cues for meeting speech segmentation by investigating the effect of varying microphone pattern on the spatial cues. Experiments conducted on recordings of a real acoustic environment indicate that the relationship between speaker location and spatial audio cues strongly depends on the microphone pattern

Crossref

Research Online

Using spatial cues for meeting speech segmentation

Author: Burnett I
Cheng E
Lukasiak J
Stirling D
Publication venue: IEEE (USA)
Publication date: 01/01/2005
Field of study

Not availabl

RMIT Research Repository

Research Online