Vocal caricatures reveal signatures of speaker identity

Abstract

What are the features that impersonators select to elicit a speaker’s identity? We built a voice database of public figures (targets) and imitations produced by professional impersonators. They produced one imitation based on their memory of the target (caricature) and another one after listening to the target audio (replica). A set of naive participants then judged identity and similarity of pairs of voices. Identity was better evoked by the caricatures and replicas were perceived to be closer to the targets in terms of voice similarity. We used this data to map relevant acoustic dimensions for each task. Our results indicate that speaker identity is mainly associated with vocal tract features, while perception of voice similarity is related to vocal folds parameters.Wetherefore show the way in which acoustic caricatures emphasize identity features at the cost of loosing similarity, which allows drawing an analogy with caricatures in the visual space.Fil: López, Sabrina. Dynamical Systems Lab, IFIBA-Physics dept, University of Buenos Aires, Pabellón 1, Ciudad Universitaria, CABA 1428EGA, ArgentinaFil: Riera, Pablo. Acoustics and Sound Perception Lab, Universidad of Quilmes, Roque Saénz Peña 352, Bernal, Buenos Aires B1876BXD, ArgentinaFil: Assaneo, María Florencia. Dynamical Systems Lab, IFIBA-Physics dept, University of Buenos Aires, Pabellón 1, Ciudad Universitaria, CABA 1428EGA, ArgentinaFil: Eguía, Manuel. Acoustics and Sound Perception Lab, Universidad of Quilmes, Roque Saénz Peña 352, Bernal, Buenos Aires B1876BXD, Argentin

    Similar works