1 research outputs found
RWCP-SSD-Onomatopoeia: Onomatopoeic Word Dataset for Environmental Sound Synthesis
Environmental sound synthesis is a technique for generating a natural
environmental sound. Conventional work on environmental sound synthesis using
sound event labels cannot finely control synthesized sounds, for example, the
pitch and timbre. We consider that onomatopoeic words can be used for
environmental sound synthesis. Onomatopoeic words are effective for explaining
the feature of sounds. We believe that using onomatopoeic words will enable us
to control the fine time-frequency structure of synthesized sounds. However,
there is no dataset available for environmental sound synthesis using
onomatopoeic words. In this paper, we thus present RWCP-SSD-Onomatopoeia, a
dataset consisting of 155,568 onomatopoeic words paired with audio samples for
environmental sound synthesis. We also collected self-reported confidence
scores and others-reported acceptance scores of onomatopoeic words, to help us
investigate the difficulty in the transcription and selection of a suitable
word for environmental sound synthesis.Comment: Submitted to DCASE2020 worksho