Search CORE

4 research outputs found

Attention-based cross-modal fusion for audio-visual voice activity detection in musical video streams

Author: Botteldooren Dick
Du Xingjian
Hou Yuanbo
Liang Xia
Ma Zejun
Yu Zhesong
Zhu Bilei
Publication venue
Publication date: 01/01/2021
Field of study

Many previous audio-visual voice-related works focus on speech, ignoring the singing voice in the growing number of musical video streams on the Internet. For processing diverse musical video data, voice activity detection is a necessary step. This paper attempts to detect the speech and singing voices of target performers in musical video streams using audiovisual information. To integrate information of audio and visual modalities, a multi-branch network is proposed to learn audio and image representations, and the representations are fused by attention based on semantic similarity to shape the acoustic representations through the probability of anchor vocalization. Experiments show the proposed audio-visual multi-branch network far outperforms the audio-only model in challenging acoustic environments, indicating the cross-modal information fusion based on semantic correlation is sensible and successful.Comment: Accepted by INTERSPEECH 202

arXiv.org e-Print Archive

Ghent University Academic Bibliography

Concept and key technology analysis of deep-sea walking-swimming robot

Author: CHEN Hong
LIU Zhi
MA Zhesong
TANG Pingpeng
WANG Xinliang
WEI Wei
Zheng Chao
Publication venue: Editorial Office of Chinese Journal of Ship Research
Publication date: 01/12/2018
Field of study

The deep-sea robot is very useful in deep sea engineering. Based on a comparison and analysis of current deep-sea robots, this paper proposes a novel concept for a deep-sea walking-swimming robot, the purpose of which is to swim extensively in the sea and walk stably on the seafloor. The overall proposal, specifications and characteristics of the deep-sea walking-swimming robot are introduced. After an analysis of its environment and function characteristics,such key techniques as the regulation of the robot's walking/swimming attitude, cooperative current anti-turbulence of multi-legs and multi-joints, path planning for low energy consumption, dynamic seal of deep-sea joints and integration and optimization of the overall design are presented, showing that it is quite different from traditional underwater and multi-foot robots. Finally, the research progress of the above-mentioned techniques is also presented

Directory of Open Access Journals

Attention-based cross-modal fusion for audio-visual voice activity detection in musical video streams

Author: Botteldooren Dick
Du Xingjian
Hou Yuanbo
Liang Xia
Ma Zejun
Yu Zhesong
Zhu Bilei
Publication venue: 'International Speech Communication Association'
Publication date: 01/01/2021
Field of study

Many previous audio-visual voice-related works focus on speech, ignoring the singing voice in the growing number of musical video streams on the Internet. For processing diverse musical video data, voice activity detection is a necessary step. This paper attempts to detect the speech and singing voices of target performers in musical video streams using audio-visual information. To integrate information of audio and visual modalities, a multi-branch network is proposed to learn audio and image representations, and the representations are fused by attention based on semantic similarity to shape the acoustic representations through the probability of anchor vocalization. Experiments show the proposed audio-visual multi-branch network far outperforms the audio-only model in challenging acoustic environments, indicating the cross-modal information fusion based on semantic correlation is sensible and successful

Ghent University Academic Bibliography

Ocean thermal energy harvesting with phase change material for underwater glider

Author: Abolghasemi
Al-Kayiem
Arcuri
Bedard
Davis
Dutil
Eriksen
Faizal
Farid
Hasvold
Jeff
Jones
Karuppanan
Kong
Kong
Kumano
Lamberg
Lee
Lee
Lei
Liu
Lopez-Sabiron
Melo
Rajagopalan
Rajagopalan
Sharma
Shi
Shuxin Wang
Stommel
Sun
Uehara
Vega
Verma
Vélez
Wang
Wang
Wang
Wang
Webb
Xu
Yanan Yang
Yang
Yang
Yanhui Wang
Yoon
Yuan
Zhang
Zhesong Ma
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref