Search CORE

173,708 research outputs found

Henri Temianka (Miscellaneous Items)

Author: Pacific Audio-Visual Enterprises
Publication venue: Chapman University Digital Commons
Publication date: 01/01/1980
Field of study

This collection contains material pertaining to the life, career, and activities of Henri Temianka, violin virtuoso, conductor, music teacher, and author. Materials include correspondence, concert programs and flyers, music scores, photographs, and books.https://digitalcommons.chapman.edu/temianka_ephemera/1176/thumbnail.jp

Chapman University Digital Commons

A-V Clearing House No. I

Author: ACRL Audio-Visual Committee
Publication venue: Association of College and Research Libraries. American Library Association
Publication date: 01/07/1955
Field of study

published or submitted for publicatio

Illinois Digital Environment for Access to Learning and Scholarship Repository

Audio-Visual Materials

Author: Nolan John L.
Publication venue: Graduate School of Library and Information Science. University of Illinois at Urbana-Champaign
Publication date: 01/01/1961
Field of study

published or submitted for publicatio

Illinois Digital Environment for Access to Learning and Scholarship Repository

Audio-visual speech recognition with background music using single-channel source separation

Author: Erdogan Hakan
Erdoğan Hakan
Grais Emad Mounir
Topkaya İbrahim Saygın
Topkaya Ibrahim Saygin
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2012
Field of study

In this paper, we consider audio-visual speech recognition with background music. The proposed algorithm is an integration of audio-visual speech recognition and single channel source separation (SCSS). We apply the proposed algorithm to recognize spoken speech that is mixed with music signals. First, the SCSS algorithm based on nonnegative matrix factorization (NMF) and spectral masks is used to separate the audio speech signal from the background music in magnitude spectral domain. After speech audio is separated from music, regular audio-visual speech recognition (AVSR) is employed using multi-stream hidden Markov models. Employing two approaches together, we try to improve recognition accuracy by both processing the audio signal with SCSS and supporting the recognition task with visual information. Experimental results show that combining audio-visual speech recognition with source separation gives remarkable improvements in the accuracy of the speech recognition system

CiteSeerX

Crossref

University of Surrey

Sabanci University Research Database

Surrey Research Insight

Self-Supervised Audio-Visual Co-Segmentation

Author: Gan Chuang
McDermott Josh
Rouditchenko Andrew
Torralba Antonio
Zhao Hang
Publication venue
Publication date: 18/04/2019
Field of study

Segmenting objects in images and separating sound sources in audio are challenging tasks, in part because traditional approaches require large amounts of labeled data. In this paper we develop a neural network model for visual object segmentation and sound source separation that learns from natural videos through self-supervision. The model is an extension of recently proposed work that maps image pixels to sounds. Here, we introduce a learning approach to disentangle concepts in the neural networks, and assign semantic categories to network feature channels to enable independent image segmentation and sound source separation after audio-visual training on videos. Our evaluations show that the disentangled model outperforms several baselines in semantic segmentation and sound source separation.Comment: Accepted to ICASSP 201

arXiv.org e-Print Archive

Crossref

DSpace@MIT

Software Defined Media: Virtualization of Audio-Visual Services

Author: Esaki Hiroshi
Ikeda Masahiro
Kasuya Takashi
Niwa Kenta
Ogawa Keiko
Saito Shoichiro
Sone Takuro
Sunahara Hideki
Tsukada Manabu
Publication venue
Publication date: 23/02/2017
Field of study

Internet-native audio-visual services are witnessing rapid development. Among these services, object-based audio-visual services are gaining importance. In 2014, we established the Software Defined Media (SDM) consortium to target new research areas and markets involving object-based digital media and Internet-by-design audio-visual environments. In this paper, we introduce the SDM architecture that virtualizes networked audio-visual services along with the development of smart buildings and smart cities using Internet of Things (IoT) devices and smart building facilities. Moreover, we design the SDM architecture as a layered architecture to promote the development of innovative applications on the basis of rapid advancements in software-defined networking (SDN). Then, we implement a prototype system based on the architecture, present the system at an exhibition, and provide it as an SDM API to application developers at hackathons. Various types of applications are developed using the API at these events. An evaluation of SDM API access shows that the prototype SDM platform effectively provides 3D audio reproducibility and interactiveness for SDM applications.Comment: IEEE International Conference on Communications (ICC2017), Paris, France, 21-25 May 201

arXiv.org e-Print Archive

Crossref