6 research outputs found
Two-Staged Acoustic Modeling Adaption for Robust Speech Recognition by the Example of German Oral History Interviews
In automatic speech recognition, often little training data is available for
specific challenging tasks, but training of state-of-the-art automatic speech
recognition systems requires large amounts of annotated speech. To address this
issue, we propose a two-staged approach to acoustic modeling that combines
noise and reverberation data augmentation with transfer learning to robustly
address challenges such as difficult acoustic recording conditions, spontaneous
speech, and speech of elderly people. We evaluate our approach using the
example of German oral history interviews, where a relative average reduction
of the word error rate by 19.3% is achieved.Comment: Accepted for IEEE International Conference on Multimedia and Expo
(ICME), Shanghai, China, July 201
Deliverable D7.7 Dissemination and Standardisation Report v3
This deliverable presents the LinkedTV dissemination and standardisation report for the project period of months 31 to 42 (April 2014 to March 2015)
Deliverable D9.3 Final Project Report
This document comprises the final report of LinkedTV. It includes a publishable summary, a plan for use and dissemination of foreground and a report covering the wider societal implications of the project in the form of a questionnaire
Deliverable D1.4 Visual, text and audio information analysis for hypervideo, final release
Having extensively evaluated the performance of the technologies included in the first release of WP1 multimedia analysis tools, using content from the LinkedTV scenarios and by participating in international benchmarking activities, concrete decisions regarding the appropriateness and the importance of each individual method or combination of methods were made, which, combined with an updated list of information needs for each scenario, led to a new set of analysis requirements that had to be addressed through the release of the final set of analysis techniques of WP1. To this end, coordinated efforts on three directions, including (a) the improvement of a number of methods in terms of accuracy and time efficiency, (b) the development of new technologies and (c) the definition of synergies between methods for obtaining new types of information via multimodal processing, resulted in the final bunch of multimedia analysis methods for video hyperlinking. Moreover, the different developed analysis modules have been integrated into a web-based infrastructure, allowing the fully automatic linking of the multitude of WP1 technologies and the overall LinkedTV platform