5 research outputs found
Recommended from our members
The Challenge of Spoken Language Systems: Research Directions for the Nineties
A spoken language system combines speech recognition, natural language processing and human interface technology. It functions by recognizing the person's words, interpreting the sequence of words to obtain a meaning in terms of the application, and providing an appropriate response back to the user. Potential applications of spoken language systems range from simple tasks, such as retrieving information from an existing database (traffic reports, airline schedules), to interactive problem solving tasks involving complex planning and reasoning (travel planning, traffic routing), to support for multilingual interactions. We examine eight key areas in which basic research is needed to produce spoken language systems: (1) robust speech recognition; (2) automatic training and adaptation; (3) spontaneous speech; (4) dialogue models; (5) natural language response generation; (6) speech synthesis and speech generation; (7) multilingual systems; and (8) interactive multimodal systems. In each area, we identify key research challenges, the infrastructure needed to support research, and the expected benefits. We conclude by reviewing the need for multidisciplinary research, for development of shared corpora and related resources, for computational support and far rapid communication among researchers. The successful development of this technology will increase accessibility of computers to a wide range of users, will facilitate multinational communication and trade, and will create new research specialties and jobs in this rapidly expanding area
Recommended from our members
The Challenge of Spoken Language Systems: Research Directions for the Nineties
A spoken language system combines speech recognition, natural language processing and human interface technology. It functions by recognizing the person's words, interpreting the sequence of words to obtain a meaning in terms of the application, and providing an appropriate response back to the user. Potential applications of spoken language systems range from simple tasks, such as retrieving information from an existing database (traffic reports, airline schedules), to interactive problem solving tasks involving complex planning and reasoning (travel planning, traffic routing), to support for multilingual interactions. We examine eight key areas in which basic research is needed to produce spoken language systems: (1) robust speech recognition; (2) automatic training and adaptation; (3) spontaneous speech; (4) dialogue models; (5) natural language response generation; (6) speech synthesis and speech generation; (7) multilingual systems; and (8) interactive multimodal systems. In each area, we identify key research challenges, the infrastructure needed to support research, and the expected benefits. We conclude by reviewing the need for multidisciplinary research, for development of shared corpora and related resources, for computational support and far rapid communication among researchers. The successful development of this technology will increase accessibility of computers to a wide range of users, will facilitate multinational communication and trade, and will create new research specialties and jobs in this rapidly expanding area
Robust Multichannel Microphone Beamforming
In this thesis, a method for the design and implementation of a spatially robust multichannel microphone beamforming system is presented.
A set of spatial correlation functions are derived for 2D and 3D far-field/near-field scenarios based on von Mises(-Fisher), Gaussian, and uniform source location distributions. These correlation functions are used to design spatially robust beamformers and blocking beamformers (nullformers) designed to enhance or suppress a known source, where the target source location is not perfectly known due to either an incorrect location estimate or movement of the target while the beamformers are active.
The spatially robust beam/null-formers form signal and interferer plus noise references which can be further processed via a blind source separation algorithm to remove mutual components - removing the interference and sensor noise from the signal path and vice versa. The noise reduction performance of the combined beamforming and blind source separation system approaches that of a perfect information MVDR beamformer under reverberant conditions.
It is demonstrated that the proposed algorithm can be implemented on low-power hardware with good performance on hardware similar to current mobile platforms using a four-element microphone array
2D and 3D visualization of acoustic waves by optical feedback interferometry
The visualization of physical phenomena is one of the challenges that researchers are trying to overcome by designing and implementing different sensors that provide information close to realitythrough changes in one of the parameters they measure. Historically, the visualization of variations in physical phenomena has allowed for a better understanding of the problem being studied and has changed our perception of the world and ourselves forever. Over the last 300 years, in particular, many methods have been developed to visualize sound through a visual representation. In the field of acoustics, scientists have attempted to develop a visual representation of sound waves using transducers detecting two fundamental components of sound: sound pressure and particle velocity. In other words, the measurement of kinetic energy and potential, whose quantities provide information on the physical phenomenon of acoustic propagation. In this summary, we briefly present the work of the thesis entitled "2D and 3D Visualizations of Acoustic Waves by Optical Feedback Interferometry" in which a new visualization tool for acoustic phenomena was developed. This system is based on an optical sensor said reinjection in a laser diode and allows to reconstruct in 2D and 3D the image of a propagating acoustic wave. The manuscript is divided into 3 chapters: • a first chapter presents the known methods for the visualization of the acoustic phenomena and presents the context of the research carried out, • a second chapter, allows to detail the principle of measurement and its application to the realization of a two-dimensional image of the acoustic wave • finally, in the last chapter, we demonstrate how a tomographic method can be used to create a three-dimensional image