Search CORE

221 research outputs found

Voice-processing technologies--their application in telecommunications.

Author: J. G. Wilpon
Publication venue: 'Proceedings of the National Academy of Sciences'
Publication date
Field of study

Telephony-based email application

Author: Darta Perdana
Publication venue: RIT Scholar Works
Publication date: 01/01/1999
Field of study

The aim of this project is to provide user to gain access of telephony services such as placing an outgoing calls, answering calls, playing back an announcement to the callers, recording caller\u27s messages, playing back recorded messages and finally establishing a dial up connection to send the recorded message to an e-mail address. To do that, the application will be implementing and combining some of Windows Application Programming Interface (API) functions as follows : 1. Telephony Programming Interface (TAPI) ver. 1.4. 2. Multimedia Control Interface (MCI Visual Basic 6\u27s Multimedia control). 3. Remote Access Service (RAS API) 4. Messaging Application Programming Interface (MAPI Visual Basic 6\u27s MAPI control). 5. Windows Sockets (Visual Basic 6\u27s WinSocks control)

RIT Scholar Works

A FRAMEWORK FOR INTELLIGENT VOICE-ENABLED E-EDUCATION SYSTEMS

Author: Azeta A. A.
Publication venue
Publication date: 01/03/2012
Field of study

Covenant University Repository

Real-Time Reconfigurable Adaptive Speech Recognition Command and Control Apparatus and Method

Author: Haynes Dena S.
Salazar George A.
Sommers Marc J.
Publication venue
Publication date
Field of study

An adaptive speech recognition and control system and method for controlling various mechanisms and systems in response to spoken instructions and in which spoken commands are effective to direct the system into appropriate memory nodes, and to respective appropriate memory templates corresponding to the voiced command is discussed. Spoken commands from any of a group of operators for which the system is trained may be identified, and voice templates are updated as required in response to changes in pronunciation and voice characteristics over time of any of the operators for which the system is trained. Provisions are made for both near-real-time retraining of the system with respect to individual terms which are determined not be positively identified, and for an overall system training and updating process in which recognition of each command and vocabulary term is checked, and in which the memory templates are retrained if necessary for respective commands or vocabulary terms with respect to an operator currently using the system. In one embodiment, the system includes input circuitry connected to a microphone and including signal processing and control sections for sensing the level of vocabulary recognition over a given period and, if recognition performance falls below a given level, processing audio-derived signals for enhancing recognition performance of the system

NASA Technical Reports Server

The audio-graphical interface to a personal integrated telecommunications system

Author: Arons Barry Michael
Publication venue: Massachusetts Institute of Technology
Publication date: 01/01/1984
Field of study

Thesis (M.S.V.S.)--Massachusetts Institute of Technology, Dept. of Architecture, 1984.Includes bibliographical references (leaves 80-88).The telephone is proposed as an environment for exploring conversational computer systems. A personal communications system is developed which supports multi-modal access to multi-media mail. It is a testbed for developing novel methods of interactive information retrieval that are as intuitive and useful as the spoken word. A personalized telecommunications management system that handles both voice and electronic mail mess.ages through a unified user interface is described. Incoming voice messages are gathered via a conversational answering machine. Known callers are identified with a speech recognition unit so they can receive personal outgoing recordings. The system's owner accesses messages over the telephone by voice using natural language queries, or with the telephone keypad. Electronic mail messages and system status are transmitted by a text-to-speech synthesizer. Local access is provided by a touch sensitive screen and color raster display. Text and digitized voice messages are randomly accessible through graphical ideograms. A Rolodex-style directory permits dialing-by-name and the creation of outgoing recordings for individuals or mailing lists. Note: A 3/4 inch color U-matic video cassette accompanies this thesis, it is five minutes in length, and has an English narrative.by Barry Michael Arons.M.S.V.S

DSpace@MIT

Information Outlook, February 1999

Author: Special Libraries Association
Publication venue: SJSU ScholarWorks
Publication date: 01/02/1997
Field of study

Volume 3, Issue 2https://scholarworks.sjsu.edu/sla_io_1999/1001/thumbnail.jp

SJSU ScholarWorks

Automatic speech recognition: from study to practice

Author: Sara Sharifzadeh (1260087)
Publication venue
Publication date: 01/01/2010
Field of study

Today, automatic speech recognition (ASR) is widely used for different purposes such as robotics, multimedia, medical and industrial application. Although many researches have been performed in this field in the past decades, there is still a lot of room to work. In order to start working in this area, complete knowledge of ASR systems as well as their weak points and problems is inevitable. Besides that, practical experience improves the theoretical knowledge understanding in a reliable way. Regarding to these facts, in this master thesis, we have first reviewed the principal structure of the standard HMM-based ASR systems from technical point of view. This includes, feature extraction, acoustic modeling, language modeling and decoding. Then, the most significant challenging points in ASR systems is discussed. These challenging points address different internal components characteristics or external agents which affect the ASR systems performance. Furthermore, we have implemented a Spanish language recognizer using HTK toolkit. Finally, two open research lines according to the studies of different sources in the field of ASR has been suggested for future work

Loughborough University Institutional Repository

Using VXML to construct a speech browser for a public-domain SpeechWeb

Author: Su Li
Publication venue: 'University of Windsor Leddy Library'
Publication date: 01/01/2006
Field of study

Despite the fact that interpreters for the voice-application markup language VXML have been available for around five years, there is very little evidence of the emergence of a public-domain SpeechWeb. This is in contrast to the huge growth of the conventional web only a few years after the introduction of HTML. One reason for this is that architectures for distributed speech applications are not conducive to public involvement in the creation and deployment of speech applications. In previous research, a new architecture for a public-domain SpeechWeb has been proposed. However, a non-proprietary speech browser is needed for this new architecture. In this thesis, it is shown that through a novel use of VXML, a viable public-domain SpeechWeb browser can be built as a single VXML page. This thesis is proven through the development and implementation of a single VXML page SpeechWeb browser. Paper copy at Leddy Library: Theses & Major Papers - Basement, West Bldg. / Call Number: Thesis2005 .S8. Source: Masters Abstracts International, Volume: 45-01, page: 0366. Thesis (M.Sc.)--University of Windsor (Canada), 2006

Scholarship at UWindsor