9,991 research outputs found

    Multi-session group scenarios for speech interface design

    Get PDF
    When developing adaptive speech-based multilingual interaction systems, we need representative data on the user's behaviour. In this paper we focus on a data collection method pertaining to adaptation in the user's interaction with the system. We describe a multi-session group scenario for Wizard of Oz studies with two novel features: firstly, instead of doing solo sessions with a static mailbox, our test users communicated with each other in a group of six, and secondly, the communication took place over several sessions in a period of five to eight days. The paper discusses our data collection studies using the method, concentrating on the usefulness of the method in terms of naturalness of the interaction and long-term developments

    The development and implementation of a single-line intelligent digital telephone answering unit on a personal computer

    Get PDF
    ThesisCommercial telephone answering machines are limited to some extent by one or more of the following factors: • limited facilities • difficult to upgrade • nonstandard telephone interfacing • expensive • lack of user-friendliness • lack of dialogue and intelligence The purpose of this study is to design an intelligent digital telephone system which will overcome as many of the above-mentioned problems as possible. The following features are proposed and will be discussed: The use of a commonly available, but powerful, personal computer processor and memory instead of the elementary and rigid processor and magnetic tape storage units of the commercial telephone answering machine . This allows the quick storage and retrieval of digitized messages, each with its individual name, time and date stamp. Using the personal computer's hardware and not duplicating the processor and memory units allows a more cost-effective system upgrade. Upgrades mainly consist of software changes and minor hardware changes. This means that an upgrade does not implicate a total hardware redesign. Standards as prescribed by the local switching network standards and the Department of Post and Telecommunications, apply to this design and are applicable for licensing of the product. It is evident that the cost of this project and design is kept minimal by not duplicating expensive components like the microprocessor and the memory units, although these are used in the design. In this respect upgrades are software orientated to further limit the costs. The personal computer is equipped with a display which allows the user to make easy selections in order to execute the required instructions or to obtain information by using the help functions. This real-time help function eliminates the need for a user manual. Dialogue between user and personal computer over the telephone network offers a simple method of delivering information without the need for any extra equipment such as modems, keyboards or display units. The software used on the personal computer is designed in such a way that the system is intelligent and capable of decision making. Communication from the public telephone network is possible by using the telephone keypad and Dual Tone Multifrequency (DTMF) signalling

    Alcohol Language Corpus

    Get PDF
    The Alcohol Language Corpus (ALC) is the first publicly available speech corpus comprising intoxicated and sober speech of 162 female and male German speakers. Recordings are done in the automotive environment to allow for the development of automatic alcohol detection and to ensure a consistent acoustic environment for the alcoholized and the sober recording. The recorded speech covers a variety of contents and speech styles. Breath and blood alcohol concentration measurements are provided for all speakers. A transcription according to SpeechDat/Verbmobil standards and disfluency tagging as well as an automatic phonetic segmentation are part of the corpus. An Emu version of ALC allows easy access to basic speech parameters as well as the us of R for statistical analysis of selected parts of ALC. ALC is available without restriction for scientific or commercial use at the Bavarian Archive for Speech Signals

    Voice Command Controller

    Get PDF
    Signal processing technology has been strongly developed and it has attracted interest from scientists and engineers around the world from the last decade. Speech synthesis and speech recognition are particular topic in the field that have been widely used and developed in many different area such as business, controlling, education and entertainment. The project\u27s main objective is to study and develop an application program with the Speech SDK through design and implementation of Tele-Control system based on the commercial product of National Semiconductor: Carrier-Current Transceiver (LM 1893) and Speech development kit (Speech SDK4.0) from Microsoft Corporation. The project is suitable to be used in restricted areas where space, wiring, decoration and signal interference are issues of concerned. Speech SDK is an interesting and useful tool in helping develop a Voice application programs. In this project, the user can use voice command interact with the control program to control a remote device. In conjunction with hardware modification, extra function can be added to the program such as controlling camera, video capture and position control buttons on the environment map, the project will be suitable for security purposes

    Telephony-based email application

    Get PDF
    The aim of this project is to provide user to gain access of telephony services such as placing an outgoing calls, answering calls, playing back an announcement to the callers, recording caller\u27s messages, playing back recorded messages and finally establishing a dial up connection to send the recorded message to an e-mail address. To do that, the application will be implementing and combining some of Windows Application Programming Interface (API) functions as follows : 1. Telephony Programming Interface (TAPI) ver. 1.4. 2. Multimedia Control Interface (MCI Visual Basic 6\u27s Multimedia control). 3. Remote Access Service (RAS API) 4. Messaging Application Programming Interface (MAPI Visual Basic 6\u27s MAPI control). 5. Windows Sockets (Visual Basic 6\u27s WinSocks control)

    The audio-graphical interface to a personal integrated telecommunications system

    Get PDF
    Thesis (M.S.V.S.)--Massachusetts Institute of Technology, Dept. of Architecture, 1984.Includes bibliographical references (leaves 80-88).The telephone is proposed as an environment for exploring conversational computer systems. A personal communications system is developed which supports multi-modal access to multi-media mail. It is a testbed for developing novel methods of interactive information retrieval that are as intuitive and useful as the spoken word. A personalized telecommunications management system that handles both voice and electronic mail mess.ages through a unified user interface is described. Incoming voice messages are gathered via a conversational answering machine. Known callers are identified with a speech recognition unit so they can receive personal outgoing recordings. The system's owner accesses messages over the telephone by voice using natural language queries, or with the telephone keypad. Electronic mail messages and system status are transmitted by a text-to-speech synthesizer. Local access is provided by a touch sensitive screen and color raster display. Text and digitized voice messages are randomly accessible through graphical ideograms. A Rolodex-style directory permits dialing-by-name and the creation of outgoing recordings for individuals or mailing lists. Note: A 3/4 inch color U-matic video cassette accompanies this thesis, it is five minutes in length, and has an English narrative.by Barry Michael Arons.M.S.V.S
    • …
    corecore