Search CORE

9,991 research outputs found

Recommended from our members

Towards a tool for the subjective assessment of speech system interfaces (SASSI)

Author: Graham R
Hone KS
Publication venue: 'Cambridge University Press (CUP)'
Publication date: 01/09/2000
Field of study

Applications of speech recognition are now widespread, but user-centred evaluation methods are necessary to ensure their success. Objective evaluation techniques are fairly well established, but previous subjective techniques have been unstructured and unproven. This paper reports on the first stage of the development of a questionnaire measure for the Subjective Assessment of Speech System Interfaces (SASSI). The aim of the research programme is to produce a valid, reliable and sensitive measure of users' subjective experiences with speech recognition systems. Such a technique could make an important contribution to theory and practice in the design and evaluation of speech recognition systems according to best human factors practice. A prototype questionnaire was designed, based on established measures for evaluating the usability of other kinds of user interface, and on a review of the research literature into speech system design. This consisted of 50 statements with which respondents rated their level of agreement. The questionnaire was given to users of four different speech applications, and Exploratory Factor Analysis of 214 completed questionnaires was conducted. This suggested the presence of six main factors in users' perceptions of speech systems: System Response Accuracy, Likeability, Cognitive Demand, Annoyance, Habitability and Speed. The six factors have face validity, and a reasonable level of statistical reliability. The findings form a userful theoretical and practical basis for the subjective evaluation of any speech recognition interface. However, further work is recommended, to establish the validity and sensitivity of the approach, before a final tool can be produced which warrants general use

Brunel University Research Archive

Design of a smartphone with a Digital Signal Processor

Author: Lecluse Joep
Publication venue
Publication date: 01/01/1996
Field of study

Repository TU/e

Pure OAI Repository

Multi-session group scenarios for speech interface design

Author: Cheadle Maria
Gambäck Björn
Hansen Preben
Publication venue: 'Informa UK Limited'
Publication date: 01/01/2003
Field of study

When developing adaptive speech-based multilingual interaction systems, we need representative data on the user's behaviour. In this paper we focus on a data collection method pertaining to adaptation in the user's interaction with the system. We describe a multi-session group scenario for Wizard of Oz studies with two novel features: firstly, instead of doing solo sessions with a static mailbox, our test users communicated with each other in a group of six, and secondly, the communication took place over several sessions in a period of five to eight days. The paper discusses our data collection studies using the method, concentrating on the usefulness of the method in terms of naturalness of the interaction and long-term developments

RISE – Research Institutes of Sweden

Digitala Vetenskapliga Arkivet - Academic Archive On-line

Swedish Institute of Computer Science Publications Database

Software institutes' Online Digital Archive

The development and implementation of a single-line intelligent digital telephone answering unit on a personal computer

Author: du Plessis. Marchand Charl
Publication venue: Bloemfontein: Central University of Technology, Free State
Publication date: 01/01/1998
Field of study

ThesisCommercial telephone answering machines are limited to some extent by one or more of the following factors: • limited facilities • difficult to upgrade • nonstandard telephone interfacing • expensive • lack of user-friendliness • lack of dialogue and intelligence The purpose of this study is to design an intelligent digital telephone system which will overcome as many of the above-mentioned problems as possible. The following features are proposed and will be discussed: The use of a commonly available, but powerful, personal computer processor and memory instead of the elementary and rigid processor and magnetic tape storage units of the commercial telephone answering machine . This allows the quick storage and retrieval of digitized messages, each with its individual name, time and date stamp. Using the personal computer's hardware and not duplicating the processor and memory units allows a more cost-effective system upgrade. Upgrades mainly consist of software changes and minor hardware changes. This means that an upgrade does not implicate a total hardware redesign. Standards as prescribed by the local switching network standards and the Department of Post and Telecommunications, apply to this design and are applicable for licensing of the product. It is evident that the cost of this project and design is kept minimal by not duplicating expensive components like the microprocessor and the memory units, although these are used in the design. In this respect upgrades are software orientated to further limit the costs. The personal computer is equipped with a display which allows the user to make easy selections in order to execute the required instructions or to obtain information by using the help functions. This real-time help function eliminates the need for a user manual. Dialogue between user and personal computer over the telephone network offers a simple method of delivering information without the need for any extra equipment such as modems, keyboards or display units. The software used on the personal computer is designed in such a way that the system is intelligent and capable of decision making. Communication from the public telephone network is possible by using the telephone keypad and Dual Tone Multifrequency (DTMF) signalling

Central University Of Technology Free State - LibraryCUT, South Africa

Speech recognition for smart homes

Author: McLoughlin Ian Vince
Sharifzadeh Hamid Reza
Publication venue: 'IntechOpen'
Publication date: 01/11/2008
Field of study

IntechOpen

Crossref

Kent Academic Repository

The Production of Speech Corpora

Author: Baumann Angela
Draxler Christoph
Ellbogen Tania
Schiel Florian
Steffen Alexander
Publication venue
Publication date: 21/03/2012
Field of study

Open Access LMU

Alcohol Language Corpus

Author: Barfüßer Sabine
Heinrich Christian
Schiel Florian
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2011
Field of study

The Alcohol Language Corpus (ALC) is the first publicly available speech corpus comprising intoxicated and sober speech of 162 female and male German speakers. Recordings are done in the automotive environment to allow for the development of automatic alcohol detection and to ensure a consistent acoustic environment for the alcoholized and the sober recording. The recorded speech covers a variety of contents and speech styles. Breath and blood alcohol concentration measurements are provided for all speakers. A transcription according to SpeechDat/Verbmobil standards and disfluency tagging as well as an automatic phonetic segmentation are part of the corpus. An Emu version of ALC allows easy access to basic speech parameters as well as the us of R for statistical analysis of selected parts of ALC. ALC is available without restriction for scientific or commercial use at the Bavarian Archive for Speech Signals

CiteSeerX

Open Access LMU

Voice Command Controller

Author: Nguyen Hoang Nghia
Publication venue: Edith Cowan University, Research Online, Perth, Western Australia
Publication date: 01/01/2000
Field of study

Signal processing technology has been strongly developed and it has attracted interest from scientists and engineers around the world from the last decade. Speech synthesis and speech recognition are particular topic in the field that have been widely used and developed in many different area such as business, controlling, education and entertainment. The project\u27s main objective is to study and develop an application program with the Speech SDK through design and implementation of Tele-Control system based on the commercial product of National Semiconductor: Carrier-Current Transceiver (LM 1893) and Speech development kit (Speech SDK4.0) from Microsoft Corporation. The project is suitable to be used in restricted areas where space, wiring, decoration and signal interference are issues of concerned. Speech SDK is an interesting and useful tool in helping develop a Voice application programs. In this project, the user can use voice command interact with the control program to control a remote device. In conjunction with hardware modification, extra function can be added to the program such as controlling camera, video capture and position control buttons on the environment map, the project will be suitable for security purposes

Research Online @ ECU

Telephony-based email application

Author: Darta Perdana
Publication venue: RIT Scholar Works
Publication date: 01/01/1999
Field of study

The aim of this project is to provide user to gain access of telephony services such as placing an outgoing calls, answering calls, playing back an announcement to the callers, recording caller\u27s messages, playing back recorded messages and finally establishing a dial up connection to send the recorded message to an e-mail address. To do that, the application will be implementing and combining some of Windows Application Programming Interface (API) functions as follows : 1. Telephony Programming Interface (TAPI) ver. 1.4. 2. Multimedia Control Interface (MCI Visual Basic 6\u27s Multimedia control). 3. Remote Access Service (RAS API) 4. Messaging Application Programming Interface (MAPI Visual Basic 6\u27s MAPI control). 5. Windows Sockets (Visual Basic 6\u27s WinSocks control)

RIT Scholar Works

The audio-graphical interface to a personal integrated telecommunications system

Author: Arons Barry Michael
Publication venue: Massachusetts Institute of Technology
Publication date: 01/01/1984
Field of study

Thesis (M.S.V.S.)--Massachusetts Institute of Technology, Dept. of Architecture, 1984.Includes bibliographical references (leaves 80-88).The telephone is proposed as an environment for exploring conversational computer systems. A personal communications system is developed which supports multi-modal access to multi-media mail. It is a testbed for developing novel methods of interactive information retrieval that are as intuitive and useful as the spoken word. A personalized telecommunications management system that handles both voice and electronic mail mess.ages through a unified user interface is described. Incoming voice messages are gathered via a conversational answering machine. Known callers are identified with a speech recognition unit so they can receive personal outgoing recordings. The system's owner accesses messages over the telephone by voice using natural language queries, or with the telephone keypad. Electronic mail messages and system status are transmitted by a text-to-speech synthesizer. Local access is provided by a touch sensitive screen and color raster display. Text and digitized voice messages are randomly accessible through graphical ideograms. A Rolodex-style directory permits dialing-by-name and the creation of outgoing recordings for individuals or mailing lists. Note: A 3/4 inch color U-matic video cassette accompanies this thesis, it is five minutes in length, and has an English narrative.by Barry Michael Arons.M.S.V.S

DSpace@MIT