Search CORE

14,609 research outputs found

The voice activity detection (VAD) recorder and VAD network recorder : a thesis presented in partial fulfilment of the requirements for the degree of Master of Science in Computer Science at Massey University

Author: Liu Feng
Publication venue: 'Massey University'
Publication date: 01/01/2001
Field of study

The project is to provide a feasibility study for the AudioGraph tool, focusing on two application areas: the VAD (voice activity detector) recorder and the VAD network recorder. The first one achieves a low bit-rate speech recording on the fly, using a GSM compression coder with a simple VAD algorithm; and the second one provides two-way speech over IP, fulfilling echo cancellation with a simplex channel. The latter is required for implementing a synchronous AudioGraph. In the first chapter we introduce the background of this project, specifically, the VoIP technology, the AudioGraph tool, and the VAD algorithms. We also discuss the problems set for this project. The second chapter presents all the relevant techniques in detail, including sound representation, speech-coding schemes, sound file formats, PowerPlant and Macintosh programming issues, and the simple VAD algorithm we have developed. The third chapter discusses the implementation issues, including the systems' objective, architecture, the problems encountered and solutions used. The fourth chapter illustrates the results of the two applications. The user documentations for the applications are given, and after that, we analyse the parameters based on the results. We also present the default settings of the parameters, which could be used in the AudioGraph system. The last chapter provides conclusions and future work

Massey Research Online

Recommended from our members

Mobile Audiovisual Terminal: System Design and Subjective Testing in DECT and UMTS networks

Author: Cosmas J
Gill D
Pearmain A
Publication venue: IEEE*
Publication date: 01/07/2000
Field of study

It is anticipated that there will shortly be a requirement for multimedia terminals that operate via mobile communications systems. This paper presents a functional specification for such a terminal operating at 32 kb/s in a digital European cordless telecommunications (DECT) and universal mobile telecommunications system (UMTS) radio network. A terminal has been built, based on a PC with digital signal processor (DSP) boards for audio and video coding and decoding. Speech coding is by a phonetically driven code-excited linear prediction (CELP) speech coder and video coding by a block-oriented hybrid discrete cosine transform (DCT) coder. Separate channel coding is provided for the audio and video data. The paper describes the techniques used for audio and video coding, channel coding, and synchronization. Methods of subjective testing in a DECT network and in a UMTS network are also described. These consisted of subjective tests of first impressions of the mobile audio–visual terminal (MAVT) quality, interactive tests, and the completion of an exit questionnaire. The test results showed that the quality of the audio was sufficiently good for comprehension and the video was sufficiently good for following and repeating simple mechanical tasks. However, the quality of the MAVT was not good enough for general use where high-quality audio and video was needed, especially when transmission was in a noisy radio environment

Brunel University Research Archive

A Utility-based QoS Model for Emerging Multimedia Applications

Author: Garcia Francisco
Mauthe Andreas
Mu Mu
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2008
Field of study

Existing network QoS models do not sufficiently reflect the challenges faced by high-throughput, always-on, inelastic multimedia applications. In this paper, a utility-based QoS model is proposed as a user layer extension to existing communication QoS models to better assess the requirements of multimedia applications and manage the QoS provisioning of multimedia flows. Network impairment utility functions are derived from user experiments and combined to application utility functions to evaluate the application quality. Simulation is used to demonstrate the validity of the proposed QoS model

CiteSeerX

Crossref

Lancaster E-Prints

Near-Instantaneously Adaptive HSDPA-Style OFDM Versus MC-CDMA Transceivers for WIFI, WIMAX, and Next-Generation Cellular Systems

Author: Choi B.J.
Hanzo Lajos
Publication venue
Publication date: 01/12/2007
Field of study

Burts-by-burst (BbB) adaptive high-speed downlink packet access (HSDPA) style multicarrier systems are reviewed, identifying their most critical design aspects. These systems exhibit numerous attractive features, rendering them eminently eligible for employment in next-generation wireless systems. It is argued that BbB-adaptive or symbol-by-symbol adaptive orthogonal frequency division multiplex (OFDM) modems counteract the near instantaneous channel quality variations and hence attain an increased throughput or robustness in comparison to their fixed-mode counterparts. Although they act quite differently, various diversity techniques, such as Rake receivers and space-time block coding (STBC) are also capable of mitigating the channel quality variations in their effort to reduce the bit error ratio (BER), provided that the individual antenna elements experience independent fading. By contrast, in the presence of correlated fading imposed by shadowing or time-variant multiuser interference, the benefits of space-time coding erode and it is unrealistic to expect that a fixed-mode space-time coded system remains capable of maintaining a near-constant BER

Southampton (e-Prints Soton)

On the evaluation of the conversational speech quality in telecommunications

Author: Barriac Vincent
Faucon Gérard
Gautier-Turbin Valérie
Guéguin Marie
Le Bouquin Jeannès Régine
Publication venue: 'Hindawi Limited'
Publication date: 11/03/2008
Field of study

International audienceIn this paper we propose an objective method to assess speech quality in the conversational context by taking into account the talking and listening speech qualities and the impact of delay. This approach is applied to the results of four subjective tests on the effects of echo, delay, packet loss and noise. The dataset is divided into training and validation sets. For the training set, a multiple linear regression is applied to determine a relationship between conversational, talking and listening speech qualities and the delay value. The multiple linear regression leads to an accurate estimation of the conversational scores with high correlation and low error between subjective and estimated scores, both on the training and validation sets. In addition, a validation is performed on the data of a subjective test found in the literature which confirms the reliability of the regression. The relationship is then applied to an objective level by replacing talking and listening subjective scores with talking and listening objective scores provided by existing objective models, fed by speech signals recorded during the subjective tests. The conversational model achieves high perfor- mance as revealed by comparison with the test results and with the existing standard methodology “E-model”, presented in the ITU-T (International Telecommunication Union) Recommendation G.107

Springer - Publisher Connector

Directory of Open Access Journals

HAL-Rennes 1

The MobyDick Project: A Mobile Heterogeneous All-IP Architecture

Author: Aguiar Rui L.
Einsiedler H.
Gozdecki J.
Jonas K.
Jähnert Jürgen
Liebsch Marco
Moreno Novella José Ignacio
Pacyna P.
Papir Z.
Schmitz R.
Soto Ignacio
Publication venue: FEEACSE Press
Publication date: 01/01/2001
Field of study

Proceedings of Advanced Technologies, Applications and Market Strategies for 3G (ATAMS 2001). Cracow, Poland: 17-20 June, 2001.This paper presents the current stage of an IP-based architecture for heterogeneous environments, covering UMTS-like W-CDMA wireless access technology, wireless and wired LANs, that is being developed under the aegis of the IST Moby Dick project. This architecture treats all transmission capabilities as basic physical and data-link layers, and attempts to replace all higher-level tasks by IP-based strategies. The proposed architecture incorporates aspects of mobile-IPv6, fast handover, AAA-control, and Quality of Service. The architecture allows for an optimised control on the radio link layer resources. The Moby dick architecture is currently under refinement for implementation on field trials. The services planned for trials are data transfer and voice-over-IP.Publicad

CiteSeerX

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Universidad Carlos III de Madrid e-Archivo

pub H-BRS - Publikationsserver der Hochschule Bonn-Rhein-Sieg

To train or not to train?

Author: De Turck Filip
Staelens Nicolas
Van den Broeck W
Publication venue
Publication date: 01/01/2013
Field of study

Ghent University Academic Bibliography

A 10-Point Agenda for Comprehensive Telecom Reform

Author: Adam D. Thierer
Publication venue: 'Cato Institute'
Publication date: 05/05/2001
Field of study

Changing committee chairmanships in Congress and a leadership shakeup at the Federal Communications Commission have once again opened a window of opportunity for comprehensive telecommunications policy reform. While new faces are taking over within Congress and at the FCC, however, old issues continue to dominate the telecom policy landscape. This is largely due to the fact that, when Congress last attempted to address these matters five years ago by passing the historic Telecommunications Act of 1996, legislators intentionally avoided providing clear deregulatory objectives for the FCC and instead delegated broad and remarkably ambiguous authority to the agency. That left the most important deregulatory decisions to the FCC, and, not surprisingly, the agency did a very poor job of following through with a serious liberalization agenda. The Telecom Act, with its backward-looking focus on correcting the market problems of a bygone era, has been a failure. Instead of thoroughly clearing out the regulatory deadwood of the past, legislators and regulators have engaged in an effort to rework regulatory paradigms that where outmoded decades ago. In short, it was an analog act for an increasingly digital world. The new leadership in Congress and the FCC should adopt a fresh approach based on deregulation and free markets

IssueLab

Perceptual techniques in audio quality assessment

Author: Rix Antony W.
Publication venue: The University of Edinburgh
Publication date: 01/01/2003
Field of study

Edinburgh Research Archive

Particle Filter Design Using Importance Sampling for Acoustic Source Localisation and Tracking in Reverberant Environments

Author: Lehmann Eric A
Williamson Robert C
Publication venue: 'Hindawi Limited'
Publication date: 01/01/2006
Field of study

Sequential Monte Carlo methods have been recently proposed to deal with the problem of acoustic source localisation and tracking using an array of microphones. Previous implementations make use of the basic bootstrap particle filter, whereas a more general approach involves the concept of importance sampling. In this paper, we develop a new particle filter for acoustic source localisation using importance sampling, and compare its tracking ability with that of a bootstrap algorithm proposed previously in the literature. Experimental results obtained with simulated reverberant samples and real audio recordings demonstrate that the new algorithm is more suitable for practical applications due to its reinitialisation capabilities, despite showing a slightly lower average tracking accuracy. A real-time implementation of the algorithm also shows that the proposed particle filter can reliably track a person talking in real reverberant rooms.This paper was performed while Eric A. Lehmann was working with National ICT Australia. National ICT Australia is funded by the Australian Government’s Department of Communications, Information Technology, and the Arts, the Australian Research Council, through Backing Australia’s Ability, and the ICT Centre of Excellence programs

Springer - Publisher Connector

Directory of Open Access Journals

The Australian National University