Search CORE

66 research outputs found

A study and experiment plan for digital mobile communication via satellite

Author: Craighill E. J.
Evans R. G.
Jones J. J.
Tom N. N.
Vincze A. D.
Publication venue
Publication date
Field of study

The viability of mobile communications is examined within the context of a frequency division multiple access, single channel per carrier satellite system emphasizing digital techniques to serve a large population of users. The intent is to provide the mobile users with a grade of service consistant with the requirements for remote, rural (perhaps emergency) voice communications, but which approaches toll quality speech. A traffic model is derived on which to base the determination of the required maximum number of satellite channels to provide the anticipated level of service. Various voice digitalization and digital modulation schemes are reviewed along with a general link analysis of the mobile system. Demand assignment multiple access considerations and analysis tradeoffs are presented. Finally, a completed configuration is described

NASA Technical Reports Server

Time and frequency domain algorithms for speech coding

Author: Francis S.C. Yeoh (7201616)
Publication venue
Publication date: 01/01/1983
Field of study

The promise of digital hardware economies (due to recent advances in VLSI technology), has focussed much attention on more complex and sophisticated speech coding algorithms which offer improved quality at relatively low bit rates. This thesis describes the results (obtained from computer simulations) of research into various efficient (time and frequency domain) speech encoders operating at a transmission bit rate of 16 Kbps. In the time domain, Adaptive Differential Pulse Code Modulation (ADPCM) systems employing both forward and backward adaptive prediction were examined. A number of algorithms were proposed and evaluated, including several variants of the Stochastic Approximation Predictor (SAP). A Backward Block Adaptive (BBA) predictor was also developed and found to outperform the conventional stochastic methods, even though its complexity in terms of signal processing requirements is lower. A simplified Adaptive Predictive Coder (APC) employing a single tap pitch predictor considered next provided a slight improvement in performance over ADPCM, but with rather greater complexity. The ultimate test of any speech coding system is the perceptual performance of the received speech. Recent research has indicated that this may be enhanced by suitable control of the noise spectrum according to the theory of auditory masking. Various noise shaping ADPCM configurations were examined, and it was demonstrated that a proposed pre-/post-filtering arrangement which exploits advantageously the predictor-quantizer interaction, leads to the best subjective performance in both forward and backward prediction systems. Adaptive quantization is instrumental to the performance of ADPCM systems. Both the forward adaptive quantizer (AQF) and the backward oneword memory adaptation (AQJ) were examined. In addition, a novel method of decreasing quantization noise in ADPCM-AQJ coders, which involves the application of correction to the decoded speech samples, provided reduced output noise across the spectrum, with considerable high frequency noise suppression. More powerful (and inevitably more complex) frequency domain speech coders such as the Adaptive Transform Coder (ATC) and the Sub-band Coder (SBC) offer good quality speech at 16 Kbps. To reduce complexity and coding delay, whilst retaining the advantage of sub-band coding, a novel transform based split-band coder (TSBC) was developed and found to compare closely in performance with the SBC. To prevent the heavy side information requirement associated with a large number of bands in split-band coding schemes from impairing coding accuracy, without forgoing the efficiency provided by adaptive bit allocation, a method employing AQJs to code the sub-band signals together with vector quantization of the bit allocation patterns was also proposed. Finally, 'pipeline' methods of bit allocation and step size estimation (using the Fast Fourier Transform (FFT) on the input signal) were examined. Such methods, although less accurate, are nevertheless useful in limiting coding delay associated with SRC schemes employing Quadrature Mirror Filters (QMF)

Loughborough University Institutional Repository

Advanced signal processing techniques for pitch synchronous sinusoidal speech coders

Author: Edwards Richard
Publication venue
Publication date: 01/01/2007
Field of study

Recent trends in commercial and consumer demand have led to the increasing use of multimedia applications in mobile and Internet telephony. Although audio, video and data communications are becoming more prevalent, a major application is and will remain the transmission of speech. Speech coding techniques suited to these new trends must be developed, not only to provide high quality speech communication but also to minimise the required bandwidth for speech, so as to maximise that available for the new audio, video and data services. The majority of current speech coders employed in mobile and Internet applications employ a Code Excited Linear Prediction (CELP) model. These coders attempt to reproduce the input speech signal and can produce high quality synthetic speech at bit rates above 8 kbps. Sinusoidal speech coders tend to dominate at rates below 6 kbps but due to limitations in the sinusoidal speech coding model, their synthetic speech quality cannot be significantly improved even if their bit rate is increased. Recent developments have seen the emergence and application of Pitch Synchronous (PS) speech coding techniques to these coders in order to remove the limitations of the sinusoidal speech coding model. The aim of the research presented in this thesis is to investigate and eliminate the factors that limit the quality of the synthetic speech produced by PS sinusoidal coders. In order to achieve this innovative signal processing techniques have been developed. New parameter analysis and quantisation techniques have been produced which overcome many of the problems associated with applying PS techniques to sinusoidal coders. In sinusoidal based coders, two of the most important elements are the correct formulation of pitch and voicing values from the' input speech. The techniques introduced here have greatly improved these calculations resulting in a higher quality PS sinusoidal speech coder than was previously available. A new quantisation method which is able to reduce the distortion from quantising speech spectral information has also been developed. When these new techniques are utilised they effectively raise the synthetic speech quality of sinusoidal coders to a level comparable to that produced by CELP based schemes, making PS sinusoidal coders a promising alternative at low to medium bit rates.EThOS - Electronic Theses Online ServiceGBUnited Kingdo

OpenGrey Repository

Delta modulation techniques for low bit-rate digital speech encoding

Author: Irvine James Moir
Publication venue: Department of Electrical Engineering
Publication date: 01/01/1985
Field of study

Includes bibliography.Two new hybrid companding delta modulators for speech encoding are presented here. These modulators differ from the Hybrid Companding Delta Modulator (HCDM) proposed by Un et al in that the two new encoders employ Song Voice Adaptation as the basis of the instantaneous compandor, rather than Constant Factor adaptation. A detailed analysis of the performance, both objective and subjective, of these hybrid codecs has been carried out. Results show that overall the two codecs developed as part of this project are better than the HCDM codec. In addition the new codecs offer simpler implementation in digital hardware than the HCDM. A Computer Aided Test (CAT) system has been developed to simplify the design and test processes for speech codecs

Cape Town University OpenUCT

Residual-excited linear predictive (RELP) vocoder system with TMS320C6711 DSK and vowel characterization

Author: Taguchi Akihiro
Publication venue: 'University of Saskatchewan Library'
Publication date
Field of study

The area of speech recognition by machine is one of the most popular and complicated subjects in the current multimedia field. Linear predictive coding (LPC) is a useful technique for voice coding in speech analysis and synthesis. The first objective of this research was to establish a prototype of the residual-excited linear predictive (RELP) vocoder system in a real-time environment. Although its transmission rate is higher, the quality of synthesized speech of the RELP vocoder is superior to that of other vocoders. As well, it is rather simple and robust to implement. The RELP vocoder uses residual signals as excitation rather than periodic pulse or white noise. The RELP vocoder was implemented with Texas Instruments TMS320C6711 DSP starter kit (DSK) using C. Identifying vowel sounds is an important element in recognizing speech contents. The second objective of research was to explore a method of characterizing vowels by means of parameters extracted by the RELP vocoder, which was not known to have been used in speech recognition, previously. Five English vowels were chosen for the experimental sample. Utterances of individual vowel sounds and of the vowel sounds in one-syllable-words were recorded and saved as WAVE files. A large sample of 20-ms vowel segments was obtained from these utterances. The presented method utilized 20 samples of a segment's frequency response, taken equally in logarithmic scale, as a LPC frequency response vector. The average of each vowel's vectors was calculated. The Euclidian distances between the average vectors of the five vowels and an unknown vector were compared to classify the unknown vector into a certain vowel group. The results indicate that, when a vowel is uttered alone, the distance to its average vector is smaller than to the other vowels' average vectors. By examining a given vowel frequency response against all known vowels' average vectors, individually, one can determine to which vowel group the given vowel belongs. When a vowel is uttered with consonants, however, variances and covariances increase. In some cases, distinct differences may not be recognized among the distances to a vowel's own average vector and the distances to the other vowels' average vectors. Overall, the results of vowel characterization did indicate an ability of the RELP vocoder to identify and classify single vowel sounds

eCommons@USASK

University of Saskatchewan Research Archive

Channel Fading Statistics For Real-Time Data Transmission In Emergency Call Systems And Unmanned Aerial Systems

Author: Li Yunrui
Publication venue: DigitalCommons@WayneState
Publication date: 01/01/2018
Field of study

The Third Generation Partnership Project (3GPP) selected an in-band modem to transmit emergency data over cellular voice channel for the European Union emergency call (eCall) system. However, the road test results presented by the Harmonized eCall European Pilot project showed that the success rate of data delivery was only 71%, indicating that there is significant potential to improve its performance. In this dissertation, a testbed is designed for the eCall system that satisfies the 3GPP TS 26.267/268/269 standards. A method is proposed to measure the power of the received signal that passes through the in-band channel. Experiments are performed with the in-vehicle system testbed in a laboratory or a car travelling in city, suburb, country- side, or freeway. Fading statistics of the received signal after power control are found and discussed, together with cumulative distribution function (CDF), level crossing rate (LCR), and average fade duration (AFD). It is found that with probability less than or equal to 0.1%, fading and attenuation can vary from -19 dB for the continuous wave (CW) signal at 500 Hz to -9.5 dB for the CW signal at 2000 Hz. This dissertation recommends moving the CW signals at 500 Hz and 800 Hz for detection and synchronization in the 3GPP standard to 1500 Hz and 2000 Hz, respectively. This will give 9.5 dB improvement in detection and synchronization. The fading results are used to calculate the bit error rate (BER) performance for the eCall in-band modem. Synchronization detection probability are obtained by transmitting the synchronization preamble through various adaptive multi-rate vocoders and an additive white Gaussian noise channel. The testbed and proposed method are also used to measure the power of signals received by an unmanned aerial systems (UAS) and by the receiver in the operation center, respectively. Field experiments are carried out by flying the UAS above different locations. Statistics, including CDF, LCR, and AFD, are calculated for the six test-sites. The results of the fading statistics, synchronization detection probability, and BER can be directly applied to design real-time communication systems, including detection, delay estimation, modulation and coding

Digital Commons@Wayne State University

Author: Allen Jonathan
Kleppner Daniel
Publication venue: Research Laboratory of Electronics (RLE) at the Massachusetts Institute of Technology (MIT)
Publication date
Field of study