1,439 research outputs found

    De lof der rede

    Get PDF

    The development of speech coding and the first standard coder for public mobile telephony

    Get PDF
    This thesis describes in its core chapter (Chapter 4) the original algorithmic and design features of the ??rst coder for public mobile telephony, the GSM full-rate speech coder, as standardized in 1988. It has never been described in so much detail as presented here. The coder is put in a historical perspective by two preceding chapters on the history of speech production models and the development of speech coding techniques until the mid 1980s, respectively. In the epilogue a brief review is given of later developments in speech coding. The introductory Chapter 1 starts with some preliminaries. It is de- ??ned what speech coding is and the reader is introduced to speech coding standards and the standardization institutes which set them. Then, the attributes of a speech coder playing a role in standardization are explained. Subsequently, several applications of speech coders - including mobile telephony - will be discussed and the state of the art in speech coding will be illustrated on the basis of some worldwide recognized standards. Chapter 2 starts with a summary of the features of speech signals and their source, the human speech organ. Then, historical models of speech production which form the basis of di??erent kinds of modern speech coders are discussed. Starting with a review of ancient mechanical models, we will arrive at the electrical source-??lter model of the 1930s. Subsequently, the acoustic-tube models as they arose in the 1950s and 1960s are discussed. Finally the 1970s are reviewed which brought the discrete-time ??lter model on the basis of linear prediction. In a unique way the logical sequencing of these models is exposed, and the links are discussed. Whereas the historical models are discussed in a narrative style, the acoustic tube models and the linear prediction tech nique as applied to speech, are subject to more mathematical analysis in order to create a sound basis for the treatise of Chapter 4. This trend continues in Chapter 3, whenever instrumental in completing that basis. In Chapter 3 the reader is taken by the hand on a guided tour through time during which successive speech coding methods pass in review. In an original way special attention is paid to the evolutionary aspect. Speci??cally, for each newly proposed method it is discussed what it added to the known techniques of the time. After presenting the relevant predecessors starting with Pulse Code Modulation (PCM) and the early vocoders of the 1930s, we will arrive at Residual-Excited Linear Predictive (RELP) coders, Analysis-by-Synthesis systems and Regular- Pulse Excitation in 1984. The latter forms the basis of the GSM full-rate coder. In Chapter 4, which constitutes the core of this thesis, explicit forms of Multi-Pulse Excited (MPE) and Regular-Pulse Excited (RPE) analysis-by-synthesis coding systems are developed. Starting from current pulse-amplitude computation methods in 1984, which included solving sets of equations (typically of order 10-16) two hundred times a second, several explicit-form designs are considered by which solving sets of equations in real time is avoided. Then, the design of a speci??c explicitform RPE coder and an associated eÆcient architecture are described. The explicit forms and the resulting architectural features have never been published in so much detail as presented here. Implementation of such a codec enabled real-time operation on a state-of-the-art singlechip digital signal processor of the time. This coder, at a bit rate of 13 kbit/s, has been selected as the Full-Rate GSM standard in 1988. Its performance is recapitulated. Chapter 5 is an epilogue brie y reviewing the major developments in speech coding technology after 1988. Many speech coding standards have been set, for mobile telephony as well as for other applications, since then. The chapter is concluded by an outlook

    Acoustic Correlates of Word Stress as A Cue to Accent Strength

    Get PDF
    Due to the clear interference of their mother tongue prosody, many Czech learners produce their English with a conspicuous foreign accent. The goal of the present study is to investigate the acoustic cues that differentiate stressed and unstressed syllabic nuclei and identify individual details concerning their contribution to the specific sound of Czech English. Speech production of sixteen female non-professional Czech and British speakers was analysed with the sounds segmented on a word and phone level and with both canonical and actual stress positions manually marked. Prior to analyses the strength of the foreign accent was assessed in a perception test. Subsequently, stressed and unstressed vowels were measured with respect to their duration, amplitude, fundamental frequency and spectral slope. Our results show that, in general, Czech speakers use much less acoustic marking of stress than the British subjects. The difference is most prominent in the domains of fundamental frequency and amplitude. The Czech speakers also deviate from the canonical placement of stress, shifting it frequently to the first syllable. On the other hand, they seem to approximate the needed durational difference quite successfully. These outcomes support the concept of language interference since they correspond with the existing linguistic knowledge about Czech and English word stress. The study adds specific details concerning the extent of this interference in four acoustic dimensions

    Hartle-Hawking state is a maximum of entanglement entropy

    Get PDF
    It is shown that the Hartle-Hawking state of a scalar field is a maximum of entanglement entropy in the space of pure quantum states satisfying the condition that backreaction is finite. In other words, the Hartle-Hawking state is a curved-space analogue of the EPR state, which is also a maximum of entanglement entropy.Comment: Latex, 4 pages, Some comments are added on the "small backreaction condition

    Spectral Characteristics of Schwa in Czech Accented English

    Get PDF
    The English central mid lax vowel (i.e., schwa) often contributes considerably to the sound differences between native and non-native speech. Many foreign speakers of English fail to reduce certain underlying vowels to schwa, which, on the suprasegmental level of description, affects the perceived rhythm of their speech. However, the problem of capturing quantitatively the differences between native and non-native schwa poses difficulties that, to this day, have been tackled only partially. We offer a technique of measurement in the acoustic domain that has not been probed properly as yet: the distribution of acoustic energy in the vowel spectrum. Our results show that spectral slope features measured in weak vowels discriminate between Czech and British speakers of English quite reliably. Moreover, the measurements of formant bandwidths turned out to be useful for the same task, albeit less direc
    corecore