Search CORE

17 research outputs found

Data compression for satellite images

Author: Chen P. H.
Wintz P. A.
Publication venue
Publication date
Field of study

An efficient data compression system is presented for satellite pictures and two grey level pictures derived from satellite pictures. The compression techniques take advantages of the correlation between adjacent picture elements. Several source coding methods are investigated. Double delta coding is presented and shown to be the most efficient. Both predictive differential quantizing technique and double delta coding can be significantly improved by applying a background skipping technique. An extension code is constructed. This code requires very little storage space and operates efficiently. Simulation results are presented for various coding schemes and source codes

NASA Technical Reports Server

Implementation issues in source coding

Author: Chen Yun-Chung
Hadenfeldt A. C.
Sayood Khalid
Publication venue
Publication date
Field of study

An edge preserving image coding scheme which can be operated in both a lossy and a lossless manner was developed. The technique is an extension of the lossless encoding algorithm developed for the Mars observer spectral data. It can also be viewed as a modification of the DPCM algorithm. A packet video simulator was also developed from an existing modified packet network simulator. The coding scheme for this system is a modification of the mixture block coding (MBC) scheme described in the last report. Coding algorithms for packet video were also investigated

NASA Technical Reports Server

Optimum quantization

Author: Bruce J. D.
Publication venue
Publication date
Field of study

Design of optimum quantizers for quantizer-input message signal and quantizer-input message signal contaminated by nois

NASA Technical Reports Server

Predictive Encoding of Digitized TV Pictures

Author: Sharma Dhiraj Kumar
Publication venue
Publication date: 01/01/1975
Field of study

In this report, we consider the problem of efficient intra-frame encoding of digitized TV pictures using Differential Pulse Code Modulation (DPCM) type encoders. Our goal is to reduce the average number of bits used to encode a pixel while subjective quality of the picture still remains acceptable. To this end, we propose some new predictors and compare them with the existing ones using the error criteria "essential maximum" and "variance" of the prediction errors. The two criteria have also been compared with respect to the subjective quality of the final encoded pictures. We propose to design the quantizers using "mean n-th power" distortion criterion. Effects of "n" on bit rate and subjective quality of the encoded picture have been studied as it varies from 1 to 6. In this way, we achieve reductions up to 6 to 1 in the bit rate with little degradation in the picture quality. We also consider the use of second and third order entropies of the encoded pictures to reduce the bit rate. Next, we adapt the encoding procedure to local structural variations in the picture by first segmenting it in three areas of varying detail and then using the concepts developed for non-adaptive encoding to encode the three areas differently. The resulting encoder yields better picture quality than before and can be used to encode a broad class of pictures.</p

Caltech Theses and Dissertations

Shuttle/TDRSS Ku-band downlink study

Author: Meyer R.
Publication venue
Publication date
Field of study

Assessing the adequacy of the baseline signal design approach, developing performance specifications for the return link hardware, and performing detailed design and parameter optimization tasks was accomplished by completing five specific study tasks. The results of these tasks show that the basic signal structure design is sound and that the goals can be met. Constraints placed on return link hardware by this structure allow reasonable specifications to be written so that no extreme technical risk areas in equipment design are foreseen. A third channel can be added to the PM mode without seriously degrading the other services. The feasibility of using only a PM mode was shown to exist, however, this will require use of some digital TV transmission techniques. Each task and its results are summarized

NASA Technical Reports Server

Differential encoding techniques applied to speech signals

Author: Constantinos S. Xydeas (7200770)
Publication venue
Publication date: 01/01/1978
Field of study

The increasing use of digital communication systems has produced a continuous search for efficient methods of speech encoding. This thesis describes investigations of novel differential encoding systems. Initially Linear First Order DPCM systems employing a simple delayed encoding algorithm are examined. The systems detect an overload condition in the encoder, and through a simple algorithm reduce the overload noise at the expense of some increase in the quantization (granular) noise. The signal-to-noise ratio (snr) performance of such d codec has 1 to 2 dB's advantage compared to the First Order Linear DPCM system. In order to obtain a large improvement in snr the high correlation between successive pitch periods as well as the correlation between successive samples in the voiced speech waveform is exploited. A system called "Pitch Synchronous First Order DPCM" (PSFOD) has been developed. Here the difference Sequence formed between the samples of the input sequence in the current pitch period and the samples of the stored decoded sequence from the previous pitch period are encoded. This difference sequence has a smaller dynamic range than the original input speech sequence enabling a quantizer with better resolution to be used for the same transmission bit rate. The snr is increased by 6 dB compared with the peak snr of a First Order DPCM codea. A development of the PSFOD system called a Pitch Synchronous Differential Predictive Encoding system (PSDPE) is next investigated. The principle of its operation is to predict the next sample in the voiced-speech waveform, and form the prediction error which is then subtracted from the corresponding decoded prediction error in the previous pitch period. The difference is then encoded and transmitted. The improvement in snr is approximately 8 dB compared to an ADPCM codea, when the PSDPE system uses an adaptive PCM encoder. The snr of the system increases further when the efficiency of the predictors used improve. However, the performance of a predictor in any differential system is closely related to the quantizer used. The better the quantization the more information is available to the predictor and the better the prediction of the incoming speech samples. This leads automatically to the investigation in techniques of efficient quantization. A novel adaptive quantization technique called Dynamic Ratio quantizer (DRQ) is then considered and its theory presented. The quantizer uses an adaptive non-linear element which transforms the input samples of any amplitude to samples within a defined amplitude range. A fixed uniform quantizer quantizes the transformed signal. The snr for this quantizer is almost constant over a range of input power limited in practice by the dynamia range of the adaptive non-linear element, and it is 2 to 3 dB's better than the snr of a One Word Memory adaptive quantizer. Digital computer simulation techniques have been used widely in the above investigations and provide the necessary experimental flexibility. Their use is described in the text

Loughborough University Institutional Repository

Optimum quantization.

Author
Publication venue: MIT Research Laboratory of Electronics
Publication date: 01/01/1965
Field of study

Bibliography: p.63-65

DSpace@MIT

Transform coding techniques and their application in JPEG scheme.

Author
Publication venue: Department of Cultural and Religious Studies, The Chinese University of Hong Kong
Publication date: 01/01/1991
Field of study

by Chun-tat See.Thesis (M.Phil.)--Chinese University of Hong Kong, 1991.Includes bibliographical references.ACKNOWLEDGEMENTS --- p.iABSTRACT --- p.iiNOTATIONS --- p.ivTABLE OF CONTENTS --- p.viChapter 1. --- INTRODUCTION --- p.1-1Chapter 1.1 --- Introduction --- p.1-1Chapter 1.2 --- A Basic Transform Coding System --- p.1-2Chapter 1.3 --- Thesis Organization --- p.1-5Chapter 2. --- DYADIC MATRICES AND THEIR APPLICATION --- p.2-1Chapter 2.1 --- Introduction --- p.2-1Chapter 2.2 --- Theory of Dyadic Matrix --- p.2-2Chapter 2.2.1 --- Basic Definitions --- p.2-3Chapter 2.2.2 --- Maximum Size of Dyadic Matrix --- p.2-8Chapter 2.3 --- Application of Dyadic Matrix in Generating Orthogonal Transforms --- p.2-13Chapter 2.3.1 --- Transform Performance Criteria --- p.2-14Chapter 2.3.2 --- "[T1] = [P]Diag([DM2(4)],[A(4)])[Q]" --- p.2-19Chapter 2.3.3 --- "[T2] = [P]Diag([DM2(4)],[DM2(4)])[Q]" --- p.2-21Chapter 2.4 --- Discussion and Conclusion --- p.2-26Chapter 3. --- LOW SEQUENCY COEFFICIENT TRUNCATION (LSCT) CODING SCHEME --- p.3-1Chapter 3.1 --- Introduction --- p.3-1Chapter 3.2 --- DC Coefficient Estimation Schemes --- p.3-2Chapter 3.2.1 --- Element Estimation --- p.3-2Chapter 3.2.2 --- Row Estimation --- p.3-4Chapter 3.2.3 --- Plane Estimation --- p.3-7Chapter 3.3 --- LSCT Coding Scheme 1 and Results --- p.3-11Chapter 3.4 --- LSCT Coding Scheme 2 and Results --- p.3-17Chapter 3.5 --- Discussions and Conclusions --- p.3-21Chapter 4. --- VARIABLE BLOCK SIZE (VBS) CODING SCHEME --- p.4-1Chapter 4.1 --- Introduction --- p.4-1Chapter 4.2 --- Chen's VBS Coding Scheme and Its Limitation --- p.4-3Chapter 4.3 --- VBS Coding Scheme With Block Size Determined Using Edge Discriminator --- p.4-6Chapter 4.4 --- Simulation Results --- p.4-8Chapter 4.5 --- Discussions and Conclusions --- p.4-12Chapter 5. --- ENHANCEMENT OF JPEG INTERNATIONAL STANDARD --- p.5-1Chapter 5.1 --- Introduction --- p.5-1Chapter 5.2 --- The Basic JPEG International Standard --- p.5-2Chapter 5.2.1 --- Level Shift and Discrete Cosine Transform --- p.5-4Chapter 5.2.2 --- Uniform Quantization --- p.5-5Chapter 5.2.3 --- Coefficient Coding --- p.5-7Chapter 5.3 --- Efficient DC Coefficients Encoding --- p.5-8Chapter 5.3.1 --- The Minimum Edge Difference (MED) Predictor --- p.5-8Chapter 5.3.2 --- Simulation Results --- p.5-9Chapter 5.3.3 --- Pixel Domain Predictors --- p.5-13Chapter 5.3.4 --- Discussion and Conclusion --- p.5-15Chapter 5.4 --- JPEG Scheme Using Variable Block Size Technique --- p.5-15Chapter 5.4.1 --- Scheme 1 --- p.5-16Chapter 5.4.2 --- Scheme 2 --- p.5-25Chapter 5.4.3 --- Scheme 3 --- p.5-27Chapter 5.4.4 --- Scheme 4 --- p.5-29Chapter 5.4.5 --- Scheme 5 --- p.5-32Chapter 5.4.6 --- Discussions and Conclusions --- p.5-32Chapter 5.5 --- Conclusions --- p.5-33Chapter 6. --- CONCLUSIONS --- p.6-1Chapter 6.1 --- Summary of Research Work --- p.6-1Chapter 6.2 --- Contributions of Work --- p.6-2Chapter 6.3 --- Suggestions for Further Research --- p.6-3Chapter 7. --- REFERENCES --- p.7-1RESULT

CUHK Digital Repository

Channel state testing in information decoding.

Author: Yudkin Howard Louis
Publication venue: Massachusetts Institute of Technology
Publication date: 01/01/1965
Field of study

Massachusetts Institute of Technology. Dept. of Electrical Engineering. Thesis. 1965. Ph.D.Ph.D

DSpace@MIT

Evaluation of glottal characteristics for speaker identification.

Author: Elder Andrew George
Publication venue: University of Canterbury. Electrical and Electronic Engineering
Publication date: 01/01/1992
Field of study

Based on the assumption that the physical characteristics of people's vocal apparatus cause their voices to have distinctive characteristics, this thesis reports on investigations into the use of the long-term average glottal response for speaker identification. The long-term average glottal response is a new feature that is obtained by overlaying successive vocal tract responses within an utterance. The way in which the long-term average glottal response varies with accent and gender is examined using a population of 352 American English speakers from eight different accent regions. Descriptors are defined that characterize the shape of the long-term average glottal response. Factor analysis of the descriptors of the long-term average glottal responses shows that the most important factor contains significant contributions from descriptors comprised of the coefficients of cubics fitted to the long-term average glottal response. Discriminant analysis demonstrates that the long-term average glottal response is potentially useful for classifying speakers according to their gender, but is not useful for distinguishing American accents. The identification accuracy of the long-term average glottal response is compared with that obtained from vocal tract features. Identification experiments are performed using a speaker database containing utterances from twenty speakers of the digits zero to nine. Vocal tract features, which consist of cepstral coefficients, partial correlation coefficients and linear prediction coefficients, are shown to be more accurate than the long-term average glottal response. Despite analysis of the training data indicating that the long-term average glottal response was uncorrelated with the vocal tract features, various feature combinations gave insignificant improvements in identification accuracy. The effect of noise and distortion on speaker identification is examined for each of the features. It is found that the identification performance of the long-term average glottal response is insensitive to noise compared with cepstral coefficients, partial correlation coefficients and the long-term average spectrum, but that it is highly sensitive to variations in the phase response of the speech transmission channel. Before reporting on the identification experiments, the thesis introduces speech production, speech models and background to the various features used in the experiments. Investigations into the long-term average glottal response demonstrate that it approximates the glottal pulse convolved with the long-term average impulse response, and this relationship is verified using synthetic speech. Furthermore, the spectrum of the long-term average glottal response extracted from pre-emphasized speech is shown to be similar to the long-term average spectrum of pre-emphasized speech, but computationally much simpler

UC Research Repository