33,678 research outputs found
Time and frequency domain algorithms for speech coding
The promise of digital hardware economies (due to recent advances in
VLSI technology), has focussed much attention on more complex and sophisticated
speech coding algorithms which offer improved quality at relatively
low bit rates.
This thesis describes the results (obtained from computer simulations)
of research into various efficient (time and frequency domain) speech
encoders operating at a transmission bit rate of 16 Kbps.
In the time domain, Adaptive Differential Pulse Code Modulation (ADPCM)
systems employing both forward and backward adaptive prediction were
examined. A number of algorithms were proposed and evaluated, including
several variants of the Stochastic Approximation Predictor (SAP). A
Backward Block Adaptive (BBA) predictor was also developed and found to
outperform the conventional stochastic methods, even though its complexity
in terms of signal processing requirements is lower. A simplified
Adaptive Predictive Coder (APC) employing a single tap pitch predictor
considered next provided a slight improvement in performance over ADPCM,
but with rather greater complexity.
The ultimate test of any speech coding system is the perceptual performance
of the received speech. Recent research has indicated that this
may be enhanced by suitable control of the noise spectrum according to
the theory of auditory masking. Various noise shaping ADPCM
configurations were examined, and it was demonstrated that a proposed
pre-/post-filtering arrangement which exploits advantageously the
predictor-quantizer interaction, leads to the best subjective
performance in both forward and backward prediction systems.
Adaptive quantization is instrumental to the performance of ADPCM systems.
Both the forward adaptive quantizer (AQF) and the backward oneword
memory adaptation (AQJ) were examined. In addition, a novel method
of decreasing quantization noise in ADPCM-AQJ coders, which involves the
application of correction to the decoded speech samples, provided
reduced output noise across the spectrum, with considerable high frequency
noise suppression.
More powerful (and inevitably more complex) frequency domain speech
coders such as the Adaptive Transform Coder (ATC) and the Sub-band Coder
(SBC) offer good quality speech at 16 Kbps. To reduce complexity and
coding delay, whilst retaining the advantage of sub-band coding, a novel
transform based split-band coder (TSBC) was developed and found to compare
closely in performance with the SBC.
To prevent the heavy side information requirement associated with a
large number of bands in split-band coding schemes from impairing coding
accuracy, without forgoing the efficiency provided by adaptive bit
allocation, a method employing AQJs to code the sub-band signals together
with vector quantization of the bit allocation patterns was also
proposed.
Finally, 'pipeline' methods of bit allocation and step size estimation
(using the Fast Fourier Transform (FFT) on the input signal) were examined.
Such methods, although less accurate, are nevertheless useful in
limiting coding delay associated with SRC schemes employing Quadrature
Mirror Filters (QMF)
Coding overcomplete representations of audio using the MCLT
We propose a system for audio coding using the modulated complex
lapped transform (MCLT). In general, it is difficult to encode signals using
overcomplete representations without avoiding a penalty in rate-distortion
performance. We show that the penalty can be significantly reduced for
MCLT-based representations, without the need for iterative methods of
sparsity reduction. We achieve that via a magnitude-phase polar quantization
and the use of magnitude and phase prediction. Compared to systems based
on quantization of orthogonal representations such as the modulated lapped
transform (MLT), the new system allows for reduced warbling artifacts and
more precise computation of frequency-domain auditory masking functions
Graph Spectral Image Processing
Recent advent of graph signal processing (GSP) has spurred intensive studies
of signals that live naturally on irregular data kernels described by graphs
(e.g., social networks, wireless sensor networks). Though a digital image
contains pixels that reside on a regularly sampled 2D grid, if one can design
an appropriate underlying graph connecting pixels with weights that reflect the
image structure, then one can interpret the image (or image patch) as a signal
on a graph, and apply GSP tools for processing and analysis of the signal in
graph spectral domain. In this article, we overview recent graph spectral
techniques in GSP specifically for image / video processing. The topics covered
include image compression, image restoration, image filtering and image
segmentation
Mechanical and durability performance of lightweight concrete brick with palm oil fuel ash (POFA)
Lightweight building materials such as precast roof and wall panel has been widely used in the construction industries. This is because lightweight materials could benefits the economy and society in terms of manufacturing, transportation and handling cost. One of the most preferable lightweight material is Expanded Polystyrene (EPS). EPS consist of 98% of air and 2% of polystyrene. Therefore, EPS is very low in density which could contribute in the reduction of building materials mass. Abundance of studies has shown that EPS has significantly contribute to the reduction of brick density. EPS has been used as the aggregates replacement in concrete. However, the existing of EPS in the concrete has reduce the strength performance of the concrete. Due to this, researchers have extend their research in improvising the EPS concrete and brick strength with the addition of pozzolanic materials such as fly ash, rice husk ask, silica fume and etc [1-4]. The ability of these pozzolanic materials in enhancing the strength of brick or concrete has been proven..
Numerical simulation analysis on water jet pressure distribution at various nozzle aperture
The low velocity water jet is required by small scale Unmanned Underwater Vehicle (UUV) to control its position, either to remain statics in its position or to perform a slow and steady locomotion. However, the water jet performance is influenced by the size of nozzle aperture. By studying the pressure distribution around the nozzle area, the water jet velocity could be determined and characterized. In this studies, the ejection pressure was fixed at 23.37 Pa according to the constant actuation. Studies were conducted using ANSYS Fluent software. The results show that the water jet velocity and dynamic pressure are higher for larger nozzle aperture size at constant pressure. The total pressure and dynamic pressure had the lowest pressure drop at certain nozzle aperture size but became constant when the nozzle size was wider. This finding is useful in designing the UUV that powered by contractile water jet thruster
- …