21 research outputs found
On Enhancement and Quality Assessment of Audio and Video in Communication Systems
The use of audio and video communication has increased exponentially over the
last decade and has gone from speech over GSM to HD resolution video conference
between continents on mobile devices. As the use becomes more widespread the
interest in delivering high quality media increases even on devices with
limited resources. This includes both development and enhancement of the
communication chain but also the topic of objective measurements of the
perceived quality. The focus of this thesis work has been to perform
enhancement within speech encoding and video decoding, to measure influence
factors of audio and video performance, and to build methods to predict the
perceived video quality.
The audio enhancement part of this thesis addresses the well known problem in
the GSM system with an interfering signal generated by the switching nature of
TDMA cellular telephony. Two different solutions are given to suppress such
interference internally in the mobile handset. The first method involves the
use of subtractive noise cancellation employing correlators, the second uses a
structure of IIR notch filters. Both solutions use control algorithms based on
the state of the communication between the mobile handset and the base station.
The video enhancement part presents two post-filters. These two filters are
designed to improve visual quality of highly compressed video streams from
standard, block-based video codecs by combating both blocking and ringing
artifacts. The second post-filter also performs sharpening.
The third part addresses the problem of measuring audio and video delay as well
as skewness between these, also known as synchronization. This method is a
black box technique which enables it to be applied on any audiovisual
application, proprietary as well as open standards, and can be run on any
platform and over any network connectivity.
The last part addresses no-reference (NR) bitstream video quality prediction
using features extracted from the coded video stream. Several methods have been
used and evaluated: Multiple Linear Regression (MLR), Artificial Neural Network
(ANN), and Least Square Support Vector Machines (LS-SVM), showing high
correlation with both MOS and objective video assessment methods as PSNR and
PEVQ. The impact from temporal, spatial and quantization variations on
perceptual video quality has also been addressed, together with the trade off
between these, and for this purpose a set of locally conducted subjective
experiments were performed
On the Enhancement of Audio and Video in Mobile Equipment
Use of mobile equipment has increased exponentially over the last decade. As
use becomes more widespread so too does the demand for new functionalities. The
limited memory and computational power of many mobile devices has proven to be
a challenge resulting in many innovative solutions and a number of new
standards. Despite this, there is often a requirement for additional
enhancement to improve quality.
The focus of this thesis work has been to perform enhancement within two
different areas; audio or speech encoding and video encoding/decoding. The
audio enhancement section of this thesis addresses the well known problem in
the GSM system with an interfering signal generated by the switching nature of
TDMA cellular telephony. Two different solutions are given to suppress such
interference internally in the mobile handset. The first method involves the
use of subtractive noise cancellation employing correlators, the second uses a
structure of IIR noth filters. Both solutions use control algorithms based on
the state of the communication between the mobile handset and the base station.
The video section of this thesis presents two post-filters and one pre-filter.
The two post-filters are designed to improve visual quality of highly
compressed video streams from standard, block-based video codecs by combating
both blocking and ringing artifacts. The second post-filter also performs
sharpening. The pre-filter is designed to increase the coding efficiency of a
standard block based video codec. By introducing a pre-processing algorithm
before the encoder, the amount of camera disturbance and the complexity of the
sequence can be decreased, thereby increasing coding efficiency
A robust method for estimating synchronization and delay of audio and video for communication services
One of the main contributions to the quality of experience in streaming services or in two-way communication of audio and video applications is synchronization. This has been shown in several studies and experiments but methods to measure synchronization are less frequent, especially for situations without internal access to the application and independent of platform and device. In this paper we present a method for measuring synchronization skewness as well as delay for audio and video. The solution incorporates audio and video reference streams, where audio and video frames are marked with frame numbers which are decoded on the receiver side to enable calculation of synchronization and delay. The method has been verified in a two-way communication application in a transparent network with and without inserting known delays, as well as in a network with 5 and 10 % packet loss levels. The method can be used for both streaming and two-way communication services, both with and without access to the internal structures, and enables measurements of applications running on e.g. smartphones, tablets, and laptops under various conditions
Adaptive De-Blocking De-Ringing Post Filter
In this paper an adaptive filter for reducing blocking and ringing artifacts is
presented. The solution is designed with consideration of Mobile Equipment with
limited computational power and memory. Also, the solution is computationally
scalable if there is limited CPU resources in different user cases
A New Low Complex Reference Free Video Quality Predictor
In many applications and environments for mobile
communication there is a need for reference free perceptual quality
measurements. In this paper a method for prediction of a number of quality
metrics is proposed, where the input to the prediction is readily available
parameters at the receiver side of a communications channel. Since the
parameters are extracted from the coded video bit stream the model can be used
in user scenarios where it is normally difficult to estimate the quality due to
the reference not being available, as in streaming video and mobile TV
applications. The predictor turns out to give good
results for both the PSNR and the PEVQ metrics
En ny prediktor för videokvalitet baserad på avkodarens parameterextrahering
In the mobile communication area there is a demand for reference free perceptual quality measurements in video applications. In addition low complexity measurements are required. This paper proposes a method for prediction of a number of well known quality metrics, where the inputs to the predictors are readily available parameters at the decoder side of the communication channel. After an investigation of the dependencies between these parameters and between each parameter and the quality metrics, a set of parameters is chosen for the predictor. This predictor shows good results, especially for the PSN
Notch Filtering of Humming GSM Mobile Telephone Noise
A common problem in the world's most widespread cellular telephone system, the
GSM system, is the interfering signal generated in TDMA cellular telephony. The
infamous "bumblebee" is generated by the switching nature of TDMA cellular
telephony, the radio circuits are switched on and off at a rate of
approximately 217 Hz (GSM).
This paper describes a study of two solutions for eliminating the humming noise
with IIR notch filters. The simpler one is suitable for any exterior
equipment. This method still suffers from a small residual of the noise,
resulting from the IDLE slots of the sending mobile. The more advanced IIR
structure for use within the mobile also eliminates this residual