Search CORE

17 research outputs found

Improving the robustness of CELP-like speech decoders using late-arrival packets information : application to G.729 standard in VoIP

Author: Khadra Ali
Publication venue: 'Universite de Sherbrooke'
Publication date: 01/01/2003
Field of study

L'utilisation de la voix sur Internet est une nouvelle tendance dans Ie secteur des télécommunications et de la réseautique. La paquetisation des données et de la voix est réalisée en utilisant Ie protocole Internet (IP). Plusieurs codecs existent pour convertir la voix codée en paquets. La voix codée est paquetisée et transmise sur Internet. À la réception, certains paquets sont soit perdus, endommages ou arrivent en retard. Ceci est cause par des contraintes telles que Ie délai («jitter»), la congestion et les erreurs de réseau. Ces contraintes dégradent la qualité de la voix. Puisque la transmission de la voix est en temps réel, Ie récepteur ne peut pas demander la retransmission de paquets perdus ou endommages car ceci va causer plus de délai. Au lieu de cela, des méthodes de récupération des paquets perdus (« concealment ») s'appliquent soit à l'émetteur soit au récepteur pour remplacer les paquets perdus ou endommages. Ce projet vise à implémenter une méthode innovatrice pour améliorer Ie temps de convergence suite a la perte de paquets au récepteur d'une application de Voix sur IP. La méthode a déjà été intégrée dans un codeur large-bande (AMR-WB) et a significativement amélioré la qualité de la voix en présence de <<jitter » dans Ie temps d'arrivée des trames au décodeur. Dans ce projet, la même méthode sera intégrée dans un codeur a bande étroite (ITU-T G.729) qui est largement utilise dans les applications de voix sur IP. Le codeur ITU-T G.729 défini des standards pour coder et décoder la voix a 8 kb/s en utilisant 1'algorithme CS-CELP (Conjugate Stmcture Algebraic Code-Excited Linear Prediction).Abstract: Voice over Internet applications is the new trend in telecommunications and networking industry today. Packetizing data/voice is done using the Internet protocol (IP). Various codecs exist to convert the raw voice data into packets. The coded and packetized speech is transmitted over the Internet. At the receiving end some packets are either lost, damaged or arrive late. This is due to constraints such as network delay (fitter), network congestion and network errors. These constraints degrade the quality of speech. Since voice transmission is in real-time, the receiver can not request the retransmission of lost or damaged packets as this will cause more delay. Instead, concealment methods are applied either at the transmitter side (coder-based) or at the receiver side (decoder-based) to replace these lost or late-arrival packets. This work attempts to implement a novel method for improving the recovery time of concealed speech The method has already been integrated in a wideband speech coder (AMR-WB) and significantly improved the quality of speech in the presence of jitter in the arrival time of speech frames at the decoder. In this work, the same method will be integrated in a narrowband speech coder (ITU-T G.729) that is widely used in VoIP applications. The ITUT G.729 coder defines the standards for coding and decoding speech at 8 kb/s using Conjugate Structure Algebraic Code-Excited Linear Prediction (CS-CELP) Algorithm

Savoirs UdeS

Perceptual techniques in audio quality assessment

Author: Rix Antony W.
Publication venue: The University of Edinburgh
Publication date: 01/01/2003
Field of study

Edinburgh Research Archive

Secure VoIP Performance Measurement

Author: Amna Saad (7170101)
Publication venue
Publication date: 01/01/2013
Field of study

This project presents a mechanism for instrumentation of secure VoIP calls. The experiments were run under different network conditions and security systems. VoIP services such as Google Talk, Express Talk and Skype were under test. The project allowed analysis of the voice quality of the VoIP services based on the Mean Opinion Score (MOS) values generated by Perceptual valuation of Speech Quality (PESQ). The quality of the audio streams produced were subjected to end-to-end delay, jitter, packet loss and extra processing in the networking hardware and end devices due to Internetworking Layer security or Transport Layer security implementations. The MOS values were mapped to Perceptual Evaluation of Speech Quality for wideband (PESQ-WB) scores. From these PESQ-WB scores, the graphs of the mean of 10 runs and box and whisker plots for each parameter were drawn. Analysis on the graphs was performed in order to deduce the quality of each VoIP service. The E-model was used to predict the network readiness and Common vulnerability Scoring System (CVSS) was used to predict the network vulnerabilities. The project also provided the mechanism to measure the throughput for each test case. The overall performance of each VoIP service was determined by PESQ-WB scores, CVSS scores and the throughput. The experiment demonstrated the relationship among VoIP performance, VoIP security and VoIP service type. The experiment also suggested that, when compared to an unsecure IPIP tunnel, Internetworking Layer security like IPSec ESP or Transport Layer security like OpenVPN TLS would improve a VoIP security by reducing the vulnerabilities of the media part of the VoIP signal. Morever, adding a security layer has little impact on the VoIP voice quality

Loughborough University Institutional Repository

Quality of media traffic over Lossy internet protocol networks: Measurement and improvement.

Author: Al-Akhras Mousa Tawfiq
Publication venue: Software Technology Research Laboratory
Publication date: 01/01/2007
Field of study

Voice over Internet Protocol (VoIP) is an active area of research in the world of communication. The high revenue made by the telecommunication companies is a motivation to develop solutions that transmit voice over other media rather than the traditional, circuit switching network. However, while IP networks can carry data traffic very well due to their besteffort nature, they are not designed to carry real-time applications such as voice. As such several degradations can happen to the speech signal before it reaches its destination. Therefore, it is important for legal, commercial, and technical reasons to measure the quality of VoIP applications accurately and non-intrusively. Several methods were proposed to measure the speech quality: some of these methods are subjective, others are intrusive-based while others are non-intrusive. One of the non-intrusive methods for measuring the speech quality is the E-model standardised by the International Telecommunication Union-Telecommunication Standardisation Sector (ITU-T). Although the E-model is a non-intrusive method for measuring the speech quality, but it depends on the time-consuming, expensive and hard to conduct subjective tests to calibrate its parameters, consequently it is applicable to a limited number of conditions and speech coders. Also, it is less accurate than the intrusive methods such as Perceptual Evaluation of Speech Quality (PESQ) because it does not consider the contents of the received signal. In this thesis an approach to extend the E-model based on PESQ is proposed. Using this method the E-model can be extended to new network conditions and applied to new speech coders without the need for the subjective tests. The modified E-model calibrated using PESQ is compared with the E-model calibrated using i ii subjective tests to prove its effectiveness. During the above extension the relation between quality estimation using the E-model and PESQ is investigated and a correction formula is proposed to correct the deviation in speech quality estimation. Another extension to the E-model to improve its accuracy in comparison with the PESQ looks into the content of the degraded signal and classifies packet loss into either Voiced or Unvoiced based on the received surrounding packets. The accuracy of the proposed method is evaluated by comparing the estimation of the new method that takes packet class into consideration with the measurement provided by PESQ as a more accurate, intrusive method for measuring the speech quality. The above two extensions for quality estimation of the E-model are combined to offer a method for estimating the quality of VoIP applications accurately, nonintrusively without the need for the time-consuming, expensive, and hard to conduct subjective tests. Finally, the applicability of the E-model or the modified E-model in measuring the quality of services in Service Oriented Computing (SOC) is illustrated

De Montfort University Open Research Archive

OpenGrey Repository

Design and implementation of an on-line demonstrator for a video telephony system over heterogeneus networks

Author: Pinar Loriente Gerardo
Publication venue
Publication date: 01/01/2013
Field of study

In recent times, Next Generation Mobile Networks (NGMN) enable user's mobility, not needing to be in a fixed place anymore. For this service to be successful, seamless transitions between the different technologies become essential, in order to make possible the always best connected goal. This brings an additional problematic, which is the impairments resulting from a handover between two networks. In order to succesfully plan and continue the development of always on services and mobility management, the approach must be based on user's perception of phenomena such us packet loss, the so-called Quality of Experience (QoE). This is the context in which Mobisense was born, intending a better understanding of NGMN transmission phenomena and resulting quality. Hence, Mobisense project is focused on the evaluation of the quality of service from user's point of view and on the seamless switch provision between video codecs when connections are transferred between two networks. On that purpose, a NGMN test environment was developed for real-time multimedia services. In this environment, specific network conditions can be associated with user's assessments within a realistic model. Mobisense project creates the foundations for the employment of advance prediction methods for real-time mobility management, to make decisions depending on the characteristics measured in the network and the predictions of quality resulting from them. The present degree final project gathers the work carried out to develop an extension for Mobisense testbed, in order to deploy it in a real environment of network technologies, as well as the integration with Quality of Service (QoS) algorithms. Therefore, the aim of this project consists on the development of the software required for the creation of a video thelephony system over NGMN, taking Mobisense and MultiRAT testbeds as starting point. Mobisense brings adaptation in the application layer and user's perception, and MultiRAT provides QoS adaptation and new wireless technologies. Both testbeds combined to explore more QoE aspects in wireless networks of tomorrow.En los últimos tiempos, las NGMN posibilitan la movilidad del usuario, sin ser ya necesaria su permanencia en un lugar fijo. Para el éxito de este servicio se hacen indispensables transiciones continuas entre las diferentes tecnologías, de manera que sea posible el objetivo de "siempre la mejor conexión". Esto lleva consigo una problemática adicional, que son las de ciencias resultantes del handover entre dos redes. Para planificar y continuar satisfactoriamente el desarrollo de servicios always on y la gestión de la movilidad, el enfoque debe ser en base a la percepción del usuario de fenómenos tales como la pérdida de paquetes, la llamada QoE. En este contexto nació el proyecto Mobisense, buscando una mejor comprensión del fenómeno de transmisión NGMN y la calidad resultante. El proyecto Mobisense se centra, por tanto, en la evaluación de la calidad de servicio desde el punto de vista del usuario y en la provisión de cambios continuos entre codecs de vídeo al transmitirse conexiones entre dos redes. Para tal propósito, un entorno de pruebas NGMN fue desarrollado para servicios multimedia en tiempo real. En este entorno, pueden asociarse determinadas condiciones en la red con valoraciones de calidad por parte del usuario en un modelo realista. El proyecto Mobisense sienta las bases para el empleo de métodos avanzados de predicción para la gestión de movilidad en tiempo real, para tomar decisiones dependientes de las características de la red medidas y de las predicciones de calidad derivadas a partir de éstas. El presente proyecto de fin de carrera recoge el trabajo realizado para desarrollar una extensión del testbed Mobisense, de cara a desplegarlo en un entorno real de tecnologías de red, así como la integración de algoritmos de QoS. Por tanto, el objeto de este proyecto consiste en el desarrollo software requerido para la creación de un sistema de videotelefonía sobre NGMN, tomando los testbeds Mobisense y MultiRAT como punto de partida. Mobisense aporta adaptación en la capa de aplicación y la percepción de usuario, y MultiRAT proporciona adaptación QoS y nuevas tecnologías de red. Ambos testbeds se combinan para la exploración más amplia de los aspectos de QoE en las redes inalámbricas del mañana.Ingeniería de Telecomunicació

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Universidad Carlos III de Madrid e-Archivo

Enhanced transport protocols for real time and streaming applications on wireless links

Author: Sarwar Golam
Publication venue: UNSW, Sydney
Publication date: 01/01/2013
Field of study

Real time communications have, in the last decade, become a highly relevant component of Internet applications and services, with both interactive communications and streamed content being used in developed and developing countries alike. Due to the proliferation of mobile devices, wireless media is becoming the means of transmitting a large part of this increasingly important real time communications traffic. Wireless has also become an important technology in developing countries, with satellite communications being increasingly deployed for traffic backhaul and ubiquitous connection to the Internet. A number of issues need to be addressed in order to have an acceptable service quality for real time communications in wireless environments. In addition to this, the availability of multiple wireless interfaces on mobile devices presents an opportunity to improve and further exacerbates the issues already present on single wireless links. Therefore in this thesis, we consider improvements to transport protocols for real time communications and streaming services to address these problems and we provide the following contributions. To deal with wireless link issues of errors and delay, we propose two enhancements. First, an improvement technique for Datagram Congestion Control Protocol CCID4 for long delay wireless (e.g. satellite) links, demonstrating significant performance improvements for Voice over IP applications. To deal with link errors, we have proposed, implemented and evaluated an erasure coding based packet error correction approach for Concurrent Multipath Transfer extension of Stream Control Transport Protocol data transport over multiple wireless paths. We have identified packet reordering as a major cause of performance degradation in both single and multi-path transport protocols for real time communications and media streaming. We have proposed a dynamically resizable buffer based solution to mitigate this problem within the DCCP protocol. For improving the performance of multi-path transport protocols over dissimilar network paths, we have proposed a delay aware packet scheduling scheme, which significantly improves the performance of multimedia and bulk data transfer with CMT-SCTP in heterogeneous multi-path network scenarios. Finally, we have developed a tool for online streaming video quality evaluation experiments, comprising a real-time cross-layer video streaming technique implemented within an open-source H.264 video encoder tool called x264

UNSWorks

Contribution to quality of user experience provision over wireless networks

Author: Sánchez Iborra Ramón Jesús
Publication venue: 'Universidad Politecnica de Cartagena'
Publication date: 01/01/2015
Field of study

The widespread expansion of wireless networks has brought new attractive possibilities to end users. In addition to the mobility capabilities provided by unwired devices, it is worth remarking the easy configuration process that a user has to follow to gain connectivity through a wireless network. Furthermore, the increasing bandwidth provided by the IEEE 802.11 family has made possible accessing to high-demanding services such as multimedia communications. Multimedia traffic has unique characteristics that make it greatly vulnerable against network impairments, such as packet losses, delay, or jitter. Voice over IP (VoIP) communications, video-conference, video-streaming, etc., are examples of these high-demanding services that need to meet very strict requirements in order to be served with acceptable levels of quality. Accomplishing these tough requirements will become extremely important during the next years, taking into account that consumer video traffic will be the predominant traffic in the Internet during the next years. In wired systems, these requirements are achieved by using Quality of Service (QoS) techniques, such as Differentiated Services (DiffServ), traffic engineering, etc. However, employing these methodologies in wireless networks is not that simple as many other factors impact on the quality of the provided service, e.g., fading, interferences, etc. Focusing on the IEEE 802.11g standard, which is the most extended technology for Wireless Local Area Networks (WLANs), it defines two different architecture schemes. On one hand, the infrastructure mode consists of a central point, which manages the network, assuming network controlling tasks such as IP assignment, routing, accessing security, etc. The rest of the nodes composing the network act as hosts, i.e., they send and receive traffic through the central point. On the other hand, the IEEE 802.11 ad-hoc configuration mode is less extended than the infrastructure one. Under this scheme, there is not a central point in the network, but all the nodes composing the network assume both host and router roles, which permits the quick deployment of a network without a pre-existent infrastructure. This type of networks, so called Mobile Ad-hoc NETworks (MANETs), presents interesting characteristics for situations when the fast deployment of a communication system is needed, e.g., tactics networks, disaster events, or temporary networks. The benefits provided by MANETs are varied, including high mobility possibilities provided to the nodes, network coverage extension, or network reliability avoiding single points of failure. The dynamic nature of these networks makes the nodes to react to topology changes as fast as possible. Moreover, as aforementioned, the transmission of multimedia traffic entails real-time constraints, necessary to provide these services with acceptable levels of quality. For those reasons, efficient routing protocols are needed, capable of providing enough reliability to the network and with the minimum impact to the quality of the service flowing through the nodes. Regarding quality measurements, the current trend is estimating what the end user actually perceives when consuming the service. This paradigm is called Quality of user Experience (QoE) and differs from the traditional Quality of Service (QoS) approach in the human perspective given to quality estimations. In order to measure the subjective opinion that a user has about a given service, different approaches can be taken. The most accurate methodology is performing subjective tests in which a panel of human testers rates the quality of the service under evaluation. This approach returns a quality score, so-called Mean Opinion Score (MOS), for the considered service in a scale 1 - 5. This methodology presents several drawbacks such as its high expenses and the impossibility of performing tests at real time. For those reasons, several mathematical models have been presented in order to provide an estimation of the QoE (MOS) reached by different multimedia services In this thesis, the focus is on evaluating and understanding the multimedia-content transmission-process in wireless networks from a QoE perspective. To this end, firstly, the QoE paradigm is explored aiming at understanding how to evaluate the quality of a given multimedia service. Then, the influence of the impairments introduced by the wireless transmission channel on the multimedia communications is analyzed. Besides, the functioning of different WLAN schemes in order to test their suitability to support highly demanding traffic such as the multimedia transmission is evaluated. Finally, as the main contribution of this thesis, new mechanisms or strategies to improve the quality of multimedia services distributed over IEEE 802.11 networks are presented. Concretely, the distribution of multimedia services over ad-hoc networks is deeply studied. Thus, a novel opportunistic routing protocol, so-called JOKER (auto-adJustable Opportunistic acK/timEr-based Routing) is presented. This proposal permits better support to multimedia services while reducing the energy consumption in comparison with the standard ad-hoc routing protocols.Universidad Politécnica de CartagenaPrograma Oficial de Doctorado en Tecnologías de la Información y Comunicacione

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Repositorio Digital de la Universidad Politécnica de Cartagena

Recommended from our members

Time-frequency analysis based on split spectrum applied to audio and ultrasonic signals

Author: Pedram Rad Seyed Kamran
Publication venue: Brunel University London
Publication date: 01/01/2018
Field of study

This thesis was submitted for the award of Doctor of Philosophy and was awarded by Brunel University LondonSignal processing is a large subject with applications integral to a number of technological fields such as communication, audio, Voice over IP (VoIP), pattern recognition, sonar, radar, ultrasound and medical imaging. Techniques exist for the analysis, modelling, extraction, recognition and synthesis of signals of interest. The focus of this thesis is signal processing for acoustics (both sonic and ultrasonic). In the applications examined, signals of interest are usually incomplete, distorted and/or noisy. Therefore, reconstructing the signal, noise reduction and removal of any distortion/interference are the main goals of the signal processing techniques presented. The primary aim is to study and develop an advanced time-frequency signal processing technique for acoustic applications to enhance the quality of the signals. In the first part of the thesis, a technique is presented that models and maintains the correlation between temporal and spectral parameters of audio signals. A novel Packet Loss Concealment (PLC) method is developed with applications to VoIP, audio broadcasting, and streaming. The problem of modelling the time-varying frequency spectrum in the context of PLC is addressed, and a novel solution is proposed for tracking and using the temporal motion of spectral flow to reconstruct the signal. The proposed method utilises a Time-Frequency Motion (TFM) matrix representation of the audio signal, where each frequency is tagged with a motion vector estimate that is assessed by cross-correlation of the movement of spectral energy within sub-bands across time frames. The missing packets are estimated using extrapolation or interpolation algorithms using a TFM matrix and then inverse transformed to the time-domain for reconstruction of the signal. The proposed method is compared with conventional approaches using objective Performance Evaluation of Speech Quality (PESQ), and subjective Mean Opinion Scores (MOS) in a range of packet loss from 5% to 20%. The evaluation results demonstrate that the proposed algorithm substantially improves performance by an average of 2.85% and 5.9% in terms of PESQ and MOS respectively. In the second part of the thesis, the proposed method is extended and modified to address challenges of excessive coherent noise arising from ultrasonic signals gathered during Guided Wave Testing (GWT). It is an advanced Non-destructive testing technique which is used over several branches of industry to inspect large structures for defects where the structural integrity is of concern. In such systems, signal interpretation can often be challenging due to the multi-modal and dispersive propagation of Ultrasonic Guided Waves (UGWs). The multi-modal and dispersive nature of the received signals hampers the ability to detect defects in a given structure. The Split-Spectrum Processing (SSP) method with application for such signal has been studied and reviewed quantitatively to measure the enhancement in terms of Signal-to-Noise Ratio (SNR) and spatial resolution. In this thesis, the influence of SSP filter bank parameters on these signals is studied and optimised to improve SNR and spatial resolution considerably. The proposed method is compared analytically and experimentally with conventional approaches. The proposed SSP algorithm substantially improves SNR by an average of 30dB. The conclusions reached in this thesis will contribute to the progression of the GWT technique through considerable improvement in defect detection capability.Centre for Electronic Systems Research (CESR) of Brunel University London, The National Structural Integrity Research Centre (NSIRC) and TWI Ltd

Brunel University Research Archive

Planning and dynamic spectrum management in heterogeneous mobile networks with QoE optimization

Author: Robalo Daniel Luís Silveira
Publication venue
Publication date: 01/08/2014
Field of study

The radio and network planning and optimisation are continuous processes that do not end after the network has been launched. To achieve the best trade-offs, especially between quality and costs, operators make use of several coverage and capacity enhancement methods. The research from this thesis proposes methods such as the implementation of cell zooming and Relay Stations (RSs) with dynamic sleep modes and Carrier Aggregation (CA) for coverage and capacity enhancements. Initially, a survey is presented on ubiquitous mesh networks implementation scenarios and an updated characterization of requirements for services and applications is proposed. The performance targets for the key parameters, delay, delay variation, information loss and throughput have been addressed for all types of services. Furthermore, with the increased competition, mobile operator’s success does not only depend on how good the offered Quality of Service (QoS) is, but also if it meets the end user’s expectations, i.e., Quality of Experience (QoE). In this context, a model for the mapping between QoS parameters and QoE has been proposed for multimedia traffic. The planning and optimization of fixed Worldwide Interoperability for Microwave Access (WiMAX) networks with RSs in conjunction with cell zooming has been addressed. The challenging case of a propagation measurement-based scenario in the hilly region of Covilhã has been considered. A cost/revenue function has been developed by taking into account the cost of building and maintaining the infrastructure with the use of RSs. This part of the work also investigates the energy efficiency and economic implications of the use of power saving modes for RSs in conjunction with cell zooming. Assuming that the RSs can be switched-off or zoomed out to zero in periods when the trafﬁc exchange is low, such as nights and weekends, it has been shown that energy consumption may be reduced whereas cellular coverage and capacity, as well as economic performance may be improved. An integrated Common Radio Resource Management (iCRRM) entity is proposed that implements inter-band CA by performing scheduling between two Long Term Evolution – Advanced (LTE-A) Component Carriers (CCs). Considering the bandwidths available in Portugal, the 800 MHz and 2.6 GHz CCs have been considered whilst mobile video traffic is addressed. Through extensive simulations it has been found that the proposed multi-band schedulers overcome the capacity of LTE systems without CA. Result shown a clear improvement of the QoS, QoE and economic trade-off with CA

UBibliorum repositorio digital da ubi

Sparsity in Linear Predictive Coding of Speech

Author: Giacobello Daniele
Publication venue: Multimedia Information and Signal Processing, Institute of Electronic Systems, Aalborg University
Publication date: 01/01/2010
Field of study

nrpages: 197status: publishe

Lirias

VBN