Search CORE

154 research outputs found

Improving the robustness of CELP-like speech decoders using late-arrival packets information : application to G.729 standard in VoIP

Author: Khadra Ali
Publication venue: 'Universite de Sherbrooke'
Publication date: 01/01/2003
Field of study

L'utilisation de la voix sur Internet est une nouvelle tendance dans Ie secteur des télécommunications et de la réseautique. La paquetisation des données et de la voix est réalisée en utilisant Ie protocole Internet (IP). Plusieurs codecs existent pour convertir la voix codée en paquets. La voix codée est paquetisée et transmise sur Internet. À la réception, certains paquets sont soit perdus, endommages ou arrivent en retard. Ceci est cause par des contraintes telles que Ie délai («jitter»), la congestion et les erreurs de réseau. Ces contraintes dégradent la qualité de la voix. Puisque la transmission de la voix est en temps réel, Ie récepteur ne peut pas demander la retransmission de paquets perdus ou endommages car ceci va causer plus de délai. Au lieu de cela, des méthodes de récupération des paquets perdus (« concealment ») s'appliquent soit à l'émetteur soit au récepteur pour remplacer les paquets perdus ou endommages. Ce projet vise à implémenter une méthode innovatrice pour améliorer Ie temps de convergence suite a la perte de paquets au récepteur d'une application de Voix sur IP. La méthode a déjà été intégrée dans un codeur large-bande (AMR-WB) et a significativement amélioré la qualité de la voix en présence de <<jitter » dans Ie temps d'arrivée des trames au décodeur. Dans ce projet, la même méthode sera intégrée dans un codeur a bande étroite (ITU-T G.729) qui est largement utilise dans les applications de voix sur IP. Le codeur ITU-T G.729 défini des standards pour coder et décoder la voix a 8 kb/s en utilisant 1'algorithme CS-CELP (Conjugate Stmcture Algebraic Code-Excited Linear Prediction).Abstract: Voice over Internet applications is the new trend in telecommunications and networking industry today. Packetizing data/voice is done using the Internet protocol (IP). Various codecs exist to convert the raw voice data into packets. The coded and packetized speech is transmitted over the Internet. At the receiving end some packets are either lost, damaged or arrive late. This is due to constraints such as network delay (fitter), network congestion and network errors. These constraints degrade the quality of speech. Since voice transmission is in real-time, the receiver can not request the retransmission of lost or damaged packets as this will cause more delay. Instead, concealment methods are applied either at the transmitter side (coder-based) or at the receiver side (decoder-based) to replace these lost or late-arrival packets. This work attempts to implement a novel method for improving the recovery time of concealed speech The method has already been integrated in a wideband speech coder (AMR-WB) and significantly improved the quality of speech in the presence of jitter in the arrival time of speech frames at the decoder. In this work, the same method will be integrated in a narrowband speech coder (ITU-T G.729) that is widely used in VoIP applications. The ITUT G.729 coder defines the standards for coding and decoding speech at 8 kb/s using Conjugate Structure Algebraic Code-Excited Linear Prediction (CS-CELP) Algorithm

Savoirs UdeS

Bilateral Waveform Similarity Overlap-and-Add Based Packet Loss Concealment for Voice over IP

Author: J.F. Yeh
M.D. Kuo
P.C. Lin
Z.H. Hsu
Publication venue: Elsevier
Publication date: 01/08/2013
Field of study

This paper invested a bilateral waveform similarity overlap-and-add algorithm for voice packet lost. Since Packet lost will cause the semantic misunderstanding, it has become one of the most essential problems in speech communication. This investment is based on waveform similarity measure using overlap-and-Add algorithm and provides the bilateral information to enhance the speech signal reconstruction. Traditionally, it has been improved that waveform similarity overlap-and-add (WSOLA) technique is an effective algorithm to deal with packet loss concealment (PLC) for real-time time communication. WSOLA algorithm is widely applied to deal with the length adaptation and packet loss concealment of speech signal. Time scale modification of audio signal is one of the most essential research topics in data communication, especially in voice of IP (VoIP). Herein, the proposed the bilateral WSOLA (BWSOLA) that is derived from WSOLA. Instead of only exploitation one direction speech data, the proposed method will reconstruct the lost voice data according to the preceding and cascading data. The related algorithms have been developed to achieve the optimal reconstructing estimation. The experimental results show that the quality of the reconstructed speech signal of the bilateral WSOLA is much better compared to the standard WSOLA and GWSOLA on different packet loss rate and length using the metrics PESQ and MOS. The significant improvement is obtained by bilateral information and proposed method. The proposed bilateral waveform similarity overlap-and-add (BWSOLA) outperforms the traditional approaches especially in the long duration data loss

Directory of Open Access Journals

Improved voice quality with the combination of transport layer & audio codec for wireless devices

Author: Jannati Binti Roslin Raihan
Khalifa Othman O.
Shah Newaj Bhuiyan Sharif
Publication venue: 'Institute of Advanced Engineering and Science'
Publication date: 01/06/2019
Field of study

Improving voice quality over wireless communication becomes a demanding feature for social media apps like facebook, whatsapp and other communication channels. Voice-over-internet protocol (VoIP) helps us to make quick telephone calls over the internet. It includes various mechanism which are signaling, controlling and transport layer. Over wireless links, packet loss and high transmission delay damage voice quality. Here VoIP quality will be measured by three main elements which are signaling protocol, audio codec and transport layer. To improve the overall voice quality, we need to combine these three elements properly to get the best score. Otherwise perceptual speech quality will not be the right tool to measure the voice quality. Here we will use Mean Opinion Score (MOS) for calculated jitter values and end to end delay. At the end, best combination of audio codec & signaling protocol produced the quality speech

Experiences of VoIP traffic monitoring in a commercial ISP

Author: Birke Robert Rene' Maria
Mellia Marco
Petracca M.
Rossi D.
Publication venue: Wiley
Publication date: 01/01/2010
Field of study

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)

PORTO Publications Open Repository TOrino

Objective Measurement of Speech Quality in VoIP over Wireless LAN during Handoff

Author: Gambhir Nidhi Marwaha
Publication venue: SJSU ScholarWorks
Publication date: 01/01/2009
Field of study

Quality of Service is a very important factor to determine the quality of a VoIP call. Different subjective and objective models exist for evaluating the speech quality in VoIP. E-model is one of the objective methods of measuring the speech quality; it considers various factors like packet loss, delay and codec impairments. The calculations of Emodel are not very accurate in case of handovers – when a VoIP call moves from one wireless LAN to another. This project conducted experimental evaluation of performance of E-model during handovers and proposes a new approach to accurately calculate the speech quality of VoIP during handovers. A detailed description of the experimental setup and the comparison of the new approach with E-model is presented in this report

SJSU ScholarWorks

A Time-Frequency Generative Adversarial based method for Audio Packet Loss Concealment

Author: Aironi Carlo
Cornell Samuele
Serafini Luca
Squartini Stefano
Publication venue
Publication date: 28/07/2023
Field of study

Packet loss is a major cause of voice quality degradation in VoIP transmissions with serious impact on intelligibility and user experience. This paper describes a system based on a generative adversarial approach, which aims to repair the lost fragments during the transmission of audio streams. Inspired by the powerful image-to-image translation capability of Generative Adversarial Networks (GANs), we propose bin2bin, an improved pix2pix framework to achieve the translation task from magnitude spectrograms of audio frames with lost packets, to noncorrupted speech spectrograms. In order to better maintain the structural information after spectrogram translation, this paper introduces the combination of two STFT-based loss functions, mixed with the traditional GAN objective. Furthermore, we employ a modified PatchGAN structure as discriminator and we lower the concealment time by a proper initialization of the phase reconstruction algorithm. Experimental results show that the proposed method has obvious advantages when compared with the current state-of-the-art methods, as it can better handle both high packet loss rates and large gaps.Comment: Accepted at EUSIPCO - 31st European Signal Processing Conference, 202

arXiv.org e-Print Archive

Audio Inpainting

Author: Adler A
Elad M
Emiya V
Gribonval R
Jafari MG
Plumbley MD
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/03/2012
Field of study

(c) 2012 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other users, including reprinting/ republishing this material for advertising or promotional purposes, creating new collective works for resale or redistribution to servers or lists, or reuse of any copyrighted components of this work in other works. Published version: IEEE Transactions on Audio, Speech and Language Processing 20(3): 922-932, Mar 2012. DOI: 10.1090/TASL.2011.2168211

HAL-CentraleSupelec

CiteSeerX

INRIA a CCSD electronic archive server

Surrey Research Insight

Hal-Diderot

HAL-Rennes 1

VoIP Packet Delay Techniques: A Survey

Author: R.Shankar
Publication venue: Global Journals Inc. (US)
Publication date: 29/03/2014
Field of study

The continuous development in the field of communication have paved the way for Voice over Internet Protocol (VoIP). VoIP is a group of hardware and software that facilitates people to utilize the Internet as the transmission medium for telephone calls by transmitting voice data in packets using IP instead of using conventional circuit transmissions of the Public Switched Telephone Network (PSTN). At present, VoIP is becoming an important tool for quick communication across the world. There are several Internet telephony applications existing at present. The major disadvantage in VoIP is that the packet delay. In VoIP, the terminology jitter is used to refer the type of packet delay where the delay has a huge setback in the quality of the voice conversation. Several packet delay techniques were proposed in recent years. Some of the important packet delay techniques are discussed in the literature. This survey would definitely help the researchers to carry out their research for providing better communication in VoIP without any delay