185 research outputs found
User-Oriented QoS in Packet Video Delivery
We focus on packet video delivery, with an emphasis on the quality of service perceived by the end-user. A video signal passes through several subsystems, such as the source coder, the network and the decoder. Each of these can impair the information, either by data loss or by introducing delay. We describe how each of the subsystems can be tuned to optimize the quality of the delivered signal, for a given available bit rate in the network. The assessment of end-user quality is not trivial. We present recent research results, which rely on a model of the human visual system
Improving the robustness of CELP-like speech decoders using late-arrival packets information : application to G.729 standard in VoIP
L'utilisation de la voix sur Internet est une nouvelle tendance dans Ie secteur des télécommunications et de la réseautique. La paquetisation des données et de la voix est réalisée en utilisant Ie protocole Internet (IP). Plusieurs codecs existent pour convertir la voix codée en paquets. La voix codée est paquetisée et transmise sur Internet. À la réception, certains paquets sont soit perdus, endommages ou arrivent en retard. Ceci est cause par des contraintes telles que Ie délai («jitter»), la congestion et les erreurs de réseau. Ces contraintes dégradent la qualité de la voix. Puisque la transmission de la voix est en temps réel, Ie récepteur ne peut pas demander la retransmission de paquets perdus ou endommages car ceci va causer plus de délai. Au lieu de cela, des méthodes de récupération des paquets perdus (« concealment ») s'appliquent soit à l'émetteur soit au récepteur pour remplacer les paquets perdus ou endommages. Ce projet vise à implémenter une méthode innovatrice pour améliorer Ie temps de convergence suite a la perte de paquets au récepteur d'une application de Voix sur IP. La méthode a déjà été intégrée dans un codeur large-bande (AMR-WB) et a significativement amélioré la qualité de la voix en présence de <<jitter » dans Ie temps d'arrivée des trames au décodeur. Dans ce projet, la même méthode sera intégrée dans un codeur a bande étroite (ITU-T G.729) qui est largement utilise dans les applications de voix sur IP. Le codeur ITU-T G.729 défini des standards pour coder et décoder la voix a 8 kb/s en utilisant 1'algorithme CS-CELP (Conjugate Stmcture Algebraic Code-Excited Linear Prediction).Abstract: Voice over Internet applications is the new trend in telecommunications and networking industry today. Packetizing data/voice is done using the Internet protocol (IP). Various codecs exist to convert the raw voice data into packets. The coded and packetized speech is transmitted over the Internet. At the receiving end some packets are either lost, damaged or arrive late. This is due to constraints such as network delay (fitter), network congestion and network errors. These constraints degrade the quality of speech. Since voice transmission is in real-time, the receiver can not request the retransmission of lost or damaged packets as this will cause more delay. Instead, concealment methods are applied either at the transmitter side (coder-based) or at the receiver side (decoder-based) to replace these lost or late-arrival packets. This work attempts to implement a novel method for improving the recovery time of concealed speech The method has already been integrated in a wideband speech coder (AMR-WB) and significantly improved the quality of speech in the presence of jitter in the arrival time of speech frames at the decoder. In this work, the same method will be integrated in a narrowband speech coder (ITU-T G.729) that is widely used in VoIP applications. The ITUT G.729 coder defines the standards for coding and decoding speech at 8 kb/s using Conjugate Structure Algebraic Code-Excited Linear Prediction (CS-CELP) Algorithm
Flow control of real-time unicast multimedia applications in best-effort networks
One of the fastest growing segments of Internet applications are real-time mul-
timedia applications, like Voice over Internet Protocol (VoIP). Real-time multimedia
applications use the User Datagram Protocol (UDP) as the transport protocol because
of the inherent conservative nature of the congestion avoidance schemes of Transmis-
sion Control Protocol (TCP). The e®ects of uncontrolled °ows on the Internet have
not yet been felt because UDP tra±c frequently constitutes only » 20% of the total
Internet tra±c. It is pertinent that real-time multimedia applications become better
citizens of the Internet, while at the same time deliver acceptable Quality of Service
(QoS).
Traditionally, packet losses and the increase in the end-to-end delay experienced
by some of the packets characterizes congestion in the network. These two signals
have been used to develop most known °ow control schemes. The current research
considers the °ow accumulation in the network as the signal for use in °ow control.
The most signi¯cant contribution of the current research is to propose novel end-
to-end °ow control schemes for unicast real-time multimedia °ows transmitting over
best-e®ort networks. These control schemes are based on predictive control of the
accumulation signal. The end-to-end control schemes available in the literature are
based on reactive control that do not take into account the feedback delay existing
between the sender and the receiver nor the forward delay in the °ow dynamics. The performance of the proposed control schemes has been evaluated using the
ns-2 simulation environment. The research concludes that active control of hard real-
time °ows delivers the same or somewhat better QoS as High Bit Rate (HBR, no
control), but with a lower average bit rate. Consequently, it helps reduce bandwidth
use of controlled real-time °ows by anywhere between 31:43% to 43:96%. Proposed
reactive control schemes deliver good QoS. However, they do not scale up as well as
the predictive control schemes. Proposed predictive control schemes are e®ective in
delivering good quality QoS while using up less bandwidth than even the reactive con-
trol schemes. They scale up well as more real-time multimedia °ows start employing
them
Analyzing Voice And Video Call Service Performance Over A Local Area Network
Tez (Yüksek Lisans) -- İstanbul Teknik Üniversitesi, Fen Bilimleri Enstitüsü, 2010Thesis (M.Sc.) -- İstanbul Technical University, Institute of Science and Technology, 2010Bu çalışmada, VOIP teknolojisinden ve bu teknolojiyi kablolu ve kablosuz ortamda gerçeklemenin en önemli darboğazları anlatılacaktır. Ayrıca H.323, SIP (Session Initiation Protocol), Megaco ve MGCP gibi yaygın olarak kullanılan ses iletim protokolleri ve H.261, H.263 ve H.264 gibi görüntü iletim protokollerinden bahsedilmiştir. Ses kodek seçimi ve VOIP servis kalitesine etki eden faktörleri anlatılmaktadır. Bu tezde, ses, görüntü ve veri iletişimini aynı anda bünyesinde barındıran gerçek şebekeler simüle edilecektir. Kullanıcılara rastlantısal olarak ses, görüntü ve FTP gibi birtakım uygulamalar atanmıştır. Ayrıca önerilen kablolu şebekeye, kablosuz bir şebeke ilave edilerek sonuçlar incelenecektir. Optimal servis kalitesini sağlamak için seçilen uygun kuyruklama mekanizmaları ve kodek seçimlerini içeren senaryolar incelenecek ve OPNET ile elde edilmiş simülasyon sonuçları tartışılacaktır.In this study, we present a detailed description of the VoIP and also the most common challenges of implementing voice communication into wireline or wireless networks are discussed. Common voice protocols, such as H.323, Session Initiation Protocol (SIP), Megaco, MGCP and video protocols such as H.261, H.263, H.264 are described as well. CODEC selection and factors affecting VoIP Quality of Service are analyzed. We simulate a real network which includes both voice, video and data communication simultaneously. Workstations are randomly assigned to different applications, such as voice, video and FTP. We will also implement a wireless network to our proposed system. The scenarios including selecting appropriate queuing scheme and codec selection are presented and the simulation results with OPNET are drawn.Yüksek LisansM.Sc
Scalable Speech Coding for IP Networks
The emergence of Voice over Internet Protocol (VoIP) has posed new challenges to the development of speech codecs. The key issue of transporting real-time voice packet over IP networks is the lack of guarantee for reasonable speech quality due to packet delay or loss.
Most of the widely used narrowband codecs depend on the Code Excited Linear Prediction (CELP) coding technique. The CELP technique utilizes the long-term prediction across the frame boundaries and therefore causes error propagation in the case of packet loss and need to transmit redundant information in order to mitigate the problem. The internet Low Bit-rate Codec (iLBC) employs the frame-independent coding and therefore inherently possesses high robustness to packet loss. However, the original iLBC lacks in some of the key features of speech codecs for IP networks: Rate flexibility, Scalability, and Wideband support.
This dissertation presents novel scalable narrowband and wideband speech codecs for IP networks using the frame independent coding scheme based on the iLBC. The rate flexibility is added to the iLBC by employing the discrete cosine transform (DCT) and iii the scalable algebraic vector quantization (AVQ) and by allocating different number of bits to the AVQ. The bit-rate scalability is obtained by adding the enhancement layer to the core layer of the multi-rate iLBC. The enhancement layer encodes the weighted iLBC coding error in the modified DCT (MDCT) domain. The proposed wideband codec employs the bandwidth extension technique to extend the capabilities of existing narrowband codecs to provide wideband coding functionality. The wavelet transform is also used to further enhance the performance of the proposed codec.
The performance evaluation results show that the proposed codec provides high robustness to packet loss and achieves equivalent or higher speech quality than state-of-the-art codecs under the clean channel condition
Multimedia streaming over wireless channels
The improvements in mobile communication systems have accelerated the development of new multimedia streaming techniques to increase the quality of streaming data over time varying wireless channels. In order to increase multimedia quality, error control schemes are indispensable due to time-varying and erroneous nature of the channel. However, relatively low channel capacity of wireless channels, and dependency structure in multimedia limit the eectiveness of existing error control schemes and require more sophisticated techniques to provide quality improvement on the streaming data. In this thesis, we propose sender driven multimedia streaming algorithms that incorporate error control schemes of FEC, ARQ, and packet scheduling by considering media and channel parameters such as packet importance, packet dependencies, decoding deadlines, channel state information, and channel capacity. Initially, we have proposed a multi-rate distortion optimization framework so as to jointly optimize FEC rate and packet selection by minimizing end-to-end distortion to satisfy a specified Quality of Service under channel capacity constraint. Minimization of end-to-end distortion causes computational complexity in the rate distortion optimization framework due to dependency in encoded multimedia. Therefore, we propose multimedia streaming algorithms that select packet and FEC rate with reduced computational complexity and high quality as compared with multi-rate distortion optimization framework. Additionally, protocol stack of a UMTS cellular network system with W-CDMA air interface is presented in order to clarify the relation between proposed multimedia streaming algorithms and UMTS system that is used in simulations. Finally, proposed algorithms are simulated and results demonstrate that proposed algorithms improve multimedia quality significantly as compared to existing methods
Error resilient packet switched H.264 video telephony over third generation networks.
Real-time video communication over wireless networks is a challenging problem because
wireless channels suffer from fading, additive noise and interference, which translate
into packet loss and delay. Since modern video encoders deliver video packets with
decoding dependencies, packet loss and delay can significantly degrade the video quality
at the receiver. Many error resilience mechanisms have been proposed to combat packet
loss in wireless networks, but only a few were specifically designed for packet switched
video telephony over Third Generation (3G) networks.
The first part of the thesis presents an error resilience technique for packet switched
video telephony that combines application layer Forward Error Correction (FEC) with
rateless codes, Reference Picture Selection (RPS) and cross layer optimization. Rateless
codes have lower encoding and decoding computational complexity compared to traditional
error correcting codes. One can use them on complexity constrained hand-held
devices. Also, their redundancy does not need to be fixed in advance and any number of
encoded symbols can be generated on the fly. Reference picture selection is used to limit
the effect of spatio-temporal error propagation. Limiting the effect of spatio-temporal
error propagation results in better video quality. Cross layer optimization is used to
minimize the data loss at the application layer when data is lost at the data link layer.
Experimental results on a High Speed Packet Access (HSPA) network simulator for
H.264 compressed standard video sequences show that the proposed technique achieves
significant Peak Signal to Noise Ratio (PSNR) and Percentage Degraded Video Duration
(PDVD) improvements over a state of the art error resilience technique known as
Interactive Error Control (IEC), which is a combination of Error Tracking and feedback
based Reference Picture Selection. The improvement is obtained at a cost of higher
end-to-end delay.
The proposed technique is improved by making the FEC (Rateless code) redundancy
channel adaptive. Automatic Repeat Request (ARQ) is used to adjust the redundancy
of the Rateless codes according to the channel conditions. Experimental results show
that the channel adaptive scheme achieves significant PSNR and PDVD improvements
over the static scheme for a simulated Long Term Evolution (LTE) network.
In the third part of the thesis, the performance of the previous two schemes is
improved by making the transmitter predict when rateless decoding will fail. In this
case, reference picture selection is invoked early and transmission of encoded symbols
for that source block is aborted. Simulations for an LTE network show that this results
in video quality improvement and bandwidth savings.
In the last part of the thesis, the performance of the adaptive technique is improved
by exploiting the history of the wireless channel. In a Rayleigh fading wireless channel,
the RLC-PDU losses are correlated under certain conditions. This correlation is
exploited to adjust the redundancy of the Rateless code and results in higher Rateless
code decoding success rate and higher video quality. Simulations for an LTE network
show that the improvement was significant when the packet loss rate in the two wireless
links was 10%.
To facilitate the implementation of the proposed error resilience techniques in practical
scenarios, RTP/UDP/IP level packetization schemes are also proposed for each
error resilience technique.
Compared to existing work, the proposed error resilience techniques provide better
video quality. Also, more emphasis is given to implementation issues in 3G networks
Recommended from our members
Multimedia delivery in the future internet
The term “Networked Media” implies that all kinds of media including text, image, 3D graphics, audio
and video are produced, distributed, shared, managed and consumed on-line through various networks,
like the Internet, Fiber, WiFi, WiMAX, GPRS, 3G and so on, in a convergent manner [1]. This white
paper is the contribution of the Media Delivery Platform (MDP) cluster and aims to cover the Networked
challenges of the Networked Media in the transition to the Future of the Internet.
Internet has evolved and changed the way we work and live. End users of the Internet have been confronted
with a bewildering range of media, services and applications and of technological innovations concerning
media formats, wireless networks, terminal types and capabilities. And there is little evidence that the pace
of this innovation is slowing. Today, over one billion of users access the Internet on regular basis, more
than 100 million users have downloaded at least one (multi)media file and over 47 millions of them do so
regularly, searching in more than 160 Exabytes1 of content. In the near future these numbers are expected
to exponentially rise. It is expected that the Internet content will be increased by at least a factor of 6, rising
to more than 990 Exabytes before 2012, fuelled mainly by the users themselves. Moreover, it is envisaged
that in a near- to mid-term future, the Internet will provide the means to share and distribute (new)
multimedia content and services with superior quality and striking flexibility, in a trusted and personalized
way, improving citizens’ quality of life, working conditions, edutainment and safety.
In this evolving environment, new transport protocols, new multimedia encoding schemes, cross-layer inthe
network adaptation, machine-to-machine communication (including RFIDs), rich 3D content as well as
community networks and the use of peer-to-peer (P2P) overlays are expected to generate new models of
interaction and cooperation, and be able to support enhanced perceived quality-of-experience (PQoE) and
innovative applications “on the move”, like virtual collaboration environments, personalised services/
media, virtual sport groups, on-line gaming, edutainment. In this context, the interaction with content
combined with interactive/multimedia search capabilities across distributed repositories, opportunistic P2P
networks and the dynamic adaptation to the characteristics of diverse mobile terminals are expected to
contribute towards such a vision.
Based on work that has taken place in a number of EC co-funded projects, in Framework Program 6 (FP6)
and Framework Program 7 (FP7), a group of experts and technology visionaries have voluntarily
contributed in this white paper aiming to describe the status, the state-of-the art, the challenges and the way
ahead in the area of Content Aware media delivery platforms
- …