Search CORE

98 research outputs found

A real-time SNR scalable transcoder for MPEG-2 video streams

Author: Al-khrayshah M.
Publication venue
Publication date: 01/01/2009
Field of study

Repository TU/e

Pure OAI Repository

Low Cost Transcoder

Author: Luís Filipe Rodrigues Coelho
Publication venue
Publication date: 14/02/2020
Field of study

Repositório Aberto da Universidade do Porto

An Efficient Motion Estimation Method for H.264-Based Video Transcoding with Arbitrary Spatial Resolution Conversion

Author: Wang Jiao
Publication venue: 'University of Waterloo'
Publication date: 01/01/2007
Field of study

As wireless and wired network connectivity is rapidly expanding and the number of network users is steadily increasing, it has become more and more important to support universal access of multimedia content over the whole network. A big challenge, however, is the great diversity of network devices from full screen computers to small smart phones. This leads to research on transcoding, which involves in efficiently reformatting compressed data from its original high resolution to a desired spatial resolution supported by the displaying device. Particularly, there is a great momentum in the multimedia industry for H.264-based transcoding as H.264 has been widely employed as a mandatory player feature in applications ranging from television broadcast to video for mobile devices. While H.264 contains many new features for effective video coding with excellent rate distortion (RD) performance, a major issue for transcoding H.264 compressed video from one spatial resolution to another is the computational complexity. Specifically, it is the motion compensated prediction (MCP) part. MCP is the main contributor to the excellent RD performance of H.264 video compression, yet it is very time consuming. In general, a brute-force search is used to find the best motion vectors for MCP. In the scenario of transcoding, however, an immediate idea for improving the MCP efficiency for the re-encoding procedure is to utilize the motion vectors in the original compressed stream. Intuitively, motion in the high resolution scene is highly related to that in the down-scaled scene. In this thesis, we study homogeneous video transcoding from H.264 to H.264. Specifically, for the video transcoding with arbitrary spatial resolution conversion, we propose a motion vector estimation algorithm based on a multiple linear regression model, which systematically utilizes the motion information in the original scenes. We also propose a practical solution for efficiently determining a reference frame to take the advantage of the new feature of multiple references in H.264. The performance of the algorithm was assessed in an H.264 transcoder. Experimental results show that, as compared with a benchmark solution, the proposed method significantly reduces the transcoding complexity without degrading much the video quality

University of Waterloo's Institutional Repository

An ICAP-based content repurposing system for ubiquitous access to multimedia content.

Author
Publication venue
Publication date: 01/01/2003
Field of study

by Tam Wing-Lam.Thesis (M.Phil.)--Chinese University of Hong Kong, 2003.Includes bibliographical references (leaves 55-57).Abstracts in English and Chinese.Acknowledgments --- p.iAbstract --- p.ii哲學碩士論文摘要 --- p.iiiChapter Chapter 1 --- Introduction --- p.1Chapter 1.1 --- Research Background --- p.2Chapter 1.2 --- Contribution of the Thesis --- p.5Chapter 1.3 --- Organization of the Thesis --- p.6Chapter Chapter 2 --- Content Repurposing System Architecture --- p.7Chapter 2.1 --- Introduction to ICAP-based Content Repurposing System --- p.7Chapter 2.2 --- Generic Service-enabling Platform --- p.8Chapter 2.3 --- Rule Engine --- p.10Chapter 2.4 --- ICAP-enabled Application Server --- p.10Chapter 2.5 --- Store-and-forward Transcoding and Streamed Transcoding --- p.11Chapter Chapter 3 --- Transcoding Techniques --- p.18Chapter 3.1 --- Text Transcoding --- p.19Chapter 2.2 --- Image Transcoding --- p.20Chapter 3.3 --- Audio Transcoding --- p.23Chapter 3.4 --- Video Transcoding --- p.25Chapter Chapter 4 --- Adaptation Policy --- p.28Chapter 4.1 --- Delay Analysis of Content Repurposing System --- p.30Chapter 4.2 --- Store-and-forward Transcoding for Image Files --- p.31Chapter 4.2.1 --- Distribution of Input Web Images --- p.34Chapter 4.2.2 --- Transcoding Web images to WBMP --- p.34Chapter 4.2.3 --- Adaptation policy of Transformation to WBMP --- p.36Chapter 4.2.4 --- Adaptation policy of JPEG images --- p.36Chapter 4.3 --- Streamed Transcoding for Audio/Video Files --- p.39Chapter 4.3.1 --- Audio Transcoding --- p.41Chapter 4.3.2 --- Video Transcoding --- p.42Chapter 4.4 --- Case Study --- p.43Chapter 4.4.1 --- Weak Device with Insufficient Bandwidth --- p.43Chapter 4.4.2 --- Weak Device with Sufficient Bandwidth --- p.43Chapter 4.4.3 --- Strong Device with Insufficient Bandwidth --- p.44Chapter Chapter 5 --- Conclusion --- p.54Bibliography --- p.5

CUHK Digital Repository

Recommended from our members

Adaptive intra refresh for robust wireless multi-view video

Author: Lawan Sagir
Publication venue: Brunel University London
Publication date: 01/01/2016
Field of study

This thesis was submitted for the award of PhD and was awarded by Brunel University LondonMobile wireless communication technology is a fast developing field and every day new mobile communication techniques and means are becoming available. In this thesis multi-view video (MVV) is also refers to as 3D video. Thus, the 3D video signals through wireless communication are shaping telecommunication industry and academia. However, wireless channels are prone to high level of bit and burst errors that largely deteriorate the quality of service (QoS). Noise along the wireless transmission path can introduce distortion or make a compressed bitstream lose vital information. The error caused by noise progressively spread to subsequent frames and among multiple views due to prediction. This error may compel the receiver to pause momentarily and wait for the subsequent INTRA picture to continue decoding. The pausing of video stream affects the user's Quality of Experience (QoE). Thus, an error resilience strategy is needed to protect the compressed bitstream against transmission errors. This thesis focuses on error resilience Adaptive Intra Refresh (AIR) technique. The AIR method is developed to make the compressed 3D video more robust to channel errors. The process involves periodic injection of Intra-coded macroblocks in a cyclic pattern using H.264/AVC standard. The algorithm takes into account individual features in each macroblock and the feedback information sent by the decoder about the channel condition in order to generate an MVV-AIR map. MVV-AIR map generation regulates the order of packets arrival and identifies the motion activities in each macroblock. Based on the level of motion activity contained in each macroblock, the MVV-AIR map classifies frames as high or low motion macroblocks. A proxy MVV-AIR transcoder is used to validate the efficiency of the generated MVV-AIR map. The MVV-AIR transcoding algorithm uses spatial and views downscaling scheme to convert from MVV to single view. Various experimental results indicate that the proposed error resilient MVV-AIR transcoder technique effectively improves the quality of reconstructed 3D video in wireless networks. A comparison of MVV-AIR transcoder algorithm with some traditional error resilience techniques demonstrates that MVV-AIR algorithm performs better in an error prone channel. Results of simulation revealed significant improvements in both objective and subjective qualities. No additional computational complexity emanates from the scheme while the QoS and QoE requirements are still fully met.Tertiary Institution Trust Fund (TETFund) of Nigeri

Brunel University Research Archive

Video transcoding: an overview of various techniques and research issues

Author: I. Ahmad
Xiaohui Wei
Ya-Qin Zhang
Yu Sun
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

Crossref

Toward a General Parametric Model for Assessing the Impact of Video Transcoding on Objective Video Quality

Author: Alsrehin Nawaf Omar N.
Publication venue: DigitalCommons@USU
Publication date: 01/05/2016
Field of study

Video transcoding can cause degradation to an original video. Currently, there is no general model that assesses the impact of video transcoding on video quality. Such a model could play a critical role in evaluating the quality of the transcoded video, and thereby optimizing delivery of video to end-users while meeting their expectations. The main contribution of this research is the development and substantiation of a general parametric model, called the Video Transcoding Objective-quality Model (VTOM), that provides an extensible video transcoding service selection mechanism, which takes into account both the format and characteristics of the original video and the desired output, i.e., viewing format with preferred quality of service. VTOM represents a mathematical function that uses a set of media-related parameters for the original video and desired output, including codec, bit rate, frame rate, and frame size to predict the quality of the transcoded video generated from a specific transcoding. VTOM includes four quality sub-models, each describing the impact of each of these parameters on objective video quality, as well as a weighted-product aggregation function that combines these quality sub-models with four additional error sub-models in a single function for assessing the overall video quality. I compared the predicted quality results generated from the VTOM with quality values generated from an existing objective-quality metric. These comparisons yielded results that showed good correlations, with low error values. VTOM helps the researchers and developers of video delivery systems and applications to calculate the degradation that video transcoding can cause on the fly, rather than evaluate it statistically using statistical methods that only consider the desired output. Because VTOM takes into account the quality of the input video, i.e., video format and characteristics, and the desired quality of the output video, it can be used for dynamic video transcoding service selection and composition. A number of quality metrics were examined and used in development of VTOM and its assessment. However, this research discovered that, to date, there are no suitable metrics in the literature for comparing two videos with different frame rates. Therefore, this dissertation defines a new metric, called Frame Rate Metric (FRM) as part of its contributions. FRM can use any frame-based quality metric for comparing frames from both videos. Finally, this research presents and adapts four Quality of Service (QoS)-aware video transcoding service selection algorithms. The experimental results showed that these four algorithms achieved good results in terms of time complexity, success ratio, and user satisfaction rate

DigitalCommons@USU

Metadata-driven multimedia access

Author: Askelof J.
Ebrahimi Touradj
Smith J.R.
Suzuki T.
van Beek P.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 16/09/2007
Field of study

With the growing ubiquity and mobility of multimedia-enabled devices, universal multimedia access (UMA) is emerging as one of the important components for the next generation of multimedia applications. The basic concept underlying UMA is universal or seamless access to multimedia content, by automatic selection and adaptation of content based on the user's environment. UMA promises an integration of these different perspectives into a new class of content adaptive applications that could allow users to access multimedia content without concern for specific coding formats, terminal capabilities, or network conditions. We discuss methods that support UMA and the tools provided by MPEG-7 to achieve this. We also discuss the inclusion of metadata in JPEG 2000 encoded images. We present these methods in the typical order that they may be used in an actual application. Therefore, we first discuss the (personalized) selection of desired content from all available content, followed by the organization of related variations of a single piece of content. Then, we discuss segmentation and summarization of audio video (AV) content, and finally, transcoding of AV content

Infoscience - École polytechnique fédérale de Lausanne

Efficient HEVC-based video adaptation using transcoding

Author: Pham Van Luong
Publication venue: Ghent University. Faculty of Engineering and Architecture
Publication date: 01/01/2017
Field of study

In a video transmission system, it is important to take into account the great diversity of the network/end-user constraints. On the one hand, video content is typically streamed over a network that is characterized by different bandwidth capacities. In many cases, the bandwidth is insufficient to transfer the video at its original quality. On the other hand, a single video is often played by multiple devices like PCs, laptops, and cell phones. Obviously, a single video would not satisfy their different constraints. These diversities of the network and devices capacity lead to the need for video adaptation techniques, e.g., a reduction of the bit rate or spatial resolution. Video transcoding, which modifies a property of the video without the change of the coding format, has been well-known as an efficient adaptation solution. However, this approach comes along with a high computational complexity, resulting in huge energy consumption in the network and possibly network latency. This presentation provides several optimization strategies for the transcoding process of HEVC (the latest High Efficiency Video Coding standard) video streams. First, the computational complexity of a bit rate transcoder (transrater) is reduced. We proposed several techniques to speed-up the encoder of a transrater, notably a machine-learning-based approach and a novel coding-mode evaluation strategy have been proposed. Moreover, the motion estimation process of the encoder has been optimized with the use of decision theory and the proposed fast search patterns. Second, the issues and challenges of a spatial transcoder have been solved by using machine-learning algorithms. Thanks to their great performance, the proposed techniques are expected to significantly help HEVC gain popularity in a wide range of modern multimedia applications

Ghent University Academic Bibliography