186 research outputs found

    An Analysis of VP8, a new video codec for the web

    Get PDF
    Video is an increasingly ubiquitous part of our lives. Fast and efficient video codecs are necessary to satisfy the increasing demand for video on the web and mobile devices. However, open standards and patent grants are paramount to the adoption of video codecs across different platforms and browsers. Google On2 released VP8 in May 2010 to compete with H.264, the current standard of video codecs, complete with source code, specification and a perpetual patent grant. As the amount of video being created every day is growing rapidly, the decision of which codec to encode this video with is paramount; if a low quality codec or a restrictively licensed codec is used, the video recorded might be of little to no use. We sought to study VP8 and its quality versus its resource consumption compared to H.264 -- the most popular current video codec -- so that reader may make an informed decision for themselves or for their organizations about whether to use H.264 or VP8, or something else entirely. We examined VP8 in detail, compared its theoretical complexity to H.264 and measured the efficiency of its current implementation. VP8 shares many facets of its design with H.264 and other Discrete Cosine Transform (DCT) based video codecs. However, VP8 is both simpler and less feature rich than H.264, which may allow for rapid hardware and software implementations. As it was designed for the Internet and newer mobile devices, it contains fewer legacy features, such as interlacing, than H.264 supports. To perform quality measurements, the open source VP8 implementation libvpx was used. This is the reference implementation. For H.264, the open source H.264 encoder x264 was used. This encoder has very high performance, and is often rated at the top of its field in efficiency. The JM reference encoder was used to establish a baseline quality for H.264. Our findings indicate that VP8 performs very well at low bitrates, at resolutions at and below CIF. VP8 may be able to successfully displace H.264 Baseline in the mobile streaming video domain. It offers higher quality at a lower bitrate for low resolution images due to its high performing entropy coder and non-contiguous macroblock segmentation. At higher resolutions, VP8 still outperforms H.264 Baseline, but H.264 High profile leads. At HD resolution (720p and above), H.264 is significantly better than VP8 due to its superior motion estimation and adaptive coding. There is little significant difference between the intra-coding performance between H.264 and VP8. VP8\u27s in-loop deblocking filter outperforms H.264\u27s version. H.264\u27s inter-coding, with full support for B frames and weighting outperforms VP8\u27s alternate reference scheme, although this may improve in the future. On average, VP8\u27s feature set is less complex than H.264\u27s equivalents, which, along with its open source implementation, may spur development in the future. These findings indicate that VP8 has strong fundamentals when compared with H.264, but that it lacks optimization and maturity. It will likely improve as engineers optimize VP8\u27s reference implementation, or when a competing implementation is developed. We recommend several areas that the VP8 developers should focus on in the future

    Acting rehearsal in collaborative multimodal mixed reality environments

    Get PDF
    This paper presents the use of our multimodal mixed reality telecommunication system to support remote acting rehearsal. The rehearsals involved two actors, located in London and Barcelona, and a director in another location in London. This triadic audiovisual telecommunication was performed in a spatial and multimodal collaborative mixed reality environment based on the 'destination-visitor' paradigm, which we define and put into use. We detail our heterogeneous system architecture, which spans the three distributed and technologically asymmetric sites, and features a range of capture, display, and transmission technologies. The actors' and director's experience of rehearsing a scene via the system are then discussed, exploring successes and failures of this heterogeneous form of telecollaboration. Overall, the common spatial frame of reference presented by the system to all parties was highly conducive to theatrical acting and directing, allowing blocking, gross gesture, and unambiguous instruction to be issued. The relative inexpressivity of the actors' embodiments was identified as the central limitation of the telecommunication, meaning that moments relying on performing and reacting to consequential facial expression and subtle gesture were less successful

    A Methodology for Characterizing Real-Time Multimedia Quality of Service in Limited Bandwidth Network

    Get PDF
    This paper presents how to characterize the quality of multimedia which consists of audio and video that are transmitted in real-time communication through the Internet with limited bandwidth. We developed a methodology of characterizing the multimedia Quality-of-Service (QoS) by measuring network parameters (i.e., bandwidth capacity, packet loss rate (PLR), and end-to-end delay) of testbed network and simulating the audio-video delivery according to the measured network parameters. The analysis of network parameters was aimed to describe the network characteristics. Multimedia QoS was characterized by conducting a simulation using data which was collected from the previous network characterization. A simulation network model was built using OMNet++ representing a delivery of audio-video in real-time while a background traffic was generated to represent a real condition of the network. Apllying the methodology in a network testbed in Indonesia’s rural area, the simulation results showed that audio-video could be delivered with accepted level of user satisfaction

    Videos in Context for Telecommunication and Spatial Browsing

    Get PDF
    The research presented in this thesis explores the use of videos embedded in panoramic imagery to transmit spatial and temporal information describing remote environments and their dynamics. Virtual environments (VEs) through which users can explore remote locations are rapidly emerging as a popular medium of presence and remote collaboration. However, capturing visual representation of locations to be used in VEs is usually a tedious process that requires either manual modelling of environments or the employment of specific hardware. Capturing environment dynamics is not straightforward either, and it is usually performed through specific tracking hardware. Similarly, browsing large unstructured video-collections with available tools is difficult, as the abundance of spatial and temporal information makes them hard to comprehend. At the same time, on a spectrum between 3D VEs and 2D images, panoramas lie in between, as they offer the same 2D images accessibility while preserving 3D virtual environments surrounding representation. For this reason, panoramas are an attractive basis for videoconferencing and browsing tools as they can relate several videos temporally and spatially. This research explores methods to acquire, fuse, render and stream data coming from heterogeneous cameras, with the help of panoramic imagery. Three distinct but interrelated questions are addressed. First, the thesis considers how spatially localised video can be used to increase the spatial information transmitted during video mediated communication, and if this improves quality of communication. Second, the research asks whether videos in panoramic context can be used to convey spatial and temporal information of a remote place and the dynamics within, and if this improves users' performance in tasks that require spatio-temporal thinking. Finally, the thesis considers whether there is an impact of display type on reasoning about events within videos in panoramic context. These research questions were investigated over three experiments, covering scenarios common to computer-supported cooperative work and video browsing. To support the investigation, two distinct video+context systems were developed. The first telecommunication experiment compared our videos in context interface with fully-panoramic video and conventional webcam video conferencing in an object placement scenario. The second experiment investigated the impact of videos in panoramic context on quality of spatio-temporal thinking during localization tasks. To support the experiment, a novel interface to video-collection in panoramic context was developed and compared with common video-browsing tools. The final experimental study investigated the impact of display type on reasoning about events. The study explored three adaptations of our video-collection interface to three display types. The overall conclusion is that videos in panoramic context offer a valid solution to spatio-temporal exploration of remote locations. Our approach presents a richer visual representation in terms of space and time than standard tools, showing that providing panoramic contexts to video collections makes spatio-temporal tasks easier. To this end, videos in context are suitable alternative to more difficult, and often expensive solutions. These findings are beneficial to many applications, including teleconferencing, virtual tourism and remote assistance

    Comparison of compression efficiency between HEVC/H.265 and VP9 based on subjective assessments

    Get PDF
    Current increasing effort of broadcast providers to transmit UHD (Ultra High Definition) content is likely to increase demand for ultra high definition televisions (UHDTVs). To compress UHDTV content, several alter- native encoding mechanisms exist. In addition to internationally recognized standards, open access proprietary options, such as VP9 video encoding scheme, have recently appeared and are gaining popularity. One of the main goals of these encoders is to efficiently compress video sequences beyond HDTV resolution for various scenarios, such as broadcasting or internet streaming. In this paper, a broadcast scenario rate-distortion performance analysis and mutual comparison of one of the latest video coding standards H.265/HEVC with recently released proprietary video coding scheme VP9 is presented. Also, currently one of the most popular and widely spread encoder H.264/AVC has been included into the evaluation to serve as a comparison baseline. The comparison is performed by means of subjective evaluations showing actual differences between encoding algorithms in terms of perceived quality. The results indicate a dominance of HEVC based encoding algorithm in comparison to other alternatives if a wide range of bit-rates from very low to high bit-rates corresponding to low quality up to transparent quality when compared to original and uncompressed video is considered. In addition, VP9 shows competitive results for synthetic content and bit-rates that correspond to operating points for transparent or close to transparent quality video

    Video CODEC with adaptive frame rate control for intelligent transportation system applications

    Get PDF
    Video cameras are one of the important types of devices in Intelligent Transportation Systems (ITS). The camera images are practical, widely deployable and beneficial for traffic management and congestion control. The advent of image processing has established several applications based on ITS camera images, including vehicle detection, weather monitoring, smart work zones, etc. Unlike digital video entertainment applications, the camera images in ITS applications require high video image quality but usually not a high video frame rate. Traditional block-based video compression standards, which were developed primarily with the video entertainment industry in mind, are dependent on adaptive rate control algorithms to control the video quality and the video frame rate. Modern rate control algorithms range from simple frame skipping to complicated adaptive algorithms based on optimal rate-distortion theory. In this dissertation, I presented an innovative video frame rate control scheme based on adaptive frame dropping. Video transmission schemes were also discussed and a new strategy to reduce the video traffic on the network was presented. Experimental results in a variety of network scenarios shown that the proposed technique could improve video quality in both the temporal and spatial dimensions, as quantified by standard video metrics (up to 6 percent of PSNR, 5 percent of SSIM, and 10 percent VQM compared to the original video). Another benefit of the proposed technique is that video traffic and network congestion are generally reduced. Both FPGA and embedded Linux implementations are considered for video encoder development

    Energy-aware adaptive solutions for multimedia delivery to wireless devices

    Get PDF
    The functionality of smart mobile devices is improving rapidly but these devices are limited in terms of practical use because of battery-life. This situation cannot be remedied by simply installing batteries with higher capacities in the devices. There are strict limitations in the design of a smartphone, in terms of physical space, that prohibit this “quick-fix” from being possible. The solution instead lies with the creation of an intelligent, dynamic mechanism for utilizing the hardware components on a device in an energy-efficient manner, while also maintaining the Quality of Service (QoS) requirements of the applications running on the device. This thesis proposes the following Energy-aware Adaptive Solutions (EASE): 1. BaSe-AMy: the Battery and Stream-aware Adaptive Multimedia Delivery (BaSe-AMy) algorithm assesses battery-life, network characteristics, video-stream properties and device hardware information, in order to dynamically reduce the power consumption of the device while streaming video. The algorithm computes the most efficient strategy for altering the characteristics of the stream, the playback of the video, and the hardware utilization of the device, dynamically, while meeting application’s QoS requirements. 2. PowerHop: an algorithm which assesses network conditions, device power consumption, neighboring node devices and QoS requirements to decide whether to adapt the transmission power or the number of hops that a device uses for communication. PowerHop’s ability to dynamically reduce the transmission power of the device’s Wireless Network Interface Card (WNIC) provides scope for reducing the power consumption of the device. In this case shorter transmission distances with multiple hops can be utilized to maintain network range. 3. A comprehensive survey of adaptive energy optimizations in multimedia-centric wireless devices is also provided. Additional contributions: 1. A custom video comparison tool was developed to facilitate objective assessment of streamed videos. 2. A new solution for high-accuracy mobile power logging was designed and implemented

    Enabling energy-awareness for internet video

    Get PDF
    Continuous improvements to the state of the art have made it easier to create, send and receive vast quantities of video over the Internet. Catalysed by these developments, video is now the largest, and fastest growing type of traffic on modern IP networks. In 2015, video was responsible for 70% of all traffic on the Internet, with an compound annual growth rate of 27%. On the other hand, concerns about the growing energy consumption of ICT in general, continue to rise. It is not surprising that there is a significant energy cost associated with these extensive video usage patterns. In this thesis, I examine the energy consumption of typical video configurations during decoding (playback) and encoding through empirical measurements on an experimental test-bed. I then make extrapolations to a global scale to show the opportunity for significant energy savings, achievable by simple modifications to these video configurations. Based on insights gained from these measurements, I propose a novel, energy-aware Quality of Experience (QoE) metric for digital video - the Energy - Video Quality Index (EnVI). Then, I present and evaluate vEQ-benchmark, a benchmarking and measurement tool for the purpose of generating EnVI scores. The tool enables fine-grained resource-usage analyses on video playback systems, and facilitates the creation of statistical models of power usage for these systems. I propose GreenDASH, an energy-aware extension of the existing Dynamic Adaptive Streaming over HTTP standard (DASH). GreenDASH incorporates relevant energy-usage and video quality information into the existing standard. It could enable dynamic, energy-aware adaptation for video in response to energy-usage and user ‘green’ preferences. I also evaluate the subjective perception of such energy-aware, adaptive video streaming by means of a user study featuring 36 participants. I examine how video may be adapted to save energy without a significant impact on the Quality of Experience of these users. In summary, this thesis highlights the significant opportunities for energy savings if Internet users gain an awareness about their energy usage, and presents a technical discussion how this can be achieved by straightforward extensions to the current state of the art

    MECHANISM AND STAGES OF PACKAGING OF VP8, THE MAJOR TEGUMENT PROTEIN OF BOVINE HERPESVIRUS-1

    Get PDF
    VP8 (pUL47), the major tegument protein of bovine herpesvirus -1 (BoHV-1), is crucial for viral replication and induction of host immune responses. VP8 (pUL47) translocation from the nucleus to the cytoplasm and subsequently to the Golgi results from its phosphorylation within the nucleus by pUS3. VP8 (pUL47) phosphorylation mutant contains a significantly lower amount of VP8 (pUL47) (~30%) than wild type virus. Outside the context of infection, VP8 (pUL47) is translocated to the cytoplasm if co-transfected with pUS3 encoding plasmid, but remains cytoplasmic and is not translocated to the Golgi. Based on these previous studies, we hypothesized that VP8 (pUL47) is partially packaged in the perinuclear region, and localisation of VP8 at the Golgi for final packaging involves another viral factor, presumably a glycoprotein. Mass spectrometry studies indicated presence of VP8 (pUL47), and another tegument protein, VP22 (pUL49), in the perinuclear and mature virus particles. Co-immunoprecipitation and confocal microscopy confirmed an interaction between VP8 (pUL47) and VP22 (pUL49) and their co-localisation in the perinuclear region, respectively. In cells infected with virus lacking the VP22 (pUL49)-encoding gene, VP8 (pUL47) was absent from the perinuclear space, and the amount of VP8 (pUL47) in the purified mature virus was reduced by approximately 33%. To identify the viral factor(s) responsible for the localisation of cytoplasmic VP8 (pUL47) at the Golgi, a screening of co-precipitating glycoproteins was performed, and glycoprotein M (gM) was observed to be an interaction partner of VP8 (pUL47) during infection, as well as outside the context of infection. VP8 (pUL47) and gM (pUL10) co-localised at the Golgi in infected cells, and gM (pUL10) was sufficient for localisation of VP8 (pUL47) at the Golgi outside the context of infection. In recombinant virus lacking gene encoding gM (ΔgM- BoHV-1), the localisation of VP8 (pUL47) at the Golgi was impeded, and restored with the restoration of gM (pUL10). Analysis of purified mature virus from ΔgM- BoHV-1 infected cells indicated a reduction of approximately 65% in the amount of VP8 (pUL47). The results of this research add to the knowledge of the stages and proteins involved in the assembly of the tegument layer of BoHV-1 with focus on the major tegument protein, VP8 (pUL47)

    CHARACTERISATION AND MOLECULAR TYPING OF CLINICAL AND ENVIRONMENTAL ISOLATES OF VIBRIO PARAHAEMOLYTICUS

    Get PDF
    Vibrio parahaemolyticus is a natural inhabitant of coastal waters worldwide and is the leading cause of seafood-borne gastroenteritis. This study reports on the use of several molecular characterisation methods to screen clinical and environmental isolates of V parahaemolyticus to assess whether such techniques can be used to distinguish pathogenic isolates reliably. In a total of 86 isolates mainly of V parahaemolyticus but also including V cholerae, V vulnificus and several other species, serotypes of the more virulent clonal group 03:K6 were identified, but otherwise there appeared no association with serotype and phenotype. The tdh and trh genes encoding haemolysins that are typically associated with virulent isolates were found in a significantly large number of isolates; however, poor concordance between haemolytic activity and the presence of the gene tdh was found. In an effort to establish more accurate relationships amongst clinical and environmental isolates of V parahaemolyticus, four molecular typing systems were employed; namely pulsed-field gel electrophoresis (PFGE), intergenic transcribed spacer (ITS) analysis, tDNA intergenic length polymorphisms (tDNA-ILPs) and randomly amplified polymorphic DNA (RAPD). Typing patterns and clustering analysis using these methods differentiated V parahaemolyticus from other marine species as well as at the subspecies level. PFGE with NotI was shown to be the most discriminative but suffered from not being universally applicable. Both ITS and tDNA-ILP methods were sufficiently discriminatory with discrimination indices (DI) of between 0.568 and 0.724, depending on the primers employed. The discriminatory ability of RAPD was also affected by the primers used (DI= 0.959 - 0.965) but closely matched that of PFGE (DI = 0.976). Additionally, both RAPD methods were able to distinguish putative markers for the pandemic clonal group. Typing systems appeared largely stable in duplicate and triplicate analyses with multiple primer pairs with some obvious variability in the reproduction of faint amplicons. All methods except PFGE were simple to execute but none of the methods could distinguish V parahaemolyticus into obvious lineages based on the clinical or environmental source. With the recent implication of a type Ill secretion system {TTSS) involved in the pathogenicity of V parahaemolyticus, a multiplex PCR system using PCR primers that spanned both TTSSl and TTSS2 regions was developed. Dot-blot analysis confirmed TTSS2 genes in at least 30% of environmental isolates. Nucleotide sequence analysis revealed l00% sequence homology in three loci of TTSS2 putative structural genes. In comparison, a total of 34 single nucleotide polymorphisms (SNP) were identified in three TTSS1 regions. In two of the regions, the SNPs were synonymous, whereas a non-synonymous substitution in the structural gene vcrDI resulted in valine replacement with isoleucine. In addition, nucleotide deletions in TTSS1 with resultant frameshift mutations were identified. The finding that significant numbers of environmental isolates also possess TTSS2 genes is contrary to currently held opinion that TTSS2 is only present in clinical isolates. It is hypothesed that the high incidences of V parahaemolyticus infections may be related to active TTSS2 genes, whereas a high degree of polymorphisms in TTSS1 suggest it may be inactive.The Centre for Environment, Fisheries and Aquaculture Science Weymouth Laboratories, Dorset, United Kingdo
    corecore