1,051 research outputs found

    Error resilience and concealment techniques for high-efficiency video coding

    Get PDF
    This thesis investigates the problem of robust coding and error concealment in High Efficiency Video Coding (HEVC). After a review of the current state of the art, a simulation study about error robustness, revealed that the HEVC has weak protection against network losses with significant impact on video quality degradation. Based on this evidence, the first contribution of this work is a new method to reduce the temporal dependencies between motion vectors, by improving the decoded video quality without compromising the compression efficiency. The second contribution of this thesis is a two-stage approach for reducing the mismatch of temporal predictions in case of video streams received with errors or lost data. At the encoding stage, the reference pictures are dynamically distributed based on a constrained Lagrangian rate-distortion optimization to reduce the number of predictions from a single reference. At the streaming stage, a prioritization algorithm, based on spatial dependencies, selects a reduced set of motion vectors to be transmitted, as side information, to reduce mismatched motion predictions at the decoder. The problem of error concealment-aware video coding is also investigated to enhance the overall error robustness. A new approach based on scalable coding and optimally error concealment selection is proposed, where the optimal error concealment modes are found by simulating transmission losses, followed by a saliency-weighted optimisation. Moreover, recovery residual information is encoded using a rate-controlled enhancement layer. Both are transmitted to the decoder to be used in case of data loss. Finally, an adaptive error resilience scheme is proposed to dynamically predict the video stream that achieves the highest decoded quality for a particular loss case. A neural network selects among the various video streams, encoded with different levels of compression efficiency and error protection, based on information from the video signal, the coded stream and the transmission network. Overall, the new robust video coding methods investigated in this thesis yield consistent quality gains in comparison with other existing methods and also the ones implemented in the HEVC reference software. Furthermore, the trade-off between coding efficiency and error robustness is also better in the proposed methods

    Seminario sullo Standard MPEG-4: utilizzo ed aspetti implementativi

    Get PDF
    Una delle tecnologie chiave che hanno permesso il grande sviluppo della televisione digitale è la compressione video. La tecnologia di codifica video nota come MPEG-2, sviluppata nei primi anni novanta, è diventata lo standard di trasmissione DTV (Digital TV) sia satellitare sia terrestre in quasi tutti i paesi del mondo. Da allora la velocità dei microprocessori e le capacità di memoria dei dispositivi hardware per la codifica e la decodifica sono migliorate significativamente rendendo possibile lo sviluppo e l’implementazione di algoritmi di codifica innovativi in grado di abbattere significativamente i limiti di compressione dello standard MPEG-2. Tali innovazioni, sfociate nel 2003 nello standard MPEG-4 AVC (Advanced Video Coding), non hanno permesso di mantenere la compatibilità all’indietro con l’MPEG-2, e questo ha inizialmente costituito un limite alla loro introduzione nei sistemi di trasmissione DTV. Tuttavia, negli ultimi anni la codifica MPEG-4 AVC si è diffusa rapidamente, è stata adottata dal progetto DVB, recentemente dall’ATSC, ed è lo standard di codifica nell’IPTV. L’obiettivo di questo seminario, che si articola in due giornate, è quello di presentare lo standard di codifica MPEG-4 AVC con particolare attenzione agli aspetti implementativi del livello di codifica video.2008-11-18Sardegna Ricerche, Edificio 2, Località Piscinamanna 09010 Pula (CA) - ItaliaSeminario sullo Standard MPEG-4: utilizzo ed aspetti implementativ

    Autonomous Recovery Of Reconfigurable Logic Devices Using Priority Escalation Of Slack

    Get PDF
    Field Programmable Gate Array (FPGA) devices offer a suitable platform for survivable hardware architectures in mission-critical systems. In this dissertation, active dynamic redundancy-based fault-handling techniques are proposed which exploit the dynamic partial reconfiguration capability of SRAM-based FPGAs. Self-adaptation is realized by employing reconfiguration in detection, diagnosis, and recovery phases. To extend these concepts to semiconductor aging and process variation in the deep submicron era, resilient adaptable processing systems are sought to maintain quality and throughput requirements despite the vulnerabilities of the underlying computational devices. A new approach to autonomous fault-handling which addresses these goals is developed using only a uniplex hardware arrangement. It operates by observing a health metric to achieve Fault Demotion using Recon- figurable Slack (FaDReS). Here an autonomous fault isolation scheme is employed which neither requires test vectors nor suspends the computational throughput, but instead observes the value of a health metric based on runtime input. The deterministic flow of the fault isolation scheme guarantees success in a bounded number of reconfigurations of the FPGA fabric. FaDReS is then extended to the Priority Using Resource Escalation (PURE) online redundancy scheme which considers fault-isolation latency and throughput trade-offs under a dynamic spare arrangement. While deep-submicron designs introduce new challenges, use of adaptive techniques are seen to provide several promising avenues for improving resilience. The scheme developed is demonstrated by hardware design of various signal processing circuits and their implementation on a Xilinx Virtex-4 FPGA device. These include a Discrete Cosine Transform (DCT) core, Motion Estimation (ME) engine, Finite Impulse Response (FIR) Filter, Support Vector Machine (SVM), and Advanced Encryption Standard (AES) blocks in addition to MCNC benchmark circuits. A iii significant reduction in power consumption is achieved ranging from 83% for low motion-activity scenes to 12.5% for high motion activity video scenes in a novel ME engine configuration. For a typical benchmark video sequence, PURE is shown to maintain a PSNR baseline near 32dB. The diagnosability, reconfiguration latency, and resource overhead of each approach is analyzed. Compared to previous alternatives, PURE maintains a PSNR within a difference of 4.02dB to 6.67dB from the fault-free baseline by escalating healthy resources to higher-priority signal processing functions. The results indicate the benefits of priority-aware resiliency over conventional redundancy approaches in terms of fault-recovery, power consumption, and resource-area requirements. Together, these provide a broad range of strategies to achieve autonomous recovery of reconfigurable logic devices under a variety of constraints, operating conditions, and optimization criteria

    High-Performance Accurate and Approximate Multipliers for FPGA-Based Hardware Accelerators

    Get PDF
    Multiplication is one of the widely used arithmetic operations in a variety of applications, such as image/video processing and machine learning. FPGA vendors provide high-performance multipliers in the form of DSP blocks. These multipliers are not only limited in number and have fixed locations on FPGAs but can also create additional routing delays and may prove inefficient for smaller bit-width multiplications. Therefore, FPGA vendors additionally provide optimized soft IP cores for multiplication. However, in this work, we advocate that these soft multiplier IP cores for FPGAs still need better designs to provide high-performance and resource efficiency. Toward this, we present generic area-optimized, low-latency accurate, and approximate softcore multiplier architectures, which exploit the underlying architectural features of FPGAs, i.e., lookup table (LUT) structures and fast-carry chains to reduce the overall critical path delay (CPD) and resource utilization of multipliers. Compared to Xilinx multiplier LogiCORE IP, our proposed unsigned and signed accurate architecture provides up to 25% and 53% reduction in LUT utilization, respectively, for different sizes of multipliers. Moreover, with our unsigned approximate multiplier architectures, a reduction of up to 51% in the CPD can be achieved with an insignificant loss in output accuracy when compared with the LogiCORE IP. For illustration, we have deployed the proposed multiplier architecture in accelerators used in image and video applications, and evaluated them for area and performance gains. Our library of accurate and approximate multipliers is opensource and available online at https://cfaed.tu-dresden.de/pd-downloads to fuel further research and development in this area, facilitate reproducible research, and thereby enabling a new research direction for the FPGA community

    Exposure to negative socio-emotional events induces sustained alteration of resting-state brain networks in older adults

    Get PDF
    Basic emotional functions seem well preserved in older adults. However, their reactivity to and recovery from socially negative events remain poorly characterized. To address this, we designed a ‘task–rest’ paradigm in which 182 participants from two independent experiments underwent functional magnetic resonance imaging while exposed to socio-emotional videos. Experiment 1 (N = 55) validated the task in young and older participants and unveiled age-dependent effects on brain activity and connectivity that predominated in resting periods after (rather than during) negative social scenes. Crucially, emotional elicitation potentiated subsequent resting-state connectivity between default mode network and amygdala exclusively in older adults. Experiment 2 replicated these results in a large older adult cohort (N = 127) and additionally showed that emotion-driven changes in posterior default mode network–amygdala connectivity were associated with anxiety, rumination and negative thoughts. These findings uncover the neural dynamics of empathy-related functions in older adults and help understand its relationship to poor social stress recovery

    Selected topics on distributed video coding

    Get PDF
    Distributed Video Coding (DVC) is a new paradigm for video compression based on the information theoretical results of Slepian and Wolf (SW), and Wyner and Ziv (WZ). While conventional coding has a rigid complexity allocation as most of the complex tasks are performed at the encoder side, DVC enables a flexible complexity allocation between the encoder and the decoder. The most novel and interesting case is low complexity encoding and complex decoding, which is the opposite of conventional coding. While the latter is suitable for applications where the cost of the decoder is more critical than the encoder's one, DVC opens the door for a new range of applications where low complexity encoding is required and the decoder's complexity is not critical. This is interesting with the deployment of small and battery-powered multimedia mobile devices all around in our daily life. Further, since DVC operates as a reversed-complexity scheme when compared to conventional coding, DVC also enables the interesting scenario of low complexity encoding and decoding between two ends by transcoding between DVC and conventional coding. More specifically, low complexity encoding is possible by DVC at one end. Then, the resulting stream is decoded and conventionally re-encoded to enable low complexity decoding at the other end. Multiview video is attractive for a wide range of applications such as free viewpoint television, which is a system that allows viewing the scene from a viewpoint chosen by the viewer. Moreover, multiview can be beneficial for monitoring purposes in video surveillance. The increased use of multiview video systems is mainly due to the improvements in video technology and the reduced cost of cameras. While a multiview conventional codec will try to exploit the correlation among the different cameras at the encoder side, DVC allows for separate encoding of correlated video sources. Therefore, DVC requires no communication between the cameras in a multiview scenario. This is an advantage since communication is time consuming (i.e. more delay) and requires complex networking. Another appealing feature of DVC is the fact that it is based on a statistical framework. Moreover, DVC behaves as a natural joint source-channel coding solution. This results in an improved error resilience performance when compared to conventional coding. Further, DVC-based scalable codecs do not require a deterministic knowledge of the lower layers. In other words, the enhancement layers are completely independent from the base layer codec. This is called the codec-independent scalability feature, which offers a high flexibility in the way the various layers are distributed in a network. This thesis addresses the following topics: First, the theoretical foundations of DVC as well as the practical DVC scheme used in this research are presented. The potential applications for DVC are also outlined. DVC-based schemes use conventional coding to compress parts of the data, while the rest is compressed in a distributed fashion. Thus, different conventional codecs are studied in this research as they are compared in terms of compression efficiency for a rich set of sequences. This includes fine tuning the compression parameters such that the best performance is achieved for each codec. Further, DVC tools for improved Side Information (SI) and Error Concealment (EC) are introduced for monoview DVC using a partially decoded frame. The improved SI results in a significant gain in reconstruction quality for video with high activity and motion. This is done by re-estimating the erroneous motion vectors using the partially decoded frame to improve the SI quality. The latter is then used to enhance the reconstruction of the finally decoded frame. Further, the introduced spatio-temporal EC improves the quality of decoded video in the case of erroneously received packets, outperforming both spatial and temporal EC. Moreover, it also outperforms error-concealed conventional coding in different modes. Then, multiview DVC is studied in terms of SI generation, which differentiates it from the monoview case. More specifically, different multiview prediction techniques for SI generation are described and compared in terms of prediction quality, complexity and compression efficiency. Further, a technique for iterative multiview SI is introduced, where the final SI is used in an enhanced reconstruction process. The iterative SI outperforms the other SI generation techniques, especially for high motion video content. Finally, fusion techniques of temporal and inter-view side informations are introduced as well, which improves the performance of multiview DVC over monoview coding. DVC is also used to enable scalability for image and video coding. Since DVC is based on a statistical framework, the base and enhancement layers are completely independent, which is an interesting property called codec-independent scalability. Moreover, the introduced DVC scalable schemes show a good robustness to errors as the quality of decoded video steadily decreases with error rate increase. On the other hand, conventional coding exhibits a cliff effect as the performance drops dramatically after a certain error rate value. Further, the issue of privacy protection is addressed for DVC by transform domain scrambling, which is used to alter regions of interest in video such that the scene is still understood and privacy is preserved as well. The proposed scrambling techniques are shown to provide a good level of security without impairing the performance of the DVC scheme when compared to the one without scrambling. This is particularly attractive for video surveillance scenarios, which is one of the most promising applications for DVC. Finally, a practical DVC demonstrator built during this research is described, where the main requirements as well as the observed limitations are presented. Furthermore, it is defined in a setup being as close as possible to a complete real application scenario. This shows that it is actually possible to implement a complete end-to-end practical DVC system relying only on realistic assumptions. Even though DVC is inferior in terms of compression efficiency to the state of the art conventional coding for the moment, strengths of DVC reside in its good error resilience properties and the codec-independent scalability feature. Therefore, DVC offers promising possibilities for video compression with transmission over error-prone environments requirement as it significantly outperforms conventional coding in this case

    Physiological underpinnings of healthy brain ageing

    Get PDF
    Changes in cerebral perfusion or metabolism can occur as a result of healthy ageing, and in conditions of impaired ageing such as mild cognitive impairment (MCI) or Alzheimer’s disease (AD). Overarchingly, this thesis aimed to explore physiological magnetic resonance imaging (MRI) measures to study both cerebral perfusion and metabolism in the healthy ageing brain. Specifically, arterial spin labelling (ASL) and functional magnetic resonance spectroscopy (1H-fMRS) were employed in the elucidation of healthy ageing. Investigation of cerebral functionality is clinically important, enabling understanding of healthy ageing and disease pathology beyond that provided by structural measures. Given the necessity for tightly-regulated tissue perfusion in the delivery of oxygen to the brain, assessment of brain perfusion can enable elucidation of related brain health. Firstly, this thesis focused on changes in brain perfusion within a cross-sectional retrospective cohort of healthy subjects. This study aimed to assess the utility of univariate and multivariate pattern analysis (MVPA) techniques, and determine whether spatial coefficient of variation (sCoV) measures – which provide a method for inferring spatial heterogeneity of blood flow from single post-label delay (PLD) ASL data – are more significantly associated with age than standard perfusion metrics (ml/100g/min values). The impact of data processing steps on quantification of perfusion was initially assessed. Particularly, the influence of partial volume effect (PVE) correction and how this affected quantification of cerebral perfusion was of interest. The relationship between measures of cerebral perfusion – in regions of interest, vascular territories, and grey matter – and age were assessed, before grey matter (GM) spatial covariance patterns were identified, with MVPA hypothesised to elucidate more subtle age-related change than univariate, voxel-wise methodology. The executive control network (ECN) was the only network exhibiting a significant decline in perfusion with age, after controlling for relevant covariates. Interestingly, whilst the PCA approach resulted in a pattern of both positive and negative associations with age across cerebral GM, the surviving clusters in voxel-wise approaches were deemed spurious. Five-fold cross validation of PCA findings was used to assess whether the resultant spatial covariance patterns were able to predict subject age. This prediction was successful, with related r2 values of between 0.5316 and 0.7297 (p < 0.001 for all), however validation of these findings in an unseen dataset is required. The utility of the sCoV metric was also compared with standard tissue perfusion values, finding that sCoV may be more closely associated with ageing than ml/100g/min in certain regions. Particularly, a significant increase in whole GM sCoV with age was notable, given the absence of significant changes in perfusion with age in the same region. Additionally, a MVPA approach was used to establish the complex unknown relationship between cerebral perfusion and the Montreal Cognitive Assessment (MoCA), before graph visualisation was used to further understand the regional relatedness of the spatial covariance pattern. PCA resulted in a model which provided a moderate explanation of the aforementioned relationship, but this may be improved by inclusion of additional covariates in subsequent work, such as those pertaining to genetic status, such as apolipoprotein E (APOE). This study also replicates an FDG PET cognitive resilience signature in an ASL cohort for the first time, with a trend towards declining perfusion with age found (p = .08). Lastly, as ageing is associated with metabolic failure in the brain, which is often investigated using methodology which employs ionising radiation, the final study was motivated to investigate possible metabolic markers of brain ageing which can be measured using MRI. Metabolic-functional coupling can be studied using functional stimulation, and functional magnetic resonance spectroscopy (fMRS) is perfectly poised to elucidate certain metabolic behaviour. Given the close relationship between glucose (Glc) – the key fuel for cerebral functionality – and lactate (Lac) metabolism, an optimised long echo time (TE) semi-localized by adiabatic selective refocusing (semi-LASER) sequence (TE=144ms) with optimised J-modulation selection at 7T was employed to assess the effects of age on the dynamic behaviour of Lac, and determine its absolute concentrations throughout the time course, whilst a visual stimulation paradigm was viewed. Successful quantification of metabolite concentrations – including Lac, tCr and tNAA – was achieved in both the young and old cohorts, and their Lac peaks clearly visually identifiable throughout the time course. A significant increase in Lac concentration was observed between rest and stimulation, but not stimulation and recovery, in the young cohort. No significant Lac time course changes were identified in the full old cohort. This thesis concluded by summarising and contextualising the key findings herein, and discussion of possible directions for further associated research. The findings of this thesis broaden the field of knowledge around healthy ageing, and therefore may contribute to subsequent translation efforts for both clinical diagnostics and treatment approaches
    • …
    corecore