57 research outputs found

    Novel source coding methods for optimising real time video codecs.

    Get PDF
    The quality of the decoded video is affected by errors occurring in the various layers of the protocol stack. In this thesis, disjoint errors occurring in different layers of the protocol stack are investigated with the primary objective of demonstrating the flexibility of the source coding layer. In the first part of the thesis, the errors occurring in the editing layer, due to the coexistence of different video standards in the broadcast market, are addressed. The problems investigated are ‘Field Reversal’ and ‘Mixed Pulldown’. Field Reversal is caused when the interlaced video fields are not shown in the same order as they were captured. This results in a shaky video display, as the fields are not displayed in chronological order. Additionally, Mixed Pulldown occurs when the video frame-rate is up-sampled and down-sampled, when digitised film material is being standardised to suit standard televisions. Novel image processing algorithms are proposed to solve these problems from the source coding layer. In the second part of the thesis, the errors occurring in the transmission layer due to data corruption are addressed. The usage of block level source error-resilient methods over bit level channel coding methods are investigated and improvements are suggested. The secondary objective of the thesis is to optimise the proposed algorithm’s architecture for real-time implementation, since the problems are of a commercial nature. The Field Reversal and Mixed Pulldown algorithms were tested in real time at MTV (Music Television) and are made available commercially through ‘Cerify’, a Linux-based media testing box manufactured by Tektronix Plc. The channel error-resilient algorithms were tested in a laboratory environment using Matlab and performance improvements are obtained

    Signal processing for improved MPEG-based communication systems

    Get PDF

    Perceptually Optimized Visualization on Autostereoscopic 3D Displays

    Get PDF
    The family of displays, which aims to visualize a 3D scene with realistic depth, are known as "3D displays". Due to technical limitations and design decisions, such displays create visible distortions, which are interpreted by the human vision as artefacts. In absence of visual reference (e.g. the original scene is not available for comparison) one can improve the perceived quality of the representations by making the distortions less visible. This thesis proposes a number of signal processing techniques for decreasing the visibility of artefacts on 3D displays. The visual perception of depth is discussed, and the properties (depth cues) of a scene which the brain uses for assessing an image in 3D are identified. Following the physiology of vision, a taxonomy of 3D artefacts is proposed. The taxonomy classifies the artefacts based on their origin and on the way they are interpreted by the human visual system. The principles of operation of the most popular types of 3D displays are explained. Based on the display operation principles, 3D displays are modelled as a signal processing channel. The model is used to explain the process of introducing distortions. It also allows one to identify which optical properties of a display are most relevant to the creation of artefacts. A set of optical properties for dual-view and multiview 3D displays are identified, and a methodology for measuring them is introduced. The measurement methodology allows one to derive the angular visibility and crosstalk of each display element without the need for precision measurement equipment. Based on the measurements, a methodology for creating a quality profile of 3D displays is proposed. The quality profile can be either simulated using the angular brightness function or directly measured from a series of photographs. A comparative study introducing the measurement results on the visual quality and position of the sweet-spots of eleven 3D displays of different types is presented. Knowing the sweet-spot position and the quality profile allows for easy comparison between 3D displays. The shape and size of the passband allows depth and textures of a 3D content to be optimized for a given 3D display. Based on knowledge of 3D artefact visibility and an understanding of distortions introduced by 3D displays, a number of signal processing techniques for artefact mitigation are created. A methodology for creating anti-aliasing filters for 3D displays is proposed. For multiview displays, the methodology is extended towards so-called passband optimization which addresses Moiré, fixed-pattern-noise and ghosting artefacts, which are characteristic for such displays. Additionally, design of tuneable anti-aliasing filters is presented, along with a framework which allows the user to select the so-called 3d sharpness parameter according to his or her preferences. Finally, a set of real-time algorithms for view-point-based optimization are presented. These algorithms require active user-tracking, which is implemented as a combination of face and eye-tracking. Once the observer position is known, the image on a stereoscopic display is optimised for the derived observation angle and distance. For multiview displays, the combination of precise light re-direction and less-precise face-tracking is used for extending the head parallax. For some user-tracking algorithms, implementation details are given, regarding execution of the algorithm on a mobile device or on desktop computer with graphical accelerator

    Energy efficient enabling technologies for semantic video processing on mobile devices

    Get PDF
    Semantic object-based processing will play an increasingly important role in future multimedia systems due to the ubiquity of digital multimedia capture/playback technologies and increasing storage capacity. Although the object based paradigm has many undeniable benefits, numerous technical challenges remain before the applications becomes pervasive, particularly on computational constrained mobile devices. A fundamental issue is the ill-posed problem of semantic object segmentation. Furthermore, on battery powered mobile computing devices, the additional algorithmic complexity of semantic object based processing compared to conventional video processing is highly undesirable both from a real-time operation and battery life perspective. This thesis attempts to tackle these issues by firstly constraining the solution space and focusing on the human face as a primary semantic concept of use to users of mobile devices. A novel face detection algorithm is proposed, which from the outset was designed to be amenable to be offloaded from the host microprocessor to dedicated hardware, thereby providing real-time performance and reducing power consumption. The algorithm uses an Artificial Neural Network (ANN), whose topology and weights are evolved via a genetic algorithm (GA). The computational burden of the ANN evaluation is offloaded to a dedicated hardware accelerator, which is capable of processing any evolved network topology. Efficient arithmetic circuitry, which leverages modified Booth recoding, column compressors and carry save adders, is adopted throughout the design. To tackle the increased computational costs associated with object tracking or object based shape encoding, a novel energy efficient binary motion estimation architecture is proposed. Energy is reduced in the proposed motion estimation architecture by minimising the redundant operations inherent in the binary data. Both architectures are shown to compare favourable with the relevant prior art

    Biomedical Engineering

    Get PDF
    Biomedical engineering is currently relatively wide scientific area which has been constantly bringing innovations with an objective to support and improve all areas of medicine such as therapy, diagnostics and rehabilitation. It holds a strong position also in natural and biological sciences. In the terms of application, biomedical engineering is present at almost all technical universities where some of them are targeted for the research and development in this area. The presented book brings chosen outputs and results of research and development tasks, often supported by important world or European framework programs or grant agencies. The knowledge and findings from the area of biomaterials, bioelectronics, bioinformatics, biomedical devices and tools or computer support in the processes of diagnostics and therapy are defined in a way that they bring both basic information to a reader and also specific outputs with a possible further use in research and development

    In-Soil Measuring of Sugar Beet Yield Using UWB Radar Sensor System

    Get PDF
    Yield mapping is a basic entity of the Precision Farming concept and provides crucial information about the success of cultivation. Several approaches to site-specific yield recording during the sugar beet harvest are known. Most of them are based on the weighing of sugar beets together with soil tare. Another real-time yield mapping approach with the option of plant population counting is based on estimating the mass of individual sugar beets on the basis of their maximal diameter. The main goal of the research was to develop and evaluate a yield recording procedure based on radar technology, which will provide non-invasive in-soil detection and identification of single sugar beets in order to enable the counting of individual sugar beets and determining of the single sugar beet root mass. Further goals were to enhance the radar technology for other applications in the agriculture, as a general goal, and to define applicability restrictions of practical utilisation of the system for the sugar beet and similar crops. The research activities have been divided into laboratory and field experiments. The results of the laboratory experiments have provided valuable information about the measuring system’s behaviour, which enabled the successful field measurements. The used method allowed the identification and detection of 90% to 96% of sugar beets under test in the various field conditions, with correlation coefficients between real sugar beet positions and detected positions of about 99%, and average positioning error from 1,1 to 3,6 cm. The correlation coefficients between single sugar beet root masses and recorded reflected energy amounts were for the majority of tests over 70%, and the best results have been on the level close to 90%. This project was a joint venture of the Institute for Agricultural Engineering from Bonn and the Technical University of Ilmenau.TeilflĂ€chenspezifische Ertragsmessung von ZuckerrĂŒben im Boden mittels UWB Radarsensorsystem Die Ertragskartierung ist ein wesentlicher Bestandteil des Konzeptes „Precision Farming“. Die Erntemasse von Kulturpflanzen ist fĂŒr den Landwirt eine elementare Information ĂŒber den Erfolg pflanzbaulicher Maßnahmen. Es sind mehrere Verfahren zur Ertragsermittlung von ZuckerrĂŒben wĂ€hrend der Ernte mit dem Bezug auf TeilflĂ€chen bekannt. Ein sensorischer Ansatz besteht in der PflanzenzĂ€hlung und Ermittlung der Masse der einzelnen ZuckerrĂŒben ĂŒber den maximalen Durchmesser. Das Hauptziel dieser Forschungsarbeiten war die Entwicklung und Bewertung eines berĂŒhrungslosen Ertragserfassungssystems fĂŒr ZuckerrĂŒben, das teilflĂ€chenbasiert eine ZĂ€hlung und Massebestimmung der EinzelrĂŒben ermöglicht. Die weiteren Ziele bestanden in der Weiterentwicklung der Radartechnologie fĂŒr andere Einsatzgebiete der Landwirtschaft und in der Bestimmung der Anwendbarkeitsgrenzen des Systems fĂŒr ZuckerrĂŒben und Ă€hnliche WurzelfrĂŒchte. Die ForschungsaktivitĂ€ten fanden im Labor und unter Feldbedingungen auf Versuchsparzellen eines typischen ZuckerrĂŒbenstandortes statt. Die Ergebnisse unter Laborbedingungen lieferten wertvolle Informationen, die erfolgreiche Feldmessungen ermöglicht haben. Die angewendete Methode hat in unterschiedlichen Messbedingungen eine 90% bis 96% erfolgreiche ZuckerrĂŒbenidentifikation ermöglicht, mit Korrelationskoeffizienten zwischen tatsĂ€chlichen und detektierten ZuckerrĂŒbenpositionen von um 99% und einem durchschnittlichen Positionierungsfehler von 1,1 bis 3,6 cm. Die Korrelationskoeffizienten zwischen der EinzelrĂŒbenmasse und der gemessenen reflektierten Energiemenge lagen im Bereich von ĂŒber 70% und die besten Ergebnisse erreichten Werte von 90%. Das Projekt wurde in der Zusammenarbeit des Instituts fĂŒr Landtechnik Bonn und des Instituts fĂŒr Kommunikations- und Messtechnik der Technischen UniversitĂ€t Ilmenau durchgefĂŒhrt

    Towards Computational Efficiency of Next Generation Multimedia Systems

    Get PDF
    To address throughput demands of complex applications (like Multimedia), a next-generation system designer needs to co-design and co-optimize the hardware and software layers. Hardware/software knobs must be tuned in synergy to increase the throughput efficiency. This thesis provides such algorithmic and architectural solutions, while considering the new technology challenges (power-cap and memory aging). The goal is to maximize the throughput efficiency, under timing- and hardware-constraints

    Images on the Move

    Get PDF
    In contemporary society, digital images have become increasingly mobile. They are networked, shared on social media, and circulated across small and portable screens. Accordingly, the discourses of spreadability and circulation have come to supersede the focus on production, indexicality, and manipulability, which had dominated early conceptions of digital photography and film. However, the mobility of images is neither technologically nor conceptually limited to the realm of the digital. The edited volume re-examines the historical, aesthetical, and theoretical relevance of image mobility. The contributors provide a materialist account of images on the move - ranging from wired photography to postcards to streaming media
    • 

    corecore