24 research outputs found
Perceptually Optimized Visualization on Autostereoscopic 3D Displays
The family of displays, which aims to visualize a 3D scene with realistic depth, are known as "3D displays". Due to technical limitations and design decisions, such displays create visible distortions, which are interpreted by the human vision as artefacts. In absence of visual reference (e.g. the original scene is not available for comparison) one can improve the perceived quality of the representations by making the distortions less visible. This thesis proposes a number of signal processing techniques for decreasing the visibility of artefacts on 3D displays.
The visual perception of depth is discussed, and the properties (depth cues) of a scene which the brain uses for assessing an image in 3D are identified. Following the physiology of vision, a taxonomy of 3D artefacts is proposed. The taxonomy classifies the artefacts based on their origin and on the way they are interpreted by the human visual system.
The principles of operation of the most popular types of 3D displays are explained. Based on the display operation principles, 3D displays are modelled as a signal processing channel. The model is used to explain the process of introducing distortions. It also allows one to identify which optical properties of a display are most relevant to the creation of artefacts. A set of optical properties for dual-view and multiview 3D displays are identified, and a methodology for measuring them is introduced. The measurement methodology allows one to derive the angular visibility and crosstalk of each display element without the need for precision measurement equipment. Based on the measurements, a methodology for creating a quality profile of 3D displays is proposed. The quality profile can be either simulated using the angular brightness function or directly measured from a series of photographs. A comparative study introducing the measurement results on the visual quality and position of the sweet-spots of eleven 3D displays of different types is presented. Knowing the sweet-spot position and the quality profile allows for easy comparison between 3D displays. The shape and size of the passband allows depth and textures of a 3D content to be optimized for a given 3D display.
Based on knowledge of 3D artefact visibility and an understanding of distortions introduced by 3D displays, a number of signal processing techniques for artefact mitigation are created. A methodology for creating anti-aliasing filters for 3D displays is proposed. For multiview displays, the methodology is extended towards so-called passband optimization which addresses Moiré, fixed-pattern-noise and ghosting artefacts, which are characteristic for such displays. Additionally, design of tuneable anti-aliasing filters is presented, along with a framework which allows the user to select the so-called 3d sharpness parameter according to his or her preferences. Finally, a set of real-time algorithms for view-point-based optimization are presented. These algorithms require active user-tracking, which is implemented as a combination of face and eye-tracking. Once the observer position is known, the image on a stereoscopic display is optimised for the derived observation angle and distance. For multiview displays, the combination of precise light re-direction and less-precise face-tracking is used for extending the head parallax. For some user-tracking algorithms, implementation details are given, regarding execution of the algorithm on a mobile device or on desktop computer with graphical accelerator
3D-DCT based perceptual quality assessment of stereo video
ABSTRACT In this paper, we present a novel stereoscopic video quality assessment method based on 3D-DCT transform. In our approach, similar blocks from left and right views of stereoscopic video frames are found by block-matching, grouped into 3D stack and then analyzed by 3D-DCT. Comparison between reference and distorted images are made in terms of MSE calculated within the 3D-DCT domain and modified to reflect the contrast sensitive function and luminance masking. We validate our quality assessment method using test videos annotated with results from subjective tests. The results show that the proposed algorithm outperforms current popular metrics over a wide range of distortion levels
Adapting Learned Image Codecs to Screen Content via Adjustable Transformations
As learned image codecs (LICs) become more prevalent, their low coding
efficiency for out-of-distribution data becomes a bottleneck for some
applications. To improve the performance of LICs for screen content (SC) images
without breaking backwards compatibility, we propose to introduce parameterized
and invertible linear transformations into the coding pipeline without changing
the underlying baseline codec's operation flow. We design two neural networks
to act as prefilters and postfilters in our setup to increase the coding
efficiency and help with the recovery from coding artifacts. Our end-to-end
trained solution achieves up to 10% bitrate savings on SC compression compared
to the baseline LICs while introducing only 1% extra parameters.Comment: 7 pages, 6 figures, 2 table
Laparoscopic or conventional abdominoperineal extirpation in low rectal cancer
INTRODUCTION: Laparoscopic abdominoperineal resection (LAPR) as a minimally invasive approach for the treatment of large rectal cancer is widely used. It has been proven to be technically feasible and safe with fewer complications and faster postoperative recovery than the open procedure. Our aim was to evaluate LAPR safety and feasibility as compared to the open procedure in large low rectal cancer.PATIENTS AND METHODS: A total of 34 low rectal cancer patients who underwent open APR (OAPR) were matched with 42 patients who underwent LAPR in a one-to-one fashion between 2011 and 2014 in the Division of General Surgery, Kaspela University Hospital of Plovdiv.RESULTS: Intraoperative parameters of LAPR were better than those of OAPR as followed: mean operation time (121.8±47.8 min versus 152.1±49.2 min), mean operative blood loss (82±30.0 mL versus 120±35.0 mL), mean total number of retrieved lymph nodes (12±1 versus 12±1.4), and percentage of surgical complications (12.3% versus 15.1%). Laparoscopically treated patients showed significantly shorter postoperative analgesia (2.1±0.7 days versus 3.7±0.6 days), earlier first flatus (36.3±7.9 hours versus 48.5±9.2 hours), shorter urinary drainage (3.8±3.4 days versus 5.8±1.3 days), and shorter hospital stay (6.2±1 days versus 8±2.0 days). Local recurrence rate during a three-year period (in 3 versus 4 patients) and metachronous liver metastasis (in 5 versus 6 patients) were less common after LAPR than after OAPR.CONCLUSION: The risks of APR-specific surgical complications such as perineal wound infection and parastomal hernia were comparable between the laparoscopic and open surgery groups. There were no significant differences regarding local recurrence and metachronous liver metastasis between these groups. Complication and locoregional recurrence rates in low large rectal cancer patients after laparoscopic and open were quite similar. Scr Sci Med 2017; 49(3): 22-2
Perceptually Optimized Visualization on Autostereoscopic 3D Displays
The family of displays, which aims to visualize a 3D scene with realistic depth, are known as "3D displays". Due to technical limitations and design decisions, such displays create visible distortions, which are interpreted by the human vision as artefacts. In absence of visual reference (e.g. the original scene is not available for comparison) one can improve the perceived quality of the representations by making the distortions less visible. This thesis proposes a number of signal processing techniques for decreasing the visibility of artefacts on 3D displays.
The visual perception of depth is discussed, and the properties (depth cues) of a scene which the brain uses for assessing an image in 3D are identified. Following the physiology of vision, a taxonomy of 3D artefacts is proposed. The taxonomy classifies the artefacts based on their origin and on the way they are interpreted by the human visual system.
The principles of operation of the most popular types of 3D displays are explained. Based on the display operation principles, 3D displays are modelled as a signal processing channel. The model is used to explain the process of introducing distortions. It also allows one to identify which optical properties of a display are most relevant to the creation of artefacts. A set of optical properties for dual-view and multiview 3D displays are identified, and a methodology for measuring them is introduced. The measurement methodology allows one to derive the angular visibility and crosstalk of each display element without the need for precision measurement equipment. Based on the measurements, a methodology for creating a quality profile of 3D displays is proposed. The quality profile can be either simulated using the angular brightness function or directly measured from a series of photographs. A comparative study introducing the measurement results on the visual quality and position of the sweet-spots of eleven 3D displays of different types is presented. Knowing the sweet-spot position and the quality profile allows for easy comparison between 3D displays. The shape and size of the passband allows depth and textures of a 3D content to be optimized for a given 3D display.
Based on knowledge of 3D artefact visibility and an understanding of distortions introduced by 3D displays, a number of signal processing techniques for artefact mitigation are created. A methodology for creating anti-aliasing filters for 3D displays is proposed. For multiview displays, the methodology is extended towards so-called passband optimization which addresses Moiré, fixed-pattern-noise and ghosting artefacts, which are characteristic for such displays. Additionally, design of tuneable anti-aliasing filters is presented, along with a framework which allows the user to select the so-called 3d sharpness parameter according to his or her preferences. Finally, a set of real-time algorithms for view-point-based optimization are presented. These algorithms require active user-tracking, which is implemented as a combination of face and eye-tracking. Once the observer position is known, the image on a stereoscopic display is optimised for the derived observation angle and distance. For multiview displays, the combination of precise light re-direction and less-precise face-tracking is used for extending the head parallax. For some user-tracking algorithms, implementation details are given, regarding execution of the algorithm on a mobile device or on desktop computer with graphical accelerator
Design of Tuneable Anti-Aliasing Filters for Multiview Displays
acceptedVersionPeer reviewe
Measuring and modeling per-element angular visibility in multi-view displays
acceptedVersionPeer reviewe
Visual-quality evaluation methodology for multiview displays
acceptedVersionPeer reviewe