Search CORE

13 research outputs found

Fully Deep Blind Image Quality Predictor

Author: Jongyoo Kim
Sanghoon Lee
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

Crossref

Implementation of an Omnidirectional Human Motion Capture System Using Multiple Kinect Sensors

Author: Inwoong LEE
Jongyoo KIM
Junghwan KIM
Sanghoon LEE
Publication venue: 'Institute of Electronics, Information and Communications Engineers (IEICE)'
Publication date: 01/01/2015
Field of study

Crossref

Blind Sharpness Prediction for Ultrahigh-Definition Video Based on Human Visual Resolution

Author: Haksub Kim
Jongyoo Kim
Sanghoon Lee
Taegeun Oh
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

Crossref

Quality Assessment of Perceptual Crosstalk on Two-View Auto-Stereoscopic Displays

Author: Alan Conrad Bovik
Jongyoo Kim
Sanghoon Lee
Taewan Kim
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

Crossref

Perceptual Crosstalk Prediction on Autostereoscopic 3D Display

Author: Jongyoo Kim
Sanghoon Lee
SeongYong Kim
Sungho Cho
Taewan Kim
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

Crossref

Video frame synthesis via plug-and-play deep locally temporal embedding

Author: Kim Jongyoo
Kim Woojae
Lee Sanghoon
Lin Weisi
Nguyen Anh-Duc
Publication venue
Publication date: 01/01/2019
Field of study

We propose a generative framework that tackles video frame interpolation. Conventionally, optical flow methods can solve the problem, but the perceptual quality depends on the accuracy of flow estimation. Nevertheless, a merit of traditional methods is that they have a remarkable generalization ability. Recently, deep convolutional neural networks (CNNs) have achieved good performance at the price of computation. However, to deploy a CNN, it is necessary to train it with a large-scale dataset beforehand, not to mention the process of fine tuning and adaptation afterwards. Also, despite the sharp motion results, their perceptual quality does not correlate well with their pixel-to-pixel difference metric performance due to various artifacts created by erroneous warping. In this paper, we take the advantages of both conventional and deep-learning models, and tackle the problem from a different perspective. The framework, which we call deep locally temporal embedding (DeepLTE), is powered by a deep CNN and can be used instantly like conventional models. DeepLTE fits an auto-encoding CNN to several consecutive frames and embeds some constraints on the latent representations so that new frames can be generated by interpolating new latent codes. Unlike the current deep learning paradigm which requires training on large datasets, DeepLTE works in a plug-and-play and unsupervised manner, and is able to generate an arbitrary number of frames from multiple given consecutive frames. We demonstrate that, without bells and whistles, DeepLTE outperforms existing state-of-the-art models in terms of the perceptual quality.Published versio

DR-NTU (Digital Repository of NTU)

Enhancement of Visual Comfort and Sense of Presence on Stereoscopic 3D Images

Author: Alan Conrad Bovik
Heeseok Oh
Jinwoo Kim
Jongyoo Kim
Sanghoon Lee
Taewan Kim
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

Crossref

Deep visual saliency on stereoscopic images

Author: Kim Haksub
Kim Jongyoo
Lee Sanghoon
Lin Weisi
Nguyen Anh-Duc
Oh Heeseok
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2018
Field of study

Visual saliency on stereoscopic 3D (S3D) images has been shown to be heavily influenced by image quality. Hence, this dependency is an important factor in image quality prediction, image restoration and discomfort reduction, but it is still very difficult to predict such a nonlinear relation in images. In addition, most algorithms specialized in detecting visual saliency on pristine images may unsurprisingly fail when facing distorted images. In this paper, we investigate a deep learning scheme named Deep Visual Saliency (DeepVS) to achieve a more accurate and reliable saliency predictor even in the presence of distortions. Since visual saliency is influenced by low-level features (contrast, luminance, and depth information) from a psychophysical point of view, we propose seven low-level features derived from S3D image pairs and utilize them in the context of deep learning to detect visual attention adaptively to human perception. During analysis, it turns out that the low-level features play a role to extract distortion and saliency information. To construct saliency predictors, we weight and model the human visual saliency through two different network architectures, a regression and a fully convolutional neural networks. Our results from thorough experiments confirm that the predicted saliency maps are up to 70% correlated with human gaze patterns, which emphasize the need for the hand-crafted features as input to deep neural networks in S3D saliency detection

DR-NTU (Digital Repository of NTU)

Deep Visual Saliency on Stereoscopic Images

Author: Anh-Duc Nguyen
Haksub Kim
Heeseok Oh
Jongyoo Kim
Sanghoon Lee
Weisi Lin
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

Crossref