Search CORE

27 research outputs found

Angularly continuous light‐field format : concept, implementation, and evaluation

Author: Balogh Tibor
Barsi Attila
Bokor Laszlo
Cserkaszky Aron
Kara Peter A.
Martini Maria G.
Tamboli Roopak R.
Publication venue: 'Wiley'
Publication date: 31/07/2019
Field of study

Crossref

Kingston University Research Repository

New visual coding exploration in MPEG: Super-MultiView and free navigation in free viewpoint TV

Author: Carballeira López Pablo
Ceulemans Beerend
Domanski Marek
García Lobo Sergio Carlos
Goorts Patrik
Grajek Tomasz
Jorissen Lode
Jung Joël
Kovács Péter Tamás
Lafruit Gauthier
Monteanu Adrian
Senoh Takanori
Tanimoto Masayuki
Wegner Krzysztof
Publication venue: 'Society for Imaging Science & Technology'
Publication date: 01/02/2016
Field of study

ISO/IEC MPEG and ITU-T VCEG have recently jointly issued a new multiview video compression standard, called 3D-HEVC, which reaches unpreceded compression performances for linear,dense camera arrangements. In view of supporting future highquality,auto-stereoscopic 3D displays and Free Navigation virtual/augmented reality applications with sparse, arbitrarily arranged camera setups, innovative depth estimation and virtual view synthesis techniques with global optimizations over all camera views should be developed. Preliminary studies in response to the MPEG-FTV (Free viewpoint TV) Call for Evidence suggest these targets are within reach, with at least 6% bitrate gains over 3DHEVC technology

Crossref

DI-fusion

Archivo Digital UPM

A bag of words description scheme for image quality assessment

Author: Fernandes Miguel Francisco Fidalgo
Publication venue
Publication date: 31/10/2016
Field of study

Every day millions of images are obtained, processed, compressed, saved, transmitted and reproduced. All these operations can cause distortions that affect their quality. The quality of these images should be measured subjectively. However, that brings the disadvantage of achieving a considerable number of tests with individuals requested to provide a statistical analysis of an image’s perceptual quality. Several objective metrics have been developed, that try to model the human perception of quality. However, in most applications the representation of human quality perception given by these metrics is far from the desired representation. Therefore, this work proposes the usage of machine learning models that allow for a better approximation. In this work, definitions for image and quality are given and some of the difficulties of the study of image quality are mentioned. Moreover, three metrics are initially explained. One uses the image’s original quality has a reference (SSIM) while the other two are no reference (BRISQUE and QAC). A comparison is made, showing a large discrepancy of values between the two kinds of metrics. The database that is used for the tests is TID2013. This database was chosen due to its dimension and by the fact of considering a large number of distortions. A study of each type of distortion in this database is made. Furthermore, some concepts of machine learning are introduced along with algorithms relevant in the context of this dissertation, notably, K-means, KNN and SVM. Description aggregator algorithms like “bag of words” and “fisher-vectors” are also mentioned. This dissertation studies a new model that combines machine learning and a quality metric for quality estimation. This model is based on the division of images in cells, where a specific metric is computed. With this division, it is possible to obtain local quality descriptors that will be aggregated using “bag of words”. A SVM with an RBF kernel is trained and tested on the same database and the results of the model are evaluated using cross-validation. The results are analysed using Pearson, Spearman and Kendall correlations and the RMSE to evaluate the representation of the model when compared with the subjective results. The model improves the results of the metric that was used and shows a new path to apply machine learning for quality evaluation.No nosso dia-a-dia as imagens são obtidas, processadas, comprimidas, guardadas, transmitidas e reproduzidas. Em qualquer destas operações podem ocorrer distorções que prejudicam a sua qualidade. A qualidade destas imagens pode ser medida de forma subjectiva, o que tem a desvantagem de serem necessários vários testes, a um número considerável de indivíduos para ser feita uma análise estatística da qualidade perceptual de uma imagem. Foram desenvolvidas várias métricas objectivas, que de alguma forma tentam modelar a percepção humana de qualidade. Todavia, em muitas aplicações a representação de percepção de qualidade humana dada por estas métricas fica aquém do desejável, razão porque se propõe neste trabalho usar modelos de reconhecimento de padrões que permitam uma maior aproximação. Neste trabalho, são dadas definições para imagem e qualidade e algumas das dificuldades do estudo da qualidade de imagem são referidas. É referida a importância da qualidade de imagem como ramo de estudo, e são estudadas diversas métricas de qualidade. São explicadas três métricas, uma delas que usa a qualidade original como referência (SSIM) e duas métricas sem referência (BRISQUE e QAC). Uma comparação é feita entre elas, mostrando- – se uma grande discrepância de valores entre os dois tipos de métricas. Para os testes feitos é usada a base de dados TID2013, que é muitas vezes considerada para estudos de qualidade de métricas devido à sua dimensão e ao facto de considerar um grande número de distorções. Neste trabalho também se fez um estudo dos tipos de distorção incluidos nesta base de dados e como é que eles são simulados. São introduzidos também alguns conceitos teóricos de reconhecimento de padrões e alguns algoritmos relevantes no contexto da dissertação, são descritos como o K-means, KNN e as SVMs. Algoritmos de agregação de descritores como o “bag of words” e o “fisher-vectors” também são referidos. Esta dissertação adiciona métodos de reconhecimento de padrões a métricas objectivas de qua– lidade de imagem. Uma nova técnica é proposta, baseada na divisão de imagens em células, nas quais uma métrica será calculada. Esta divisão permite obter descritores locais de qualidade que serão agregados usando “bag of words”. Uma SVM com kernel RBF é treinada e testada na mesma base de dados e os resultados do modelo são mostrados usando cross-validation. Os resultados são analisados usando as correlações de Pearson, Spearman e Kendall e o RMSE que permitem avaliar a proximidade entre a métrica desenvolvida e os resultados subjectivos. Este modelo melhora os resultados obtidos com a métrica usada e demonstra uma nova forma de aplicar modelos de reconhecimento de padrões ao estudo de avaliação de qualidade

UBibliorum repositorio digital da ubi

Exploiting Digital Surface Models for Inferring Super-Resolution for Remotely Sensed Images

Author: Kamilaris Andreas
Karatsiolis Savvas
Padubidri Chirag
Publication venue: ArXiv.org
Publication date: 09/05/2022
Field of study

Despite the plethora of successful Super-Resolution Reconstruction (SRR) models applied to natural images, their application to remote sensing imagery tends to produce poor results. Remote sensing imagery is often more complicated than natural images and has its peculiarities such as being of lower resolution, it contains noise, and often depicting large textured surfaces. As a result, applying non-specialized SRR models on remote sensing imagery results in artifacts and poor reconstructions. To address these problems, this paper proposes an architecture inspired by previous research work, introducing a novel approach for forcing an SRR model to output realistic remote sensing images: instead of relying on feature-space similarities as a perceptual loss, the model considers pixel-level information inferred from the normalized Digital Surface Model (nDSM) of the image. This strategy allows the application of better-informed updates during the training of the model which sources from a task (elevation map inference) that is closely related to remote sensing. Nonetheless, the nDSM auxiliary information is not required during production and thus the model infers a super-resolution image without any additional data besides its low-resolution pairs. We assess our model on two remotely sensed datasets of different spatial resolutions that also contain the DSM pairs of the images: the DFC2018 dataset and the dataset containing the national Lidar fly-by of Luxembourg. Based on visual inspection, the inferred super-resolution images exhibit particularly superior quality. In particular, the results for the high-resolution DFC2018 dataset are realistic and almost indistinguishable from the ground truth images

University of Twente Research Information

Adaptive Subtitles:Preferences and Trade-Offs in Real-Time Media Adaption

Author: Armstrong Mike
Crabb Michael
Gorman Benjamin M.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 06/05/2021
Field of study

University of Dundee Online Publications

A Review of Predictive Quality of Experience Management in Video Streaming Services

Author: De Turck Filip
Liotta Antonio
Perra Cristian
Torres Vega Maria
Publication venue: Institute of Electrical and Electronics Engineers
Publication date: 01/01/2018
Field of study

Satisfying the requirements of devices and users of online video streaming services is a challenging task. It requires not only managing the network quality of service but also to exert real-time control, addressing the user's quality of experience (QoE) expectations. QoE management is an end-to-end process that, due to the ever-increasing variety of video services, has become too complex for conventional “reactive” techniques. Herein, we review the most significant “predictive” QoE management methods for video streaming services, showing how different machine learning approaches may be used to perform proactive control. We pinpoint a selection of the best suited machine learning methods, highlighting advantages and limitations in specific service conditions. The review leads to lessons learned and guidelines to better address QoE requirements in complex video services

Crossref

Ghent University Academic Bibliography

Archivio istituzionale della ricerca - Università di Cagliari

Repository@Napier

A Virtual Reality Application of the Rubber Hand Illusion Induced by Ultrasonic Mid-Air Haptic Stimulation

Author: de Sousa Alexandra A
Finnegan Daniel
Hadnett-Hunter Jacob
Proulx Michael
Salagean Anca
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 06/01/2022
Field of study

Ultrasonic mid-air haptic technologies, which provide haptic feedback through airwaves produced using ultrasound, could be employed to investigate the sense of body ownership and immersion in virtual reality (VR) by inducing the virtual hand illusion (VHI). Ultrasonic mid-air haptic perception has solely been investigated for glabrous (hairless) skin, which has higher tactile sensitivity than hairy skin. In contrast, the VHI paradigm typically targets hairy skin without comparisons to glabrous skin. The aim of this article was to investigate illusory body ownership, the applicability of ultrasonic mid-air haptics, and perceived immersion in VR using the VHI. Fifty participants viewed a virtual hand being stroked by a feather synchronously and asynchronously with the ultrasonic stimulation applied to the glabrous skin on the palmar surface and the hairy skin on the dorsal surface of their hands. Questionnaire responses revealed that synchronous stimulation induced a stronger VHI than asynchronous stimulation. In synchronous conditions, the VHI was stronger for palmar stimulation than dorsal stimulation. The ultrasonic stimulation was also perceived as more intense on the palmar surface compared to the dorsal surface. Perceived immersion was not related to illusory body ownership per se but was enhanced by the provision of synchronous stimulation

OPUS

Online Research @ Cardiff

ResearchSPace - Bath Spa University

Adaptive Subtitles: Preferences and Trade-Offs in Real-Time Media Adaption

Author: Armstrong M.
Crabb M.
Gorman Benjamin
Publication venue: 'American College of Medical Physics (ACMP)'
Publication date: 08/05/2021
Field of study

Subtitles can help improve the understanding of media content. People enable subtitles based on individual characteristics (e.g., language or hearing ability), viewing environment, or media context (e.g., drama, quiz show). However, some people find that subtitles can be distracting and that they negatively impact their viewing experience. We explore the challenges and opportunities surrounding interaction with real-time personalisation of subtitled content. To understand how people currently interact with subtitles, we first conducted an online questionnaire with 102 participants. We used our findings to elicit requirements for a new approach called Adaptive Subtitles that allows the viewer to alter which speakers have subtitles displayed in real-time. We evaluated our approach with 19 participants to understand the interaction trade-offs and challenges within real-time adaptations of subtitled media. Our evaluation findings suggest that granular controls and structured onboarding allow viewers to make informed trade-offs when adapting media content, leading to improved viewing experiences

Bournemouth University Research Online

Application of Quality of Experience in Networked Services: Review, Trend & Perspectives

Author: A Abdelmaboud
A Castiglione
A SanWariya
AA Laghari
AA Laghari
AA Laghari
AA Laghari
AA Laghari
AA Laghari
AA Laghari
AA Laghari
AA Laghari
AA Laghari
AA Laghari
AA Laghari
Asif Ali Laghari
Asiya Khan
CY Huang
D Egan
D Soldani
D Tsolkas
E Bocchi
E Cecchet
E Hossain
E Kafetzakis
H Nourikhah
Hui He
I Slivar
IAT Hashem
J Pokhrel
J Scotton
J Shaikh
J Wu
K Alhamazani
K Nihei
KT Chen
L Chunlin
M Alhamad
M Dong
M Ebner
M Jarschel
M Jarschel
M Ljubojević
M Oche
M Papadopouli
M Varela
M Varela
MM George
MS Mushtaq
Muhammad Shafiq
N Islam
N Lin
N Samet
N Varga
P Casas
P Manuel
P Paudyal
P Reichl
P Simoens
R Moreno-Vozmediano
S Singh
S Wang
SS Joshi
T Kawano
T Pessemier De
T Tajima
T Taleb
T Taleb
T Zhao
U Engelke
UK Laghari
V Stavropoulos
W Cai
W Zhu
WS Jones
X Tao
X Wang
Y Chen
Y Muraki
YC Chang
YK Salih
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 29/09/2018
Field of study

Full text embargoed until 17.10.2019 (publisher's embargo period, 12 months

Crossref

Plymouth Electronic Archive and Research Library