Search CORE

22 research outputs found

Transitioning360: Content-aware NFoV Virtual Camera Paths for 360° Video Playback

Author: Hu Shi-Min
Li Yi-Jun
Richardt Christian
Wang Miao
Zhang Wen-Xuan
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 31/12/2020
Field of study

Text-driven video acceleration:A weakly-supervised reinforcement learning method

Author: Araujo E.
Martins de Oliveira K.C.
Moura V.
Nascimento E.
Ramos W.L.D.S.
Silva M.M.D.
Soriano Marcolino L.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 07/03/2022
Field of study

The growth of videos in our digital age and the users' limited time raise the demand for processing untrimmed videos to produce shorter versions conveying the same information. Despite the remarkable progress that summarization methods have made, most of them can only select a few frames or skims, creating visual gaps and breaking the video context. This paper presents a novel weakly-supervised methodology based on a reinforcement learning formulation to accelerate instructional videos using text. A novel joint reward function guides our agent to select which frames to remove and reduce the input video to a target length without creating gaps in the final video. We also propose the Extended Visually-guided Document Attention Network (VDAN+), which can generate a highly discriminative embedding space to represent both textual and visual data. Our experiments show that our method achieves the best performance in Precision, Recall, and F1 Score against the baselines while effectively controlling the video's output length. IEE

arXiv.org e-Print Archive

Lancaster E-Prints

Towards Making Videos Accessible for Low Vision Screen Magnifier Users

Author: Bernard Jean-Baptiste
Chi Pei-Yu
Christen Michael
Hallett Elyse C
Seakins Paul J
Su Yu-Chuan
Publication venue: ODU Digital Commons
Publication date: 01/01/2020
Field of study

People with low vision who use screen magnifiers to interact with computing devices find it very challenging to interact with dynamically changing digital content such as videos, since they do not have the luxury of time to manually move, i.e., pan the magnifier lens to different regions of interest (ROIs) or zoom into these ROIs before the content changes across frames. In this paper, we present SViM, a first of its kind screen-magnifier interface for such users that leverages advances in computer vision, particularly video saliency models, to identify salient ROIs in videos. SViM\u27s interface allows users to zoom in/out of any point of interest, switch between ROIs via mouse clicks and provides assistive panning with the added flexibility that lets the user explore other regions of the video besides the ROIs identified by SViM. Subjective and objective evaluation of a user study with 13 low vision screen magnifier users revealed that overall the participants had a better user experience with SViM over extant screen magnifiers, indicative of the former\u27s promise and potential for making videos accessible to low vision screen magnifier users

Crossref

Old Dominion University