42,336 research outputs found
Action Recognition in Videos: from Motion Capture Labs to the Web
This paper presents a survey of human action recognition approaches based on
visual data recorded from a single video camera. We propose an organizing
framework which puts in evidence the evolution of the area, with techniques
moving from heavily constrained motion capture scenarios towards more
challenging, realistic, "in the wild" videos. The proposed organization is
based on the representation used as input for the recognition task, emphasizing
the hypothesis assumed and thus, the constraints imposed on the type of video
that each technique is able to address. Expliciting the hypothesis and
constraints makes the framework particularly useful to select a method, given
an application. Another advantage of the proposed organization is that it
allows categorizing newest approaches seamlessly with traditional ones, while
providing an insightful perspective of the evolution of the action recognition
task up to now. That perspective is the basis for the discussion in the end of
the paper, where we also present the main open issues in the area.Comment: Preprint submitted to CVIU, survey paper, 46 pages, 2 figures, 4
table
Region of interest-based adaptive multimedia streaming scheme
Adaptive multimedia streaming aims at adjusting
the transmitted content based on the available bandwidth such as losses that often severely affect the end-user perceived quality are minimized and consequently the transmission quality increases. Current solutions affect equally the whole viewing area of the multimedia frames, despite research showing that there are regions on which the viewers are more interested in than on others. This paper presents a novel region of interest-based adaptive scheme (ROIAS) for multimedia streaming that when performing transmission-related quality adjustments, selectively affects the quality of those regions of the image the viewers are the least interested in. As the quality of the regions the viewers are the most interested in will not change (or will involve little change),the proposed scheme provides higher overall end-user perceived
quality than any of the existing adaptive solutions
Caching-Aided Collaborative D2D Operation for Predictive Data Dissemination in Industrial IoT
Industrial automation deployments constitute challenging environments where
moving IoT machines may produce high-definition video and other heavy sensor
data during surveying and inspection operations. Transporting massive contents
to the edge network infrastructure and then eventually to the remote human
operator requires reliable and high-rate radio links supported by intelligent
data caching and delivery mechanisms. In this work, we address the challenges
of contents dissemination in characteristic factory automation scenarios by
proposing to engage moving industrial machines as device-to-device (D2D)
caching helpers. With the goal to improve reliability of high-rate
millimeter-wave (mmWave) data connections, we introduce the alternative
contents dissemination modes and then construct a novel mobility-aware
methodology that helps develop predictive mode selection strategies based on
the anticipated radio link conditions. We also conduct a thorough system-level
evaluation of representative data dissemination strategies to confirm the
benefits of predictive solutions that employ D2D-enabled collaborative caching
at the wireless edge to lower contents delivery latency and improve data
acquisition reliability
Video browsing interfaces and applications: a review
We present a comprehensive review of the state of the art in video browsing and retrieval systems, with special emphasis on interfaces and applications. There has been a significant increase in activity (e.g., storage, retrieval, and sharing) employing video data in the past decade, both for personal and professional use. The ever-growing amount of video content available for human consumption and the inherent characteristics of video data—which, if presented in its raw format, is rather unwieldy and costly—have become driving forces for the development of more effective solutions to present video contents and allow rich user interaction. As a result, there are many contemporary research efforts toward developing better video browsing solutions, which we summarize. We review more than 40 different video browsing and retrieval interfaces and classify them into three groups: applications that use video-player-like interaction, video retrieval applications, and browsing solutions based on video surrogates. For each category, we present a summary of existing work, highlight the technical aspects of each solution, and compare them against each other
- …