9 research outputs found
Online television library: organization and content browsing for general users
This paper describes the organisational and playback features of FĂschlĂĄr, a digital video library that allows users to record, browse and watch television programmes online. Programmes that can be watched and recorded are organised by personal recommendations, genre classifications, name and other attributes for access by general television users. Motivations and interactions of users with online television libraries are outlined and they are also supported by personalised library access,
categorised programmes, a combined player browser with content viewing history and content marks. The combined player browser supports a user who watches a programme on different occasions in a non-sequential order
Collaborative video searching on a tabletop
Almost all system and application design for multimedia systems is based around a single user working in isolation to perform some task yet much of the work for which we use computers to help us, is based on working collaboratively with colleagues. Groupware systems do support user collaboration but typically this is supported through software and users still physically work independently. Tabletop systems, such as the DiamondTouch from MERL, are interface devices which support direct user collaboration on a tabletop. When a tabletop is used as the interface for a multimedia system, such as a video search system, then this kind of direct collaboration raises many questions for system design. In this paper we present a tabletop system for supporting a pair of users in a video search task and we evaluate the system not only in terms of search performance but also in terms of userâuser interaction and how different user personalities within each pair of searchers impacts search performance and user interaction. Incorporating the user into the system evaluation as we have done here reveals several interesting results and has important ramifications for the design of a multimedia search system
The TRECVID 2007 BBC rushes summarization evaluation pilot
This paper provides an overview of a pilot evaluation of
video summaries using rushes from several BBC dramatic series. It was carried out under the auspices of TRECVID.
Twenty-two research teams submitted video summaries of
up to 4% duration, of 42 individual rushes video files aimed
at compressing out redundant and insignificant material.
The output of two baseline systems built on straightforward
content reduction techniques was contributed by Carnegie
Mellon University as a control. Procedures for developing
ground truth lists of important segments from each video
were developed at Dublin City University and applied to
the BBC video. At NIST each summary was judged by
three humans with respect to how much of the ground truth
was included, how easy the summary was to understand,
and how much repeated material the summary contained.
Additional objective measures included: how long it took
the system to create the summary, how long it took the assessor to judge it against the ground truth, and what the
summary's duration was. Assessor agreement on finding desired segments averaged 78% and results indicate that while it is difficult to exceed the performance of baselines, a few systems did
Video browsing interfaces and applications: a review
We present a comprehensive review of the state of the art in video browsing and retrieval systems, with special emphasis on interfaces and applications. There has been a significant increase in activity (e.g., storage, retrieval, and sharing) employing video data in the past decade, both for personal and professional use. The ever-growing amount of video content available for human consumption and the inherent characteristics of video dataâwhich, if presented in its raw format, is rather unwieldy and costlyâhave become driving forces for the development of more effective solutions to present video contents and allow rich user interaction. As a result, there are many contemporary research efforts toward developing better video browsing solutions, which we summarize. We review more than 40 different video browsing and retrieval interfaces and classify them into three groups: applications that use video-player-like interaction, video retrieval applications, and browsing solutions based on video surrogates. For each category, we present a summary of existing work, highlight the technical aspects of each solution, and compare them against each other
VAST: A Human-Centered, Domain-Independent Video Analysis Support Tool
Providing computer-aided support for human analysis of videos has been a battle
of extremes. Powerful solutions exist, but they tend to be domain-specific and complex.
The user-friendly, simple systems provide little analysis support beyond basic media
player functionality. We propose a human-centered, domain-independent solution
between these two points.
Our proposed model and system, VAST, is based on our experience in two
diverse video analysis domains: science and athletics. Multiple-perspective location
metadata is used to group related video clips together. Users interact with these clip
groups through a novel interaction paradigm ? views. Each view provides a different
context by which users can judge and evaluate the events that are captured by the video.
Easy conversion between views allows the user to quickly switch between contexts. The
model is designed to support a variety of user goals and expertise with minimal producer
overhead.
To evaluate our model, we developed a system prototype and conducted several
rounds of user testing requiring the analysis of volleyball practice videos. The user tasks included: foreground analysis, ambiguous identification, background analysis, and
planning. Both domain novices and experts participated in the study. User feedback,
participant performance, and system logs were used to evaluate the system.
VAST successfully supported a variety of problem solving strategies employed
by participants during the course of the study. Participants had no difficulty handling
multiple views (and resulting multiple video clips) simultaneously opened in the
workspace. The capability to view multiple related clips at one time was highly
regarded.
In all tasks, except the open-ended portion of the background analysis,
participants performed well. However, performance was not significantly influenced by
domain expertise. Participants had a favorable opinion of the system?s intuitiveness, ease
of use, enjoyability, and aesthetics. The majority of participants stated a desire to use
VAST outside of the study, given the opportunity
Automatic Key-frame Extraction From Broadcast Soccer Videos
This paper presents a new approach for broadcast soccer video navigation and summarization based on specific representative images of the video. It also takes into account some soccer video features to better describe these videos. This work considers a special color reduction based on an HSV subquantization and a shot classification approach for soccer videos by exploring the dominant color related to the playground area.2216223Arman, F., Depommier, R., Hsu, A., Chiu, M.-Y., Content-based browsing of video sequences (1994) MULTIMEDIA '94: Proceedings of the Second ACM International Conference on Multimedia, pp. 97-103. , New York, NY USA. ACM PressBezerra, F.N., Leite, N.J., Video transition detection using string matching: Preliminary results (2003) SIB GRAPIXVIBrazilian Symposium on Computer Graphics and Image Procesing, pp. 339-346Brunelli, R., Mich, O., Modena, C.M., A survey on the automatic indexing of video data (1999) Journal of Visual Communication and Image Representation, 10 (2), pp. 78-112Chung, M.G., Lee, J., Kim, H., Song, S.M.-H., Kim, W.M., (1999) Automatic Video Segmentation Based on Spatio-Temporal Features, 4 (1), pp. 1-13. , Korea Telecom JournalCiocca, G., Schettini, R., Dynamic key-frame extraction for video summarization (2005) Proceedings of SPIE - The International Society for Optical Engineering, 5670, pp. 137-142. , DOI 10.1117/12.586777, 37, Proceedings of SPIE-IS and T Electronic Imaging - Internet Imaging VIDoulamis, A.D., Doulamis, N.D., Kollias, S.D., Fuzzy video content representation for video summarization and content-based retrieval (2000) Signal Processing, 80 (6), pp. 1049-1067. , DOI 10.1016/S0165-1684(00)00019-0Dufaux, F., Key frame selection to represent a video (2000) IEEE International Conference on Image Processing, 2, pp. 275-278Guimaraes, S.J.F., Couprie, M.M., De Albuquerque Araujo, A., Leite, N.J., Video segementation based on 2D image analysis (2003) Pattern Recognition Letters, 24 (7), pp. 947-957. , DOI 10.1016/S0167-8655(02)00218-0, PII S0167865502002180Kim, H., Lee, J., Yang, J.-H., Sull, S., Kim, W.M., Song, S.M.-H., Visual rhythm and shot verification (2001) Multimedia Tools and Applications, 15 (3), pp. 227-245Komlodi, A., Marchionini, G., Key frame preview techniques for video browsing (1998) DL '98: Proceedings of the Third ACM Conference on Digital Libraries, pp. 118-125. , New York, NY, USA. ACM PressKoprinska, I., Carrato, S., Temporal video segmentation (2001) Signal Processing: Image Communication, 16 (5), pp. 477-500Liu, F., Dong, D., Miao, X., Xue, X., A fast video clip retrieval algorithm based on va-file (2003) Storage and Retrieval Methods and Applications for Multimedia 2004, 5307, pp. 167-176. , Yeung, M. M., Lienhart, R. W., and Li, C.-S., editors, SPIENgo, C.W., Pong, T.C., Chin, R.T., Survey of video parsing and image indexing techniques in compressed domain (1998) Symposium on Image, Speech, Signal Processing, and Robotics (Workshop on Computer Vision), 1, pp. 231-236Pardo, A., Pixel-wise histograms for visual segment description and applications (2006) Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 4225, pp. 873-882. , Progress in Pattern Recognition, Image Analysis and Applications - 11th Iberoamerican Congress in Pattern Recognition, CIARP 2006, ProceedingsPatel, N.V., Sethi, I.K., Compressed video processing for cut detection (1996) IEE Proceedings: Vision, Image and Signal Processing, 143 (5), pp. 315-323SimĂŽes, N.C., DetecĂŁo de algumas transiĂŽes abruptas em segncias de imagens (in portuguese (2004) Master's Thesis, , Institute of Computing-IJNICAMPSmith, J.R., Natsev, A.P., Tesic, J., Xie, R.Y.L., Letz, F., Penz, C., Seidl, J., Yang, J., (2007) IBM Multimedia Analysis and Retrieval System-Marvel Lite 3, , http.-//www.alphaworks.ibm.com/tech/imars, 2aSze, K.-W., Lam, K.-M., Qiu, G., A new key frame representation for video segment retrieval (2005) Circ ii Its and Systems for Video Technology, 15 (9), pp. 1148-1155. , IEEE Transactions onTse, T., Marchionini, G., Ding, W., Slaughter, L., Komlodi, A., Dynamic key frame presentation techniques for augmenting video browsing (1998) AVI '98: Proceedings of the Working Conference on Advanced Visual Interfaces, pp. 185-194. , New York, NY, USA. ACM PressUeda, H., Miyatake, T., Yoshizawa, S., Impact: An interactive natural-motion-picture dedicated multimedia authoring system (1991) CHI '91: Proceedings Of the SIGCHI Conference on Human Factors in Computing Systems, pp. 343-350. , New York, NY, USA. ACM PressVendrig, J., Worring, M., (2003) Interactive Adaptive Movie Annotation, 10 (3), pp. 30-37. , MultiMediaWolf, W., Key frame selection by motion analysis (1996) IEEE International Conference on Acoustics, Speech, and Signal ProcessingXiong, W., Lee, J.C.-M., Ma, R.-H., Automatic video data structuring through shot partitioning and key-frame computing (1997) Machine Vision and Applications, 10 (2), pp. 51-65Yeung, M.M., Liu, B., Efficient matching and clustering of video shots (1995) ICIP '95: Proceedings of the 1995 International Conference on Image Processing, 1, p. 338. , Washington, DC, USA. IEEE Computer SocietyZhang, H., Kankanhalli, A., Smoliar, S.W., Automatic partitioning of full-motion video (1993) ACM Multimedia Systems, 1 (1), pp. 10-28Zhang, H.J., Low, C.Y., Smoliar, S.W., Wu, J.H., Video parsing, retrieval and browsing: An integrated and content-based solution (1995) MUL TIMEDIA '95: Proceedings of the third ACM International Conference on Multimedia, pp. 15-24. , New York, NY USA. ACM PressZhong, D., Zhang, H., Chang, S.-F., Clustering methods for video browsing and annotation (1996) Storage and Retrieval for Still Image and Video Databases IV, 2670, pp. 239-246. , Sethi, I. K. and Jam, R. C., editors, SPI