Search CORE

30,275 research outputs found

Modelling of content-aware indicators for effective determination of shot boundaries in compressed MPEG videos

Author: A Hanjalic
BL Yeo
C Cotsaces
C Grana
G Boccignone
H Fang
J Bescos
J Cao
J Hoey
J Meng
J Ren
J Yuan
Jianmin Jiang
Jinchang Ren
Juan Chen
K Qiu
K-C Yang
M Cooper
O Urhan
R Lienhart
RM Ford
S Lefèvre
S Li
S Porter
S-C Pei
TY Liu
U Gargi
Z Rasheed
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 09/04/2010
Field of study

In this paper, a content-aware approach is proposed to design multiple test conditions for shot cut detection, which are organized into a multiple phase decision tree for abrupt cut detection and a finite state machine for dissolve detection. In comparison with existing approaches, our algorithm is characterized with two categories of content difference indicators and testing. While the first category indicates the content changes that are directly used for shot cut detection, the second category indicates the contexts under which the content change occurs. As a result, indications of frame differences are tested with context awareness to make the detection of shot cuts adaptive to both content and context changes. Evaluations announced by TRECVID 2007 indicate that our proposed algorithm achieved comparable performance to those using machine learning approaches, yet using a simpler feature set and straightforward design strategies. This has validated the effectiveness of modelling of content-aware indicators for decision making, which also provides a good alternative to conventional approaches in this topic

Crossref

University of Strathclyde Institutional Repository

Surrey Research Insight

Ridiculously Fast Shot Boundary Detection with Fully Convolutional Neural Networks

Author: Gygli Michael
Publication venue
Publication date: 23/05/2017
Field of study

Shot boundary detection (SBD) is an important component of many video analysis tasks, such as action recognition, video indexing, summarization and editing. Previous work typically used a combination of low-level features like color histograms, in conjunction with simple models such as SVMs. Instead, we propose to learn shot detection end-to-end, from pixels to final shot boundaries. For training such a model, we rely on our insight that all shot boundaries are generated. Thus, we create a dataset with one million frames and automatically generated transitions such as cuts, dissolves and fades. In order to efficiently analyze hours of videos, we propose a Convolutional Neural Network (CNN) which is fully convolutional in time, thus allowing to use a large temporal context without the need to repeatedly processing frames. With this architecture our method obtains state-of-the-art results while running at an unprecedented speed of more than 120x real-time

arXiv.org e-Print Archive

Crossref

Video shot boundary detection: seven years of TRECVid activity

Author: Aiden R. Doherty
Alan F. Smeaton
Doherty
Hanjalic
Heng
Joyce
Lewis
Lu
Manly
Nesvadba
Paul Over
Tan
Zhang
Publication venue: 'Elsevier BV'
Publication date: 22/12/2009
Field of study

Shot boundary detection (SBD) is the process of automatically detecting the boundaries between shots in video. It is a problem which has attracted much attention since video became available in digital form as it is an essential pre-processing step to almost all video analysis, indexing, summarisation, search, and other content-based operations. Automatic SBD was one of the tracks of activity within the annual TRECVid benchmarking exercise, each year from 2001 to 2007 inclusive. Over those seven years we have seen 57 different research groups from across the world work to determine the best approaches to SBD while using a common dataset and common scoring metrics. In this paper we present an overview of the TRECVid shot boundary detection task, a high-level overview of the most significant of the approaches taken, and a comparison of performances, focussing on one year (2005) as an example

CiteSeerX

Crossref

Irish Universities

Oxford University Research Archive

DCU Online Research Access Service

TRECVID 2004 - an overview

Author: Kraaij Wessel
Over Paul
Smeaton Alan F.
Publication venue: 'University of Aden - Faculty of Economics and Administration'
Publication date: 01/11/2004
Field of study

Irish Universities

DCU Online Research Access Service

TRECVID 2003 - an overview

Author: Kraaij Wessel
Over Paul
Smeaton Alan F.
Publication venue: 'University of Aden - Faculty of Economics and Administration'
Publication date: 01/11/2003
Field of study

Irish Universities

DCU Online Research Access Service

Dublin City University video track experiments for TREC 2001

Author: Browne Paul
Gurrin Cathal
Lee Hyowon
McDonald Kieran
Sav Sorin Vasile
Ye Jiamin
Publication venue: 'University of Aden - Faculty of Economics and Administration'
Publication date: 01/11/2001
Field of study

Dublin City University participated in the interactive search task and Shot Boundary Detection task* of the TREC Video Track. In the interactive search task experiment thirty people used three different digital video browsers to find video segments matching the given topics. Each user was under a time constraint of six minutes for each topic assigned to them. The purpose of this experiment was to compare video browsers and so a method was developed for combining independent users’ results for a topic into one set of results. Collated results based on thirty users are available herein though individual users’ and browsers’ results are currently unavailable for comparison. Our purpose in participating in this TREC track was to create the ground truth within the TREC framework, which will allow us to do direct browser performance comparisons

Irish Universities

DCU Online Research Access Service

The TREC-2002 video track report

Author: Over Paul
Smeaton Alan F.
Publication venue: 'University of Aden - Faculty of Economics and Administration'
Publication date: 01/01/2002
Field of study

TREC-2002 saw the second running of the Video Track, the goal of which was to promote progress in content-based retrieval from digital video via open, metrics-based evaluation. The track used 73.3 hours of publicly available digital video (in MPEG-1/VCD format) downloaded by the participants directly from the Internet Archive (Prelinger Archives) (internetarchive, 2002) and some from the Open Video Project (Marchionini, 2001). The material comprised advertising, educational, industrial, and amateur films produced between the 1930's and the 1970's by corporations, nonprofit organizations, trade associations, community and interest groups, educational institutions, and individuals. 17 teams representing 5 companies and 12 universities - 4 from Asia, 9 from Europe, and 4 from the US - participated in one or more of three tasks in the 2001 video track: shot boundary determination, feature extraction, and search (manual or interactive). Results were scored by NIST using manually created truth data for shot boundary determination and manual assessment of feature extraction and search results. This paper is an introduction to, and an overview of, the track framework - the tasks, data, and measures - the approaches taken by the participating groups, the results, and issues regrading the evaluation. For detailed information about the approaches and results, the reader should see the various site reports in the final workshop proceedings

CiteSeerX

Irish Universities

DCU Online Research Access Service

Rushes video summarization using a collaborative approach

Author: Bailer Werner
Bredin Hervé
Byrne Daragh
Dumont Emilie
Essid Slim
Haller Martin
Jones Gareth J.F.
Krutz Andreas
Merialdo Bernard
O'Connor Noel E.
Piatrik Tomas
Rehatschek Herwig
Sikora Thomas
Smeaton Alan F.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2008
Field of study

This paper describes the video summarization system developed by the partners of the K-Space European Network of Excellence for the TRECVID 2008 BBC rushes summarization evaluation. We propose an original method based on individual content segmentation and selection tools in a collaborative system. Our system is organized in several steps. First, we segment the video, secondly we identify relevant and redundant segments, and finally, we select a subset of segments to concatenate and build the final summary with video acceleration incorporated. We analyze the performance of our system through the TRECVID evaluation

DCU Online Research Access Service

EURECOM Repository

Who is the director of this movie? Automatic style recognition based on shot features

Author: Benini Sergio
Kovács András Bálint
Savardi Mattia
Signoroni Alberto
Svanera Michele
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 25/07/2018
Field of study

We show how low-level formal features, such as shot duration, meant as length of camera takes, and shot scale, i.e. the distance between the camera and the subject, are distinctive of a director's style in art movies. So far such features were thought of not having enough varieties to become distinctive of an author. However our investigation on the full filmographies of six different authors (Scorsese, Godard, Tarr, Fellini, Antonioni, and Bergman) for a total number of 120 movies analysed second by second, confirms that these shot-related features do not appear as random patterns in movies from the same director. For feature extraction we adopt methods based on both conventional and deep learning techniques. Our findings suggest that feature sequential patterns, i.e. how features evolve in time, are at least as important as the related feature distributions. To the best of our knowledge this is the first study dealing with automatic attribution of movie authorship, which opens up interesting lines of cross-disciplinary research on the impact of style on the aesthetic and emotional effects on the viewers

arXiv.org e-Print Archive

Archivio istituzionale della ricerca - Università di Brescia

Enlighten