Search CORE

1,145 research outputs found

High-level feature detection from video in TRECVid: a 5-year retrospective of achievements

Author: A. F. Smeaton
A. F. Smeaton
A. Loui
A. P. Natsev
A. Smeulders
C. G. M. Snoek
C. G. Snoek
E. Yilmaz
M. G. Christel
M. Naphade
M. R. Naphade
P. Joly
P. Over
T. Volkmer
W. Kraaij
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2008
Field of study

Successful and effective content-based access to digital video requires fast, accurate and scalable methods to determine the video content automatically. A variety of contemporary approaches to this rely on text taken from speech within the video, or on matching one video frame against others using low-level characteristics like colour, texture, or shapes, or on determining and matching objects appearing within the video. Possibly the most important technique, however, is one which determines the presence or absence of a high-level or semantic feature, within a video clip or shot. By utilizing dozens, hundreds or even thousands of such semantic features we can support many kinds of content-based video navigation. Critically however, this depends on being able to determine whether each feature is or is not present in a video clip. The last 5 years have seen much progress in the development of techniques to determine the presence of semantic features within video. This progress can be tracked in the annual TRECVid benchmarking activity where dozens of research groups measure the effectiveness of their techniques on common data and using an open, metrics-based approach. In this chapter we summarise the work done on the TRECVid high-level feature task, showing the progress made year-on-year. This provides a fairly comprehensive statement on where the state-of-the-art is regarding this important task, not just for one research group or for one approach, but across the spectrum. We then use this past and on-going work as a basis for highlighting the trends that are emerging in this area, and the questions which remain to be addressed before we can achieve large-scale, fast and reliable high-level feature detection on video

CiteSeerX

Crossref

Irish Universities

DCU Online Research Access Service

Automatic summarization of rushes video using bipartite graphs

Author: A Ferman
AF Smeaton
AF Smeaton
Alan F. Smeaton
C Liu
C Ngo
C Taskiran
D Byrne
J Canny
Liang Bai
Noel E. O’Connor
P Over
P Over
Songyang Lao
Y Dai
Y Ma
Yanli Hu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2009
Field of study

In this paper we present a new approach for automatic summarization of rushes, or unstructured video. Our approach is composed of three major steps. First, based on shot and sub-shot segmentations, we filter sub-shots with low information content not likely to be useful in a summary. Second, a method using maximal matching in a bipartite graph is adapted to measure similarity between the remaining shots and to minimize inter-shot redundancy by removing repetitive retake shots common in rushes video. Finally, the presence of faces and motion intensity are characterised in each sub-shot. A measure of how representative the sub-shot is in the context of the overall video is then proposed. Video summaries composed of keyframe slideshows are then generated. In order to evaluate the effectiveness of this approach we re-run the evaluation carried out by TRECVid, using the same dataset and evaluation metrics used in the TRECVid video summarization task in 2007 but with our own assessors. Results show that our approach leads to a significant improvement on our own work in terms of the fraction of the TRECVid summary ground truth included and is competitive with the best of other approaches in TRECVid 2007

CiteSeerX

Crossref

Irish Universities

DCU Online Research Access Service

Semantic analysis of field sports video using a petri-net of audio-visual concepts

Author: A. F. Smeaton
D. Sadlier
D. Sinclair
Ekin
L. Bai
N. E. O'Connor
S. Lao
Tang
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2009
Field of study

The most common approach to automatic summarisation and highlight detection in sports video is to train an automatic classifier to detect semantic highlights based on occurrences of low-level features such as action replays, excited commentators or changes in a scoreboard. We propose an alternative approach based on the detection of perception concepts (PCs) and the construction of Petri-Nets which can be used for both semantic description and event detection within sports videos. Low-level algorithms for the detection of perception concepts using visual, aural and motion characteristics are proposed, and a series of Petri-Nets composed of perception concepts is formally defined to describe video content. We call this a Perception Concept Network-Petri Net (PCN-PN) model. Using PCN-PNs, personalized high-level semantic descriptions of video highlights can be facilitated and queries on high-level semantics can be achieved. A particular strength of this framework is that we can easily build semantic detectors based on PCN-PNs to search within sports videos and locate interesting events. Experimental results based on recorded sports video data across three types of sports games (soccer, basketball and rugby), and each from multiple broadcasters, are used to illustrate the potential of this framework

University of Limerick Institutional Repository

Crossref

Irish Universities

DCU Online Research Access Service

A content-based retrieval system for UAV-like video and associated metadata

Author: Duffy Thomas
Gurrin Cathal
Lee Hyowon
O'Connor Noel E.
Sadlier David A.
Smeaton Alan F.
Zhang Ke
Publication venue: 'SPIE-Intl Soc Optical Eng'
Publication date: 16/03/2008
Field of study

In this paper we provide an overview of a content-based retrieval (CBR) system that has been specifically designed for handling UAV video and associated meta-data. Our emphasis in designing this system is on managing large quantities of such information and providing intuitive and efficient access mechanisms to this content, rather than on analysis of the video content. The retrieval unit in our system is termed a "trip". At capture time, each trip consists of an MPEG-1 video stream and a set of time stamped GPS locations. An analysis process automatically selects and associates GPS locations with the video timeline. The indexed trip is then stored in a shared trip repository. The repository forms the backend of a MPEG-211 compliant Web 2.0 application for subsequent querying, browsing, annotation and video playback. The system interface allows users to search/browse across the entire archive of trips and, depending on their access rights, to annotate other users' trips with additional information. Interaction with the CBR system is via a novel interactive map-based interface. This interface supports content access by time, date, region of interest on the map, previously annotated specific locations of interest and combinations of these. To develop such a system and investigate its practical usefulness in real world scenarios, clearly a significant amount of appropriate data is required. In the absence of a large volume of UAV data with which to work, we have simulated UAV-like data using GPS tagged video content captured from moving vehicles

Crossref

Irish Universities

DCU Online Research Access Service

Trecvid 2019: an evaluation campaign to benchmark video activity detection, video captioning and matching, and video search & retrieval

Author: Awad George M.
Butt Asad A.
Delgado Andrew
Fiscus Jon
Godil Afzal
Graham Yvette
Lee Yooyoung
Smeaton Alan F.
Publication venue
Publication date: 12/11/2019
Field of study

DCU Online Research Access Service

The Físchlár digital video system: a digital library of broadcast TV programmes

Author: A. F. Smeaton
H. Lee
J. Ye
K. Mcdonald
N. E. O&apos
N. E. O’connor
N. Murphy
P. Browne
S. Marlow
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2001
Field of study

Físchlár is a system for recording, indexing, browsing and playback of broadcast TV programmes which has been operational on our University campus for almost 18 months. In this paper we give a brief overview of how the system operates, how TV programmes are organised for browse/playback and a short report on the system usage by over 900 users in our University

CiteSeerX

Crossref

Irish Universities

DCU Online Research Access Service

Recommended from our members

Epitaxial growth of the first two members of the Ban +1InnO2.5 n +1Ruddlesden-Popper homologous series

Author: Azizie Kathy
Barone Matthew R.
Hensling Felix V. E.
Kourkoutis Lena F.
Schlom Darrell G.
Show Veronica
Smeaton Michelle A.
Publication venue: New York, NY : American Institute of Physics
Publication date: 01/01/2022
Field of study

We demonstrate the epitaxial growth of the first two members, and the n = ∞ member of the homologous Ruddlesden-Popper series of Ba n + 1 In n O 2.5 n + 1 of which the n = 1 member was previously unknown. The films were grown by suboxide molecular-beam epitaxy where the indium is provided by a molecular beam of indium-suboxide [In 2O (g)]. To facilitate ex situ characterization of the highly hygroscopic barium indate films, a capping layer of amorphous SiO 2 was deposited prior to air exposure. The structural quality of the films was assessed by x-ray diffraction, reflective high-energy electron diffraction, and scanning transmission electron microscopy

Repositorium für Naturwissenschaften und Technik

Físchlár: an on-line system for indexing and browsing broadcast television content

Author: A. F. Smeaton
H. Lee
K. Mcdonald
N. E. O&apos
N. E. O’connor
N. Murphy
P. Browne
S. Deasy
S. Marlow
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2001
Field of study

This paper describes a demonstration system which automatically indexes broadcast television content for subsequent non-linear browsing. User-specified television programmes are captured in MPEG-1 format and analysed using a number of video indexing tools such as shot boundary detection, keyframe extraction, shot clustering and news story segmentation. A number of different interfaces have been developed which allow a user to browse the visual index created by these analysis tools. These interfaces are designed to facilitate users locating video content of particular interest. Once such content is located, the MPEG-1 bitstream can be streamed to the user in real-time. This paper describes both the high-level functionality of the system and the low-level indexing tools employed, as well as giving an overview of the different browsing mechanisms employe

CiteSeerX

Crossref

Irish Universities

DCU Online Research Access Service

Everyday concept detection in visual lifelogs: validation, relationships and trends

Author: A Bovik
A Hauptmann
A Smeaton
Aiden R. Doherty
Alan F. Smeaton
AR Doherty
AR Doherty
Cees G. M. Snoek
CGM Snoek
D Byrne
D Wang
Daragh Byrne
G Bell
Gareth J. F. Jones
H Naphade
HT Lin
J Fleiss
J Kapur
JC Gemert van
JM Geusebroek
JR Landis
MA Hoang
N O’Hare
R DeVaul
VN Vapnik
YG Jiang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2009
Field of study

The Microsoft SenseCam is a small lightweight wearable camera used to passively capture photos and other sensor readings from a user's day-to-day activities. It can capture up to 3,000 images per day, equating to almost 1 million images per year. It is used to aid memory by creating a personal multimedia lifelog, or visual recording of the wearer's life. However the sheer volume of image data captured within a visual lifelog creates a number of challenges, particularly for locating relevant content. Within this work, we explore the applicability of semantic concept detection, a method often used within video retrieval, on the novel domain of visual lifelogs. A concept detector models the correspondence between low-level visual features and high-level semantic concepts (such as indoors, outdoors, people, buildings, etc.) using supervised machine learning. By doing so it determines the probability of a concept's presence. We apply detection of 27 everyday semantic concepts on a lifelog collection composed of 257,518 SenseCam images from 5 users. The results were then evaluated on a subset of 95,907 images, to determine the precision for detection of each semantic concept. We conduct further analysis on the temporal consistency, co-occurance and trends within the detected concepts to more extensively investigate the robustness of the detectors within this novel domain. We additionally present future applications of concept detection within the domain of lifelogging

CiteSeerX

Crossref

Oxford University Research Archive

DCU Online Research Access Service

International Migration, Integration and Social Cohesion online publications

The UK guidelines for management and surveillance of Tuberous Sclerosis Complex.

Author: Amin S
Bolton PF
Elmslie F
Gale DP
Harland C
Johnson SR
Kingswood JC
O'Callaghan FJ
Parker A
Sampson JR
Smeaton M
Wright I
Publication venue: 'Oxford University Press (OUP)'
Publication date: 21/09/2018
Field of study

Background: The severity of Tuberous Sclerosis Complex (TSC) can vary among affected individuals. Complications of TSC can be life threatening, with significant impact on patients' quality of life. Management may vary dependent on treating physician, local and national policies, and funding. There are no current UK guidelines. We conducted a Delphi consensus process to reach agreed guidance for the management of patients with TSC in the UK. Methods: We performed a literature search and reviewed the 2012/13 international guideline for TSC management. Based on these, a Delphi questionnaire was formed. We invited 86 clinicians and medical researchers to complete an online survey in two rounds. All the people surveyed were based in the UK. Clinicians were identified through the regional TSC clinics, and researchers were identified through publications. In round one, 55 questions were asked. In round two, 18 questions were asked in order to obtain consensus on the outstanding points that had been contentious in round one. The data was analysed by a core committee and subcommittees, which consisted of UK experts in different aspects of TSC. The Tuberous Sclerosis Association was consulted. Results: 51 TSC experts took part in this survey. Two rounds were required to achieve consensus. The responders were neurologists, nephrologists, psychiatrist, psychologists, oncologists, general paediatricians, dermatologist, urologists, radiologists, clinical geneticists, neurosurgeons, respiratory and neurodisability clinicians. Conclusions: These new UK guidelines for the management and surveillance of TSC patients provide consensus guidance for delivery of best clinical care to individuals with TSC in the UK

Online Research @ Cardiff

UCL Discovery

King's Research Portal

St George's Online Research Archive

Explore Bristol Research