Search CORE

2,391 research outputs found

Video Shot Boundary Detection using the Scale Invariant Feature Transform and RGB Color Channels

Author: Benkaddour Abdelhamid
El khattabi Zaynab
Tabii Youness
Publication venue: 'Institute of Advanced Engineering and Science'
Publication date: 01/10/2017
Field of study

Segmentation of the video sequence by detecting shot changes is essential for video analysis, indexing and retrieval. In this context, a shot boundary detection algorithm is proposed in this paper based on the scale invariant feature transform (SIFT). The first step of our method consists on a top down search scheme to detect the locations of transitions by comparing the ratio of matched features extracted via SIFT for every RGB channel of video frames. The overview step provides the locations of boundaries. Secondly, a moving average calculation is performed to determine the type of transition. The proposed method can be used for detecting gradual transitions and abrupt changes without requiring any training of the video content in advance. Experiments have been conducted on a multi type video database and show that this algorithm achieves well performances

Crossref

NEUROSURGERY ENTHUSIASTIC WOMEN SOCIETY

Institute of Advanced Engineering and Science

Machine learning of hierarchical clustering to segment 2D and 3D images

Author: Chklovskii Dmitri B.
Kennedy Ryan
Nunez-Iglesias Juan
Parag Toufiq
Shi Jianbo
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2013
Field of study

We aim to improve segmentation through the use of machine learning tools during region agglomeration. We propose an active learning approach for performing hierarchical agglomerative segmentation from superpixels. Our method combines multiple features at all scales of the agglomerative process, works for data with an arbitrary number of dimensions, and scales to very large datasets. We advocate the use of variation of information to measure segmentation accuracy, particularly in 3D electron microscopy (EM) images of neural tissue, and using this metric demonstrate an improvement over competing algorithms in EM and natural images.Comment: 15 pages, 8 figure

arXiv.org e-Print Archive

Directory of Open Access Journals

University of Melbourne Institutional Repository

FigShare

TRECVID 2003 - an overview

Author: Kraaij Wessel
Over Paul
Smeaton Alan F.
Publication venue: 'University of Aden - Faculty of Economics and Administration'
Publication date: 01/11/2003
Field of study

Irish Universities

DCU Online Research Access Service

Indexing of fictional video content for event detection and summarisation

Author: Lee Hyowon
Lehane Bart
O'Connor Noel E.
Smeaton Alan F.
Publication venue: 'Hindawi Limited'
Publication date: 02/08/2007
Field of study

This paper presents an approach to movie video indexing that utilises audiovisual analysis to detect important and meaningful temporal video segments, that we term events. We consider three event classes, corresponding to dialogues, action sequences, and montages, where the latter also includes musical sequences. These three event classes are intuitive for a viewer to understand and recognise whilst accounting for over 90% of the content of most movies. To detect events we leverage traditional filmmaking principles and map these to a set of computable low-level audiovisual features. Finite state machines (FSMs) are used to detect when temporal sequences of specific features occur. A set of heuristics, again inspired by filmmaking conventions, are then applied to the output of multiple FSMs to detect the required events. A movie search system, named MovieBrowser, built upon this approach is also described. The overall approach is evaluated against a ground truth of over twenty-three hours of movie content drawn from various genres and consistently obtains high precision and recall for all event classes. A user experiment designed to evaluate the usefulness of an event-based structure for both searching and browsing movie archives is also described and the results indicate the usefulness of the proposed approach

DCU Online Research Access Service

Indexing of fictional video content for event detection and summarisation

Author: Lee Hyowon
Lehane Bart
O'Connor Noel E.
Smeaton Alan F.
Publication venue: 'Hindawi Limited'
Publication date: 01/01/2007
Field of study

CiteSeerX

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

Irish Universities

DCU Online Research Access Service

Indexing of fictional video content for event detection and summarisation

Author: Lee Hyowon
Lehane Bart
O'Connor Noel E.
Smeaton Alan F.
Publication venue: Hindawi Publishing Corporation
Publication date: 01/01/2007
Field of study

CiteSeerX

Springer - Publisher Connector

Directory of Open Access Journals

Irish Universities

DCU Online Research Access Service

SGPN: Similarity Group Proposal Network for 3D Point Cloud Instance Segmentation

Author: Huang Qiangui
Neumann Ulrich
Wang Weiyue
Yu Ronald
Publication venue
Publication date: 30/05/2019
Field of study

We introduce Similarity Group Proposal Network (SGPN), a simple and intuitive deep learning framework for 3D object instance segmentation on point clouds. SGPN uses a single network to predict point grouping proposals and a corresponding semantic class for each proposal, from which we can directly extract instance segmentation results. Important to the effectiveness of SGPN is its novel representation of 3D instance segmentation results in the form of a similarity matrix that indicates the similarity between each pair of points in embedded feature space, thus producing an accurate grouping proposal for each point. To the best of our knowledge, SGPN is the first framework to learn 3D instance-aware semantic segmentation on point clouds. Experimental results on various 3D scenes show the effectiveness of our method on 3D instance segmentation, and we also evaluate the capability of SGPN to improve 3D object detection and semantic segmentation results. We also demonstrate its flexibility by seamlessly incorporating 2D CNN features into the framework to boost performance

arXiv.org e-Print Archive

Crossref

LIG and LIRIS at TRECVID 2008: High Level Feature Extraction and Collaborative Annotation

Author: Ayache Stéphane
Quénot Georges
Publication venue: HAL CCSD
Publication date: 01/01/2008
Field of study

International audienceThis paper describes our participations of LIG and LIRIS to the TRECVID 2008 High Level Features detection task. We evaluated several fusion strategies and especially rank fusion. Results show that including as many low-level and intermediate features as possible is the best strategy, that SIFT features are very important, that the way in which the fusion from the various low-level and intermediate features does matter, that the type of mean (arithmetic, geometric and harmonic) does matter. LIG and LIRIS best runs respectively have a Mean Inferred Average Precision of 0.0833 and 0.0598; both above TRECVID 2008 HLF detection task median performance. LIG and LIRIS also co-organized the TRECVID 2008 collaborative annotation. 40 teams did 1235428 annotations. The development collection was annotated at least once at 100\%, at least twice at 37.6\%, at least three times at 3.99\% and at least four times at 0.06\%. Thanks to the active learning and active cleaning used approach, the annotations that were done multiple times were those for which the risk of error was maximum

Hal - Université Grenoble Alpes

INRIA a CCSD electronic archive server

Scene Determination based on Video and Audio Features

Author: Effelsberg Wolfgang
Lienhart Rainer
Pfeiffer Silvia
Publication venue
Publication date: 01/01/1998
Field of study

Determination of scenes from a video is a challenging task. When asking humans for it, results will be inconsistent since the term scene is not precisely defined. It leaves it up to each human to set shared attributes which integrate shots to scenes. However, consistent results can be found for certain basic attributes like dialogs, same settings and continuing sounds. We have therefore developed a scene determination scheme which clusters shots based on detected dialogs, same settings and similar audio. Our experimental results show that automatic deter mination of these types of scenes can be performed reliably

TUbiblio

Crossref

MAnnheim DOCument Server