Search CORE

1,677 research outputs found

K-Space at TRECVid 2007

Author: Adamek Tomasz
Byrne Daragh
Jones Gareth J.F.
Keenan Gordon
Lee Hyowon
McGuinness Kevin
O'Connor Noel E.
Smeaton Alan F.
Wilkins Peter
Publication venue: 'University of Aden - Faculty of Economics and Administration'
Publication date: 01/11/2007
Field of study

In this paper we describe K-Space participation in TRECVid 2007. K-Space participated in two tasks, high-level feature extraction and interactive search. We present our approaches for each of these activities and provide a brief analysis of our results. Our high-level feature submission utilized multi-modal low-level features which included visual, audio and temporal elements. Specific concept detectors (such as Face detectors) developed by K-Space partners were also used. We experimented with different machine learning approaches including logistic regression and support vector machines (SVM). Finally we also experimented with both early and late fusion for feature combination. This year we also participated in interactive search, submitting 6 runs. We developed two interfaces which both utilized the same retrieval functionality. Our objective was to measure the effect of context, which was supported to different degrees in each interface, on user performance. The first of the two systems was a ‘shot’ based interface, where the results from a query were presented as a ranked list of shots. The second interface was ‘broadcast’ based, where results were presented as a ranked list of broadcasts. Both systems made use of the outputs of our high-level feature submission as well as low-level visual features

Irish Universities

DCU Online Research Access Service

K-Space at TRECVID 2008

Author: Adamek T.
Amin A.
Avrithis Y.
Bailer W.
Benmokhtar R.
Byrne D.
Chandramouli K.
Cobet A.
Dumont E.
Goldmann L.
Goyal A.
Haller M.
Halvey M.
Hannah D.
Hopfgartner F.
Huet B.
Izquierdo E.
Jones G.
Jose J.M.
Keenan G.
Kompatsiaris I.
Lee H.
McGuinness K.
Merialdo B.
Mezaris V.
Moerzinger R.
O'Connor N.
O'Hare N.
Papadopoulous G.
Praks P.
Punitha P.
Samour A.
Schallauer P.
Sikora T.
Smeaton A.F.
Spyrou E.
Tolias G.
Troncy R.
Villa R.
Wilkins P.
Publication venue
Publication date: 01/01/2008
Field of study

In this paper we describe K-Space’s participation in TRECVid 2008 in the interactive search task. For 2008 the K-Space group performed one of the largest interactive video information retrieval experiments conducted in a laboratory setting. We had three institutions participating in a multi-site multi-system experiment. In total 36 users participated, 12 each from Dublin City University (DCU, Ireland), University of Glasgow (GU, Scotland) and Centrum Wiskunde and Informatica (CWI, the Netherlands). Three user interfaces were developed, two from DCU which were also used in 2007 as well as an interface from GU. All interfaces leveraged the same search service. Using a latin squares arrangement, each user conducted 12 topics, leading in total to 6 runs per site, 18 in total. We officially submitted for evaluation 3 of these runs to NIST with an additional expert run using a 4th system. Our submitted runs performed around the median. In this paper we will present an overview of the search system utilized, the experimental setup and a preliminary analysis of our results

DSpace at NTUA

Enlighten

K-Space at TRECVid 2008

Author: Adamek Tomasz
Byrne Daragh
Jones Gareth J.F.
Keenan Gordon
Lee Hyowon
McGuinness Kevin
O'Connor Noel E.
O'Hare Neil
Smeaton Alan F.
Wilkins Peter
Publication venue: 'University of Aden - Faculty of Economics and Administration'
Publication date: 01/11/2008
Field of study

In this paper we describe K-Space’s participation in TRECVid 2008 in the interactive search task. For 2008 the K-Space group performed one of the largest interactive video information retrieval experiments conducted in a laboratory setting. We had three institutions participating in a multi-site multi-system experiment. In total 36 users participated, 12 each from Dublin City University (DCU, Ireland), University of Glasgow (GU, Scotland) and Centrum Wiskunde & Informatica (CWI, the Netherlands). Three user interfaces were developed, two from DCU which were also used in 2007 as well as an interface from GU. All interfaces leveraged the same search service. Using a latin squares arrangement, each user conducted 12 topics, leading in total to 6 runs per site, 18 in total. We officially submitted for evaluation 3 of these runs to NIST with an additional expert run using a 4th system. Our submitted runs performed around the median. In this paper we will present an overview of the search system utilized, the experimental setup and a preliminary analysis of our results

Irish Universities

DCU Online Research Access Service

Compressed Video Action Recognition

Author: Hu Hexiang
Krähenbühl Philipp
Manmatha R.
Smola Alexander J.
Wu Chao-Yuan
Zaheer Manzil
Publication venue
Publication date: 29/03/2018
Field of study

Training robust deep video representations has proven to be much more challenging than learning deep image representations. This is in part due to the enormous size of raw video streams and the high temporal redundancy; the true and interesting signal is often drowned in too much irrelevant data. Motivated by that the superfluous information can be reduced by up to two orders of magnitude by video compression (using H.264, HEVC, etc.), we propose to train a deep network directly on the compressed video. This representation has a higher information density, and we found the training to be easier. In addition, the signals in a compressed video provide free, albeit noisy, motion information. We propose novel techniques to use them effectively. Our approach is about 4.6 times faster than Res3D and 2.7 times faster than ResNet-152. On the task of action recognition, our approach outperforms all the other methods on the UCF-101, HMDB-51, and Charades dataset.Comment: CVPR 2018 (Selected for spotlight presentation

arXiv.org e-Print Archive

Crossref

Fast compressed domain motion detection in H.264 video streams for video surveillance applications

Author: Eybye Peder Tanderup
Forchhammer Søren
Støttrup-Andersen Jesper
Szczerba Krzysztof
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2009
Field of study

Crossref

Online Research Database In Technology

VITALAS at TRECVID-2008

Author: Aly Robin
Delopoulos Anastasios
Dimitriou Nikos
Diou Christos
Panagiotopoulos Panagiotis
Papachristou Christos
Rode Henning
Stephanopoulos George
Tsikrika Theodora
Vries Arjen P. de
Publication venue: National Institute of Standards and Technology, NIST
Publication date: 01/11/2008
Field of study

In this paper, we present our experiments in TRECVID 2008 about High-Level feature extraction task. This is the first year for our participation in TRECVID, our system adopts some popular approaches that other workgroups proposed before. We proposed 2 advanced low-level features NEW Gabor texture descriptor and the Compact-SIFT Codeword histogram. Our system applied well-known LIBSVM to train the SVM classifier for the basic classifier. In fusion step, some methods were employed such as the Voting, SVM-base, HCRF and Bootstrap Average AdaBoost(BAAB)

CWI's Institutional Repository

University of Twente Research Information

Generative Compression

Author: Budden David
Santurkar Shibani
Shavit Nir
Publication venue
Publication date: 04/06/2017
Field of study

Traditional image and video compression algorithms rely on hand-crafted encoder/decoder pairs (codecs) that lack adaptability and are agnostic to the data being compressed. Here we describe the concept of generative compression, the compression of data using generative models, and suggest that it is a direction worth pursuing to produce more accurate and visually pleasing reconstructions at much deeper compression levels for both image and video data. We also demonstrate that generative compression is orders-of-magnitude more resilient to bit error rates (e.g. from noisy wireless channels) than traditional variable-length coding schemes

arXiv.org e-Print Archive

Crossref

Source Coding and Channel Coding for Mobile Multimedia Communication

Author: Hafiz Malik
Hammad Dilpazir
Hasan Mahmood
Tariq Shah
Publication venue: 'IntechOpen'
Publication date: 20/01/2012
Field of study

IntechOpen