803 research outputs found

    An Overview of Video Shot Clustering and Summarization Techniques for Mobile Applications

    Get PDF
    The problem of content characterization of video programmes is of great interest because video appeals to large audiences and its efficient distribution over various networks should contribute to widespread usage of multimedia services. In this paper we analyze several techniques proposed in literature for content characterization of video programmes, including movies and sports, that could be helpful for mobile media consumption. In particular we focus our analysis on shot clustering methods and effective video summarization techniques since, in the current video analysis scenario, they facilitate the access to the content and help in quick understanding of the associated semantics. First we consider the shot clustering techniques based on low-level features, using visual, audio and motion information, even combined in a multi-modal fashion. Then we concentrate on summarization techniques, such as static storyboards, dynamic video skimming and the extraction of sport highlights. Discussed summarization methods can be employed in the development of tools that would be greatly useful to most mobile users: in fact these algorithms automatically shorten the original video while preserving most events by highlighting only the important content. The effectiveness of each approach has been analyzed, showing that it mainly depends on the kind of video programme it relates to, and the type of summary or highlights we are focusing on

    Advanced content-based semantic scene analysis and information retrieval: the SCHEMA project

    Get PDF
    The aim of the SCHEMA Network of Excellence is to bring together a critical mass of universities, research centers, industrial partners and end users, in order to design a reference system for content-based semantic scene analysis, interpretation and understanding. Relevant research areas include: content-based multimedia analysis and automatic annotation of semantic multimedia content, combined textual and multimedia information retrieval, semantic -web, MPEG-7 and MPEG-21 standards, user interfaces and human factors. In this paper, recent advances in content-based analysis, indexing and retrieval of digital media within the SCHEMA Network are presented. These advances will be integrated in the SCHEMA module-based, expandable reference system

    An Overview of Multimodal Techniques for the Characterization of Sport Programmes

    Get PDF
    The problem of content characterization of sports videos is of great interest because sports video appeals to large audiences and its efficient distribution over various networks should contribute to widespread usage of multimedia services. In this paper we analyze several techniques proposed in literature for content characterization of sports videos. We focus this analysis on the typology of the signal (audio, video, text captions, ...) from which the low-level features are extracted. First we consider the techniques based on visual information, then the methods based on audio information, and finally the algorithms based on audio-visual cues, used in a multi-modal fashion. This analysis shows that each type of signal carries some peculiar information, and the multi-modal approach can fully exploit the multimedia information associated to the sports video. Moreover, we observe that the characterization is performed either considering what happens in a specific time segment, observing therefore the features in a "static" way, or trying to capture their "dynamic" evolution in time. The effectiveness of each approach depends mainly on the kind of sports it relates to, and the type of highlights we are focusing on

    How are Forbush decreases related to interplanetary magnetic field enhancements ?

    Full text link
    Aims. Forbush decrease (FD) is a transient decrease followed by a gradual recovery in the observed galactic cosmic ray intensity. We seek to understand the relationship between the FDs and near-Earth interplanetary magnetic field (IMF) enhancements associated with solar coronal mass ejections (CMEs). Methods. We use muon data at cutoff rigidities ranging from 14 to 24 GV from the GRAPES-3 tracking muon telescope to identify FD events. We select those FD events that have a reasonably clean profile, and magnitude > 0.25%. We use IMF data from ACE/WIND spacecrafts. We look for correlations between the FD profile and that of the one hour averaged IMF. We ask if the diffusion of high energy protons into the large scale magnetic field is the cause of the lag observed between the FD and the IMF. Results. The enhancement of the IMF associated with FDs occurs mainly in the shock-sheath region, and the turbulence level in the magnetic field is also enhanced in this region. The observed FD profiles look remarkably similar to the IMF enhancement profiles. The FDs typically lag the IMF enhancement by a few hours. The lag corresponds to the time taken by high energy protons to diffuse into the magnetic field enhancement via cross-field diffusion. Conclusions. Our findings show that high rigidity FDs associated with CMEs are caused primarily by the cumulative diffusion of protons across the magnetic field enhancement in the turbulent sheath region between the shock and the CME.Comment: accepted in A&

    Semantic Indexing of Sport Program Sequences by Audio-Visual Analysis

    Get PDF
    Semantic indexing of sports videos is a subject of great interest to researchers working on multimedia content characterization. Sports programs appeal to large audiences and their efficient distribution over various networks should contribute to widespread usage of multimedia services. In this paper, we propose a semantic indexing algorithm for soccer programs which uses both audio and visual information for content characterization. The video signal is processed first by extracting low-level visual descriptors from the MPEG compressed bit-stream. The temporal evolution of these descriptors during a semantic event is supposed to be governed by a controlled Markov chain. This allows to determine a list of those video segments where a semantic event of interest is likely to be found, based on the maximum likelihood criterion. The audio information is then used to refine the results of the video classification procedure by ranking the candidate video segments in the list so that the segments associated to the event of interest appear in the very first positions of the ordered list. The proposed method is applied to goal detection. Experimental results show the effectiveness of the proposed cross-modal approach
    • 

    corecore