Integration Of Multimodal Features For Video Scene Classification Based On Hmm
Along with the advance in multimedia and internet technology, ahuge amount of data, including digital video and audio, are generated daily. Tools for e#cient indexing and retrieval are indispensable. With multi-modal information present in the data, e#ectiveintegration is necessary and is still a challenging problem. In this paper, we present four di#erent methods for integrating audio and visual information for video classi#cation based on Hidden Markov Model. Our results have shown signi#cant improvementover using single modality. INTRODUCTION Along with the advancementinmultimedia and internet technology, the amount of digital data, including TV programs, conference archives, and movies, grows exponentially. For e#cient access, understanding, and retrieval of digital video, tools that can automatically understand the semantic content in a video are becoming indispensable. In this paper, we consider the classi#cation of a video sequence into one of a few predetermined scene types. ..