Abstract — Nowadays, MTV has become an important favorite pastime to modern people because of its conciseness, convenience to play and the characteristic that can bring both audio and visual experiences to audiences. An affective MTV analysis framework, which realizes MTV affective state extraction, is representation and clustering. Firstly, affective features are extracted from both audio and visual signals. Affective state of each MTV is modeled with 2D dimensional affective model and visualized in the Arousal-Valence space. Finally the MTVs having similar affective states are clustered into same categories. The validity of proposed framework is proved by subjective user study and related work proves that our features improve the performance by a significant margin. Index Terms — Categorical affective content analysis, affective visualization, dimensional affective model, Hidden Markov Models(HMMs). I
To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.