During the past decade, the feature extraction and the knowledge acquisition based on video analysis have been extensively researched and tested on many applications such as CCTV data analysis, large-scale public event control, and other daily security monitoring and surveillance operations with various degrees of success. However, since the actual video process is a multi-phased one and encompasses extensive theories and techniques ranging from fundamental image processing, computational geometry and graphics, machine vision, to advanced artificial intelligence, pattern analysis, and even cognitive science, there are still many important problems to resolve before it can be widely applied. Among them, the video event identification and detection are the two prominent ones. Comparing with the most popular frame-to-frame processing mode of most of today’s approaches and systems, this project reorganizes the video data as a 3D volume structure which provides the hybrid spatial and temporal information in a unified space. This paper reports an innovative technique to transform original video frames to 3D volume structures denoted by spatial and temporal features. It then moves on to highlight the volume array structure in a so called “pre-suspicion” mechanism for later process. The focus of this report is the development of an effective and efficient voxel based segmentation technique suitable to the volumetric nature of video events and is ready for deployment in 3D clustering operations. The paper concludes at the performance evaluation of the devised technique with further discussions on the future work for accelerating the pre processing of the original video data
To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.