16,583 research outputs found

    Unsupervised Segmentation of Action Segments in Egocentric Videos using Gaze

    Full text link
    Unsupervised segmentation of action segments in egocentric videos is a desirable feature in tasks such as activity recognition and content-based video retrieval. Reducing the search space into a finite set of action segments facilitates a faster and less noisy matching. However, there exist a substantial gap in machine understanding of natural temporal cuts during a continuous human activity. This work reports on a novel gaze-based approach for segmenting action segments in videos captured using an egocentric camera. Gaze is used to locate the region-of-interest inside a frame. By tracking two simple motion-based parameters inside successive regions-of-interest, we discover a finite set of temporal cuts. We present several results using combinations (of the two parameters) on a dataset, i.e., BRISGAZE-ACTIONS. The dataset contains egocentric videos depicting several daily-living activities. The quality of the temporal cuts is further improved by implementing two entropy measures.Comment: To appear in 2017 IEEE International Conference On Signal and Image Processing Application

    Graph-based clustering for identifying region of interest in eye tracker data analysis

    Get PDF
    Localization of a viewer's region of interest (ROI) on eye gaze signal trajectories acquired by eye trackers is a widely used approach in scene analysis, image compression, and quality of experience assessment. In this paper, we propose a novel clustering approach for ROI estimation from potentially noisy raw eye gaze data, based on signal processing on graphs. The clustering approach adapts graph signal processing (GSP)-based classification by first cleverly selecting a starting data sample, and then classifying the remaining samples. Furthermore, Graph Fourier Transform is used to adjust GSP parameters on-the-fly to maximise accuracy. Experimental results show competitive clustering accuracy of our proposed scheme compared to Density-based spatial clustering of applications with noise (DB-SCAN), Distance-Threshold Identification (I-DT), and Mean-Shift on publicly available Shape Dataset and the potential of estimating ROI accurately on true eye tracker data
    • …
    corecore