6 research outputs found
Accurate foreground segmentation without pre-learning
Foreground segmentation has been widely used in many computer vision applications. However, most of the existing methods rely on a pre-learned motion or background model, which will increase the burden of users. In this paper, we present an automatic algorithm without pre-learning for segmenting foreground from background based on the fusion of motion, color and contrast information. Motion information is enhanced by a novel method called support edges diffusion (SED) , which is built upon a key observation that edges of the difference image of two adjacent frames only appear in moving regions in most of the cases. Contrasts in background are attenuated while those in foreground are enhanced using gradient of the previous frame and that of the temporal difference. Experiments on many video sequences demonstrate the effectiveness and accuracy of the proposed algorithm. The segmentation results are comparable to those obtained by other state-of-the-art methods that depend on a pre-learned background or a stereo setup. © 2011 IEEE.published_or_final_versionThe 6th International Conference on Image and Graphics (ICIG 2011), Hefei, Anhui, China, 12-15 August 2011. In Proceedings of the 6th ICIG, 2011, p. 331-33
Visual object localization in image collections
Conference Name:6th International Conference on Image and Graphics, ICIG 2011. Conference Address: Hefei, Anhui, China. Time:August 12, 2011 - August 15, 2011.National Natural Science Foundation of China; Chinese Academy of Science; Microsoft Research Asia; Xian Institute of Optics and Precision Mechanics of CAS; Anhui Crearo Technology Co., LtdThe research of object localization is active in the field of visual object category. In this paper, we focus on object localization in a given special category dataset. We propose to exploit the context aware category discovery for object localization without any labeled examples. Firstly, the image is segmented based on a multiple segmentation algorithm. Secondly, these generated regions are clustered by spectral clustering method to find the category pattern based on the context of the dataset and the saliency. Thirdly, the object is localized based on the weakly supervised learning algorithm. To justify the effectiveness of the proposed method, the detection precision is employed to evaluate the performance of our approach. The experimental results demonstrate that our approach is promising in object localization with unsupervised learning method. ? 2011 IEEE
Recommended from our members
Natural scene classification, annotation and retrieval. Developing different approaches for semantic scene modelling based on Bag of Visual Words.
With the availability of inexpensive hardware and software, digital imaging has become an important medium of communication in our daily lives. A huge amount of digital images are being collected and become available through the internet and stored in various fields such as personal image collections, medical imaging, digital arts etc. Therefore, it is important to make sure that images are stored, searched and accessed in an efficient manner. The use of bag of visual words (BOW) model for modelling images based on local invariant features computed at interest point locations has become a standard choice for many computer vision tasks. Based on this promising model, this thesis investigates three main problems: natural scene classification, annotation and retrieval. Given an image, the task is to design a system that can determine to which class that image belongs to (classification), what semantic concepts it contain (annotation) and what images are most similar to (retrieval).
This thesis contributes to scene classification by proposing a weighting approach, named keypoints density-based weighting method (KDW), to control the fusion of colour information and bag of visual words on spatial pyramid layout in a unified framework. Different configurations of BOW, integrated visual vocabularies and multiple image descriptors are investigated and analyzed. The proposed approaches are extensively evaluated over three well-known scene classification datasets with 6, 8 and 15 scene categories using 10-fold cross validation. The second contribution in this thesis, the scene annotation task, is to explore whether the integrated visual vocabularies generated for scene classification can be used to model the local semantic information of natural scenes. In this direction, image annotation is considered as a classification problem where images are partitioned into 10x10 fixed grid and each block, represented by BOW and different image descriptors, is classified into one of predefined semantic classes. An image is then represented by counting the percentage of every semantic concept detected in the image. Experimental results on 6 scene categories demonstrate the effectiveness of the proposed approach. Finally, this thesis further explores, with an extensive experimental work, the use of different configurations of the BOW for natural scene retrieval.Applied Science University in Jorda
Measurement matrix of Compressive Sensing based on Gram-Schmidt orthogonalization
Conference Name:6th International Conference on Image and Graphics, ICIG 2011. Conference Address: Hefei, Anhui, China. Time:August 12, 2011 - August 15, 2011.National Natural Science Foundation of China; Chinese Academy of Science; Microsoft Research Asia; Xian Institute of Optics and Precision Mechanics of CAS; Anhui Crearo Technology Co., LtdMeasurement matrix plays an important part in sampling data and reconstructing signal in Compressive Sensing (CS). In this paper, the common measurement matrices and the relationship between measurement number of measurement matrix and signal sparsity are researched. The performance among the common measurement matrices is compared. In order to obtain a better reconstruction result, an improved method based on Gram-Schmidt orthogonalization of row vectors for matrix is proposed. The experiments show that the improved measurement matrix is better than the original measurement matrix when used to reconstruct signal. ? 2011 IEEE
Research on license plate detection based on salient feature under complex background
Conference Name:6th International Conference on Image and Graphics, ICIG 2011. Conference Address: Hefei, Anhui, China. Time:August 12, 2011 - August 15, 2011.National Natural Science Foundation of China; Chinese Academy of Science; Microsoft Research Asia; Xian Institute of Optics and Precision Mechanics of CAS; Anhui Crearo Technology Co., LtdFor plate detection, we found that when the plate is disturbed by complex upright borderlines, location can be inaccurate or even missed. On the basis of this, a new vehicle license plate location algorithm based on salient feature is introduced. It makes use of the texture feature, geometric characteristics and color information. By raw location and precise location, it can quickly locate the license plate position accurately and distinguishes the color type. Our experiment results show that this algorithm has a fast, efficient performance of locating vehicle license plate under complex background. ? 2011 IEEE
Advances in knowledge discovery and data mining Part II
19th Pacific-Asia Conference, PAKDD 2015, Ho Chi Minh City, Vietnam, May 19-22, 2015, Proceedings, Part II</p