Search CORE

893 research outputs found

Fast unsupervised multiresolution color image segmentation using adaptive gradient thresholding and progressive region growing

Author: Vantaram Sreenath Rao
Publication venue: RIT Scholar Works
Publication date: 01/03/2009
Field of study

In this thesis, we propose a fast unsupervised multiresolution color image segmentation algorithm which takes advantage of gradient information in an adaptive and progressive framework. This gradient-based segmentation method is initialized by a vector gradient calculation on the full resolution input image in the CIE L*a*b* color space. The resultant edge map is used to adaptively generate thresholds for classifying regions of varying gradient densities at different levels of the input image pyramid, obtained through a dyadic wavelet decomposition scheme. At each level, the classification obtained by a progressively thresholded growth procedure is combined with an entropy-based texture model in a statistical merging procedure to obtain an interim segmentation. Utilizing an association of a gradient quantized confidence map and non-linear spatial filtering techniques, regions of high confidence are passed from one level to another until the full resolution segmentation is achieved. Evaluation of our results on several hundred images using the Normalized Probabilistic Rand (NPR) Index shows that our algorithm outperforms state-of the art segmentation techniques and is much more computationally efficient than its single scale counterpart, with comparable segmentation quality

RIT Scholar Works

A Novel Multi-Focus Image Fusion Method Based on Stochastic Coordinate Coding and Local Density Peaks Clustering

Author
Publication venue: 'MDPI AG'
Publication date: 11/11/2016
Field of study

abstract: The multi-focus image fusion method is used in image processing to generate all-focus images that have large depth of field (DOF) based on original multi-focus images. Different approaches have been used in the spatial and transform domain to fuse multi-focus images. As one of the most popular image processing methods, dictionary-learning-based spare representation achieves great performance in multi-focus image fusion. Most of the existing dictionary-learning-based multi-focus image fusion methods directly use the whole source images for dictionary learning. However, it incurs a high error rate and high computation cost in dictionary learning process by using the whole source images. This paper proposes a novel stochastic coordinate coding-based image fusion framework integrated with local density peaks. The proposed multi-focus image fusion method consists of three steps. First, source images are split into small image patches, then the split image patches are classified into a few groups by local density peaks clustering. Next, the grouped image patches are used for sub-dictionary learning by stochastic coordinate coding. The trained sub-dictionaries are combined into a dictionary for sparse representation. Finally, the simultaneous orthogonal matching pursuit (SOMP) algorithm is used to carry out sparse representation. After the three steps, the obtained sparse coefficients are fused following the max L1-norm rule. The fused coefficients are inversely transformed to an image by using the learned dictionary. The results and analyses of comparison experiments demonstrate that fused images of the proposed method have higher qualities than existing state-of-the-art methods

ASU Digital Repository

Two and three dimensional segmentation of multimodal imagery

Author: Vantaram Sreenath Rao
Publication venue: RIT Scholar Works
Publication date: 01/10/2012
Field of study

The role of segmentation in the realms of image understanding/analysis, computer vision, pattern recognition, remote sensing and medical imaging in recent years has been significantly augmented due to accelerated scientific advances made in the acquisition of image data. This low-level analysis protocol is critical to numerous applications, with the primary goal of expediting and improving the effectiveness of subsequent high-level operations by providing a condensed and pertinent representation of image information. In this research, we propose a novel unsupervised segmentation framework for facilitating meaningful segregation of 2-D/3-D image data across multiple modalities (color, remote-sensing and biomedical imaging) into non-overlapping partitions using several spatial-spectral attributes. Initially, our framework exploits the information obtained from detecting edges inherent in the data. To this effect, by using a vector gradient detection technique, pixels without edges are grouped and individually labeled to partition some initial portion of the input image content. Pixels that contain higher gradient densities are included by the dynamic generation of segments as the algorithm progresses to generate an initial region map. Subsequently, texture modeling is performed and the obtained gradient, texture and intensity information along with the aforementioned initial partition map are used to perform a multivariate refinement procedure, to fuse groups with similar characteristics yielding the final output segmentation. Experimental results obtained in comparison to published/state-of the-art segmentation techniques for color as well as multi/hyperspectral imagery, demonstrate the advantages of the proposed method. Furthermore, for the purpose of achieving improved computational efficiency we propose an extension of the aforestated methodology in a multi-resolution framework, demonstrated on color images. Finally, this research also encompasses a 3-D extension of the aforementioned algorithm demonstrated on medical (Magnetic Resonance Imaging / Computed Tomography) volumes

RIT Scholar Works

Sea-Surface Object Detection Based on Electro-Optical Sensors: A Review

Author: Cheng Tao
Gao Xiaowei
Lyu Hongguang
Shao Zeyuan
Yin Yong
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 09/09/2022
Field of study

Sea-surface object detection is critical for navigation safety of autonomous ships. Electrooptical (EO) sensors, such as video cameras, complement radar on board in detecting small obstacle sea-surface objects. Traditionally, researchers have used horizon detection, background subtraction, and foreground segmentation techniques to detect sea-surface objects. Recently, deep learning-based object detection technologies have been gradually applied to sea-surface object detection. This article demonstrates a comprehensive overview of sea-surface object-detection approaches where the advantages and drawbacks of each technique are compared, covering four essential aspects: EO sensors and image types, traditional object-detection methods, deep learning methods, and maritime datasets collection. In particular, sea-surface object detections based on deep learning methods are thoroughly analyzed and compared with highly influential public datasets introduced as benchmarks to verify the effectiveness of these approaches. The arti

UCL Discovery

Real-time Video Fusion for Surveillance Applications

Author: André da Cunha Trancoso Ranito
Publication venue
Publication date: 09/07/2019
Field of study

A área da vigilância está cada vez mais presente nas nossas vidas. Desde em ambientes urbanos para prevenção, detecção e resolução de crimes, prevenção de vandalismo e controlo de tráfego rodoviário até em ambientes mais remotos como é o caso de aplicações militares para identificação e localização de forças inimigas. Com a evolução da tecnologia a área da vigilância está também a ficar mais sofisticada, os sistemas cada vez têm melhor qualidade, são mais seguros, os custos são mais baixos, há uma maior escalabilidade de sistemas e uma melhor integração entre vários tipos de sistemas de vigilância diferentes. Um dos principais tipos de vigilância é a vídeo-vigilância. Como o nome indica, esta técnica consiste na constante captura de imagens de forma a obter uma sequência dos acontecimentos num determinado local. No entanto, uma das desvantagens deste sistema é a dependência das condições de visibilidade no local. No ambiente desta dissertação foi desenvolvido um sistema em tempo real que seja capaz de recolher imagens com informação útil mesmo em condições de pouca visibilidade, isto inclui, por exemplo, ambientes nocturnos, de nevoeiro ou com fumo. Para tal foi usada a técnica de fusão de imagens, neste caso entre uma imagem no espectro infravermelho e uma imagem no espectro visível. A recolha complementar de imagens no espectro infravermelho vai introduzir mais informação sobre o ambiente, nomeadamente acerca da temperatura. Esta informação extra vai depois ser fundida com a imagem do espectro visível de forma a gerar uma só imagem que contém as informações de ambas as imagens visível e infravermelha.Surveillance is becoming a really important element in our daily lives. From urban environments as crime prevention, detection and resolution, vandalism prevention and traffic flow control to more remote environments such as military applications, for instance, to identify and locate enemy forces. As technology develops, the surveillance subject is also getting more sophisticated. The systems are improving quality wise, are getting safer, costs are getting lower, there is higher scalability of systems and there is better integration between different types of surveillance systems. One of the main types of surveillance is known as video surveillance. As the name states, this technique consists of a constant capture of images in order to obtain a sequence of events happening in a given location. However, one of the main disadvantages of these systems is the dependency on the visibility conditions available in the location. In the scope of this dissertation a real-time system capable of capturing images containing useful information even in low visibility conditions, such as nighttime, fog or smoke, was developed. For this purpose, a technique known as image fusion was used. In this case, a fusion between an image contained in the infrared spectrum and another contained in the visible spectrum. Sensing a complementary image of the environment in the infrared spectrum will provide extra information, such as the temperature. This extra information will then be fused with the visible spectrum image, generating just one image containing the information from both the visible and infrared images

Repositório Aberto da Universidade do Porto

Multi-scale data fusion for surface metrology

Author: NC DOCKS at The University of North Carolina at Charlotte
Ramasamy Suresh Kumar
Publication venue
Publication date: 01/01/2011
Field of study

The major trends in manufacturing are miniaturization, convergence of the traditional research fields and creation of interdisciplinary research areas. These trends have resulted in the development of multi-scale models and multi-scale surfaces to optimize the performance. Multi-scale surfaces that exhibit specific properties at different scales for a specific purpose require multi-scale measurement and characterization. Researchers and instrument developers have developed instruments that are able to perform measurements at multiple scales but lack the much required multi- scale characterization capability. The primary focus of this research was to explore possible multi-scale data fusion strategies and options for surface metrology domain and to develop enabling software tools in order to obtain effective multi-scale surface characterization, maximizing fidelity while minimizing measurement cost and time. This research effort explored the fusion strategies for surface metrology domain and narrowed the focus on Discrete Wavelet Frame (DWF) based multi-scale decomposition. An optimized multi-scale data fusion strategy ‘FWR method’ was developed and was successfully demonstrated on both high aspect ratio surfaces and non-planar surfaces. It was demonstrated that the datum features can be effectively characterized at a lower resolution using one system (Vision CMM) and the actual features of interest could be characterized at a higher resolution using another system (Coherence Scanning Interferometer) with higher capability while minimizing the measurement time

The University of North Carolina at Greensboro

Detection and classification of non-stationary signals using sparse representations in adaptive dictionaries

Author: Moody Daniela I.
Publication venue
Publication date: 01/01/2012
Field of study

Automatic classification of non-stationary radio frequency (RF) signals is of particular interest in persistent surveillance and remote sensing applications. Such signals are often acquired in noisy, cluttered environments, and may be characterized by complex or unknown analytical models, making feature extraction and classification difficult. This thesis proposes an adaptive classification approach for poorly characterized targets and backgrounds based on sparse representations in non-analytical dictionaries learned from data. Conventional analytical orthogonal dictionaries, e.g., Short Time Fourier and Wavelet Transforms, can be suboptimal for classification of non-stationary signals, as they provide a rigid tiling of the time-frequency space, and are not specifically designed for a particular signal class. They generally do not lead to sparse decompositions (i.e., with very few non-zero coefficients), and use in classification requires separate feature selection algorithms. Pursuit-type decompositions in analytical overcomplete (non-orthogonal) dictionaries yield sparse representations, by design, and work well for signals that are similar to the dictionary elements. The pursuit search, however, has a high computational cost, and the method can perform poorly in the presence of realistic noise and clutter. One such overcomplete analytical dictionary method is also analyzed in this thesis for comparative purposes. The main thrust of the thesis is learning discriminative RF dictionaries directly from data, without relying on analytical constraints or additional knowledge about the signal characteristics. A pursuit search is used over the learned dictionaries to generate sparse classification features in order to identify time windows that contain a target pulse. Two state-of-the-art dictionary learning methods are compared, the K-SVD algorithm and Hebbian learning, in terms of their classification performance as a function of dictionary training parameters. Additionally, a novel hybrid dictionary algorithm is introduced, demonstrating better performance and higher robustness to noise. The issue of dictionary dimensionality is explored and this thesis demonstrates that undercomplete learned dictionaries are suitable for non-stationary RF classification. Results on simulated data sets with varying background clutter and noise levels are presented. Lastly, unsupervised classification with undercomplete learned dictionaries is also demonstrated in satellite imagery analysis

Digital Repository at the University of Maryland

A brief survey of visual saliency detection

Author: Guo Jie
Hussain Sumaira
Jian Muwei
Ullah Inam
Wang Xing
Yin Yilong
Yu Hui
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 13/04/2020
Field of study

Portsmouth University Research Portal (Pure)

Event Detection from Social Media Stream: Methods, Datasets and Opportunities

Author: Chao Yang
Li Dong
Li Quanzhi
Lu Yao
Zhang Chi
Publication venue
Publication date: 28/06/2023
Field of study

Social media streams contain large and diverse amount of information, ranging from daily-life stories to the latest global and local events and news. Twitter, especially, allows a fast spread of events happening real time, and enables individuals and organizations to stay informed of the events happening now. Event detection from social media data poses different challenges from traditional text and is a research area that has attracted much attention in recent years. In this paper, we survey a wide range of event detection methods for Twitter data stream, helping readers understand the recent development in this area. We present the datasets available to the public. Furthermore, a few research opportunitiesComment: 8 page

arXiv.org e-Print Archive