Search CORE

167 research outputs found

Automatic near real-time flood detection in high resolution X-band synthetic aperture radar satellite data using context-based classification on irregular graphs

Author: Martinis Sandro
Publication venue: Ludwig-Maximilians-Universität München
Publication date: 01/01/2010
Field of study

This thesis is an outcome of the project “Flood and damage assessment using very high resolution SAR data” (SAR-HQ), which is embedded in the interdisciplinary oriented RIMAX (Risk Management of Extreme Flood Events) programme, funded by the Federal Ministry of Education and Research (BMBF). It comprises the results of three scientific papers on automatic near real-time flood detection in high resolution X-band synthetic aperture radar (SAR) satellite data for operational rapid mapping activities in terms of disaster and crisis-management support. Flood situations seem to become more frequent and destructive in many regions of the world. A rising awareness of the availability of satellite based cartographic information has led to an increase in requests to corresponding mapping services to support civil-protection and relief organizations with disaster-related mapping and analysis activities. Due to the rising number of satellite systems with high revisit frequencies, a strengthened pool of SAR data is available during operational flood mapping activities. This offers the possibility to observe the whole extent of even large-scale flood events and their spatio-temporal evolution, but also calls for computationally efficient and automatic flood detection methods, which should drastically reduce the user input required by an active image interpreter. This thesis provides solutions for the near real-time derivation of detailed flood parameters such as flood extent, flood-related backscatter changes as well as flood classification probabilities from the new generation of high resolution X-band SAR satellite imagery in a completely unsupervised way. These data are, in comparison to images from conventional medium-resolution SAR sensors, characterized by an increased intra-class and decreased inter-class variability due to the reduced mixed pixel phenomenon. This problem is addressed by utilizing multi-contextual models on irregular hierarchical graphs, which consider that semantic image information is less represented in single pixels but in homogeneous image objects and their mutual relation. A hybrid Markov random field (MRF) model is developed, which integrates scale-dependent as well as spatio-temporal contextual information into the classification process by combining hierarchical causal Markov image modeling on automatically generated irregular hierarchical graphs with noncausal Markov modeling related to planar MRFs. This model is initialized in an unsupervised manner by an automatic tile-based thresholding approach, which solves the flood detection problem in large-size SAR data with small a priori class probabilities by statistical parameterization of local bi-modal class-conditional density functions in a time efficient manner. Experiments performed on TerraSAR-X StripMap data of Southwest England and ScanSAR data of north-eastern Namibia during large-scale flooding show the effectiveness of the proposed methods in terms of classification accuracy, computational performance, and transferability. It is further demonstrated that hierarchical causal Markov models such as hierarchical maximum a posteriori (HMAP) and hierarchical marginal posterior mode (HMPM) estimation can be effectively used for modeling the inter-spatial context of X-band SAR data in terms of flood and change detection purposes. Although the HMPM estimator is computationally more demanding than the HMAP estimator, it is found to be more suitable in terms of classification accuracy. Further, it offers the possibility to compute marginal posterior entropy-based confidence maps, which are used for the generation of flood possibility maps that express that the uncertainty in labeling of each image element. The supplementary integration of intra-spatial and, optionally, temporal contextual information into the Markov model results in a reduction of classification errors. It is observed that the application of the hybrid multi-contextual Markov model on irregular graphs is able to enhance classification results in comparison to modeling on regular structures of quadtrees, which is the hierarchical representation of images usually used in MRF-based image analysis. X-band SAR systems are generally not suited for detecting flooding under dense vegetation canopies such as forests due to the low capability of the X-band signal to penetrate into media. Within this thesis a method is proposed for the automatic derivation of flood areas beneath shrubs and grasses from TerraSAR-X data. Furthermore, an approach is developed, which combines high resolution topographic information with multi-scale image segmentation to enhance the mapping accuracy in areas consisting of flooded vegetation and anthropogenic objects as well as to remove non-water look-alike areas

Institute of Transport Research:Publications

Digitale Hochschulschriften der LMU

Remote sensing satellite image processing techniques for image classification: a comprehensive survey

Author: Deepa Shenoy P.
Sowmya D.R.
Venugopal K.R.
Publication venue: 'Foundation of Computer Science'
Publication date: 01/03/2017
Field of study

This paper is a brief survey of advance technological aspects of Digital Image Processing which are applied to remote sensing images obtained from various satellite sensors. In remote sensing, the image processing techniques can be categories in to four main processing stages: Image preprocessing, Enhancement, Transformation and Classification. Image pre-processing is the initial processing which deals with correcting radiometric distortions, atmospheric distortion and geometric distortions present in the raw image data. Enhancement techniques are applied to preprocessed data in order to effectively display the image for visual interpretation. It includes techniques to effectively distinguish surface features for visual interpretation. Transformation aims to identify particular feature of earth’s surface and classification is a process of grouping the pixels, that produces effective thematic map of particular land use and land cover

ePrints@Bangalore University

Human-Centric Machine Vision

Author
Publication venue: 'IntechOpen'
Publication date: 20/04/2021
Field of study

Recently, the algorithms for the processing of the visual information have greatly evolved, providing efficient and effective solutions to cope with the variability and the complexity of real-world environments. These achievements yield to the development of Machine Vision systems that overcome the typical industrial applications, where the environments are controlled and the tasks are very specific, towards the use of innovative solutions to face with everyday needs of people. The Human-Centric Machine Vision can help to solve the problems raised by the needs of our society, e.g. security and safety, health care, medical imaging, and human machine interface. In such applications it is necessary to handle changing, unpredictable and complex situations, and to take care of the presence of humans

Directory of Open Access Books (DOAB)

CHANGE DETECTION BY FUSING ADVANTAGES OF THRESHOLD AND CLUSTERING METHODS

Author
Publication venue: 'Copernicus GmbH'
Publication date
Field of study

Crossref

Simple and Complex Human Action Recognition in Constrained and Unconstrained Videos

Author: Mohammadi Nejad Eman
Publication venue: 'University of Windsor Leddy Library'
Publication date: 01/01/2018
Field of study

Human action recognition plays a crucial role in visual learning applications such as video understanding and surveillance, video retrieval, human-computer interactions, and autonomous driving systems. A variety of methodologies have been proposed for human action recognition via developing of low-level features along with the bag-of-visual-word models. However, much less research has been performed on the compound of pre-processing, encoding and classification stages. This dissertation focuses on enhancing the action recognition performances via ensemble learning, hybrid classifier, hierarchical feature representation, and key action perception methodologies. Action variation is one of the crucial challenges in video analysis and action recognition. We address this problem by proposing the hybrid classifier (HC) to discriminate actions which contain similar forms of motion features such as walking, running, and jogging. Aside from that, we show and proof that the fusion of various appearance-based and motion features can boost the simple and complex action recognition performance. The next part of the dissertation introduces pooled-feature representation (PFR) which is derived from a double phase encoding framework (DPE). Considering that a given unconstrained video is composed of a sequence of simple frames, the first phase of DPE generates temporal sub-volumes from the video and represents them individually by employing the proposed improved rank pooling (IRP) method. The second phase constructs the pool of features by fusing the represented vectors from the first phase. The pool is compressed and then encoded to provide video-parts vector (VPV). The DPE framework allows distilling the video representation and hierarchically extracting new information. Compared with recent video encoding approaches, VPV can preserve the higher-level information through standard encoding of low-level features in two phases. Furthermore, the encoded vectors from both phases of DPE are fused along with a compression stage to develop PFR

Scholarship at UWindsor

Development of artificial neural network-based object detection algorithms for low-cost hardware devices

Author: De Benito Picazo José Jesús
Publication venue: UMA Editorial
Publication date: 21/07/2021
Field of study

Finally, the fourth work was published in the “WCCI” conference in 2020 and consisted of an individuals' position estimation algorithm based on a novel neural network model for environments with forbidden regions, named “Forbidden Regions Growing Neural Gas”.The human brain is the most complex, powerful and versatile learning machine ever known. Consequently, many scientists of various disciplines are fascinated by its structures and information processing methods. Due to the quality and quantity of the information extracted from the sense of sight, image is one of the main information channels used by humans. However, the massive amount of video footage generated nowadays makes it difficult to process those data fast enough manually. Thus, computer vision systems represent a fundamental tool in the extraction of information from digital images, as well as a major challenge for scientists and engineers. This thesis' primary objective is automatic foreground object detection and classification through digital image analysis, using artificial neural network-based techniques, specifically designed and optimised to be deployed in low-cost hardware devices. This objective will be complemented by developing individuals' movement estimation methods by using unsupervised learning and artificial neural network-based models. The cited objectives have been addressed through a research work illustrated in four publications supporting this thesis. The first one was published in the “ICAE” journal in 2018 and consists of a neural network-based movement detection system for Pan-Tilt-Zoom (PTZ) cameras deployed in a Raspberry Pi board. The second one was published in the “WCCI” conference in 2018 and consists of a deep learning-based automatic video surveillance system for PTZ cameras deployed in low-cost hardware. The third one was published in the “ICAE” journal in 2020 and consists of an anomalous foreground object detection and classification system for panoramic cameras, based on deep learning and supported by low-cost hardware

Repositorio Institucional Universidad de Málaga

Pattern Recognition

Author
Publication venue: 'IntechOpen'
Publication date: 20/04/2021
Field of study

A wealth of advanced pattern recognition algorithms are emerging from the interdiscipline between technologies of effective visual features and the human-brain cognition process. Effective visual features are made possible through the rapid developments in appropriate sensor equipments, novel filter designs, and viable information processing architectures. While the understanding of human-brain cognition process broadens the way in which the computer can perform pattern recognition tasks. The present book is intended to collect representative researches around the globe focusing on low-level vision, filter design, features and image descriptors, data mining and analysis, and biologically inspired algorithms. The 27 chapters coved in this book disclose recent advances and new ideas in promoting the techniques, technology and applications of pattern recognition

Directory of Open Access Books (DOAB)

Recommended from our members

Automated human fall recognition from visual data

Author: Albawendi SG
Publication venue
Publication date: 01/02/2019
Field of study

Falls are one of the greatest risks for the older adults living alone at home. This research presents a novel visual-based fall detection approach to support independent living for older adults in an indoor environment. The aim of the research was to investigate appropriate methods for detecting falls through analysing the motion and shape of the human body. Several techniques for automatically detecting falls have been proposed. The existing technologies can be classified into three main groups of fall detectors, namely: ambient device-based, wearable sensor-based and computer vision-based techniques. Ambient device-based techniques use vibration or pressure sensors to capture the sound and vibration for detecting the presence and position of a person. Although these devices are inexpensive and do not disturb the user, the detection rate is rather low and many false alarms are generated. Wearable devices use different sensors such as accelerometer and gyroscopes to capture the human body movement information and detect falls. However, older adults often forget to wear them. Wearable sensors are also known to be too invasive as they require wearing and carrying various uncomfortable devices. Much work has been undertaken to investigate the use of visual-based sensors for fall detection using single, multiple, and omnidirectional cameras. The proposed research reported in this thesis uses a single camera to detect a moving object using a background subtraction algorithm. The next step is to extract robust features which describe the change in human shape and to discriminate falls from other activities like lying and sitting. These features are based on motion, change in the human shape feature, projection histogram features and temporal change of head position. Features extracted from the human silhouette are finally fed into various machine learning classifiers for fall detection evaluation. The ability to distinguish a fall action depends mainly on the quality of the classifier inputs, therefore, the features of the extracted human silhouette play a key role in the effectiveness and robustness of detecting human falls. In this research, the timed Motion History Image (tMHI) method is applied for motion segmentation. In addition, the motion information was combined with other features extracted from the fitted ellipse around the human body to discriminate actual fall from other activities. Fall detection methods can be divided into two main categories; thresh- old based methods and machine learning-based methods. This research presents threshold-based methods to distinguish between Activities of Daily Living (ADL) and falls. Fall events can be detected if the measured features values higher than pre-determined threshold values. Results show that falls can be distinguished from ADL with an accuracy of 99:82%, using our recording dataset. In addition, various machine learning methods were compared to evaluate their abilities to accurately detecting falls. Experimental results show efficiency and reliability of the proposed fall detection approach with high fall detection rate of 99:60% and low false alarm 2:62% tested with UR Fall Detection dataset. Additionally, A set of experiments have been conducted using our recording dataset, the results indicate that the proposed approach achieves high fall detection rate 99:94% and low false alarm 0:02%

Nottingham Trent Institutional Repository (IRep)

Two and three dimensional segmentation of multimodal imagery

Author: Vantaram Sreenath Rao
Publication venue: RIT Scholar Works
Publication date: 01/10/2012
Field of study

The role of segmentation in the realms of image understanding/analysis, computer vision, pattern recognition, remote sensing and medical imaging in recent years has been significantly augmented due to accelerated scientific advances made in the acquisition of image data. This low-level analysis protocol is critical to numerous applications, with the primary goal of expediting and improving the effectiveness of subsequent high-level operations by providing a condensed and pertinent representation of image information. In this research, we propose a novel unsupervised segmentation framework for facilitating meaningful segregation of 2-D/3-D image data across multiple modalities (color, remote-sensing and biomedical imaging) into non-overlapping partitions using several spatial-spectral attributes. Initially, our framework exploits the information obtained from detecting edges inherent in the data. To this effect, by using a vector gradient detection technique, pixels without edges are grouped and individually labeled to partition some initial portion of the input image content. Pixels that contain higher gradient densities are included by the dynamic generation of segments as the algorithm progresses to generate an initial region map. Subsequently, texture modeling is performed and the obtained gradient, texture and intensity information along with the aforementioned initial partition map are used to perform a multivariate refinement procedure, to fuse groups with similar characteristics yielding the final output segmentation. Experimental results obtained in comparison to published/state-of the-art segmentation techniques for color as well as multi/hyperspectral imagery, demonstrate the advantages of the proposed method. Furthermore, for the purpose of achieving improved computational efficiency we propose an extension of the aforestated methodology in a multi-resolution framework, demonstrated on color images. Finally, this research also encompasses a 3-D extension of the aforementioned algorithm demonstrated on medical (Magnetic Resonance Imaging / Computed Tomography) volumes

RIT Scholar Works