12,047 research outputs found
A multi-modal event detection system for river and coastal marine monitoring applications
Abstract—This work is investigating the use of a multi-modal
sensor network where visual sensors such as cameras and
satellite imagers, along with context information can be used to complement and enhance the usefulness of a traditional in-situ sensor network in measuring and tracking some feature of a river or coastal location. This paper focuses on our work in relation to the use of an off the shelf camera as part of a multi-modal sensor network for monitoring a river environment. It outlines our results in relation to the estimation of water level using a visual sensor. It also outlines the benefits of a multi-modal sensor network for marine environmental monitoring and how this can lead to a smarter, more efficient sensing network
Complexity adaptation in H.264/AVC video coder for static cameras
H.264/AVC uses variable block size motion estimation (VBSME)
to improve coding gain. However, its complexity is significant
and fixed regardless of the required quality or of the
scene characteristics. In this paper, we propose an adaptive
complexity algorithm based on using the Walsh Hadamard
Transform (WHT). VBS automatic partition and skip mode
detection algorithms also are proposed. Experimental results
show that 70% - 5% of the computation of H.264/AVC is required
to achieve the same PSNR
Fast intra prediction in the transform domain
In this paper, we present a fast intra prediction method based on separating the transformed coefficients. The
prediction block can be obtained from the transformed and quantized neighboring block generating minimum distortion
for each DC and AC coefficients independently. Two prediction methods are proposed, one is full block search
prediction (FBSP) and the other is edge based distance prediction (EBDP), that find the best matched transformed
coefficients on additional neighboring blocks. Experimental results show that the use of transform coefficients
greatly enhances the efficiency of intra prediction whilst keeping complexity low compared to H.264/AVC
Low computational complexity variable block size (VBS) partitioning for motion estimation using the Walsh Hadamard transform (WHT)
Variable Block Size (VBS) based motion estimation has
been adapted in state of the art video coding, such as
H.264/AVC, VC-1. However, a low complexity H.264/AVC
encoder cannot take advantage of VBS due to its power consumption
requirements. In this paper, we present a VBS partition
algorithm based on a binary motion edge map without
either initial motion estimation or Rate-Distortion (R-D)
optimization for selecting modes. The proposed algorithm
uses the Walsh Hadamard Transform (WHT) to create a binary
edge map, which provides a computational complexity
cost effectiveness compared to other light segmentation
methods typically used to detect the required region
Using dempster-shafer theory to fuse multiple information sources in region-based segmentation
This paper presents a new method for segmentation of images into large regions that reflect the real world objects present in a scene. It explores the feasibility of utilizing spatial configuration of regions and their geometric properties (the so-called Syntactic Visual Features [1]) for improving the correspondence of segmentation results produced by the well-known Recursive Shortest Spanning Tree (RSST) algorithm [2] to semantic objects present in the scene. The main contribution of this paper is a novel framework for integration of evidence from multiple sources with the region merging process based on the Dempster-Shafer (DS) theory [3] that allows integration of sources providing evidence with different accuracy and reliability. Extensive experiments indicate that the proposed solution limits formation of regions spanning more than one semantic object
Using the discrete hadamard transform to detect moving objects in surveillance video
In this paper we present an approach to object detection in surveillance video based on detecting moving edges
using the Hadamard transform. The proposed method is characterized by robustness to illumination changes
and ghosting effects and provides high speed detection, making it particularly suitable for surveillance applications.
In addition to presenting an approach to moving edge detection using the Hadamard transform, we
introduce two measures to track edge history, Pixel Bit Mask Difference (PBMD) and History Update Value
(H UV ) that help reduce the false detections commonly experienced by approaches based on moving edges.
Experimental results show that the proposed algorithm overcomes the traditional drawbacks of frame differencing
and outperforms existing edge-based approaches in terms of both detection results and computational
complexity
Interactive object contour extraction for shape modeling
In this paper we present a semi-automatic segmentation approach suitable for extracting object contours as a precursor to 2D shape modeling. The approach is a modified and extended version of an existing state-of-the-art approach based on the concept of a Binary Partition Tree (BPT) [1]. The resulting segmentation tool facilitates quick and easy extraction of an object’s contour via a small amount of user interaction that is easy to perform, even in complicated scenes. Illustrative segmentation results are presented and the usefulness of the approach in generating object shape models is discussed
Multiple image view synthesis for free viewpoint video applications
Interactive audio-visual (AV) applications such as free viewpoint video (FVV) aim to enable unrestricted spatio-temporal navigation within multiple camera environments. Current virtual viewpoint view synthesis solutions for FVV are either purely image-based implying large information redundancy; or involve reconstructing complex 3D models of the scene. In this paper we present a new multiple image view synthesis algorithm that only requires camera parameters and disparity maps. The multi-view synthesis (MVS) approach can be used in any multi-camera environment and is scalable as virtual views can be created given 1 to N of the available video inputs, providing a means to gracefully handle scenarios where camera inputs decrease or increase over time. The algorithm identifies and selects only the best quality surface areas from available reference images, thereby reducing perceptual errors in virtual view reconstruction. Experimental results are presented and verified using both objective (PSNR) and subjective comparisons
Studies of Stellar Collapse and Black Hole Formation with the Open-Source Code GR1D
We discuss results from simulations of black hole formation in failing core-collapse supernovae performed with the code GR1D, a new open-source Eulerian spherically-symmetric general-relativistic hydrodynamics code. GR1D includes rotation in an approximate way (1.5D) comes with multiple finite-temperature nuclear equations of state (EOS), and treats neutrinos in the post-core-bounce phase via a 3-flavor leakage scheme and a heating prescription. We chose the favored K_0 = 220 MeV-variant of the Lattimer & Swesty (1990) EOS and present collapse calculations using the progenitor models of Limongi & Chieffi (2006). We show that there is no direct (or “prompt”) black hole formation in the collapse of ordinary massive stars (8M_☉ ≲ M_(ZAMS) ≲ 100 M_☉) present first results from black hole formation simulations that include rotation
- …