85,137 research outputs found
Image-Dependent Spatial Shape-Error Concealment
Existing spatial shape-error concealment techniques are broadly based upon either parametric curves that exploit geometric information concerning a shape's contour or object shape statistics using a combination of Markov random fields and maximum a posteriori estimation. Both categories are to some extent, able to mask errors caused by information loss, provided the shape is considered independently of the image/video. They palpably however, do not afford the best solution in applications where shape is used as metadata to describe image and video content. This paper presents a novel image-dependent spatial shape-error concealment (ISEC) algorithm that uses both image and shape information by employing the established rubber-band contour detecting function, with the novel enhancement of automatically determining the optimal width of the band to achieve superior error concealment. Experimental results corroborate both qualitatively and numerically, the enhanced performance of the new ISEC strategy compared with established techniques
Sparse visual models for biologically inspired sensorimotor control
Given the importance of using resources efficiently in the competition for survival, it is reasonable to think that natural evolution has discovered efficient cortical coding strategies for representing natural visual information. Sparse representations have intrinsic advantages in terms of fault-tolerance and low-power consumption potential, and can therefore be attractive for robot sensorimotor control with powerful dispositions for decision-making. Inspired by the mammalian brain and its visual ventral pathway, we present in this paper a hierarchical sparse coding network architecture that extracts visual features for use in sensorimotor control. Testing with natural images demonstrates that this sparse coding facilitates processing and learning in subsequent layers. Previous studies have shown how the responses of complex cells could be sparsely represented by a higher-order neural layer. Here we extend sparse coding in each network layer, showing that detailed modeling of earlier stages in the visual pathway enhances the characteristics of the receptive fields developed in subsequent stages. The yield network is more dynamic with richer and more biologically plausible input and output representation
Digital Image Access & Retrieval
The 33th Annual Clinic on Library Applications of Data Processing, held at the University of Illinois at Urbana-Champaign in March of 1996, addressed the theme of "Digital Image Access & Retrieval." The papers from this conference cover a wide range of topics concerning digital imaging technology for visual resource collections. Papers covered three general areas: (1) systems, planning, and implementation; (2) automatic and semi-automatic indexing; and (3) preservation with the bulk of the conference focusing on indexing and retrieval.published or submitted for publicatio
Recommended from our members
Very low bit-rate video coding focusing on moving regions using three-tier arbitrary-shaped pattern selection algorithm
Very low bit-rate video coding using patterns to represent moving regions in macroblocks exhibits good potential for improved coding efficiency. Recently an Arbitrary Shaped Pattern Selection (ASPS) algorithm and its Extended version(EASPS) were presented, that used a dynamically extracted set of patterns, of the two different sizes, based on actual video content. These algorithms, like other pattern matching algorithms failed to capture a large number of active-region macroblocks (RMB) especially when the object moving regions is relatively larger in a video sequence. As the size of the moving object may vary, superior coding performance is achievable by using dynamically extracted patterns of a larger size. This paper, proposes a three-tier Arbitrary Shaped Pattern Selection (ASPS-3) algorithm that uses three different pattern sizes for very low bit ate coding. Experimental results show that ASPS-3 exhibits better performance compared with other pattern matching algorithms, including the low-bit rate video coding standard H.263
A video object generation tool allowing friendly user interaction
In this paper we describe an interactive video object segmentation tool developed in the framework of the ACTS-AC098 MOMUSYS project. The Video Object Generator with User Environment (VOGUE) combines three different sets of automatic and semi-automatic-tool (spatial segmentation, object tracking and temporal segmentation) with general purpose tools for user interaction. The result is an integrated environment allowing the user-assisted segmentation of any sort of video sequences in a friendly and efficient manner.Peer ReviewedPostprint (published version
Recommended from our members
Fuzzy image segmentation of generic shaped clusters
The segmentation performance of any clustering algorithm is very sensitive to the features in an image, which ultimately restricts their generalisation capability. This limitation was the primary motivation in our investigation into using shape information to improve the generality of these algorithms. Fuzzy shape-based clustering techniques already consider ring and elliptical profiles in segmentation, though most real objects are neither ring nor elliptically shaped. This paper addresses this issue by introducing a new shape-based algorithm called fuzzy image segmentation of generic shaped clusters (FISG) that incorporates generic shape information into the framework of the fuzzy c-means (FCM) algorithm. Both qualitative and quantitative analyses confirm the superiority of FISG compared to other shape-based fuzzy clustering methods including, Gustafson-Kessel algorithm, ring-shaped, circular shell, c-ellipsoidal shells and elliptic ring-shaped clusters. The new algorithm has also been shown to be application independent so it can be applied in areas such as video object plane segmentation in MPEG-4 based coding
- …