269,329 research outputs found

    Text Extraction from Web Images Based on A Split-and-Merge Segmentation Method Using Color Perception

    No full text
    This paper describes a complete approach to the segmentation and extraction of text from Web images for subsequent recognition, to ultimately achieve both effective indexing and presentation by non-visual means (e.g., audio). The method described here (the first in the authors’ systematic approach to exploit human colour perception) enables the extraction of text in complex situations such as in the presence of varying colour (characters and background). More precisely, in addition to using structural features, the segmentation follows a split-and-merge strategy based on the Hue-Lightness- Saturation (HLS) representation of colour as a first approximation of an anthropocentric expression of the differences in chromaticity and lightness. Character-like components are then extracted as forming textlines in a number of orientations and along curves

    Color analysis of the reconstructed complex nipple-areola after a mastectomy

    Get PDF
    In this paper a colour analysis of the reconstructed NAC (nipple-areola complex) after a mastectomy is performed. First, a colour segmentation algorithm based on the Live Wire method is proposed to separate the NAC from the rest of skin. And then the colour differences between the healthy and the reconstructed NAC are measured using colour-difference formulas recommended in CIE: CIELAB, CIE94 and CIEDE2000. The application domain is analyzing how the NAC is modified after applying a new reconstructed technique of areola-nipple complex, grafted after its cryopreservation. The analysis has been performed for 20 images, and good segmentation results have been obtained and quantitative colour difference in accordance with perceptual colour difference has been obtained

    Summation of perceptual cues in natural visual scenes

    Get PDF
    Natural visual scenes are rich in information, and any neural system analysing them must piece together the many messages from large arrays of diverse feature detectors. It is known how threshold detection of compound visual stimuli (sinusoidal gratings) is determined by their components' thresholds. We investigate whether similar combination rules apply to the perception of the complex and suprathreshold visual elements in naturalistic visual images. Observers gave magnitude estimations (ratings) of the perceived differences between pairs of images made from photographs of natural scenes. Images in some pairs differed along one stimulus dimension such as object colour, location, size or blur. But, for other image pairs, there were composite differences along two dimensions (e.g. both colour and object-location might change). We examined whether the ratings for such composite pairs could be predicted from the two ratings for the respective pairs in which only one stimulus dimension had changed. We found a pooling relationship similar to that proposed for simple stimuli: Minkowski summation with exponent 2.84 yielded the best predictive power (r=0.96), an exponent similar to that generally reported for compound grating detection. This suggests that theories based on detecting simple stimuli can encompass visual processing of complex, suprathreshold stimuli

    "Visual Affluence" in social photography: applicability of image segmentation as a visually oriented approach to study Instagram hashtags

    Get PDF
    The aim of the study is to examine the applicability of image segmentation – identification of objects/regions by partitioning images – to examine online social photography. We argue that the need for a meaning-independent reading of online social photography within social markers, such as hashtags, arises due to two characteristics of social photography: 1) internal incongruence resulting from user-driven construction, and 2) variability of content in terms of visual attributes, such as colour combinations, brightness, and details in backgrounds. We suggest visual affluence- plenitude of visual stimuli, such as objects and surfaces containing a variety of colour regions, present in visual imagery- as a basis for classifying visual content and image segmentation as a technique to measure affluence. We demonstrate that images containing objects with complex texture and background patterns are more affluent, while images that include blurry backgrounds are less affluent than others. Moreover, images that contain letters and dark, single-colour backgrounds are less affluent than images that include subtle shades. Mann-Whitney U test results for nine pairs of hashtags showed that seven out of nine pairs had significant differences in visual affluence. The proposed measure can be used to encourage a ‘visually oriented’ turn in online social photography research that can benefit from hybrid methods that are able to extrapolate micro-level findings to macro-level effects

    Characterising the hippocampal response to perception, construction and complexity

    Get PDF
    The precise role played by the hippocampus in supporting cognitive functions such as episodic memory and future thinking is debated, but there is general agreement that it involves constructing representations comprised of numerous elements. Visual scenes have been deployed extensively in cognitive neuroscience because they are paradigmatic multi-element stimuli. However, questions remain about the specificity and nature of the hippocampal response to scenes. Here, we devised a paradigm in which we had participants search pairs of images for either colour or layout differences, thought to be associated with perceptual or spatial constructive processes respectively. Importantly, images depicted either naturalistic scenes or phase-scrambled versions of the same scenes, and were either simple or complex. Using this paradigm during functional MRI scanning, we addressed three questions: 1. Is the hippocampus recruited specifically during scene processing? 2. If the hippocampus is more active in response to scenes, does searching for colour or layout differences influence its activation? 3. Does the complexity of the scenes affect its response? We found that, compared to phase-scrambled versions of the scenes, the hippocampus was more responsive to scene stimuli. Moreover, a clear anatomical distinction was evident, with colour detection in scenes engaging the posterior hippocampus whereas layout detection in scenes recruited the anterior hippocampus. The complexity of the scenes did not influence hippocampal activity. These findings seem to align with perspectives that propose the hippocampus is especially attuned to scenes, and its involvement occurs irrespective of the cognitive process or the complexity of the scenes

    An Anthropocentric Approach to Text Extraction from WWW Images

    No full text
    There is a significant need to analyse the text in images on WWW pages, both for effective indexing and for presentation by non-visual means (e.g., audio). This paper argues that the extraction of text from such images benefits from an anthropocentric approach in the distinction between colour regions. The novelty of the idea is the use of a human perspective of colour perception in preference to RGB colour space analysis. This enables the extraction of text in complex situations such as in the presence of varying colour and texture (characters and background). More precisely, characters are extracted as distinct regions with separate chromaticity and/or luminance by performing a layer decomposition of the image. The method described here is the first in our systematic approach to approximate the human colour perception characteristics for the identification of character regions. In this instance, the image is decomposed by performing histogram analysis of Hue and Luminance and merging in the HLS colour space

    Two Approaches for Text Segmentation in Web Images

    No full text
    There is a significant need to recognise the text in images on web pages, both for effective indexing and for presentation by non-visual means (e.g., audio). This paper presents and compares two novel methods for the segmentation of characters for subsequent extraction and recognition. The novelty of both approaches is the combination of (different in each case) topological features of characters with an anthropocentric perspective of colour perception— in preference to RGB space analysis. Both approaches enable the extraction of text in complex situations such as in the presence of varying colour and texture (characters and background)

    Two Approaches for Text Segmentation in Web Images

    Get PDF
    There is a significant need to recognise the text in images on web pages, both for effective indexing and for presentation by non-visual means (e.g., audio). This paper presents and compares two novel methods for the segmentation of characters for subsequent extraction and recognition. The novelty of both approaches is the combination of (different in each case) topological features of characters with an anthropocentric perspective of colour perception— in preference to RGB space analysis. Both approaches enable the extraction of text in complex situations such as in the presence of varying colour and texture (characters and background)
    • …
    corecore