287 research outputs found

    Accessing Textual Information Embedded in Internet Images

    Full text link
    Indexing and searching for WWW pages is relying on analysing text. Current technology cannot process the text embedded in images on WWW pages. This paper argues that this is a significant problem as text in image form is usually semantically important (e.g. headers, titles). The results of a recent study are presented to show that the majority (76%) of words embedded in images do not appear elsewhere in the main text and that the majority (56%) of ALT tag descriptions of images are incorrect or do not exist at all. Research under way to devise tools to extract text from images based on the way humans perceive colour differences is outlined and results are presented

    An Anthropocentric Approach to Text Extraction from WWW Images

    Full text link
    There is a significant need to analyse the text in images on WWW pages, both for effective indexing and for presentation by non-visual means (e.g., audio). This paper argues that the extraction of text from such images benefits from an anthropocentric approach in the distinction between colour regions. The novelty of the idea is the use of a human perspective of colour perception in preference to RGB colour space analysis. This enables the extraction of text in complex situations such as in the presence of varying colour and texture (characters and background). More precisely, characters are extracted as distinct regions with separate chromaticity and/or luminance by performing a layer decomposition of the image. The method described here is the first in our systematic approach to approximate the human colour perception characteristics for the identification of character regions. In this instance, the image is decomposed by performing histogram analysis of Hue and Luminance and merging in the HLS colour space

    Distributed Computing in the Asynchronous LOCAL model

    Full text link
    The LOCAL model is among the main models for studying locality in the framework of distributed network computing. This model is however subject to pertinent criticisms, including the facts that all nodes wake up simultaneously, perform in lock steps, and are failure-free. We show that relaxing these hypotheses to some extent does not hurt local computing. In particular, we show that, for any construction task TT associated to a locally checkable labeling (LCL), if TT is solvable in tt rounds in the LOCAL model, then TT remains solvable in O(t)O(t) rounds in the asynchronous LOCAL model. This improves the result by Casta\~neda et al. [SSS 2016], which was restricted to 3-coloring the rings. More generally, the main contribution of this paper is to show that, perhaps surprisingly, asynchrony and failures in the computations do not restrict the power of the LOCAL model, as long as the communications remain synchronous and failure-free

    Visual Representation of Text in Web Documents and Its Interpretation

    Full text link
    This paper examines the uses of text and its representation on Web documents in terms of the challenges in its interpretation. Particular attention is paid to the significant problem of non-uniform representation of text. This non-uniformity is mainly due to the presence of semantically important text in image form as opposed to the standard encoded text. The issues surrounding text representation in Web documents are discussed in the context of colour perception and spatial representation. The characteristics of the representation of text in image form are examined and research towards interpreting these images of text is briefly described

    Visual Representation of Text in Web Documents and Its Interpretation

    Full text link
    This paper examines the uses of text and its representation on Web documents in terms of the challenges in its interpretation. Particular attention is paid to the significant problem of non-uniform representation of text. This non-uniformity is mainly due to the presence of semantically important text in image form as opposed to the standard encoded text. The issues surrounding text representation in Web documents are discussed in the context of colour perception and spatial representation. The characteristics of the representation of text in image form are examined and research towards interpreting these images of text is briefly described

    Symbolic Harmonic Analysis of Quartz Crystal Oscillators

    Full text link
    International audienceThe Nonlinear Dipolar Method is dedicated to the simulation of quartz crystal oscillator with high quality factor. In this method, the oscillators is considered as a resonator connected across an amplifier that behaves like a nonlinear dipole whose impedance evaluated at resonator's frequency depends on the current amplitude. This dipole allows us to compute very quickly the behavior of the oscillator. The computation time of the dipolar impedance by Spice is of the order of a few seconds. To gain one order of magnitude in the simulation time of the oscillator, this paper propose a modification of the Nonlinear Dipolar Method by changing the dipolar impedance Spice's calculation, that is the most time consuming part of the program, by a system of equations obtained through a symbolic manipulation of the circuit equations

    A very high speed method to simulate quartz crystal oscillator

    Full text link
    International audienceIn this paper, we present the SHA method, a Symbolic Harmonic Analysis method to simulate the behaviour of ultrastable quartz crystal oscillators. This nonlinear method is aimed to compute very quickly the steady state as well as amplitude and frequency transients. The ultimate goal is to see instantaneously the influence of a parameter change on the oscillator's features thank to the computation speed. The method proposed here is a mixing of the nonlinear dipolar method previouly developed in our team and the harmonic method. It allows to replace the set of algebro-differential equation of the circuit by a nonlinear system of the Fourier coefficients of the circuit unknowns

    Two Approaches for Text Segmentation in Web Images

    Get PDF
    There is a significant need to recognise the text in images on web pages, both for effective indexing and for presentation by non-visual means (e.g., audio). This paper presents and compares two novel methods for the segmentation of characters for subsequent extraction and recognition. The novelty of both approaches is the combination of (different in each case) topological features of characters with an anthropocentric perspective of colour perception— in preference to RGB space analysis. Both approaches enable the extraction of text in complex situations such as in the presence of varying colour and texture (characters and background)

    Two Approaches for Text Segmentation in Web Images

    Full text link
    There is a significant need to recognise the text in images on web pages, both for effective indexing and for presentation by non-visual means (e.g., audio). This paper presents and compares two novel methods for the segmentation of characters for subsequent extraction and recognition. The novelty of both approaches is the combination of (different in each case) topological features of characters with an anthropocentric perspective of colour perception— in preference to RGB space analysis. Both approaches enable the extraction of text in complex situations such as in the presence of varying colour and texture (characters and background)

    Mengelmaren: Hermans Torreken te Nieuwpoort

    Get PDF
    • …
    corecore