6 research outputs found

    Merging Grid Technologies

    Get PDF
    This paper reports the integration of the astronomical Grid solution realised in the Astro-WISE information system with the EGEE Grid and the porting of Astro-WISE applications on EGEE. We review the architecture of the Astro-WISE Grid, define the problems for the integration of the Grid infrastructures and our solution to these problems. We give examples of applications running on Astro-WISE and EGEE and review future development of the merged system

    Word mining in a sparsely-labeled handwritten collection

    No full text
    Word-spotting techniques are usually based on detailed modeling of target words, followed by search for the locations of such a target word in images of handwriting. In this study, the focus is on deciding for the presence of target words in lines of text, regardless and disregarding their horizontal position. Line strips are modeled using a Bag-of-Glyphs approach using a self-organized map. This approach uses the presence of fragmented-connected component shapes (glyphs) in a line strip to characterize this text passage, similar to the Bag-of-Words approach for 'ASCII'-encoded documents in regular Information Retrieval. Subsequently, the presence of a word or word category is trained to a support-vector machine in an iterative setup which involves an active group of users. Results are promising for a large proportion of words and are dependent both on the amount of labeled lines as well as shape uniqueness. Particularly useful is the ability to train on abstract content classes such as proper names, municipalities or word-bigram presence in the line-strip images.</p

    Word mining in a sparsely-labeled handwritten collection

    No full text
    Word-spotting techniques are usually based on detailed modeling of target words, followed by search for the locations of such a target word in images of handwriting. In this study, the focus is on deciding for the presence of target words in lines of text, regardless and disregarding their horizontal position. Line strips are modeled using a Bag-of-Glyphs approach using a self-organized map. This approach uses the presence of fragmented-connected component shapes (glyphs) in a line strip to characterize this text passage, similar to the Bag-of-Words approach for 'ASCII'-encoded documents in regular Information Retrieval. Subsequently, the presence of a word or word category is trained to a support-vector machine in an iterative setup which involves an active group of users. Results are promising for a large proportion of words and are dependent both on the amount of labeled lines as well as shape uniqueness. Particularly useful is the ability to train on abstract content classes such as proper names, municipalities or word-bigram presence in the line-strip images

    <title>Word mining in a sparsely labeled handwritten collection</title>

    No full text
    corecore