90 research outputs found

    User-driven Page Layout Analysis of historical printed Books

    Get PDF
    International audienceIn this paper, based on the study of the specificity of historical printed books, we first explain the main error sources in classical methods used for page layout analysis. We show that each method (bottom-up and top-down) provides different types of useful information that should not be ignored, if we want to obtain both a generic method and good segmentation results. Next, we propose to use a hybrid segmentation algorithm that builds two maps: a shape map that focuses on connected components and a background map, which provides information about white areas corresponding to block separations in the page. Using this first segmentation, a classification of the extracted blocks can be achieved according to scenarios produced by the user. These scenarios are defined very simply during an interactive stage. The user is able to make processing sequences adapted to the different kinds of images he is likely to meet and according to the user needs. The proposed “user-driven approach” is capable of doing segmentation and labelling of the required user high level concepts efficiently and has achieved above 93% accurate results over different data sets tested. User feedbacks and experimental results demonstrate the effectiveness and usability of our framework mainly because the extraction rules can be defined without difficulty and parameters are not sensitive to page layout variation

    A Framework for the Automatic Analysis and Interactive Exploration of Document Aesthetics. Technical Report April 21, 2016

    Get PDF
    Modern word processing software and typesetting systems such as TeX enable the quick creation of documents of various kinds. Although the quality of the software packages varies, all can produce aesthetically pleasing documents in terms of layout and type setting. Problems typically originate from the large number of parameters which are exposed to the user. These range from simple settings like typeface, font size and column width to more elaborate ones, such as kerning and leading. Most often default values are modified without grasping the consequences for readability and aesthetic appeal of the resulting document.In this paper, we present a system for interactive visualization and exploration of quantifiable aspects of document aesthetics such as alignment, spacing, gray values, but also of image color harmony. This system also allows for comparative analysis of multiple documents and document versions side-by-side. The documents are rated using an extensible and parameterizable plug-in system allowing the user to define a task-specific processing pipeline interactively. The rating is hierarchically organized such that the user can drill down into the different aspects that influence the final score. Our system takes standard document formats such as Adobe PDF or Microsoft XPS as input. Our system serves as a platform for further research on document aesthetics as well as a utility to sensibilize authors for these often underestimated aspects of scientific publishing

    The contour tree image encoding technique and file format

    Get PDF
    The process of contourization is presented which converts a raster image into a discrete set of plateaux or contours. These contours can be grouped into a hierarchical structure, defining total spatial inclusion, called a contour tree. A contour coder has been developed which fully describes these contours in a compact and efficient manner and is the basis for an image compression method. Simplification of the contour tree has been undertaken by merging contour tree nodes thus lowering the contour tree's entropy. This can be exploited by the contour coder to increase the image compression ratio. By applying general and simple rules derived from physiological experiments on the human vision system, lossy image compression can be achieved which minimises noticeable artifacts in the simplified image. The contour merging technique offers a complementary lossy compression system to the QDCT (Quantised Discrete Cosine Transform). The artifacts introduced by the two methods are very different; QDCT produces a general blurring and adds extra highlights in the form of overshoots, whereas contour merging sharpens edges, reduces highlights and introduces a degree of false contouring. A format based on the contourization technique which caters for most image types is defined, called the contour tree image format. Image operations directly on this compressed format have been studied which for certain manipulations can offer significant operational speed increases over using a standard raster image format. A couple of examples of operations specific to the contour tree format are presented showing some of the features of the new format.Science and Engineering Research Counci

    Structure out of sound

    Get PDF
    Thesis (Ph. D.)--Massachusetts Institute of Technology, Program in Media Arts & Sciences, 1993.Vita.Includes bibliographical references (p. 155-170).Michael Jerome Hawley.Ph.D

    On the Recognition of Emotion from Physiological Data

    Get PDF
    This work encompasses several objectives, but is primarily concerned with an experiment where 33 participants were shown 32 slides in order to create ‗weakly induced emotions‘. Recordings of the participants‘ physiological state were taken as well as a self report of their emotional state. We then used an assortment of classifiers to predict emotional state from the recorded physiological signals, a process known as Physiological Pattern Recognition (PPR). We investigated techniques for recording, processing and extracting features from six different physiological signals: Electrocardiogram (ECG), Blood Volume Pulse (BVP), Galvanic Skin Response (GSR), Electromyography (EMG), for the corrugator muscle, skin temperature for the finger and respiratory rate. Improvements to the state of PPR emotion detection were made by allowing for 9 different weakly induced emotional states to be detected at nearly 65% accuracy. This is an improvement in the number of states readily detectable. The work presents many investigations into numerical feature extraction from physiological signals and has a chapter dedicated to collating and trialing facial electromyography techniques. There is also a hardware device we created to collect participant self reported emotional states which showed several improvements to experimental procedure

    2005-2007 Course Catalog

    Get PDF
    2005-2007 Course Catalo

    2022-2023 Course Catalog

    Get PDF
    2022-2023 Course Catalo

    2012-2013 Course Catalog

    Get PDF
    2012-2013 Course Catalo

    2013-2014 Course Catalog

    Get PDF
    2013-2014 Course Catalo

    2023-2024 Course Catalog

    Get PDF
    2023-2024 Course Catalo
    • 

    corecore