6 research outputs found

    IMPACT Best Practice Guide: Metadata for Text Digitisation and OCR

    Get PDF

    IMPACT Best Practice Guide: Metadata for Text Digitisation and OCR

    Get PDF

    Adaptive Methods for Robust Document Image Understanding

    Get PDF
    A vast amount of digital document material is continuously being produced as part of major digitization efforts around the world. In this context, generic and efficient automatic solutions for document image understanding represent a stringent necessity. We propose a generic framework for document image understanding systems, usable for practically any document types available in digital form. Following the introduced workflow, we shift our attention to each of the following processing stages in turn: quality assurance, image enhancement, color reduction and binarization, skew and orientation detection, page segmentation and logical layout analysis. We review the state of the art in each area, identify current defficiencies, point out promising directions and give specific guidelines for future investigation. We address some of the identified issues by means of novel algorithmic solutions putting special focus on generality, computational efficiency and the exploitation of all available sources of information. More specifically, we introduce the following original methods: a fully automatic detection of color reference targets in digitized material, accurate foreground extraction from color historical documents, font enhancement for hot metal typesetted prints, a theoretically optimal solution for the document binarization problem from both computational complexity- and threshold selection point of view, a layout-independent skew and orientation detection, a robust and versatile page segmentation method, a semi-automatic front page detection algorithm and a complete framework for article segmentation in periodical publications. The proposed methods are experimentally evaluated on large datasets consisting of real-life heterogeneous document scans. The obtained results show that a document understanding system combining these modules is able to robustly process a wide variety of documents with good overall accuracy

    European Curriculum Reflections on Library and Information Science Education

    Get PDF
    The project behind this book has been carried out with the support of the European Community in the framework of the Socrates programme. The European Curriculum Reflections on Library and Information Science Education project has been inspired by curriculum discussions on the Bologna Declaration that was initiated at a EUCLID conference in Thessaloniki 2002. EUCLID (European Association for Library & Information Education and Research) is an independent European non-governmental and non-profit organisation existing for the purpose of promoting European co-operation within library and information education and research

    8th. International congress on archaeology computer graphica. Cultural heritage and innovation

    Full text link
    El lema del Congreso es: 'Documentación 3D avanzada, modelado y reconstrucción de objetos patrimoniales, monumentos y sitios.Invitamos a investigadores, profesores, arqueólogos, arquitectos, ingenieros, historiadores de arte... que se ocupan del patrimonio cultural desde la arqueología, la informática gráfica y la geomática, a compartir conocimientos y experiencias en el campo de la Arqueología Virtual. La participación de investigadores y empresas de prestigio será muy apreciada. Se ha preparado un atractivo e interesante programa para participantes y visitantes.Lerma García, JL. (2016). 8th. International congress on archaeology computer graphica. Cultural heritage and innovation. Editorial Universitat Politècnica de València. http://hdl.handle.net/10251/73708EDITORIA

    Metadatos y recuperación de información: estándares, problemas y aplicabilidad en bibliotecas digitales

    Get PDF
    Programa de Doctorado en DocumentaciónPresidente: Mercedes Caridad Sebastián. - Secretario: Antonio Hernández Pérez. - Vocales: José Carlos Rovira Soler, Eulalia Fuentes i Pujol, José Antonio Gómez Hernánde
    corecore