14 research outputs found
A structural approach for library references recognition
This paper presents a library references recognition system for retrospective
conversion of catalogues . The system is guided by a structure model of a reference
class, described by an attribute grammar. The analysis method is based on
prediction and verification of segmentation hypotheses proposed by the model . The
result, given in UNIMARC format, contains the different sub-fields of the reference
with their confidence score . This method is enough general to be adapted on any
document having a micro-structure. This method has been also used on other kind
of documents such as author index and subjects .Cet article présente un système de reconnaissance de la structure logique de notices bibliographiques en vue de la conversion rétrospective de catalogues de bibliothèques. Le système est guidé par un modèle de structures de la classe des notices, construit sur la base de spécifications détaillées par la bibliothèque. Le modèle fait intervenir aussi bien des connaissances sur la macro-structure des notices que sur la micro-structure de leur contenu. La reconnaissance de la structure d'une notice consiste à retrouver, à partir d'un flux OCR (Optical Character Recognition), sa structure logique spécifique, conformément aux descriptions du modèle. Le résultat est un flux structuré hiérarchiquement, présentant dans le format UNIMARC, les différents champs de la notice, accompagnés de leur score de confiance. Ce travail a été réalisé dans le cadre du projet européen LIB-MORE associant la société JOUVE et la Bibliothèque Royale de Belgique
Document Analysis for Retrospective Conversion of Library Reference catalogues
International audienc
Constraint Propagation vs Syntactical Analysis for the Logical Structure Recognition of Library References
. This paper describes a constraint propagation method for logical structure extraction of Library references without the use of ocr. The accent is put on the search of anchor points from visual indices extraction. A mixed strategy is performed. For each anchor points. the system proposes in a bottom-up manner the most probable model hypothesis and tries to verify in a top-down manner its left and right contexts. 1 Introduction As part of the European project lib-more 1 [Mor 92], we were interested on the retrospective conversion of pre-isbd library catalogues. Libraries are faced with this problem to convert their old paper catalogues into a data processing format in which they are more readily accessible to the readers. Since 1976, bibliographic references are normalized by the isbd 2 according to a common formalism called unimarc 3 . However, the contents of catalogues written before this date does not totally obey this standard. Ambiguities and exceptions remain embedded ..
Document Analysis for Retrospective Conversion of Library Reference catalogues
International audienc
Une approche structurelle pour la reconnaissance de notices bibliographiques
National audienceCet article présente un système de reconnaissance de la structure logique de notices bibliographiques en vue de la conversion rétrospective de catalogues de bibliothèques. Le système est guidé par un modèle de structures de la classe des notices, construit sur la base de spécifications détaillées par la bibliothèque. Le modèle fait intervenir aussi bien des connaissances sur la macro-structure des notices que sur la micro-structure de leur contenu. La reconnaissance de la structure d'une notice consiste à retrouver, à partir d'un flux OCR (Optical Character Recognition), sa structure logique spécifique, conformément aux descriptions du modèle. Le résultat est un flux structuré hiérarchiquement, présentant dans le format UNIMARC, les différents champs de la notice, accompagnés de leur score de confiance. Ce travail a été réalisé dans le cadre du projet européen LIB-MORE associant la société JOUVE et la Bibliothèque Royale de Belgique
Knowledge-Based System for Structured Document Recognition
This paper discribes a document analysis system broadly consisting of a knowledge base, a blackboard and a set of tasks having their own set of spacialists for segmentation, recognition and for inheritance. The knowledge base contains a generic hierarchical description of the document structure in terms of layout objects labeled logically. This allows the generation of hypothetic networks of linked objects in the blackboard. The specialists cooperate indirectly through the blackboard by updating the layout object descriptors. A blackboard modification causes an "event" to propagate up to some specific tasks. A task could then choose another subset of specialists to carry on with the process. Finally, a synthesized blackboard summary allows a task selector to focus efficiently on the most useful layout object to process
A knowledge-based system for contextual text recognition
Publie dans : International conference on pattern recognition 1990, 1990SIGLEAvailable at INIST (FR), Document Supply Service, under shelf-number : RP 10755 / INIST-CNRS - Institut de l'Information Scientifique et TechniqueFRFranc
A model-based system for the recognition of structured documents
Available at INIST (FR), Document Supply Service, under shelf-number : RP 11414 / INIST-CNRS - Institut de l'Information Scientifique et TechniqueSIGLEFRFranc