44,420 research outputs found
Recommended from our members
Use of colour for hand-filled form analysis and recognition
Colour information in form analysis is currently under utilised. As technology has advanced and computing costs have reduced, the processing of forms in colour has now become practicable. This paper describes a novel colour-based approach to the extraction of filled data from colour form images. Images are first quantised to reduce the colour complexity and data is extracted by examining the colour characteristics of the images. The improved performance of the proposed method has been verified by comparing the processing time, recognition rate, extraction precision and recall rate to that of an equivalent black and white system
A modular methodology for converting large, complex books into usable, accessible and standards-compliant ebooks
This report describes the methodology used for ebook creation for the Glasgow Digital Library (GDL), and provides detailed instructions on how the same methodology could be used elsewhere. The document includes a description and explanation of the processes for ebook creation followed by a tutorial
Program analysis for documentation
A program analysis for documentation (PAD) written in FORTRAN has three steps: listing the variables, describing the structure and writing the program specifications. Technical notes on editing criteria for reviewing program documentation, technical notes for PAD, and FORTRAN program analyzer for documentation are appended
Applying Online: Technological Innovation for Income Support Programs in Four States
A study examining the development, implementation, and best practices for online applications for public benefits programs in California, Georgia, Pennsylvania, and Washington based on interviews with state agencies and community-based organizations
Automatic detection of change in address blocks for reply forms processing
In this paper, an automatic method to detect the presence of on-line erasures/scribbles/corrections/over-writing in the address block of various types of subscription and utility payment forms is presented. The proposed approach employs bottom-up segmentation of the address block. Heuristic rules based on structural features are used to automate the detection process. The algorithm is applied on a large dataset of 5,780 real world document forms of 200 dots per inch resolution. The proposed algorithm performs well with an average processing time of 108 milliseconds per document with a detection accuracy of 98.96%
Design issues in the production of hyperâbooks and visualâbooks
This paper describes an ongoing research project in the area of electronic books. After a brief overview of the state of the art in this field, two new forms of electronic book are presented: hyperâbooks and visualâbooks. A flexible environment allows them to be produced in a semiâautomatic way starting from different sources: electronic texts (as input for hyperâbooks) and paper books (as input for visualâbooks). The translation process is driven by the philosophy of preserving the book metaphor in order to guarantee that electronic information is presented in a familiar way. Another important feature of our research is that hyperâbooks and visualâbooks are conceived not as isolated objects but as entities within an electronic library, which inherits most of the features of a paperâbased library but introduces a number of new properties resulting from its nonâphysical nature
Special Libraries, November 1980
Volume 71, Issue 11https://scholarworks.sjsu.edu/sla_sl_1980/1009/thumbnail.jp
The NASA Astrophysics Data System: Architecture
The powerful discovery capabilities available in the ADS bibliographic
services are possible thanks to the design of a flexible search and retrieval
system based on a relational database model. Bibliographic records are stored
as a corpus of structured documents containing fielded data and metadata, while
discipline-specific knowledge is segregated in a set of files independent of
the bibliographic data itself.
The creation and management of links to both internal and external resources
associated with each bibliography in the database is made possible by
representing them as a set of document properties and their attributes.
To improve global access to the ADS data holdings, a number of mirror sites
have been created by cloning the database contents and software on a variety of
hardware and software platforms.
The procedures used to create and manage the database and its mirrors have
been written as a set of scripts that can be run in either an interactive or
unsupervised fashion.
The ADS can be accessed at http://adswww.harvard.eduComment: 25 pages, 8 figures, 3 table
Information System for NGO Libraries in Pakistan: A Proposed Model for Organizing the Grey Literature by Syed Attaullah Shah and Humera Ilhaq
Abstract
In recent years, especially in developed countries, various systems have been created to advance the management and organization of grey literature. Such systems use the latest communication technology and electronic and digital resources, and have developed huge networking systems to distribute and mange grey literature. Because of the scarcity of a global standardized organization system for grey literature and often limited access to computer technology, however, awareness of existence and access to grey literature is still seriously lacking, particularly in developing countries. Based on a survey of selected Pakistani NGOs from various sectors, this study proposes a new model. This paper explains the current usage patterns of grey literature in Pakistani organizations, then assesses their needs and resources for grey literature and finally recommends anew standardized model for organizing grey literature in the developing world. In this model a separate subject and classification scheme to control various types of grey literature, a shelving arrangement system and a networking system have been introduce
- âŠ