Search CORE

85 research outputs found

GROUNDTRUTH GENERATION AND DOCUMENT IMAGE DEGRADATION

Author: Zi Gang
Publication venue
Publication date: 02/05/2005
Field of study

The problem of generating synthetic data for the training and evaluation of document analysis systems has been widely addressed in recent years. With the increased interest in processing multilingual sources, however, there is a tremendous need to be able to rapidly generate data in new languages and scripts, without the need to develop specialized systems. We have developed a system, which uses language support of the MS Windows operating system combined with custom print drivers to render tiff images simultaneously with windows Enhanced Metafile directives. The metafile information is parsed to generate zone, line, word, and character ground truth including location, font information and content in any language supported by Windows. The resulting images can be physically or synthetically degraded by our degradation modules, and used for training and evaluating Optical Character Recognition (OCR) systems. Our document image degradation methodology incorporates several often-encountered types of noise at the page and pixel levels. Examples of OCR evaluation and synthetically degraded document images are given to demonstrate the effectiveness

Digital Repository at the University of Maryland

Eye Detection and Face Recognition Across the Electromagnetic Spectrum

Author: Whitelam Cameron F.
Publication venue: The Research Repository @ WVU
Publication date: 01/01/2016
Field of study

Biometrics, or the science of identifying individuals based on their physiological or behavioral traits, has increasingly been used to replace typical identifying markers such as passwords, PIN numbers, passports, etc. Different modalities, such as face, fingerprint, iris, gait, etc. can be used for this purpose. One of the most studied forms of biometrics is face recognition (FR). Due to a number of advantages over typical visible to visible FR, recent trends have been pushing the FR community to perform cross-spectral matching of visible images to face images from higher spectra in the electromagnetic spectrum.;In this work, the SWIR band of the EM spectrum is the primary focus. Four main contributions relating to automatic eye detection and cross-spectral FR are discussed. First, a novel eye localization algorithm for the purpose of geometrically normalizing a face across multiple SWIR bands for FR algorithms is introduced. Using a template based scheme and a novel summation range filter, an extensive experimental analysis show that this algorithm is fast, robust, and highly accurate when compared to other available eye detection methods. Also, the eye locations produced by this algorithm provides higher FR results than all other tested approaches. This algorithm is then augmented and updated to quickly and accurately detect eyes in more challenging unconstrained datasets, spanning the EM spectrum. Additionally, a novel cross-spectral matching algorithm is introduced that attempts to bridge the gap between the visible and SWIR spectra. By fusing multiple photometric normalization combinations, the proposed algorithm is not only more efficient than other visible-SWIR matching algorithms, but more accurate in multiple challenging datasets. Finally, a novel pre-processing algorithm is discussed that bridges the gap between document (passport) and live face images. It is shown that the pre-processing scheme proposed, using inpainting and denoising techniques, significantly increases the cross-document face recognition performance

The Research Repository @ WVU (West Virginia University)

Flexible text recovery and recognition from degraded historical typewritten documents

Author: Casado. Castilla Celia
Publication venue
Publication date
Field of study

University of Liverpool Repository

Design and Real-World Application of Novel Machine Learning Techniques for Improving Face Recognition Algorithms

Author: Sáez Trigueros Daniel
Publication venue
Publication date: 28/05/2019
Field of study

Recent progress in machine learning has made possible the development of real-world face recognition applications that can match face images as good as or better than humans. However, several challenges remain unsolved. In this PhD thesis, some of these challenges are studied and novel machine learning techniques to improve the performance of real-world face recognition applications are proposed. Current face recognition algorithms based on deep learning techniques are able to achieve outstanding accuracy when dealing with face images taken in unconstrained environments. However, training these algorithms is often costly due to the very large datasets and the high computational resources needed. On the other hand, traditional methods for face recognition are better suited when these requirements cannot be satisfied. This PhD thesis presents new techniques for both traditional and deep learning methods. In particular, a novel traditional face recognition method that combines texture and shape features together with subspace representation techniques is first presented. The proposed method is lightweight and can be trained quickly with small datasets. This method is used for matching face images scanned from identity documents against face images stored in the biometric chip of such documents. Next, two new techniques to increase the performance of face recognition methods based on convolutional neural networks are presented. Specifically, a novel training strategy that increases face recognition accuracy when dealing with face images presenting occlusions, and a new loss function that improves the performance of the triplet loss function are proposed. Finally, the problem of collecting large face datasets is considered, and a novel method based on generative adversarial networks to synthesize both face images of existing subjects in a dataset and face images of new subjects is proposed. The accuracy of existing face recognition algorithms can be increased by training with datasets augmented with the synthetic face images generated by the proposed method. In addition to the main contributions, this thesis provides a comprehensive literature review of face recognition methods and their evolution over the years. A significant amount of the work presented in this PhD thesis is the outcome of a 3-year-long research project partially funded by Innovate UK as part of a Knowledge Transfer Partnership between University of Hertfordshire and IDscan Biometrics Ltd (partnership number: 009547)

University of Hertfordshire Research Archive

Digital imaging technology assessment: Digital document storage project

Author
Publication venue
Publication date
Field of study

An ongoing technical assessment and requirements definition project is examining the potential role of digital imaging technology at NASA's STI facility. The focus is on the basic components of imaging technology in today's marketplace as well as the components anticipated in the near future. Presented is a requirement specification for a prototype project, an initial examination of current image processing at the STI facility, and an initial summary of image processing projects at other sites. Operational imaging systems incorporate scanners, optical storage, high resolution monitors, processing nodes, magnetic storage, jukeboxes, specialized boards, optical character recognition gear, pixel addressable printers, communications, and complex software processes

NASA Technical Reports Server

Telemedicine

Author
Publication venue: 'IntechOpen'
Publication date: 20/04/2021
Field of study

Telemedicine is a rapidly evolving field as new technologies are implemented for example for the development of wireless sensors, quality data transmission. Using the Internet applications such as counseling, clinical consultation support and home care monitoring and management are more and more realized, which improves access to high level medical care in underserved areas. The 23 chapters of this book present manifold examples of telemedicine treating both theoretical and practical foundations and application scenarios

Directory of Open Access Books (DOAB)

Advanced document data extraction techniques to improve supply chain performance

Author: Sharma Vikash
Publication venue
Publication date: 01/07/2021
Field of study

In this thesis, a novel machine learning technique to extract text-based information from scanned images has been developed. This information extraction is performed in the context of scanned invoices and bills used in financial transactions. These financial transactions contain a considerable amount of data that must be extracted, refined, and stored digitally before it can be used for analysis. Converting this data into a digital format is often a time-consuming process. Automation and data optimisation show promise as methods for reducing the time required and the cost of Supply Chain Management (SCM) processes, especially Supplier Invoice Management (SIM), Financial Supply Chain Management (FSCM) and Supply Chain procurement processes. This thesis uses a cross-disciplinary approach involving Computer Science and Operational Management to explore the benefit of automated invoice data extraction in business and its impact on SCM. The study adopts a multimethod approach based on empirical research, surveys, and interviews performed on selected companies.The expert system developed in this thesis focuses on two distinct areas of research: Text/Object Detection and Text Extraction. For Text/Object Detection, the Faster R-CNN model was analysed. While this model yields outstanding results in terms of object detection, it is limited by poor performance when image quality is low. The Generative Adversarial Network (GAN) model is proposed in response to this limitation. The GAN model is a generator network that is implemented with the help of the Faster R-CNN model and a discriminator that relies on PatchGAN. The output of the GAN model is text data with bonding boxes. For text extraction from the bounding box, a novel data extraction framework consisting of various processes including XML processing in case of existing OCR engine, bounding box pre-processing, text clean up, OCR error correction, spell check, type check, pattern-based matching, and finally, a learning mechanism for automatizing future data extraction was designed. Whichever fields the system can extract successfully are provided in key-value format.The efficiency of the proposed system was validated using existing datasets such as SROIE and VATI. Real-time data was validated using invoices that were collected by two companies that provide invoice automation services in various countries. Currently, these scanned invoices are sent to an OCR system such as OmniPage, Tesseract, or ABBYY FRE to extract text blocks and later, a rule-based engine is used to extract relevant data. While the system’s methodology is robust, the companies surveyed were not satisfied with its accuracy. Thus, they sought out new, optimized solutions. To confirm the results, the engines were used to return XML-based files with text and metadata identified. The output XML data was then fed into this new system for information extraction. This system uses the existing OCR engine and a novel, self-adaptive, learning-based OCR engine. This new engine is based on the GAN model for better text identification. Experiments were conducted on various invoice formats to further test and refine its extraction capabilities. For cost optimisation and the analysis of spend classification, additional data were provided by another company in London that holds expertise in reducing their clients' procurement costs. This data was fed into our system to get a deeper level of spend classification and categorisation. This helped the company to reduce its reliance on human effort and allowed for greater efficiency in comparison with the process of performing similar tasks manually using excel sheets and Business Intelligence (BI) tools.The intention behind the development of this novel methodology was twofold. First, to test and develop a novel solution that does not depend on any specific OCR technology. Second, to increase the information extraction accuracy factor over that of existing methodologies. Finally, it evaluates the real-world need for the system and the impact it would have on SCM. This newly developed method is generic and can extract text from any given invoice, making it a valuable tool for optimizing SCM. In addition, the system uses a template-matching approach to ensure the quality of the extracted information

Repository@Hull - Worktribe

“I’d like to thank the Academy”: an analysis of the awards discourse at the Atlantic Schools of Business conference

Author: McLaren Patricia Genoe
Publication venue: Atlantic Schools of Business
Publication date: 01/01/2007
Field of study

The awarding of prizes has become embedded in all aspects of our society, including academic conferences. This paper views the awards discourse at the Atlantic Schools of Business Conference through a poststructural lens with an eye to understanding how the presentation of awards at the conference can aid in, or possibly detract from, the continued success of this long-lasting, unique, and much-loved academic event

Saint Mary's University, Halifax: Institutional Repository

Space and Earth Sciences, Computer Systems, and Scientific Data Analysis Support, Volume 1

Author: Estes Ronald H.
Publication venue
Publication date
Field of study

This Final Progress Report covers the specific technical activities of Hughes STX Corporation for the last contract triannual period of 1 June through 30 Sep. 1993, in support of assigned task activities at Goddard Space Flight Center (GSFC). It also provides a brief summary of work throughout the contract period of performance on each active task. Technical activity is presented in Volume 1, while financial and level-of-effort data is presented in Volume 2. Technical support was provided to all Division and Laboratories of Goddard's Space Sciences and Earth Sciences Directorates. Types of support include: scientific programming, systems programming, computer management, mission planning, scientific investigation, data analysis, data processing, data base creation and maintenance, instrumentation development, and management services. Mission and instruments supported include: ROSAT, Astro-D, BBXRT, XTE, AXAF, GRO, COBE, WIND, UIT, SMM, STIS, HEIDI, DE, URAP, CRRES, Voyagers, ISEE, San Marco, LAGEOS, TOPEX/Poseidon, Pioneer-Venus, Galileo, Cassini, Nimbus-7/TOMS, Meteor-3/TOMS, FIFE, BOREAS, TRMM, AVHRR, and Landsat. Accomplishments include: development of computing programs for mission science and data analysis, supercomputer applications support, computer network support, computational upgrades for data archival and analysis centers, end-to-end management for mission data flow, scientific modeling and results in the fields of space and Earth physics, planning and design of GSFC VO DAAC and VO IMS, fabrication, assembly, and testing of mission instrumentation, and design of mission operations center

NASA Technical Reports Server

Energy efficiency in office technology

Author: Dandridge Cyane Bemiss
Publication venue: Massachusetts Institute of Technology
Publication date: 01/01/1994
Field of study

Thesis (M.S.)--Massachusetts Institute of Technology, Dept. of Architecture, 1994.Includes bibliographical references (leaves 204-210).This thesis, directed toward a wide variety of persons interested in energy efficiency issues with office technology, explores several issues relating to reducing energy use and improving energy efficiency of office equipment. Chapter 2 compares policies and programs in several European countries and the United States that could enhance the energy efficiency of office technology. Different programs are examined, including federal government programs where in some cases target values for power usage of office equipment have already been set. Large customer procurement programs, industry involvement, with emphasis on voluntary labeling programs, and research projects are also examined. Procedures that provide energy consumption measurements of various types of equipment are important for providing information to emerging procurement programs. Two sets of proposed test procedures for testing energy consumption of copiers, fax machines and printers are examined and compared. In Chapter 3, comparisons are made of the electrical power and energy used by computers, displays, copiers, printers and facsimile machines, both while operating and while idle. Technology options for reduced energy and power consumption and improved energy efficiency are examined. As the capability of office equipment has increased, there has been a trend toward increased electrical power requirements and energy consumption while equipment is in active operation. Computer power continues to grow rapidly. These factors will combine to exert an upward pressure for electrical power. However, some emerging technologies are lessening or in some cases reversing this trend, with little or no penalty in performance or production. The overall balance between increased service and efficiency is uncertain. Chapter 3 also examines the embodied energy of paper and office equipment. I compare it to the total energy required to produce a printed page of information, or required over the lifetime of the machine. Many difficulties were encountered in collecting and comparing data on power requirements of various machines. Procedures for testing the energy usage of office equipment are needed to make valid comparisons between machines. This thesis describes in Chapter 4, modifications to the procedure issued by the American Society for Testing and Materials (ASTM) to test energy consumption in copiers, to account for energy saver modes and double-sided copying. It also presents new procedures submitted to the ASTM committee for printers and fax machines. A fourth procedure is also presented here, one to test the energy consumption of personal computers. Typically, office equipment is not in use for much of the time it is turned on. Use of power management in office equipment can considerably decrease overall energy consumption. While energy saver modes are more prevalent in copiers, those printers that have incorporated this feature achieve more dramatic power reductions. Fax machines do not seem to utilize this technology at all, even though many have high power consumption when they are idle. How energy saving modes effect the overall energy consumption of machines is largely determined by operating profiles of the machines. The effect of operating profiles on energy usage with imaging equipment has not yet been examined. Methods of determining operating profiles of office equipment are presented in Chapter 5. A comparison is made between the energy use predicted by the ASTM procedures, energy use predicted by the ASTM procedures and actual operating profiles, and the actual energy usage of several copiers and printers. For copiers, the ASTM rated energy use per page was from 10-161 % different from the actual measured energy use per page. The use of the ASTM method with the measured operating profiles of the machine gave a 7-22% difference in energy use per page. For printers, the rated values using the ASTM method gave 61-317 % difference from the actual measured energy use per page, while using actual usage profiles with the ASTM method gave 0-6% difference. This thesis provides information on a variety of subject in the area of energy use and energy efficiency in office technology. The results provide information for emerging programs and provide a strong basis for a variety of further research.by Cyane Bemiss Dandridge.M.S

DSpace@MIT