Search CORE

5 research outputs found

Data Extraction from Hand-filled Form using Form Template

Author: Rohit Sachdeva, Dharam Veer Sharma
Publication venue: 'Auricle Technologies, Pvt., Ltd.'
Publication date: 31/08/2015
Field of study

Database is very vital for taking the day to day decision and in the long run it helps in formulation of policies, strategies of an organization. Numerous efforts, time and money are spent to get, store and process the data. To get the data from a user, an interface is designed which is known as form. The forms may vary from paper based to online. Manually processing paper based form is prone to errors. Therefore, it will be useful to deploy automated systems for reading data from paper based forms and storing it in the database. Further, this data can be modified, processed and analyzed. In this paper, we have proposed a method to extract data from hand-filled pre-designed form based on form templates. DOI: 10.17762/ijritcc2321-8169.15084

International Journal on Recent and Innovation Trends in Computing and Communication

Recommended from our members

Use of colour for hand-filled form analysis and recognition

Author: Allen T
Sherkat N
Wong WS
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 21/07/2005
Field of study

Colour information in form analysis is currently under utilised. As technology has advanced and computing costs have reduced, the processing of forms in colour has now become practicable. This paper describes a novel colour-based approach to the extraction of filled data from colour form images. Images are first quantised to reduce the colour complexity and data is extracted by examining the colour characteristics of the images. The improved performance of the proposed method has been verified by comparing the processing time, recognition rate, extraction precision and recall rate to that of an equivalent black and white system

Nottingham Trent Institutional Repository (IRep)

Recognition and identification of form document layouts

Author: Luo Kai
Publication venue: Digital Scholarship@UNLV
Publication date: 01/01/2003
Field of study

In this thesis, a hierarchical tree representation is introduced to represent the logical structure of a form document. But different forms might have the same logical structure, so the representation will be ambiguous. In this thesis, an improvement is proposed to solve the ambiguity problem by using the physical information of the blocks. To fulfill the application of hierarchical tree representation and extract the physical information of blocks, a pixel tracing approach is used to extract form layout structures from form documents. Compared with Hough transform, the pixel tracing algorithm requires less computation. This algorithm has been tested on 50 different table forms. It effectively extracts all the line information required for the hierarchical tree representation, represents the form by a hierarchical tree, and distinguishes the different forms. The algorithm applies to table form documents

University of Nevada, Las Vegas Repository

Use of colour for hand-filled form analysis and recognition

Author: AK Jain
Avanindra
B Kong
B Yu
C Connolly
C Strouthopoulos
D Wang
D Zugaj
H Nishida
HA Jaekyu
HS Baird
JL Chen
JP Braquelaire
Kotropoulos
LY Tseng
M Celenk
MD Garris
MS Shyu
Nasser Sherkat
O Chutatape
R Casey
R Casey
R Schettini
SL Taylor
ST Hinds
Tony Allen
Wing Seong Wong
Y Zhong
YK Chen
Yu Bin
YW Lim
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Information Preserving Processing of Noisy Handwritten Document Images

Author: Chen Jin
Publication venue: Lehigh Preserve
Publication date
Field of study

Many pre-processing techniques that normalize artifacts and clean noise induce anomalies due to discretization of the document image. Important information that could be used at later stages may be lost. A proposed composite-model framework takes into account pre-printed information, user-added data, and digitization characteristics. Its benefits are demonstrated by experiments with statistically significant results. Separating pre-printed ruling lines from user-added handwriting shows how ruling lines impact people\u27s handwriting and how they can be exploited for identifying writers. Ruling line detection based on multi-line linear regression reduces the mean error of counting them from 0.10 to 0.03, 6.70 to 0.06, and 0.13 to 0.02, com- pared to an HMM-based approach on three standard test datasets, thereby reducing human correction time by 50%, 83%, and 72% on average. On 61 page images from 16 rule-form templates, the precision and recall of form cell recognition are increased by 2.7% and 3.7%, compared to a cross-matrix approach. Compensating for and exploiting ruling lines during feature extraction rather than pre-processing raises the writer identification accuracy from 61.2% to 67.7% on a 61-writer noisy Arabic dataset. Similarly, counteracting page-wise skew by subtracting it or transforming contours in a continuous coordinate system during feature extraction improves the writer identification accuracy. An implementation study of contour-hinge features reveals that utilizing the full probabilistic probability distribution function matrix improves the writer identification accuracy from 74.9% to 79.5%

Lehigh University: Lehigh Preserve