Search CORE

102 research outputs found

A System for Bangla Handwritten Numeral Recognition

Author: Belaïd Abdel
Chaudhuri B.B.
Pal Umapada
Publication venue: HAL CCSD
Publication date: 01/03/2003
Field of study

Colloque avec actes et comité de lecture. internationale.International audienceThis paper deals with a recognition system for unconstrained off-line Bangla handwritten numerals. To take care of variability involved in the writing style of different individuals, a robust scheme is presented here. The scheme is mainly based on new features obtained from the concept of water overflow from the reservoir as well as topological and structural features of the numerals. The proposed scheme is tested on data collected from different individuals of various background and we obtained an overall recognition accuracy of about 92.8% from 12000 data

INRIA a CCSD electronic archive server

A System for Bangla Handwritten Numeral Recognition

Author: Belaïd Abdel
Chaudhuri B.B.
Pal Umapada
Publication venue: HAL CCSD
Publication date: 01/01/2002
Field of study

INRIA a CCSD electronic archive server

A System for Bangla Handwritten Numeral Recognition

Author: Belaïd Abdel
Bidyut B. Chaudhuri
Pal Umapada
Publication venue: Institution of Electronics and Telecommunication Engineers, CDRAP Sharma, IN
Publication date: 01/01/2006
Field of study

International audienceThis paper deals with a recognition system for unconstrained off-line Bangla handwritten numerals. To take care of variability involved in the writing style of different individuals, a robust scheme is presented here. The scheme is mainly based on new features obtained from the concept of water overflow from the reservoir as well as topological and structural features of the numerals. The proposed scheme is tested on data collected from different individuals of various background and we obtained an overall recognition accuracy of about 92.8% from 12000 data

INRIA a CCSD electronic archive server

MatriVasha: A Multipurpose Comprehensive Database for Bangla Handwritten Compound Characters

Author: Ferdous Jannatul
Hossain Syed Akhter
Karmaker Suvrajit
Rabby A K M Shahariar Azad
Publication venue
Publication date: 06/05/2020
Field of study

At present, recognition of the Bangla handwriting compound character has been an essential issue for many years. In recent years there have been application-based researches in machine learning, and deep learning, which is gained interest, and most notably is handwriting recognition because it has a tremendous application such as Bangla OCR. MatrriVasha, the project which can recognize Bangla, handwritten several compound characters. Currently, compound character recognition is an important topic due to its variant application, and helps to create old forms, and information digitization with reliability. But unfortunately, there is a lack of a comprehensive dataset that can categorize all types of Bangla compound characters. MatrriVasha is an attempt to align compound character, and it's challenging because each person has a unique style of writing shapes. After all, MatrriVasha has proposed a dataset that intends to recognize Bangla 120(one hundred twenty) compound characters that consist of 2552(two thousand five hundred fifty-two) isolated handwritten characters written unique writers which were collected from within Bangladesh. This dataset faced problems in terms of the district, age, and gender-based written related research because the samples were collected that includes a verity of the district, age group, and the equal number of males, and females. As of now, our proposed dataset is so far the most extensive dataset for Bangla compound characters. It is intended to frame the acknowledgment technique for handwritten Bangla compound character. In the future, this dataset will be made publicly available to help to widen the research.Comment: 19 fig, 2 tabl

arXiv.org e-Print Archive

A fuzzy approach to segment touching characters

Author: AIRO' FARULLA Giuseppe
Murru Nadir
Rossini Rosaria
Publication venue: 'Elsevier BV'
Publication date: 01/01/2017
Field of study

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)

Institutional Research Information System University of Turin

Segmentation of Isolated and Touching Characters in Offline Handwritten Gurmukhi Script Recognition

Author
Publication venue: 'MECS Publisher'
Publication date
Field of study

Crossref

Recognition-based Approach of Numeral Extraction in Handwritten Chemistry Documents using Contextual Knowledge

Author: Belaid Abdel
Ghanmi Nabil
Publication venue: HAL CCSD
Publication date: 11/04/2016
Field of study

International audienceThis paper presents a complete procedure that uses contextual and syntactic information to identify and recognize amount fields in the table regions of chemistry documents. The proposed method is composed of two main modules. Firstly, a structural analysis based on connected component (CC) dimensions and positions identifies some special symbols and clusters other CCs into three groups: fragment of characters, isolated characters or connected characters. Then, a specific processing is performed on each group of CCs. The fragment of characters are merged with the nearest character or string using geometric relationship based rules. The characters are sent to a recognition module to identify the numeral components. For the connected characters, the final decision on the string nature (numeric or non-numeric) is made based on a global score computed on the full string using the height regularity property and the recognition probabilities of its segmented fragments. Finally, a simple syntactic verification at table row level is conducted in order to correct eventual errors. The experimental tests are carried out on real-world chemistry documents provided by our industrial partner eNovalys. The obtained results show the effectiveness of the proposed system in extracting amount fields

INRIA a CCSD electronic archive server

Automation of Indian Postal Documents written in Bangla and English

Author: B Chaudhuri Bidyut
Belaïd Abdel
Pal Umapada
Roy Kaushik
Vajda Szilárd
Publication venue: 'World Scientific Pub Co Pte Lt'
Publication date: 02/12/2009
Field of study

International audienceIn this paper, we present a system towards Indian postal automation based on pin-code and city name recognition. Here, at first, using Run Length Smoothing Approach (RLSA), non-text blocks (postal stamp, postal seal, etc.) are detected and using positional information Destination Address Block (DAB) is identified from postal documents. Next, lines and words of the DAB are segmented. In India, the address part of a postal document may be written by combination of two scripts: Latin (English) and a local (State/region) script. It is very difficult to identify the script by which pin-code part is written. To overcome this problem on pin-code part, we have used two-stage artificial neural network based general scheme to recognize pin-code numbers written in any of the two scripts. To identify the script by which a word/city name is written, we propose a water reservoir concept based feature. For recognition of city names, we propose an NSHP-HMM (Non- Symmetric Half Plane-Hidden Markov Model) based technique. At present, the accuracy of the proposed digit numeral recognition module is 93.14% while that of city name recognition scheme is 86.44%

INRIA a CCSD electronic archive server