226 research outputs found
Radical Recognition in Off-Line Handwritten Chinese Characters Using Non-Negative Matrix Factorization
In the past decade, handwritten Chinese character recognition has received renewed interest with the emergence of touch screen devices. Other popular applications include on-line Chinese character dictionary look-up and visual translation in mobile phone applications. Due to the complex structure of Chinese characters, this classification task is not exactly an easy one, as it involves knowledge from mathematics, computer science, and linguistics.
Given a large image database of handwritten character data, the goal of my senior project is to use Non-Negative Matrix Factorization (NMF), a recent method for finding a suitable representation (parts-based representation) of image data, to detect specific sub-components in Chinese characters. NMF has only been applied to typed (printed) Chinese characters in different fonts. This project focuses specifically on how well NMF works on handwritten characters. In addition, research in Chinese character classification has mainly been done using holistic approaches - treating each character as an inseparable unit. By using NMF, this project takes a different approach by focusing on a more specific problem in Chinese character classification: radical (sub-component) detection.
Finally, a possible application of radical detection will be proposed. This interactive application can potentially help Chinese language learners better recognize characters by radicals
Use of prior knowledge in classification of similar and structured objects
Statistical machine learning has achieved great success in many fields in the last few decades. However, there remain classification problems that computers still struggle to match human performance. Many such problems share the same properties---large within class variability and complex structure in the examples, which is often true for real world objects. This does not mean lack of information for classification in the examples. On the contrary, there is still a clear pattern in the examples, but hidden behind a many-way covariance structure such that useful information is too dilute for conventional statistical machine learners to pick up. However, if we can exploit the structural nature of the objects and concentrate information about the classification, the problem can become much easier. In this dissertation we propose a framework using prior knowledge about modeling the structures in the examples to concentrate information for classification. The framework is instantiated to the task of classifying pairs of similar offline handwritten Chinese characters. We empirically demonstrate that our proposed framework indeed concentrates useful information for classification and makes the classification problem easier for statistical learning. Our approach advances the state of the art both in offline handwritten character recognition and in machine learning
Chinese calligraphy: character style recognition based on full-page document
Calligraphy plays a very important role in the history of China. From ancient times to
modern times, the beauty of calligraphy has been passed down to the present. Different
calligraphy styles and structures have made calligraphy a beauty and embodiment in the
field of writing. However, the recognition of calligraphy style and fonts has always been
a blank in the computer field. The structural complexity of different calligraphy also
brings a lot of challenges to the recognition technology of computers. In my research, I
mainly discussed some of the main recognition techniques and some popular machine
learning algorithms in this field for more than 20 years, trying to find a new method of
Chinese calligraphy styles recognition and exploring its feasibility.
In our research, we searched for research papers 20 years ago. Most of the results are
about the content recognition of modern Chinese characters. At first, we analyze the
development of Chinese characters and the basic Chinese character theory. In the
analysis of the current recognition of Chinese characters (including handwriting online
and offline) in the computer field, it is more important to analyze various algorithms
and results, and to analyze how to use the experimental data, besides how they construct
the data set used for their test.
The research on the method of image processing based on Chinese calligraphy works
is very limited, and the data collection for calligraphy test is very limited also. The test
of dataset that used between different recognition technologies is also very different.
However, it has far-reaching significance for inheriting and carrying forward the
traditional Chinese culture. It is very necessary to develop and promote the recognition
of Chinese characters by means of computer tecnchque. In the current application field,
the font recognition of Chinese calligraphy can effectively help the library
administrators to identify the problem of the classification of the copybook, thus
avoiding the recognition of the calligraphy font which is difficult to perform manually
only through subjective experience.
In the past 10 years of technology, some techniques for the recognition of single
Chinese calligraphy fonts have been given. Most of them are the pre-processing of
calligraphy characters, the extraction of stroke primitives, the extraction of style
features, and the final classification of machine learning. The probability of the
classification of the calligraphy works. Such technical requirements are very large for
complex Chinese characters, the result of splitting and recognition is very large, and it
is difficult to accurately divide many complex font results. As a result, the recognition
rate is low, or the accuracy of recognition of a specific word is high, but the overall font
recognition accuracy is low.
We understand that Chinese calligraphy is a certain research value. In the field of
recognition, many research papers on the analysis of Chinese calligraphy are based on
the study of calligraphy and stroke. However, we have proposed a new method for
dealing with font recognition. The recognition technology is based on the whole page
of the document. It is studied in three steps: the first step is to use Fourier transform and
some Chinese calligraphy images and analyze the results. The second is that CNN is
based on different data sets to get some results. Finally, we made some improvements
to the CNN structure. The experimental results of the thesis show that the full-page
documents recognition method proposed can achieve high accuracy with the support of
CNN technology, and can effectively identify the different styles of Chinese calligraphy
in 5 styles. Compared with the traditional analysis methods, our experimental results
show that the method based on the full-page document is feasible, avoiding the
cumbersome font segmentation problem. This is more efficient and more accurate
Template Based Recognition of On-Line Handwriting
Software for recognition of handwriting has been available for several decades now and research on the subject have produced several different strategies for producing competitive recognition accuracies, especially in the case of isolated single characters. The problem of recognizing samples of handwriting with arbitrary connections between constituent characters (emph{unconstrained handwriting}) adds considerable complexity in form of the segmentation problem. In other words a recognition system, not constrained to the isolated single character case, needs to be able to recognize where in the sample one letter ends and another begins. In the research community and probably also in commercial systems the most common technique for recognizing unconstrained handwriting compromise Neural Networks for partial character matching along with Hidden Markov Modeling for combining partial results to string hypothesis. Neural Networks are often favored by the research community since the recognition functions are more or less automatically inferred from a training set of handwritten samples. From a commercial perspective a downside to this property is the lack of control, since there is no explicit information on the types of samples that can be correctly recognized by the system. In a template based system, each style of writing a particular character is explicitly modeled, and thus provides some intuition regarding the types of errors (confusions) that the system is prone to make. Most template based recognition methods today only work for the isolated single character recognition problem and extensions to unconstrained recognition is usually not straightforward. This thesis presents a step-by-step recipe for producing a template based recognition system which extends naturally to unconstrained handwriting recognition through simple graph techniques. A system based on this construction has been implemented and tested for the difficult case of unconstrained online Arabic handwriting recognition with good results
Information Preserving Processing of Noisy Handwritten Document Images
Many pre-processing techniques that normalize artifacts and clean noise induce anomalies due to discretization of the document image. Important information that could be used at later stages may be lost. A proposed composite-model framework takes into account pre-printed information, user-added data, and digitization characteristics. Its benefits are demonstrated by experiments with statistically significant results. Separating pre-printed ruling lines from user-added handwriting shows how ruling lines impact people\u27s handwriting and how they can be exploited for identifying writers. Ruling line detection based on multi-line linear regression reduces the mean error of counting them from 0.10 to 0.03, 6.70 to 0.06, and 0.13 to 0.02, com- pared to an HMM-based approach on three standard test datasets, thereby reducing human correction time by 50%, 83%, and 72% on average. On 61 page images from 16 rule-form templates, the precision and recall of form cell recognition are increased by 2.7% and 3.7%, compared to a cross-matrix approach. Compensating for and exploiting ruling lines during feature extraction rather than pre-processing raises the writer identification accuracy from 61.2% to 67.7% on a 61-writer noisy Arabic dataset. Similarly, counteracting page-wise skew by subtracting it or transforming contours in a continuous coordinate system during feature extraction improves the writer identification accuracy. An implementation study of contour-hinge features reveals that utilizing the full probabilistic probability distribution function matrix improves the writer identification accuracy from 74.9% to 79.5%
Hierarchichal attributed graph representation and recognition of handwritten Chinese characters
This thesis presents a system which is capable of recognizing handwritten Chinese characters. The hierarchical attributed graph representation (HAGR), a two-level graph, is introduced to describe the structural and statistical information of handwritten Chinese characters. The first level describes radicals and relations between radicals within a character, the second level describes strokes and relations between strokes in a radical. With HAGR, the recognition process becomes a simple task of graph matching. A cost function mapping a candidate to a model graph is introduced. This approach can tolerate the variations of HAGR which reflect the instablities and variabilities of handwritten Chinese characters resulting from different writing styles. Several rules have been used to re-arrange the order of the vertices of the graphs in order to avoid the combinatorial explosion inherent in graph matching. Based on HAGR, the model database is organized as a heterogeneous multi-way tree structure. For an input character, the search process can be divided into a number of simple and local decisions at different levels of the tree to find a corresponding model character in the database. The matching process is very efficient and accurate, and as well the system can acquire representations of characters by a learning process. Several HAGRs of samples of a character can be synthesized into a single HAGR of the character which can then be included in the model database. In addition, the learning process can update the models of characters with the HAGRs of their samples. The system is implemented in C on a MIPS/M-120 running RISC/OS (Version-3.1)
- …