Search CORE

1,897 research outputs found

Text Classification Aided by Clustering: a Literature Review

Author: Kyriakopoulou Antonia
Publication venue: 'IntechOpen'
Publication date: 01/08/2008
Field of study

IntechOpen

Crossref

A Review of Codebook Models in Patch-Based Visual Object Recognition

Author: Niranjan Mahesan
Ramanan Amirthalingam
Publication venue
Publication date: 22/09/2011
Field of study

The codebook model-based approach, while ignoring any structural aspect in vision, nonetheless provides state-of-the-art performances on current datasets. The key role of a visual codebook is to provide a way to map the low-level features into a fixed-length vector in histogram space to which standard classifiers can be directly applied. The discriminative power of such a visual codebook determines the quality of the codebook model, whereas the size of the codebook controls the complexity of the model. Thus, the construction of a codebook is an important step which is usually done by cluster analysis. However, clustering is a process that retains regions of high density in a distribution and it follows that the resulting codebook need not have discriminant properties. This is also recognised as a computational bottleneck of such systems. In our recent work, we proposed a resource-allocating codebook, to constructing a discriminant codebook in a one-pass design procedure that slightly outperforms more traditional approaches at drastically reduced computing times. In this review we survey several approaches that have been proposed over the last decade with their use of feature detectors, descriptors, codebook construction schemes, choice of classifiers in recognising objects, and datasets that were used in evaluating the proposed methods

Southampton (e-Prints Soton)

Novel Heuristic Recurrent Neural Network Framework to Handle Automatic Telugu Text Categorization from Handwritten Text Image

Author: Boddu Rajasekhar
Reddy Edara Sreenivasa
Publication venue: 'Auricle Technologies, Pvt., Ltd.'
Publication date: 12/05/2023
Field of study

In the near future, the digitization and processing of the current paper documents describe efficient role in the creation of a paperless environment. Deep learning techniques for handwritten recognition have been extensively studied by various researchers. Deep neural networks can be trained quickly thanks to a lot of data and other algorithmic advancements. Various methods for extracting text from handwritten manuscripts have been developed in literature. To extract features from written Telugu Text image having some other neural network approaches like convolution neural network (CNN), recurrent neural networks (RNN), long short-term memory (LSTM). Different deep learning related approaches are widely used to identification of handwritten Telugu Text; various techniques are used in literature for the identification of Telugu Text from documents. For automatic identification of Telugu written script efficiently to eliminate noise and other semantic features present in Telugu Text, in this paper, proposes Novel Heuristic Advanced Neural Network based Telugu Text Categorization Model (NHANNTCM) based on sequence-to-sequence feature extraction procedure. Proposed approach extracts the features using RNN and then represents Telugu Text in sequence-to-sequence format for the identification advanced neural network performs both encoding and decoding to identify and explore visual features from sequence of Telugu Text in input data. The classification accuracy rates for Telugu words, Telugu numerals, Telugu characters, Telugu sentences, and the corresponding Telugu sentences were 99.66%, 93.63%, 91.36%, 99.05%, and 97.73% consequently. Experimental evaluation describe extracted with revealed which are textured i.e. TENG shown considerable operations in applications such as private information protection, security defense, and personal handwriting signature identification

International Journal on Recent and Innovation Trends in Computing and Communication

Personalized driver workload inference by learning from vehicle related measurements

Author: Cunjia Liu (1176420)
Dewei Yi (7121501)
Jinya Su (1260438)
Wen-Hua Chen (1251597)
Publication venue
Publication date: 01/01/2017
Field of study

Adapting in-vehicle systems (e.g. Advanced Driver Assistance Systems, In-Vehicle Information Systems) to individual drivers’ workload can enhance safety and convenience. To make this possible, it is a prerequisite to infer driver workload so that adaptive aiding can be provided to the driver at the right time and in a proper manner. Rather than developing an average model for all drivers, a Personalized Driver Workload Inference (PDWI) system considering individual drivers’ driving characteristics is developed using machine learning techniques via easily accessed Vehicle Related Measurements (VRMs). The proposed PDWI system comprises two stages. In offline training, individual drivers’ workload is first automatically splitted into different categories according to its inherent data characteristics using Fuzzy C means clustering. Then an implicit mapping between VRMs and different levels of workload is constructed via classification algorithms. In online implementation, VRMs samples are classified into different clusters, consequently driver workload can be successfully inferred. A recently collected dataset from real-world naturalistic driving experiments is drawn to validate the proposed PDWI system. Comparative experimental results indicate that the proposed framework integrating Fuzzy C-means clustering and Support Vector Machine classifier provides a promising workload recognition performance in terms of accuracy, precision, recall, F1-score and prediction time. The inter-individual differences in term of workload are also identified and can be accommodated by the proposed framework due to its adaptiveness

Loughborough University Institutional Repository

Ensembles of Novel Visual Keywords Descriptors for Image Categorization

Author: Abdullah Azizi
Veltkamp Remco C.
Wiering Marco
Publication venue
Publication date: 01/01/2010
Field of study

Dissertations of the University of Groningen

テキスト自動分類のクラスインデックスとクラスセマンティックインデックスに基づく用語重み付けアプローチ

Author: Sohrab Mohammad Golam
Publication venue
Publication date: 29/10/2017
Field of study

Tokushima University Institutional Repository

Effect of Term Weighting on Keyword Extraction in Hierarchical Category Structure

Author: Boonbrahm Salin
Chiraratanasopha Boonthida
Theeramunkong Thanaruk
Publication venue: Institute of Informatics, Slovak Academy of Sciences
Publication date: 03/08/2021
Field of study

While there have been several studies related to the effect of term weighting on classification accuracy, relatively few works have been conducted on how term weighting affects the quality of keywords extracted for characterizing a document or a category (i.e., document collection). Moreover, many tasks require more complicated category structure, such as hierarchical and network category structure, rather than a flat category structure. This paper presents a qualitative and quantitative study on how term weighting affects keyword extraction in the hierarchical category structure, in comparison to the flat category structure. A hierarchical structure triggers special characteristic in assigning a set of keywords or tags to represent a document or a document collection, with support of statistics in a hierarchy, including category itself, its parent category, its child categories, and sibling categories. An enhancement of term weighting is proposed particularly in the form of a series of modified TFIDF's, for improving keyword extraction. A text collection of public-hearing opinions is used to evaluate variant TFs and IDFs to identify which types of information in hierarchical category structure are useful. By experiments, we found that the most effective IDF family, namely TF-IDFr, is identity>sibling>child>parent in order. The TF-IDFr outperforms the vanilla version of TFIDF with a centroid-based classifier

Computing and Informatics (E-Journal - Institute of Informatics, SAS, Bratislava)