18 research outputs found

    Recognition of similar characters using gradient features of discriminative regions

    No full text
    One important and challenging issue in handwritten character recognition is the discrimination of visually similar characters. In this paper, we propose a character recognition method for distinguishing similar characters by augmenting commonly used image feature with gradient features from potentially discriminative image regions. The discriminative regions of similar characters sets are automatically detected by analysing the weight vectors of the sparsity promoting logistic fused Lasso method. The histogram of oriented gradients is adopted to compactly represent the gradient features. Additionally, the locality preserving projection method is employed to alleviate the high dimensional nature of the resulting feature vectors. Experimental results on handwritten Lanna Dhamma and Thai characters datasets demonstrate the capability of the proposed method in discriminating visually similar characters. The method also outperforms existing character recognition methods by considerable margins. It has a great potential for character recognition of other alphabets

    Relationship between Height-Weight Difference Index and Body-Fat Percentage Estimated by Bioelectrical Impedance Analysis in Thai Adults

    No full text
    Introduction. The height-weight difference index (HWDI) is a new indicator for evaluating obesity status. While body-fat percentage (BF%) is considered to be the most accurate obesity evaluation tool, it is a more expensive method and more difficult to measure than the others. Objective. Our objectives were to find the relationship between HWDI and BF% and to find a BF% prediction model from HWDI in relation to age and gender. Method. Bioelectrical impedance analysis was used to measure BF% in 2,771 healthy adult Thais. HWDI was calculated as the difference between height and weight. Pearson’s correlation coefficient was used to assess the relationship between HWDI and BF%. Multiple linear and nonlinear regression analysis were used to construct the BF% prediction model. Results. HWDI and BF% were found to be inverse which related to a tendency toward a linear relationship. Results of a multivariate linear regression analysis, which included HWDI and age as variables in the model, predicted BF% to be 34.508 − 0.159 (HWDI) + 0.161 (age) for men and 53.35 − 0.265 (HWDI) + 0.132 (age) for women. Conclusions. The prediction model provides an easy-to-use obesity evaluation tool that should help awareness of underweight and obesity conditions

    RNA secondary structure prediction using conditional random fields model

    No full text
    Non-coding RNAs (ncRNAs) have important biological functions in living cells dependent on their conserved secondary structures. Here, we focus on computational RNA secondary structure prediction by exploring primary sequences and complementary base pair interactions using the Conditional Random Fields (CRFs) model, which treats RNA prediction as a sequence labelling problem. Proposing suitable feature extraction from known RNA secondary structures, we developed a feature extraction based on natural RNA's loop and stem characteristics. Our CRFs models can predict the secondary structures of the test RNAs with optimal F-score prediction between 56.61 and 98.20% for different RNA families

    SCMCRYS: predicting protein crystallization using an ensemble scoring card method with estimating propensity scores of P-collocated amino acid pairs.

    Get PDF
    Existing methods for predicting protein crystallization obtain high accuracy using various types of complemented features and complex ensemble classifiers, such as support vector machine (SVM) and Random Forest classifiers. It is desirable to develop a simple and easily interpretable prediction method with informative sequence features to provide insights into protein crystallization. This study proposes an ensemble method, SCMCRYS, to predict protein crystallization, for which each classifier is built by using a scoring card method (SCM) with estimating propensity scores of p-collocated amino acid (AA) pairs (p=0 for a dipeptide). The SCM classifier determines the crystallization of a sequence according to a weighted-sum score. The weights are the composition of the p-collocated AA pairs, and the propensity scores of these AA pairs are estimated using a statistic with optimization approach. SCMCRYS predicts the crystallization using a simple voting method from a number of SCM classifiers. The experimental results show that the single SCM classifier utilizing dipeptide composition with accuracy of 73.90% is comparable to the best previously-developed SVM-based classifier, SVM_POLY (74.6%), and our proposed SVM-based classifier utilizing the same dipeptide composition (77.55%). The SCMCRYS method with accuracy of 76.1% is comparable to the state-of-the-art ensemble methods PPCpred (76.8%) and RFCRYS (80.0%), which used the SVM and Random Forest classifiers, respectively. This study also investigates mutagenesis analysis based on SCM and the result reveals the hypothesis that the mutagenesis of surface residues Ala and Cys has large and small probabilities of enhancing protein crystallizability considering the estimated scores of crystallizability and solubility, melting point, molecular weight and conformational entropy of amino acids in a generalized condition. The propensity scores of amino acids and dipeptides for estimating the protein crystallizability can aid biologists in designing mutation of surface residues to enhance protein crystallizability. The source code of SCMCRYS is available at http://iclab.life.nctu.edu.tw/SCMCRYS/

    Distribution of locations of high-score dipeptides on the two typical sequences 3K9I and Q4V970.

    No full text
    <p>The distribution of locations of high-score dipeptides on the two typical sequences 3K9I and Q4V970 correctly predicted as crystallizable and non-crystallizable proteins, respectively.</p
    corecore