1,043,409 research outputs found
On the Analysis of Neural Networks for Image Processing
This paper illustrates a novel method to analyze artificial neural networks so as to gain insight into their internal functionality. To this purpose, we will show analysis results of some feed-forwardÂżerror-back-propagation neural networks for image processing. We will describe them in terms of domain-dependent basic functions, which are, in the case of the digital image processing domain, differential operators of various orders and with various angles of operation. Some other pixel classification techniques are analyzed in the same way, enabling easy comparison
Naive Bayes vs. Decision Trees vs. Neural Networks in the Classification of Training Web Pages
Web classification has been attempted through many different technologies. In this study we concentrate on the comparison of Neural Networks (NN), NaĂŻve Bayes (NB) and Decision Tree (DT) classifiers for the automatic analysis and classification of attribute data from training course web pages. We introduce an enhanced NB classifier and run the same data sample through the DT and NN classifiers to determine the success rate of our classifier in the training courses domain. This research shows that our enhanced NB classifier not only outperforms the traditional NB classifier, but also performs similarly as good, if not better, than some more popular, rival techniques. This paper also shows that, overall, our NB classifier is the best choice for the training courses domain, achieving an impressive F-Measure value of over 97%, despite it being trained with fewer samples than any of the classification systems we have encountered
Handwritten digit classification
Pattern recognition is one of the major challenges in statistics framework. Its goal is the feature extraction to classify the patterns into categories. A well-known example in this field is the handwritten digit recognition where digits have to be assigned into one of the 10 classes using some classification method. Our purpose is to present alternative classification methods based on statistical techniques. We show a comparison between a multivariate and a probabilistic approach, concluding that both methods provide similar results in terms of test-error rate. Experiments are performed on the known MNIST and USPS databases in binary-level image. Then, as an additional contribution we introduce a novel method to binarize images, based on statistical concepts associated to the written trace of the digitDigit, Classification, Images
Robust Adaptive Median Binary Pattern for noisy texture classification and retrieval
Texture is an important cue for different computer vision tasks and
applications. Local Binary Pattern (LBP) is considered one of the best yet
efficient texture descriptors. However, LBP has some notable limitations,
mostly the sensitivity to noise. In this paper, we address these criteria by
introducing a novel texture descriptor, Robust Adaptive Median Binary Pattern
(RAMBP). RAMBP based on classification process of noisy pixels, adaptive
analysis window, scale analysis and image regions median comparison. The
proposed method handles images with high noisy textures, and increases the
discriminative properties by capturing microstructure and macrostructure
texture information. The proposed method has been evaluated on popular texture
datasets for classification and retrieval tasks, and under different high noise
conditions. Without any train or prior knowledge of noise type, RAMBP achieved
the best classification compared to state-of-the-art techniques. It scored more
than under impulse noise densities, more than under
Gaussian noised textures with standard deviation , and more than
under Gaussian blurred textures with standard deviation .
The proposed method yielded competitive results and high performance as one of
the best descriptors in noise-free texture classification. Furthermore, RAMBP
showed also high performance for the problem of noisy texture retrieval
providing high scores of recall and precision measures for textures with high
levels of noise
Uncovering protein interaction in abstracts and text using a novel linear model and word proximity networks
We participated in three of the protein-protein interaction subtasks of the
Second BioCreative Challenge: classification of abstracts relevant for
protein-protein interaction (IAS), discovery of protein pairs (IPS) and text
passages characterizing protein interaction (ISS) in full text documents. We
approached the abstract classification task with a novel, lightweight linear
model inspired by spam-detection techniques, as well as an uncertainty-based
integration scheme. We also used a Support Vector Machine and the Singular
Value Decomposition on the same features for comparison purposes. Our approach
to the full text subtasks (protein pair and passage identification) includes a
feature expansion method based on word-proximity networks. Our approach to the
abstract classification task (IAS) was among the top submissions for this task
in terms of the measures of performance used in the challenge evaluation
(accuracy, F-score and AUC). We also report on a web-tool we produced using our
approach: the Protein Interaction Abstract Relevance Evaluator (PIARE). Our
approach to the full text tasks resulted in one of the highest recall rates as
well as mean reciprocal rank of correct passages. Our approach to abstract
classification shows that a simple linear model, using relatively few features,
is capable of generalizing and uncovering the conceptual nature of
protein-protein interaction from the bibliome. Since the novel approach is
based on a very lightweight linear model, it can be easily ported and applied
to similar problems. In full text problems, the expansion of word features with
word-proximity networks is shown to be useful, though the need for some
improvements is discussed
Text based classification of companies in CrunchBase
This paper introduces two fuzzy fingerprint based text classification techniques that were successfully applied to automatically label companies from CrunchBase, based purely on their unstructured textual description. This is a real and very challenging problem due to the large set of possible labels (more than 40) and also to the fact that the textual descriptions do not have to abide by any criteria and are, therefore, extremely heterogeneous. Fuzzy fingerprints are a recently introduced technique that can be used for performing fast classification. They perform well in the presence of unbalanced datasets and can cope with a very large number of classes. In the paper, a comparison is performed against some of the best text classification techniques commonly used to address similar problems. When applied to the CrunchBase dataset, the fuzzy fingerprint based approach outperformed the other techniques.info:eu-repo/semantics/submittedVersio
Improvisation of classification performance based on feature optimization for differentiation of Parkinsonâs disease from other neurological diseases using gait characteristics
Most neurological disorders that include Parkinsonâs disease (PD) as well as other neurological diseases such as Amyotrophic Lateral Sclerosis (ALS) and Huntingtonâs disease (HD) have some common abnormalities regarding the movement, vocal, and cognitive behaviors of sufferers. Variations in the manifestation of these types of abnormality help distinguish one disorder from another. In this study, differentiation was performed based on the gait characteristics of patients afflicted by different neurological disorders. In the recent past, many researchers have applied different machine learning and feature selection techniques to the classification of different groups of patients based on common abnormalities. However, in an era of modernization where the focus is on timely low-cost automatization and pattern recognition, such techniques require improvisation to provide high performance. We attempted to improve the performance of such techniques using different feature optimization methods, such as a genetic algorithm (GA) and principal component analysis (PCA), and applying different classification approaches, i.e., linear, nonlinear, and probabilistic classifiers. In this study, gait dynamics data of patients suffering with PD, ALS, and HD were collated from a public database, and a binary classification approach was used by taking PD as one group and adopting ALS+HD as another group. Performance comparison was achieved using different classification techniques that incorporated optimized feature sets obtained from GA and PCA. In comparison with other classifiers using different feature sets, the highest accuracy (97.87%) was obtained using random forest combined with GA-based feature sets. The results provide evidence that could assist medical practitioners in differentiating PD from other neurological diseases using gait characteristics
- âŠ