12,463 research outputs found

    Least squares support vector machines for direction of arrival estimation

    Get PDF
    Machine learning research has largely been devoted to binary and multiclass problems relating to data mining, text categorization, and pattern/facial recognition. Recently, popular machine learning algorithms, including support vector machines (SVM), have successfully been applied to wireless communication problems. The paper presents a multiclass least squares SVM (LS-SVM) architecture for direction of arrival (DOA) estimation as applied to a CDMA cellular system. Simulation results show a high degree of accuracy, as related to the DOA classes, and prove that the LS-SVM DDAG (decision directed acyclic graph) system has a wide range of performance capabilities. The multilabel capability for multiple DOAs is discussed. Multilabel classification is possible with the LS-SVM DDAG algorithm presented

    Automatic document classification of biological literature

    Get PDF
    Background: Document classification is a wide-spread problem with many applications, from organizing search engine snippets to spam filtering. We previously described Textpresso, a text-mining system for biological literature, which marks up full text according to a shallow ontology that includes terms of biological interest. This project investigates document classification in the context of biological literature, making use of the Textpresso markup of a corpus of Caenorhabditis elegans literature. Results: We present a two-step text categorization algorithm to classify a corpus of C. elegans papers. Our classification method first uses a support vector machine-trained classifier, followed by a novel, phrase-based clustering algorithm. This clustering step autonomously creates cluster labels that are descriptive and understandable by humans. This clustering engine performed better on a standard test-set (Reuters 21578) compared to previously published results (F-value of 0.55 vs. 0.49), while producing cluster descriptions that appear more useful. A web interface allows researchers to quickly navigate through the hierarchy and look for documents that belong to a specific concept. Conclusions: We have demonstrated a simple method to classify biological documents that embodies an improvement over current methods. While the classification results are currently optimized for Caenorhabditis elegans papers by human-created rules, the classification engine can be adapted to different types of documents. We have demonstrated this by presenting a web interface that allows researchers to quickly navigate through the hierarchy and look for documents that belong to a specific concept
    • …
    corecore