13 research outputs found

    Reinforced Angle-Based Multicategory Support Vector Machines

    Get PDF
    The Support Vector Machine (SVM) is a very popular classification tool with many successful applications. It was originally designed for binary problems with desirable theoretical properties. Although there exist various Multicategory SVM (MSVM) extensions in the literature, some challenges remain. In particular, most existing MSVMs make use of k classification functions for a k-class problem, and the corresponding optimization problems are typically handled by existing quadratic programming solvers. In this paper, we propose a new group of MSVMs, namely the Reinforced Angle-based MSVMs (RAMSVMs), using an angle-based prediction rule with k − 1 functions directly. We prove that RAMSVMs can enjoy Fisher consistency. Moreover, we show that the RAMSVM can be implemented using the very efficient coordinate descent algorithm on its dual problem. Numerical experiments demonstrate that our method is highly competitive in terms of computational speed, as well as classification prediction performance. Supplemental materials for the article are available online

    GenSVM: a generalized multiclass support vector machine

    Get PDF
    Traditional extensions of the binary support vector machine (SVM) to multiclass problems are either heuristics or require solving a large dual optimization problem. Here, a generalized multiclass SVM is proposed called GenSVM. In this method classification boundaries for a K-class problem are constructed in a (K - 1)-dimensional space using a simplex encoding. Additionally, several different weightings of the misclassification errors are incorporated in the loss function, such that it generalizes three existing multiclass SVMs through a single optimization problem. An iterative majorization algorithm is derived that solves the optimization problem without the need of a dual formulation. This algorithm has the advantage that it can use warm starts during cross validation and during a grid search, which signifficantly speeds up the training phase. Rigorous numerical experiments compare linear GenSVM with seven existing multiclass SVMs on both small and large data sets. These comparisons show that the proposed method is competitive with existing methods in both predictive accuracy and training time, and that it signiffcantly outperforms several existing methods on these criteria

    Algorithms for Multiclass Classification and Regularized Regression

    Get PDF

    Algorithms for Multiclass Classification and Regularized Regression

    Get PDF

    Algorithms for Multiclass Classification and Regularized Regression

    Get PDF
    Multiclass classification and regularized regression problems are very common in modern statistical and machine learning applications. On the one hand, multiclass classification problems require the prediction of class labels: given observations of objects that belong to certain classes, can we predict to which class a new object belongs? On the other hand, the reg

    MLweb: A toolkit for machine learning on the web

    Get PDF
    International audienceThis paper describes MLweb, an open source software toolkit for machine learning on the web. The specificity of MLweb is that all computations are performed on the client side without the need to send data to a third-party server. MLweb includes three main components: a JavaScript API for scientific computing (LALOLib), an extension of this library with machine learning tools (ML.js) and an online development environment (LALOLab) with many examples

    Multiclass optimal classification trees with SVM‑splits

    Get PDF
    In this paper we present a novel mathematical optimization-based methodology to construct tree-shaped classification rules for multiclass instances. Our approach consists of building Classification Trees in which, except for the leaf nodes, the labels are temporarily left out and grouped into two classes by means of a SVM separating hyperplane. We provide a Mixed Integer Non Linear Programming formulation for the problem and report the results of an extended battery of computational experiments to assess the performance of our proposal with respect to other benchmarking classification methods.Universidad de Sevilla/CBUASpanish Ministerio de Ciencia y TecnologĂ­a, Agencia Estatal de InvestigaciĂłn, and Fondos Europeos de Desarrollo Regional (FEDER) via project PID2020-114594GB-C21Junta de AndalucĂ­a projects FEDER-US-1256951, P18-FR-1422, CEI-3-FQM331, B-FQM-322-UGR20AT 21_00032; FundaciĂłn BBVA through project NetmeetData: Big Data 2019UE-NextGenerationEU (ayudas de movilidad para la recualificaciĂłn del profesorado universitario)IMAG-Maria de Maeztu grant CEX2020- 001105-M /AEI /10.13039/50110001103

    Machine à vecteurs de support hyperbolique et ingénierie du noyau

    Get PDF
    Statistical learning theory is a field of inferential statistics whose foundations were laid by Vapnik at the end of the 1960s. It is considered a subdomain of artificial intelligence. In machine learning, support vector machines (SVM) are supervised learning models with associated learning algorithms that analyze data used for classification and regression analysis. In this thesis, our aim is to propose two new statistical learning problems: one on the conception and evaluation of a multi-class SVM extension and another on the design of a new kernel for support vectors machines. First, we introduced a new kernel machine for multi-class pattern recognition : the hyperbolic support vector machine. Geometrically, it is characterized by the fact that its decision boundaries in the feature space are defined by hyperbolic functions. We then established its main statistical properties. Among these properties we showed that the classes of component functions are uniform Glivenko-Cantelli, this by establishing an upper bound of the Rademacher complexity. Finally, we establish a guaranteed risk for our classifier. Second, we constructed a new kernel based on the Fourier transform of a Gaussian mixture model. We proceed in the following way: first, each class is fragmented into a number of relevant subclasses, then we consider the directions given by the vectors obtained by taking all pairs of subclass centers of the same class. Among these are excluded those allowing to connect two subclasses of two different classes. We can also see this as the search for translation invariance in each class. It successfully on several datasets in the context of machine learning using multiclass support vector machines.La thĂ©orie statistique de l’apprentissage est un domaine de la statistique infĂ©rentielle dont les fondements ont Ă©tĂ© posĂ©s par Vapnik Ă  la fin des annĂ©es 60. Il est considĂ©rĂ© comme un sous-domaine de l’intelligence artificielle. Dans l’apprentissage automatique, les machines Ă  vecteurs de support (SVM) sont un ensemble de techniques d’apprentissage supervisĂ© destinĂ©es Ă  rĂ©soudre des problĂšmes de discrimination et de rĂ©gression. Dans cette thĂšse, notre objectif est de proposer deux nouveaux problĂšmes d’apprentissage statistique: Un portant sur la conception et l’évaluation d’une extension des SVM multiclasses et un autre sur la conception d’un nouveau noyau pour les machines Ă  vecteurs de support. Dans un premier temps, nous avons introduit une nouvelle machine Ă  noyau pour la reconnaissance de modĂšle multi-classe: la machine Ă  vecteur de support hyperbolique. GĂ©omĂ©triquement, il est caractĂ©risĂ© par le fait que ses surfaces de dĂ©cision dans l’espace de redescription sont dĂ©finies par des fonctions hyperboliques. Nous avons ensuite Ă©tabli ses principales propriĂ©tĂ©s statistiques. Parmi ces propriĂ©tĂ©s nous avons montrĂ© que les classes de fonctions composantes sont des classes de Glivenko-Cantelli uniforme, ceci en Ă©tablissant un majorant de la complexitĂ© de Rademacher. Enfin, nous Ă©tablissons un risque garanti pour notre classifieur.Dans un second temps, nous avons crĂ©er un nouveau noyau s’appuyant sur la transformation de Fourier d’un modĂšle de mĂ©lange gaussien. Nous procĂ©dons de la maniĂšre suivante: d’abord, chaque classe est fragmentĂ©e en un nombre de sous-classes pertinentes, ensuite on considĂšre les directions donnĂ©es par les vecteurs obtenus en prenant toutes les paires de centres de sous-classes d’une mĂȘme classe. Parmi celles-ci, sont exclues celles permettant de connecter deux sous-classes de deux classes diffĂ©rentes. On peut aussi voir cela comme la recherche d’invariance par translation dans chaque classe. Nous l’avons appliquĂ© avec succĂšs sur plusieurs jeux de donnĂ©es dans le contexte d’un apprentissage automatique utilisant des machines Ă  vecteurs support multi-classes

    Integrative Systems Approaches Towards Brain Pharmacology and Polypharmacology

    Get PDF
    Polypharmacology is considered as the future of drug discovery and emerges as the next paradigm of drug discovery. The traditional drug design is primarily based on a “one target-one drug” paradigm. In polypharmacology, drug molecules always interact with multiple targets, and therefore it imposes new challenges in developing and designing new and effective drugs that are less toxic by eliminating the unexpected drug-target interactions. Although still in its infancy, the use of polypharmacology ideas appears to already have a remarkable impact on modern drug development. The current thesis is a detailed study on various pharmacology approaches at systems level to understand polypharmacology in complex brain and neurodegnerative disorders. The research work in this thesis focuses on the design and construction of a dedicated knowledge base for human brain pharmacology. This pharmacology knowledge base, referred to as the Human Brain Pharmacome (HBP) is a unique and comprehensive resource that aggregates data and knowledge around current drug treatments that are available for major brain and neurodegenerative disorders. The HBP knowledge base provides data at a single place for building models and supporting hypotheses. The HBP also incorporates new data obtained from similarity computations over drugs and proteins structures, which was analyzed from various aspects including network pharmacology and application of in-silico computational methods for the discovery of novel multi-target drug candidates. Computational tools and machine learning models were developed to characterize protein targets for their polypharmacological profiles and to distinguish indications specific or target specific drugs from other drugs. Systems pharmacology approaches towards drug property predictions provided a highly enriched compound library that was virtually screened against an array of network pharmacology based derived protein targets by combined docking and molecular dynamics simulation workflows. The developed approaches in this work resulted in the identification of novel multi-target drug candidates that are backed up by existing experimental knowledge, and propose repositioning of existing drugs, that are undergoing further experimental validations
    corecore