Search CORE

10 research outputs found

Tensorized LSSVMs for Multitask Regression

Author: Liu Jiani
Liu Yipeng
Suykens Johan A. K.
Tao Qinghua
Zhu Ce
Publication venue
Publication date: 04/03/2023
Field of study

Multitask learning (MTL) can utilize the relatedness between multiple tasks for performance improvement. The advent of multimodal data allows tasks to be referenced by multiple indices. High-order tensors are capable of providing efficient representations for such tasks, while preserving structural task-relations. In this paper, a new MTL method is proposed by leveraging low-rank tensor analysis and constructing tensorized Least Squares Support Vector Machines, namely the tLSSVM-MTL, where multilinear modelling and its nonlinear extensions can be flexibly exerted. We employ a high-order tensor for all the weights with each mode relating to an index and factorize it with CP decomposition, assigning a shared factor for all tasks and retaining task-specific latent factors along each index. Then an alternating algorithm is derived for the nonconvex optimization, where each resulting subproblem is solved by a linear system. Experimental results demonstrate promising performances of our tLSSVM-MTL

arXiv.org e-Print Archive

Learning Using Privileged Information: SVM+ and Weighted SVM

Author: Hein Matthias
Lapin Maksim
Schiele Bernt
Publication venue: 'Elsevier BV'
Publication date: 01/01/2014
Field of study

Prior knowledge can be used to improve predictive performance of learning algorithms or reduce the amount of data required for training. The same goal is pursued within the learning using privileged information paradigm which was recently introduced by Vapnik et al. and is aimed at utilizing additional information available only at training time -- a framework implemented by SVM+. We relate the privileged information to importance weighting and show that the prior knowledge expressible with privileged features can also be encoded by weights associated with every training example. We show that a weighted SVM can always replicate an SVM+ solution, while the converse is not true and we construct a counterexample highlighting the limitations of SVM+. Finally, we touch on the problem of choosing weights for weighted SVMs when privileged features are not available.Comment: 18 pages, 8 figures; integrated reviewer comments, improved typesettin

arXiv.org e-Print Archive

CISPA – Helmholtz-Zentrum für Informationssicherheit

MPG.PuRe

Enhanced default risk models with SVM+

Author: Chen Ning
Neves João Carvalho das
Ribeiro Bernardete
Silva Catarina
Vieira Armando
Publication venue: 'Elsevier BV'
Publication date: 01/01/2012
Field of study

Default risk models have lately raised a great interest due to the recent world economic crisis. In spite of many advanced techniques that have extensively been proposed, no comprehensive method incorporating a holistic perspective has hitherto been considered. Thus, the existing models for bankruptcy prediction lack the whole coverage of contextual knowledge which may prevent the decision makers such as investors and financial analysts to take the right decisions. Recently, SVM+ provides a formal way to incorporate additional information (not only training data) onto the learning models improving generalization. In financial settings examples of such non-financial (though relevant) information are marketing reports, competitors landscape, economic environment, customers screening, industry trends, etc. By exploiting additional information able to improve classical inductive learning we propose a prediction model where data is naturally separated into several structured groups clustered by the size and annual turnover of the firms. Experimental results in the setting of a heterogeneous data set of French companies demonstrated that the proposed default risk model showed better predictability performance than the baseline SVM and multi-task learning with SVM.info:eu-repo/semantics/publishedVersio

UTL Repository

Convex formulation for multi-task L1-, L2-, and LS-SVMs

Author: Alaíz Carlos M.
Dorronsoro José R.
Ruiz Carlos
Publication venue: 'Elsevier BV'
Publication date: 07/06/2021
Field of study

Quite often a machine learning problem lends itself to be split in several well-defined subproblems, or tasks. The goal of Multi-Task Learning (MTL) is to leverage the joint learning of the problem from two different perspectives: on the one hand, a single, overall model, and on the other hand task-specific models. In this way, the found solution by MTL may be better than those of either the common or the task-specific models. Starting with the work of Evgeniou et al., support vector machines (SVMs) have lent themselves naturally to this approach. This paper proposes a convex formulation of MTL for the L1-, L2- and LS-SVM models that results in dual problems quite similar to the single-task ones, but with multi-task kernels; in turn, this makes possible to train the convex MTL models using standard solvers. As an alternative approach, the direct optimal combination of the already trained common and task-specific models can also be considered. In this paper, a procedure to compute the optimal combining parameter with respect to four different error functions is derived. As shown experimentally, the proposed convex MTL approach performs generally better than the alternative optimal convex combination, and both of them are better than the straight use of either common or task-specific modelsWith partial support from Spain’s grant TIN2016-76406-P. Work supported also by the UAM–ADIC Chair for Data Science and Machine Learning

Biblos-e Archivo

Privileged Information for Data Clustering

Author: A Blum
A Gionis
A Greven
A Likas
A P Dempster
A Strehl
A Topchy
A Y Ng
B Ribeiro
B Sch�lkopf
D Chou
D Pechyony
D Pechyony
E Fermi
F Cai
F Wilcoxon
G Forestier
J Han
J M Pe�a
J Shi
J Vesanto
Jan Feyereisl
K L Priddy
K Pearson
L Hubert
L Liang
N Cesa-Bianchi
O Chapelle
P S Bradley
S Ben-David
S Bickel
T Kohonen
U Luxburg
Uwe Aickelin
V De Sa
V N Vapnik
V Vapnik
V Vapnik
V Vapnik
W M Rand
W Shannon
W.-L Tai
Y Chen
Y Lecun
Publication venue: 'Elsevier BV'
Publication date: 01/01/2012
Field of study

Crossref

ПРИМЕНЕНИЕ СИНТЕТИЧЕСКИХ ОБРАЗОВ ДЛЯ РЕШЕНИЯ ЗАДАЧИ КЛАССИФИКАЦИИ НА ПРИМЕРЕ ДИАГНОСТИКИ РАКА ЛЕГКОГО

Author: Bel'skaya L.V.
Gundyrev I.A.
Kosenok V.K.
Sarf E.A.
Publication venue: 'Paediatrician Publishers LLC'
Publication date: 10/05/2018
Field of study

Background: From a mathematical point of view, the problems of medical diagnostics are the tasks of data classification. It is important to understand how significant distortions can contribute to the result of classification errors in the collection of primary diagnostic information, in particular, the results of biochemical tests.Aims: Determination of the dependence of the prediction result on the variability of the primary diagnostic information on the example of the model classifier.Materials and methods: The case-control study enrolled patients who were divided into 2 groups: the main (diagnosed with lung cancer, n=200) and the control group (conditionally healthy, n=500). Questioning and biochemical saliva study was performed in all participants. Patients of the main group and the comparison group were hospitalized for surgical treatment, after which carried out the histological verification of the diagnosis. The biochemical composition of saliva is determined spectrophotometrically. Based on the data obtained, a model classifier for the diagnosis of lung cancer (a random forest) has been constructed. In each parameter underlying the classifier, deviations were made in the specified range (±1–5%, ±5–10%, ±10–15%), creating synthetic images. Then, the results of the classification were evaluated by the cross-validation method.Results: The basic diagnostic characteristics of the model classifier are determined (sensitivity ― 72.5%, specificity ― 86.0%). As the deviations of synthetic images from the baseline increase, diagnostic characteristics deteriorate with the general classification. However, the result of a confident classification, on the contrary, gives higher values (sensitivity ― 81.8%, specificity ― 93.1%). In case of a confident classification, similar images that fall into different classes according to the classification results are deleted, whereas in the case of a general classification, they are taken into account. The difference between methods of classification is associated with the presence of images on which the classifier gives the result of belonging to the class in the range of 0.45–0.55. Therefore, it is necessary to introduce a third class into the classifier, the so-called gray zone (0.4–0.6), since the probability of making an erroneous diagnosis in this area is significantly increased.Conclusions: The obtained results allow to conclude that the measurement error in the range (±1–15%) does not significantly affect the quality of the classification.Обоснование. С математической точки зрения задачи медицинской диагностики представляют собой задачи классификации данных. При этом важно понимать, насколько существенные искажения могут внести в результат классификации погрешности сбора первичной диагностической информации, в частности результатов биохимических тестов.Цель исследования ― установление зависимости результата классификации от вариативности первичной диагностической информации на примере модельного классификатора.Методы. В исследовании случай-контроль приняли участие пациенты, которые были разделены на 2 группы ― основную (с диагнозом рака легкого, n=200) и контрольную (условно здоровые, n=500). Всем участникам было проведено биохимическое исследование слюны, а также последующая гистологическая верификация диагноза. Биохимический состав слюны определен спектрофотометрически. На основе полученных данных построен модельный классификатор для диагностики рака легкого (случайный лес). В каждый параметр, лежащий в основе классификатора, вносили отклонения в заданном диапазоне (±1–5%, ±5–10%, ±10–15%), создавая синтетические образы. Затем методом кросс-валидации проведена оценка результатов классификации.Результаты. Определены базовые диагностические характеристики модельного классификатора (чувствительность ― 72,5%; специфичность ― 86,0%). При увеличении отклонений синтетических образов от базового значения диагностические характеристики при общей классификации ухудшаются. Однако результат уверенной классификации, напротив, дает более высокие значения (чувствительность ― 81,8%, специфичность ― 93,1%). В случае уверенной классификации близкие образы, которые по результатам классификации попадают в разные классы, удаляются, тогда как в случае общей ― учитываются. Разница между методами классификации связана с наличием образов, на которых классификатор дает результат принадлежности к классу в диапазоне 0,45–0,55. Поэтому необходимо введение третьего класса в классификатор, так называемой серой зоны (0,4–0,6), т.к. вероятность постановки ошибочного диагноза в данной области существенно повышается.Заключение. Полученные результаты позволяют сделать вывод, что измерительная погрешность в диапазоне (±1–15%) не оказывает существенного влияния на качество классификации

Annals of the Russian academy of medical sciences / Вестник Российской академии медицинских наук

Learning with privileged and sensitive information: a gradient-boosting approach

Author: Kristian Kersting
Phillip Odom
Rahul Pasunuri
Siwen Yan
Sriraam Natarajan
Publication venue: Frontiers Media S.A.
Publication date: 01/11/2023
Field of study

We consider the problem of learning with sensitive features under the privileged information setting where the goal is to learn a classifier that uses features not available (or too sensitive to collect) at test/deployment time to learn a better model at training time. We focus on tree-based learners, specifically gradient-boosted decision trees for learning with privileged information. Our methods use privileged features as knowledge to guide the algorithm when learning from fully observed (usable) features. We derive the theory, empirically validate the effectiveness of our algorithms, and verify them on standard fairness metrics

Directory of Open Access Journals

Advanced Learning Methodologies for Biomedical Applications

Author: Shiao Han-Tai
Publication venue
Publication date: 01/10/2017
Field of study

University of Minnesota Ph.D. dissertation. October 2017. Major: Electrical/Computer Engineering. Advisor: Vladimir Cherkassky. 1 computer file (PDF); ix, 109 pages.There has been a dramatic increase in application of statistical and machine learning methods for predictive data-analytic modeling of biomedical data. Most existing work in this area involves application of standard supervised learning techniques. Typical methods include standard classification or regression techniques, where the goal is to estimate an indicator function (classification decision rule) or real-valued function of input variables, from finite training sample. However, real-world data often contain additional information besides labeled training samples. Incorporating this additional information into learning (model estimation) leads to nonstandard/advanced learning formalizations that represent extensions of standard supervised learning. Recent examples of such advanced methodologies include semi-supervised learning (or transduction) and learning through contradiction (or Universum learning). This thesis investigates two new advanced learning methodologies along with their biomedical applications. The first one is motivated by modeling complex survival data which can incorporate future, censored, or unknown data, in addition to (traditional) labeled training data. Here we propose original formalization for predictive modeling of survival data, under the framework of Learning Using Privileged Information (LUPI) proposed by Vapnik. Survival data represents a collection of time observations about events. Our modeling goal is to predict the state (alive/dead) of a subject at a pre-determined future time point. We explore modeling of survival data as binary classification problem that incorporates additional information (such as time of death, censored/uncensored status, etc.) under LUPI framework. Then we propose two advanced constructive Support Vector Machine (SVM)-based formulations: SVM+ and Loss-Order SVM (LO-SVM). Empirical results using simulated and real-life survival data indicate that the proposed LUPI-based methods are very effective (versus classical Cox regression) when the survival time does not follow classical probabilistic assumptions. Second advanced methodology investigates a new learning paradigm for classification called Group Learning. This approach is motivated by modeling high-dimensional data when the number of input features is much larger than the number of training samples. There are two main approaches to solving such ill-posed problems: (a) selecting a small number of informative features via feature selection; (b) using all features but imposing additional complexity constraints, e.g., ridge regression, SVM, LASSO, etc. The proposed Group Learning method takes a different approach, by splitting all features into many (t) groups, and then estimating a classifier in reduced space (of dimensionality d/t). This approach effectively uses all features, but implements training in a lower-dimensional input space. Note that the formation of groups reflects application-domain knowledge. For example, in classifying of two-dimensional images represented as a set of pixels (original high-dimensional input space), appropriate groups can be formed by grouping adjacent pixels or “local patches” because adjacent pixels are known to be highly correlated. We provide empirical validation of this new methodology for two real-life applications: (a) handwritten digit recognition, and (b) predictive classification of univariate signals, e.g., prediction of epileptic seizures from intracranial electroencephalogram (iEEG) signal. Prediction of epileptic seizures is particularly challenging, due to highly unbalanced data (just 4–5 observed seizures) and patient-specific modeling. In a joint project with Mayo Clinic, we have incorporated the Group Learning approach into an SVM-based system for seizure prediction. This system performs subject-specific modeling and achieves robust prediction performance

University of Minnesota Digital Conservancy

Toxicity prediction using multi-disciplinary data integration and novel computational approaches

Author: Low Yen Sia
Publication venue: University of North Carolina at Chapel Hill
Publication date: 01/01/2013
Field of study

Current predictive tools used for human health assessment of potential chemical hazards rely primarily on either chemical structural information (i.e., cheminformatics) or bioassay data (i.e., bioinformatics). Emerging data sources such as chemical libraries, high throughput assays and health databases offer new possibilities for evaluating chemical toxicity as an integrated system and overcome the limited predictivity of current fragmented efforts; yet, few studies have combined the new data streams. This dissertation tested the hypothesis that integrative computational toxicology approaches drawing upon diverse data sources would improve the prediction and interpretation of chemically induced diseases. First, chemical structures and toxicogenomics data were used to predict hepatotoxicity. Compared with conventional cheminformatics or toxicogenomics models, interpretation was enriched by the chemical and biological insights even though prediction accuracy did not improve. This motivated the second project that developed a novel integrative method, chemical-biological read-across (CBRA), that led to predictive and interpretable models amenable to visualization. CBRA was consistently among the most accurate models on four chemical-biological data sets. It highlighted chemical and biological features for interpretation and the visualizations aided transparency. Third, we developed an integrative workflow that interfaced cheminformatics prediction with pharmacoepidemiology validation using a case study of Stevens Johnson Syndrome (SJS), an adverse drug reaction (ADR) of major public health concern. Cheminformatics models first predicted potential SJS inducers and non-inducers, prioritizing them for subsequent pharmacoepidemiology evaluation, which then confirmed that predicted non-inducers were statistically associated with fewer SJS occurrences. By combining cheminformatics' ability to predict SJS as soon as drug structures are known, and pharmacoepidemiology's statistical rigor, we have provided a universal scheme for more effective study of SJS and other ADRs. Overall, this work demonstrated that integrative approaches could deliver more predictive and interpretable models. These models can then reliably prioritize high risk chemicals for further testing, allowing optimization of testing resources. A broader implication of this research is the growing role we envision for integrative methods that will take advantage of the various emerging data sources.Doctor of Philosoph

Carolina Digital Repository