7 research outputs found
Incremental Sparse Bayesian Ordinal Regression
Ordinal Regression (OR) aims to model the ordering information between
different data categories, which is a crucial topic in multi-label learning. An
important class of approaches to OR models the problem as a linear combination
of basis functions that map features to a high dimensional non-linear space.
However, most of the basis function-based algorithms are time consuming. We
propose an incremental sparse Bayesian approach to OR tasks and introduce an
algorithm to sequentially learn the relevant basis functions in the ordinal
scenario. Our method, called Incremental Sparse Bayesian Ordinal Regression
(ISBOR), automatically optimizes the hyper-parameters via the type-II maximum
likelihood method. By exploiting fast marginal likelihood optimization, ISBOR
can avoid big matrix inverses, which is the main bottleneck in applying basis
function-based algorithms to OR tasks on large-scale datasets. We show that
ISBOR can make accurate predictions with parsimonious basis functions while
offering automatic estimates of the prediction uncertainty. Extensive
experiments on synthetic and real word datasets demonstrate the efficiency and
effectiveness of ISBOR compared to other basis function-based OR approaches
Neural network for ordinal classification of imbalanced data by minimizing a Bayesian cost
Ordinal classification of imbalanced data is a challenging problem that appears in many real world applications. The challenge is to simultaneously consider the order of the classes and the class imbalance, which can notably improve the performance metrics. The Bayesian formulation allows to deal with these two characteristics jointly: It takes into account the prior probability of each class and the decision costs, which can be used to include the imbalance and the ordinal information, respectively. We propose to use the Bayesian formulation to train neural networks, which have shown excellent results in many classification tasks. A loss function is proposed to train networks with a single neuron in the output layer and a threshold based decision rule. The loss is an estimate of the Bayesian classification cost, based on the Parzen windows estimator, which is fitted for a thresholded decision. Experiments with several real datasets show that the proposed method provides competitive results in different scenarios, due to its high flexibility to specify the relative importance of the errors in the classification of patterns of different classes, considering the order and independently of the probability of each class.This work was partially supported by Spanish Ministry of Science and Innovation through Thematic Network "MAPAS"(TIN2017-90567-REDT) and by BBVA Foundation through "2-BARBAS" research grant. Funding for APC: Universidad Carlos III de Madrid (Read & Publish Agreement CRUE-CSIC 2023)
Galaxy classification: A machine learning analysis of GAMA catalogue data
We present a machine learning analysis of five labelled galaxy catalogues from the Galaxy And Mass Assembly (GAMA): The SersicCatVIKING and SersicCatUKIDSS catalogues containing morphological features, the GaussFitSimplecatalogue containing spectroscopic features, the MagPhys catalogue including physical parameters for galaxies, and the Lambdar catalogue, which contains photometric measurements. Extending work previously presented at the ESANN 2018 conference – in an analysis based on Generalized Relevance Matrix Learning Vector Quantization and Random Forests – we find that neither the data from the individual catalogues nor a combined dataset based on all 5 catalogues fully supports the visual-inspection-based galaxy classification scheme employed to categorise the galaxies. In particular, only one class, the Little Blue Spheroids, is consistently separable from the other classes. To aid further insight into the nature of the employed visual-based classification scheme with respect to physical and morphological features, we present the galaxy parameters that are discriminative for the achieved class distinctions
Galaxy classification: A machine learning analysis of GAMA catalogue data
We present a machine learning analysis of five labelled galaxy catalogues
from the Galaxy And Mass Assembly (GAMA): The SersicCatVIKING and
SersicCatUKIDSS catalogues containing morphological features, the
GaussFitSimple catalogue containing spectroscopic features, the MagPhys
catalogue including physical parameters for galaxies, and the Lambdar
catalogue, which contains photometric measurements. Extending work previously
presented at the ESANN 2018 conference - in an analysis based on Generalized
Relevance Matrix Learning Vector Quantization and Random Forests - we find that
neither the data from the individual catalogues nor a combined dataset based on
all 5 catalogues fully supports the visual-inspection-based galaxy
classification scheme employed to categorise the galaxies. In particular, only
one class, the Little Blue Spheroids, is consistently separable from the other
classes. To aid further insight into the nature of the employed visual-based
classification scheme with respect to physical and morphological features, we
present the galaxy parameters that are discriminative for the achieved class
distinctions.Comment: Accepted for the ESANN 2018 Special Issue of Neurocomputin
Cardiometabolic burden and biomarkers of autonomous cortisol secretion
The overwhelming majority of incidentally discovered adrenal tumours are benign adrenocortical tumours. These can be non-functioning (NFAT) or associated with autonomous cortisol secretion on a spectrum ranging from rare clinically overt adrenal Cushing’s syndrome (CS) to much more prevalent mild autonomous cortisol secretion (MACS) without signs of CS. Here I present the characteristics of a large cohort of persons with newly diagnosed benign adrenocortical tumours.
In 1305 prospectively recruited cases, almost every second person with benign adrenocortical tumours was diagnosed with MACS. Persons with MACS had rates of cardiometabolic disease similar to CS, particularly increased prevalence and severity of hypertension and type 2 diabetes.
Urine multi-steroid profiling of these persons revealed a gradual increase in glucocorticoid excretion from NFAT over MACS to CS, whilst androgen excretion decreased. Increased glucocorticoid and 11-oxygenated androgen metabolite excretion predicted clinical outcomes including hypertension, type 2 diabetes, and the presence of bilateral adrenal tumours.
A representative group of 291 persons underwent untargeted serum metabolome profiling. Prototype-based supervised machine learning identified progressive metabolic changes in MACS and CS suggestive of lipotoxicity, dysfunctional lipid β-oxidation, and oxidative stress across the spectrum of autonomous cortisol secretion.
These results show that MACS is a prevalent cardiometabolic risk condition associated with distinct changes in the steroid and untargeted metabolome. Observed changes offer the prospect of risk stratification of affected individuals