7 research outputs found

    Incremental Sparse Bayesian Ordinal Regression

    Get PDF
    Ordinal Regression (OR) aims to model the ordering information between different data categories, which is a crucial topic in multi-label learning. An important class of approaches to OR models the problem as a linear combination of basis functions that map features to a high dimensional non-linear space. However, most of the basis function-based algorithms are time consuming. We propose an incremental sparse Bayesian approach to OR tasks and introduce an algorithm to sequentially learn the relevant basis functions in the ordinal scenario. Our method, called Incremental Sparse Bayesian Ordinal Regression (ISBOR), automatically optimizes the hyper-parameters via the type-II maximum likelihood method. By exploiting fast marginal likelihood optimization, ISBOR can avoid big matrix inverses, which is the main bottleneck in applying basis function-based algorithms to OR tasks on large-scale datasets. We show that ISBOR can make accurate predictions with parsimonious basis functions while offering automatic estimates of the prediction uncertainty. Extensive experiments on synthetic and real word datasets demonstrate the efficiency and effectiveness of ISBOR compared to other basis function-based OR approaches

    Neural network for ordinal classification of imbalanced data by minimizing a Bayesian cost

    Get PDF
    Ordinal classification of imbalanced data is a challenging problem that appears in many real world applications. The challenge is to simultaneously consider the order of the classes and the class imbalance, which can notably improve the performance metrics. The Bayesian formulation allows to deal with these two characteristics jointly: It takes into account the prior probability of each class and the decision costs, which can be used to include the imbalance and the ordinal information, respectively. We propose to use the Bayesian formulation to train neural networks, which have shown excellent results in many classification tasks. A loss function is proposed to train networks with a single neuron in the output layer and a threshold based decision rule. The loss is an estimate of the Bayesian classification cost, based on the Parzen windows estimator, which is fitted for a thresholded decision. Experiments with several real datasets show that the proposed method provides competitive results in different scenarios, due to its high flexibility to specify the relative importance of the errors in the classification of patterns of different classes, considering the order and independently of the probability of each class.This work was partially supported by Spanish Ministry of Science and Innovation through Thematic Network "MAPAS"(TIN2017-90567-REDT) and by BBVA Foundation through "2-BARBAS" research grant. Funding for APC: Universidad Carlos III de Madrid (Read & Publish Agreement CRUE-CSIC 2023)

    Galaxy classification: A machine learning analysis of GAMA catalogue data

    Get PDF
    We present a machine learning analysis of five labelled galaxy catalogues from the Galaxy And Mass Assembly (GAMA): The SersicCatVIKING and SersicCatUKIDSS catalogues containing morphological features, the GaussFitSimplecatalogue containing spectroscopic features, the MagPhys catalogue including physical parameters for galaxies, and the Lambdar catalogue, which contains photometric measurements. Extending work previously presented at the ESANN 2018 conference – in an analysis based on Generalized Relevance Matrix Learning Vector Quantization and Random Forests – we find that neither the data from the individual catalogues nor a combined dataset based on all 5 catalogues fully supports the visual-inspection-based galaxy classification scheme employed to categorise the galaxies. In particular, only one class, the Little Blue Spheroids, is consistently separable from the other classes. To aid further insight into the nature of the employed visual-based classification scheme with respect to physical and morphological features, we present the galaxy parameters that are discriminative for the achieved class distinctions

    Galaxy classification: A machine learning analysis of GAMA catalogue data

    Get PDF
    We present a machine learning analysis of five labelled galaxy catalogues from the Galaxy And Mass Assembly (GAMA): The SersicCatVIKING and SersicCatUKIDSS catalogues containing morphological features, the GaussFitSimple catalogue containing spectroscopic features, the MagPhys catalogue including physical parameters for galaxies, and the Lambdar catalogue, which contains photometric measurements. Extending work previously presented at the ESANN 2018 conference - in an analysis based on Generalized Relevance Matrix Learning Vector Quantization and Random Forests - we find that neither the data from the individual catalogues nor a combined dataset based on all 5 catalogues fully supports the visual-inspection-based galaxy classification scheme employed to categorise the galaxies. In particular, only one class, the Little Blue Spheroids, is consistently separable from the other classes. To aid further insight into the nature of the employed visual-based classification scheme with respect to physical and morphological features, we present the galaxy parameters that are discriminative for the achieved class distinctions.Comment: Accepted for the ESANN 2018 Special Issue of Neurocomputin

    Cardiometabolic burden and biomarkers of autonomous cortisol secretion

    Get PDF
    The overwhelming majority of incidentally discovered adrenal tumours are benign adrenocortical tumours. These can be non-functioning (NFAT) or associated with autonomous cortisol secretion on a spectrum ranging from rare clinically overt adrenal Cushing’s syndrome (CS) to much more prevalent mild autonomous cortisol secretion (MACS) without signs of CS. Here I present the characteristics of a large cohort of persons with newly diagnosed benign adrenocortical tumours. In 1305 prospectively recruited cases, almost every second person with benign adrenocortical tumours was diagnosed with MACS. Persons with MACS had rates of cardiometabolic disease similar to CS, particularly increased prevalence and severity of hypertension and type 2 diabetes. Urine multi-steroid profiling of these persons revealed a gradual increase in glucocorticoid excretion from NFAT over MACS to CS, whilst androgen excretion decreased. Increased glucocorticoid and 11-oxygenated androgen metabolite excretion predicted clinical outcomes including hypertension, type 2 diabetes, and the presence of bilateral adrenal tumours. A representative group of 291 persons underwent untargeted serum metabolome profiling. Prototype-based supervised machine learning identified progressive metabolic changes in MACS and CS suggestive of lipotoxicity, dysfunctional lipid β-oxidation, and oxidative stress across the spectrum of autonomous cortisol secretion. These results show that MACS is a prevalent cardiometabolic risk condition associated with distinct changes in the steroid and untargeted metabolome. Observed changes offer the prospect of risk stratification of affected individuals
    corecore