Search CORE

10,201 research outputs found

Probabilistic Fisher discriminant analysis: A robust and flexible alternative to Fisher discriminant analysis

Author: Bouveyron Charles
Brunet Camille
Publication venue: 'Elsevier BV'
Publication date: 01/01/2012
Field of study

International audienceFisher discriminant analysis (FDA) is a popular and powerful method for dimensionality reduction and classification. Unfortunately, the optimality of the dimension reduction provided by FDA is only proved in the homoscedastic case. In addition, FDA is known to have poor performances in the cases of label noise and sparse labeled data. To overcome these limitations, this work proposes a probabilistic framework for FDA which relaxes the homoscedastic assumption on the class covariance matrices and adds a term to explicitly model the non-discriminative information. This allows the proposed method to be robust to label noise and to be used in the semi-supervised context. Experiments on real-world datasets show that the proposed approach works at least as well as FDA in standard situations and outperforms it in the label noise and sparse label cases

HAL-Paris1

Bankruptcy Prediction: A Comparison of Some Statistical and Machine Learning Techniques

Author: Bolanle Abudu
Serafín Martínez
Tonatiuh Peña
Publication venue
Publication date
Field of study

We are interested in forecasting bankruptcies in a probabilistic way. Specifically, we compare the classification performance of several statistical and machine-learning techniques, namely discriminant analysis (Altman's Z-score), logistic regression, least-squares support vector machines and different instances of Gaussian processes (GP's) -that is GP's classifiers, Bayesian Fisher discriminant and Warped GP's. Our contribution to the field of computational finance is to introduce GP's as a potentially competitive probabilistic framework for bankruptcy prediction. Data from the repository of information of the US Federal Deposit Insurance Corporation is used to test the predictions.Bankruptcy prediction, Artificial intelligence, Supervised learning, Gaussian processes, Z-score.

Research Papers in Economics

Predicción de bancarrota: Una comparación de técnicas estadísticas y de aprendizaje supervisado para computadora

Author: Abudu Bolanle
Martinez Jaramillo Serafin
Pena Centeno Tonatiuh
Publication venue
Publication date: 01/12/2009
Field of study

We are interested in forecasting bankruptcies in a probabilistic way. Specifcally, we com- pare the classification performance of several statistical and machine-learning techniques, namely discriminant analysis (Altman's Z-score), logistic regression, least-squares support vector machines and different instances of Gaussian processes (GP's) -that is GP's classifiers, Bayesian Fisher discriminant and Warped GP's. Our contribution to the field of computa- tional finance is to introduce GP's as a potentially competitive probabilistic framework for bankruptcy prediction. Data from the repository of information of the US Federal Deposit Insurance Corporation is used to test the predictions

Predicción de bancarrota: Una comparación de técnicas estadísticas y de aprendizaje supervisado para computadora

Author: Abudu Bolanle
Martinez Jaramillo Serafin
Pena Centeno Tonatiuh
Publication venue
Publication date: 01/12/2009
Field of study

Munich RePEc Personal Archive

Joint Bayesian Gaussian discriminant analysis for speaker verification

Author: Ou Zhijian
Wang Yiyan
Xu Haotian
Publication venue
Publication date: 19/01/2017
Field of study

State-of-the-art i-vector based speaker verification relies on variants of Probabilistic Linear Discriminant Analysis (PLDA) for discriminant analysis. We are mainly motivated by the recent work of the joint Bayesian (JB) method, which is originally proposed for discriminant analysis in face verification. We apply JB to speaker verification and make three contributions beyond the original JB. 1) In contrast to the EM iterations with approximated statistics in the original JB, the EM iterations with exact statistics are employed and give better performance. 2) We propose to do simultaneous diagonalization (SD) of the within-class and between-class covariance matrices to achieve efficient testing, which has broader application scope than the SVD-based efficient testing method in the original JB. 3) We scrutinize similarities and differences between various Gaussian PLDAs and JB, complementing the previous analysis of comparing JB only with Prince-Elder PLDA. Extensive experiments are conducted on NIST SRE10 core condition 5, empirically validating the superiority of JB with faster convergence rate and 9-13% EER reduction compared with state-of-the-art PLDA.Comment: accepted by ICASSP201

arXiv.org e-Print Archive

Crossref

Dimensionality reduction of clustered data sets

Author: Sanguinetti G.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/03/2008
Field of study

We present a novel probabilistic latent variable model to perform linear dimensionality reduction on data sets which contain clusters. We prove that the maximum likelihood solution of the model is an unsupervised generalisation of linear discriminant analysis. This provides a completely new approach to one of the most established and widely used classification algorithms. The performance of the model is then demonstrated on a number of real and artificial data sets

CiteSeerX

Crossref

White Rose Research Online

Probabilistic classification of acute myocardial infarction from multiple cardiac markers

Author: C Bishop
F Dombal de
F Fesmire
F Fesmire
George W. Irwin
H Selker
J Ellenuis
J Habbema
J Habbema
J Habbema
J Hanley
J Hilden
J Hilden
John V. Lamont
L Goldman
L Goldman
M Pozen
Paul C. Wilson
Robert F. Harrison
T Groth
W Tierney
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/06/2008
Field of study

Logistic regression and Gaussian mixture model (GMM) classifiers have been trained to estimate the probability of acute myocardial infarction (AMI) in patients based upon the concentrations of a panel of cardiac markers. The panel consists of two new markers, fatty acid binding protein (FABP) and glycogen phosphorylase BB (GPBB), in addition to the traditional cardiac troponin I (cTnI), creatine kinase MB (CKMB) and myoglobin. The effect of using principal component analysis (PCA) and Fisher discriminant analysis (FDA) to preprocess the marker concentrations was also investigated. The need for classifiers to give an accurate estimate of the probability of AMI is argued and three categories of performance measure are described, namely discriminatory ability, sharpness, and reliability. Numerical performance measures for each category are given and applied. The optimum classifier, based solely upon the samples take on admission, was the logistic regression classifier using FDA preprocessing. This gave an accuracy of 0.85 (95% confidence interval: 0.78–0.91) and a normalised Brier score of 0.89. When samples at both admission and a further time, 1–6 h later, were included, the performance increased significantly, showing that logistic regression classifiers can indeed use the information from the five cardiac markers to accurately and reliably estimate the probability AMI

Queen's University Belfast Research Portal

Crossref

White Rose Research Online