Search CORE

48 research outputs found

Dimensionality reduction by clustering of variables while setting aside atypical variables

Author: Vigneau Evelyne
Publication venue: Coordinamento SIBA - Università del Salento
Publication date: 26/04/2016
Field of study

Clustering of variables is one possible approach for reducing the dimensionality of a dataset. However, all the variables are usually assigned to one of the clusters, even the scattered variables associated with atypical or noise information. The presence of this type of information could obscure the interpretation of the latent variables associated with the clusters, or even give rise to artificial clusters. We propose two strategies to address this problem. The first is a "K +1" strategy, which consists of introducing an additional group of variables, called the "noise cluster" for simplicity. The second is based on the definition of sparse latent variables. Both strategies result in refined clusters for the identification of more relevant latent variables

ESE - Salento University Publishing

Università del Salento: ESE - Salento University Publishing

Clustering of variables for enhanced interpretability of predictive models

Author: Vigneau Evelyne
Publication venue
Publication date: 18/08/2020
Field of study

A new strategy is proposed for building easy to interpret predictive models in the context of a high-dimensional dataset, with a large number of highly correlated explanatory variables. The strategy is based on a first step of variables clustering using the CLustering of Variables around Latent Variables (CLV) method. The exploration of the hierarchical clustering dendrogram is undertaken in order to sequentially select the explanatory variables in a group-wise fashion. For model setting implementation, the dendrogram is used as the base-learner in an L2-boosting procedure. The proposed approach, named lmCLV, is illustrated on the basis of a toy-simulated example when the clusters and predictive equation are already known, and on a real case study dealing with the authentication of orange juices based on 1H-NMR spectroscopic analysis. In both illustrative examples, this procedure was shown to have similar predictive efficiency to other methods, with additional interpretability capacity. It is available in the R package ClustVarLV.Comment: 24 pages, 7 figure

arXiv.org e-Print Archive

La baisse du contentieux est-elle le signe d'une pacification de la relation de travail ?

Author: Serverin Evelyne
Vigneau Christophe
Publication venue: Dalloz
Publication date: 27/04/2019
Field of study

International audienc

HAL-Paris1

Application of procrustean methods to mid- and near-infrared spectral data

Author: Devaux M.F.
Safar M.
Vigneau Evelyne
Publication venue: 'Royal College of Obstetricians & Gynaecologists (RCOG)'
Publication date: 01/01/1995
Field of study

International audienc

HAL Descartes

A new algorithm for latent root regression analysis

Author: Mostafa Qannari El
Vigneau Evelyne
Publication venue
Publication date
Field of study

Research Papers in Economics

Functional Approach for the analysis of time intensity curves using B-splines

Author: Causeur David
Ledauphin Stéphanie
Vigneau Evelyne
Publication venue: 'Royal College of Obstetricians & Gynaecologists (RCOG)'
Publication date: 01/01/2005
Field of study

International audienceThis article deals with a functional approach based on the projection upon a beta-spline basis in order to analyze Time Intensity curves. The modelization is followed, on the one hand, by the assessment of the repeatability and the discrimination ability of the panelists, and on the other hand, by the determination of a good compromise over repetitions. Finally, a multidimensional analysis enables the comparison of the shapes of the curves associated with the assessors (assessors' signature) and the characterization of the products. The properties of this functional approach are illustrated with TI curves describing sweetness variations of drinks

HAL Descartes