Search CORE

186 research outputs found

Spécificités des SGBD statistiques

Author: Bry François
Diday E.
Thauront Gérard
Publication venue
Publication date: 01/01/1986
Field of study

SymScal: symbolic multidimensional scaling of interval dissimilarities

Author: Diday E.
Groenen P.J.F.
Rodriguez O.
Winsberg S.
Publication venue
Publication date
Field of study

Multidimensional scaling aims at reconstructing dissimilaritiesbetween pairs of objects by distances in a low dimensional space.However, in some cases the dissimilarity itself is unknown, but therange of the dissimilarity is given. Such fuzzy data fall in thewider class of symbolic data (Bock and Diday, 2000).Denoeux and Masson (2000) have proposed to model an intervaldissimilarity by a range of the distance defined as the minimum andmaximum distance between two rectangles representing the objects. Inthis paper, we provide a new algorithm called SymScal that is basedon iterative majorization. The advantage is that each iteration isguaranteed to improve the solution until no improvement is possible.In a simulation study, we investigate the quality of thisalgorithm. We discuss the use of SymScal on empirical dissimilarityintervals of sounds.iterative majorization;multidimensional scaling;symbolic data analysis;distance smoothing

Research Papers in Economics

Optimisation en classification automatique et reconnaissance des formes

Author: Diday E.
Publication venue: 'EDP Sciences'
Publication date: 01/01/1972
Field of study

Crossref

Numérisation de Documents Anciens Mathématiques

SymScal: symbolic multidimensional scaling of interval dissimilarities

Author: Diday E.
Groenen P.J.F. (Patrick)
Rodriguez O.
Winsberg S.
Publication venue
Publication date: 30/03/2005
Field of study

Multidimensional scaling aims at reconstructing dissimilarities between pairs of objects by distances in a low dimensional space. However, in some cases the dissimilarity itself is unknown, but the range of the dissimilarity is given. Such fuzzy data fall in the wider class of symbolic data (Bock and Diday, 2000). Denoeux and Masson (2000) have proposed to model an interval dissimilarity by a range of the distance defined as the minimum and maximum distance between two rectangles representing the objects. In this paper, we provide a new algorithm called SymScal that is based on iterative majorization. The advantage is that each iteration is guaranteed to improve the solution until no improvement is possible. In a simulation study, we investigate the quality of this algorithm. We discuss the use of SymScal on empirical dissimilarity intervals of sounds

Base de publications de l'université Paris-Dauphine

Erasmus University Digital Repository

A new approach in mixed distributions detection

Author: Diday E.
Schroeder A.
Publication venue: 'EDP Sciences'
Publication date: 01/01/1976
Field of study

Crossref

Numérisation de Documents Anciens Mathématiques

Méthode et algorithme de sélection typologique de paramètres

Author: E. Diday
J. P. Rasson
Ph. Meunier
Publication venue: 'EDP Sciences'
Publication date: 01/01/1985
Field of study

Crossref

Numérisation de Documents Anciens Mathématiques

Repository of the University of Namur

On central tendency and dispersion measures for intervals and hypercubes

Author: Bertrand P.
Bock H. H.
Bock H. H.
Bock H.-H.
Chavent M.
De Carvalho F. de A. T.
Diday E.
Jérôme Saracco
Marie Chavent
Nadler S. B. J.
Publication venue: 'Informa UK Limited'
Publication date: 01/01/2008
Field of study

The uncertainty or the variability of the data may be treated by considering, rather than a single value for each data, the interval of values in which it may fall. This paper studies the derivation of basic description statistics for interval-valued datasets. We propose a geometrical approach in the determination of summary statistics (central tendency and dispersion measures) for interval-valued variables

arXiv.org e-Print Archive

Crossref

INRIA a CCSD electronic archive server

Oskar Bordeaux

Linear regression for numeric symbolic variables: an ordinary least squares approach based on Wasserstein Distance

Author: A Irpino
Antonio Irpino
B Efron
CL Lawson
CL Mallows
E Diday
EAL Neto
EAL Neto
G Dall’Aglio
H Bock
J Arroyo
L Billard
L Kantorovich
L Wasserstein
M Noirhomme-Fraiture
P Bertrand
P Bickel
R Tibshirani
Rosanna Verde
WG Gilchrist
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 20/07/2012
Field of study

In this paper we present a linear regression model for modal symbolic data. The observed variables are histogram variables according to the definition given in the framework of Symbolic Data Analysis and the parameters of the model are estimated using the classic Least Squares method. An appropriate metric is introduced in order to measure the error between the observed and the predicted distributions. In particular, the Wasserstein distance is proposed. Some properties of such metric are exploited to predict the response variable as direct linear combination of other independent histogram variables. Measures of goodness of fit are discussed. An application on real data corroborates the proposed method

arXiv.org e-Print Archive

CiteSeerX

Crossref

Archivio Istituzionale della Ricerca - Università degli Studi della Campania "Luigi Vanvitelli"

Representing complex data using localized principal components with application to astronomical data

Author: A Gersho
A Gorban
AH Monaghan
AR Webb
B Chalmond
B Kégl
C Allende Prieto
CAL Bailer-Jones
CAL Bailer-Jones
DJ Marchette
E Diday
E Oja
EC Malthouse
EM Braverman
FL Hall
H Hotelling
H Späth
H Wold
IT Jolliffe
J Einbeck
J Einbeck
JH Friedman
JH Friedman
JH Friedman
JJ Verbeek
JM Chambers
K Fukunaga
K Hornik
L Breiman
MAC Perryman
MG Kendall
N Kambhatla
P Delicado
P Delicado
PG Willemsen
R Tibshirani
RJ Bolton
S de Jong
T Aluja-Banet
T Duchamps
T Hastie
T Hastie
WS Cleveland
Z-Y Liu
Publication venue
Publication date: 01/01/2007
Field of study

Often the relation between the variables constituting a multivariate data space might be characterized by one or more of the terms: ``nonlinear'', ``branched'', ``disconnected'', ``bended'', ``curved'', ``heterogeneous'', or, more general, ``complex''. In these cases, simple principal component analysis (PCA) as a tool for dimension reduction can fail badly. Of the many alternative approaches proposed so far, local approximations of PCA are among the most promising. This paper will give a short review of localized versions of PCA, focusing on local principal curves and local partitioning algorithms. Furthermore we discuss projections other than the local principal components. When performing local dimension reduction for regression or classification problems it is important to focus not only on the manifold structure of the covariates, but also on the response variable(s). Local principal components only achieve the former, whereas localized regression approaches concentrate on the latter. Local projection directions derived from the partial least squares (PLS) algorithm offer an interesting trade-off between these two objectives. We apply these methods to several real data sets. In particular, we consider simulated astrophysical data from the future Galactic survey mission Gaia.Comment: 25 pages. In "Principal Manifolds for Data Visualization and Dimension Reduction", A. Gorban, B. Kegl, D. Wunsch, and A. Zinovyev (eds), Lecture Notes in Computational Science and Engineering, Springer, 2007, pp. 180--204, http://www.springer.com/dal/home/generic/search/results?SGWID=1-40109-22-173750210-

arXiv.org e-Print Archive

Durham Research Online

Crossref

Enlighten

Explore Bristol Research

On the equivalence between hierarchical segmentations and ultrametric watersheds

Author: B. Leclerc
C. Couprie
C. Mattiussi
C. Ronse
E. Diday
E. Khalimsky
F. Meyer
F. Meyer
F. Meyer
G. Bertrand
G. Bertrand
J. Benzécri
J. Cousty
J. Cousty
J. Cousty
J. Cousty
J. Cousty
J. Cousty
J. Cousty
J. Gower
J. Serra
J.B. Kruskal
J.B.T.M. Roerdink
J.P. Barthélemy
L. Guigues
L. Najman
L. Najman
L. Najman
L. Najman
Laurent Najman
M. Bender
M. Couprie
M. Krasner
M. Nagao
N. Jardine
P. Alexandroff
P. Alexandroff
P. Salembier
P. Soille
P. Soille
P.A. Arbeláez
R. Diestel
S. Johnson
T. Kong
T. Pavlidis
T. Pavlidis
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2010
Field of study

We study hierarchical segmentation in the framework of edge-weighted graphs. We define ultrametric watersheds as topological watersheds null on the minima. We prove that there exists a bijection between the set of ultrametric watersheds and the set of hierarchical segmentations. We end this paper by showing how to use the proposed framework in practice in the example of constrained connectivity; in particular it allows to compute such a hierarchy following a classical watershed-based morphological scheme, which provides an efficient algorithm to compute the whole hierarchy.Comment: 19 pages, double-colum

arXiv.org e-Print Archive

CiteSeerX

Crossref

HAL-Ecole des Ponts ParisTech

HAL - UPEC / UPEM