Search CORE

99 research outputs found

Relational Data Mining Through Extraction of Representative Exemplars

Author: Blanchard Frédéric
Herbin Michel
Publication venue
Publication date: 01/01/2012
Field of study

With the growing interest on Network Analysis, Relational Data Mining is becoming an emphasized domain of Data Mining. This paper addresses the problem of extracting representative elements from a relational dataset. After defining the notion of degree of representativeness, computed using the Borda aggregation procedure, we present the extraction of exemplars which are the representative elements of the dataset. We use these concepts to build a network on the dataset. We expose the main properties of these notions and we propose two typical applications of our framework. The first application consists in resuming and structuring a set of binary images and the second in mining co-authoring relation in a research team

arXiv.org e-Print Archive

CiteSeerX

HAL Descartes

Outlier detection in high-dimensional spaces using one-dimensional neighborhoods

Author: Blanchard Frédéric
Falip Joris
Herbin Michel
Publication venue: HAL CCSD
Publication date: 01/01/2018
Field of study

International audienceDetecting outliers in a dataset is a problem with numerous applications in data analysis for fields such as medical care, finance, and banking or network surveillance. But in a majority of use-cases, data points are described by a lot of features, making outlier detection more complicated. As the number of dimensions increases, the notion of proximity becomes less meaningful: this is due to sparser data and elements becoming almost equally distant from each other. Medical datasets add a layer of complexity caused by their heterogeneous nature. Because of these caveats, standard algorithms become less relevant when hundred of dimensions are involved. This paper discusses the benefits of an outlier detection algorithm that uses a simple concept of one-dimensional neighborhood observations to circumvent the problems mentioned previously

Le vecteur de meilleur rang moyen : une statistique pour l'analyse de données multidimensionnelles -Application au filtrage d'images couleurs

Author: BLANCHARD Frédéric
DE RUNZ Cyril
HERBIN Michel
HUSSENET Laurent
VAUTROT Philippe
VRABIE Valeriu
Publication venue: GRETSI, Groupe d’Etudes du Traitement du Signal et des Images
Publication date: 01/01/2007
Field of study

L'obtention du meilleur représentant d'un échantillon est une question ouverte et dépend de la statistique utilisée. Cette communication propose une statistique, qui repose sur le calcul des rangs, et qui permet en outre d'introduire la notion de données représentant le mieux son échantillon. La donnée représentant le mieux son échantillon est nommée donnée de meilleur rang moyen. En appliquant notre statistique au filtrage d'images couleurs, nous illustrons l'efficacité et l'intérêt de notre procédure, visuellement et numériquement

I-Revues

HAL Descartes

L'image numerique en vraies couleurs lors du suivi de lesions macroscopiques en Dermatologie: aspects methodologiques et applications

Author: Herbin Michel
Publication venue
Publication date: 01/01/1989
Field of study

SIGLEINIST T 73054 / INIST-CNRS - Institut de l'Information Scientifique et TechniqueFRFranc

OpenGrey Repository

Rank-Based Similarity Index (RBSI) in a Multidimensional DataSet

Author: Herbin Michel
Publication venue: HAL CCSD
Publication date: 01/01/2019
Field of study

International audienc

Visualisation d'images multicomposantes par une image couleur

Author: Blanchard Frédéric
Herbin Michel
Publication venue: HAL CCSD
Publication date: 01/01/2003
Field of study

International audienc

L'image couleur pour visualiser des données multidimensionnelles

Author: Blanchard Frédéric
Herbin Michel
Publication venue: HAL CCSD
Publication date: 01/11/2004
Field of study

International audienc

HAL Descartes

Visualisation et classification de données multidimensionnelles (Application aux images multicomposantes)

Author: BLANCHARD Frédéric
HERBIN Michel
Publication venue
Publication date: 01/01/2005
Field of study

L'analyse des images multicomposantes est un problème crucial. Les questions de la visualisation et de la classification pour ces images sont importantes. Nous nous sommes intéressés à ces deux problèmes en nous plaçant dans le cadre plus général de l'analyse des données multidimensionnelles, et avons apporté deux éléments de réponses. Avant de traiter ces questions, nous nous sommes intéressés aux problèmes pratiques et théoriques liés à la dimensionnalité et étudions quelques unes des techniques courantes de réduction de dimensionnalité. La question de la visualisation est alors exposée et une nouvelle méthode utilisant l'image couleur est proposée. Cette technique permet une visualisation immédiate et synthétique des données, sans connaissance a priori. Elle est illustrée par des applications. Nous présentons également une contribution à la classification non supervisée de données qui se situe en amont du processus de classification proprement dit. Nous avons conçu une nouvelle façon de représenter les données et leurs liens à l'aide de la théorie des ensembles flous. Cette méthode permet, en classification, de traiter avec succès des échantillons de données dont les classes sont d'effectifs et de densités différents, sans faire d'a priori sur leur forme. Un algorithme de classification et des exemples de son application sont proposés. Ce travail présente deux contributions importantes aux problématiques de la visualisation et la classification, et fait intervenir des concepts issus de thématiques diverses comme l'analyse de données ou la théorie des ensembles flous. Il peut ainsi être utilisé dans d'autres contextes que celui de l'analyse d'images multicomposantes.The analysis of multicomponent images is a crucial problem. Visualization and clustering problem are two relevant questions about it. We decided to work in the more general frame of data analysis to answer to these questions. The preliminary step of this work is describing the problems induced by the dimensionality and studying the current dimensionality reduction methods. The visualization problem is then considered and a contribution is exposed. We propose a new method of visualization through color image that provides an immediate and sythetic image od data. Applications are presented. The second contribution lies upstream with the clustering procedure strictly speaking. We etablish a new kind of data representation by using rank transformation, fuzziness and agregation procedures. Its use inprove the clustering procedures by dealing with clusters with dissimilar density or variant effectives and by making them more robust. This work presents two important contributions to the field of data analysis applied to multicomponent image. The variety of the tools involved (originally from decision theory, uncertainty management, data mining or image processing) make the presented methods usable in many diversified areas as well as multicomponent images analysis.REIMS-BU Sciences (514542101) / SudocSudocFranceF

OpenGrey Repository

VIMCI: Visualization of Multidimensional images through Color Image

Author: Blanchard Frédéric
Herbin Michel
Publication venue: HAL CCSD
Publication date: 01/01/2004
Field of study

International audienc

HAL Descartes

Visualisation d'Images Multicomposantes par une Image Couleur

Author: Blanchard Frédéric
Herbin Michel
Publication venue: Fontis/Formatis
Publication date: 01/01/2003
Field of study

International audienc

HAL Descartes