Search CORE

54 research outputs found

Interpretability and Explainability in Machine Learning and their Application to Nonlinear Dimensionality Reduction

Author: Bibal Adrien
Publication venue
Publication date: 16/11/2020
Field of study

A methodology to compare dimensionality reduction algorithms in terms of loss of quality

Author: González Tortosa Santiago
Gracia Berná Antonio
Menasalvas Ruiz Ernestina
Robles Forcada Víctor
Publication venue: 'Elsevier BV'
Publication date: 01/01/2014
Field of study

Dimensionality Reduction (DR) is attracting more attention these days as a result of the increasing need to handle huge amounts of data effectively. DR methods allow the number of initial features to be reduced considerably until a set of them is found that allows the original properties of the data to be kept. However, their use entails an inherent loss of quality that is likely to affect the understanding of the data, in terms of data analysis. This loss of quality could be determinant when selecting a DR method, because of the nature of each method. In this paper, we propose a methodology that allows different DR methods to be analyzed and compared as regards the loss of quality produced by them. This methodology makes use of the concept of preservation of geometry (quality assessment criteria) to assess the loss of quality. Experiments have been carried out by using the most well-known DR algorithms and quality assessment criteria, based on the literature. These experiments have been applied on 12 real-world datasets. Results obtained so far show that it is possible to establish a method to select the most appropriate DR method, in terms of minimum loss of quality. Experiments have also highlighted some interesting relationships between the quality assessment criteria. Finally, the methodology allows the appropriate choice of dimensionality for reducing data to be established, whilst giving rise to a minimum loss of quality

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Archivo Digital UPM

Image processing system based on similarity/dissimilarity measures to classify binary images from contour-based features

Author: Blandón Luengas Juan Sebastián
Publication venue: Maestría en Ingeniería Eléctrica
Publication date: 01/01/2020
Field of study

Image Processing Systems (IPS) try to solve tasks like image classification or segmentation based on its content. Many authors proposed a variety of techniques to tackle the image classification task. Plenty of methods address the performance of the IPS [1], as long as the influence of many external circumstances, such as illumination, rotation, and noise [2]. However, there is an increasing interest in classifying shapes from binary images (BI). Shape Classification (SC) from BI considers a segmented image as a sample (backgroundsegmentation [3]) and aims to identify objects based in its shape..

Repositorio academico de la Universidad Tecnológica de Pereira

Making nonlinear manifold learning models interpretable: The manifold grand tour

Author: Alfredo Vellido
Asimov
Belkin
Bishop
Bishop
Bishop
Buja
Buja
Cook
Cruz
Dempster
Etchells
Forina
Huang
Jolliffe
José D. Martín-Guerrero
Kohonen
Lee
Lisboa
Mackay
Mulier
Olier
Paulo J.G. Lisboa
Roweis
Segel
Vellido
Vellido
Vellido
Vellido
Wegman
Wegman
Wickham
Publication venue: 'Elsevier BV'
Publication date: 01/01/2015
Field of study

Dimensionality reduction is required to produce visualisations of high dimensional data. In this framework, one of the most straightforward approaches to visualising high dimensional data is based on reducing complexity and applying linear projections while tumbling the projection axes in a defined sequence which generates a Grand Tour of the data. We propose using smooth nonlinear topographic maps of the data distribution to guide the Grand Tour, increasing the effectiveness of this approach by prioritising the linear views of the data that are most consistent with global data structure in these maps. A further consequence of this approach is to enable direct visualisation of the topographic map onto projective spaces that discern structure in the data. The experimental results on standard databases reported in this paper, using self-organising maps and generative topographic mapping, illustrate the practical value of the proposed approach. The main novelty of our proposal is the definition of a systematic way to guide the search of data views in the grand tour, selecting and prioritizing some of them, based on nonlinear manifold models

LJMU Research Online (Liverpool John Moores University)

Crossref

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

UPCommons. Portal del coneixement obert de la UPC

Image processing system based on similarity/dissimilarity measures to classify binary images from contour-based features

Author: Blandón Luengas Juan Sebastián
Publication venue: Maestría en Ingeniería Eléctrica
Publication date: 01/01/2020
Field of study

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

MedVir: 3D visual interface applied to gene profile analysis

Author: González Tortosa Santiago
Gracia Berná Antonio
Robles Forcada Víctor
Publication venue
Publication date: 02/07/2012
Field of study

The use of data mining techniques for the gene profile discovery of diseases, such as cancer, is becoming usual in many researches. These techniques do not usually analyze the relationships between genes in depth, depending on the different variety of manifestations of the disease (related to patients). This kind of analysis takes a considerable amount of time and is not always the focus of the research. However, it is crucial in order to generate personalized treatments to fight the disease. Thus, this research focuses on finding a mechanism for gene profile analysis to be used by the medical and biologist experts. Results: In this research, the MedVir framework is proposed. It is an intuitive mechanism based on the visualization of medical data such as gene profiles, patients, clinical data, etc. MedVir, which is based on an Evolutionary Optimization technique, is a Dimensionality Reduction (DR) approach that presents the data in a three dimensional space. Furthermore, thanks to Virtual Reality technology, MedVir allows the expert to interact with the data in order to tailor it to the experience and knowledge of the expert

Archivo Digital UPM

Exploratory visualization of misclassified GPCRs from their transformed unaligned sequences using manifold learning techniques

Author: Alquézar Mancho René
Cárdenas Domínguez Martha Ivón
Giraldo Arjonilla Jesús
König Caroline
Vellido Alcacena Alfredo
Publication venue: Copicentro Granada
Publication date: 01/01/2014
Field of study

Class C G-protein-coupled receptors (GPCRs) are cell membrane proteins of great relevance to biology and pharmacology. Previous research has revealed an upper boundary on the accuracy that can be achieved in their classification into subtypes from the unaligned transformation of their sequences. To investigate this, we focus on sequences that have been misclassified using supervised methods. These are visualized, using a nonlinear dimensionality reduction technique and phylogenetic trees, and then characterized against the rest of the data and, particularly, against the rest of cases of their own subtype. This should help to discriminate between different types of misclassification and to build hypotheses about database quality problems and the extent to which GPCR sequence transformations limit subtype discriminability. The reported experiments provide a proof of concept for the proposed method.Postprint (published version

UPCommons. Portal del coneixement obert de la UPC

Measuring Quality and Interpretability of Dimensionality Reduction Visualizations

Author: Bibal Adrien
Frénay Benoît
Publication venue
Publication date: 01/01/2019
Field of study

Repository of the University of Namur