Search CORE

225,987 research outputs found

Learning Graph Embeddings from WordNet-based Similarity Measures

Author: Biemann Chris
Dorgham Mohammad
Kutuzov Andrey
Oliynyk Oleksiy
Panchenko Alexander
Publication venue
Publication date: 01/01/2019
Field of study

We present path2vec, a new approach for learning graph embeddings that relies on structural measures of pairwise node similarities. The model learns representations for nodes in a dense space that approximate a given user-defined graph distance measure, such as e.g. the shortest path distance or distance measures that take information beyond the graph structure into account. Evaluation of the proposed model on semantic similarity and word sense disambiguation tasks, using various WordNet-based similarity measures, show that our approach yields competitive results, outperforming strong graph embedding baselines. The model is computationally efficient, being orders of magnitude faster than the direct computation of graph-based distances.Comment: Accepted to StarSem 201

arXiv.org e-Print Archive

Crossref

Principal Component Analysis Using Structural Similarity Index for Images

Author: D Brunet
D Brunet
D Brunet
D Otero
D Otero
I Goodfellow
I Jolliffe
J Ah-Pine
JL Alperin
Neal Parikh
S Boyd
Stephen Boyd
T Hofmann
Z Wang
Z Wang
Z Wang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 25/08/2019
Field of study

Despite the advances of deep learning in specific tasks using images, the principled assessment of image fidelity and similarity is still a critical ability to develop. As it has been shown that Mean Squared Error (MSE) is insufficient for this task, other measures have been developed with one of the most effective being Structural Similarity Index (SSIM). Such measures can be used for subspace learning but existing methods in machine learning, such as Principal Component Analysis (PCA), are based on Euclidean distance or MSE and thus cannot properly capture the structural features of images. In this paper, we define an image structure subspace which discriminates different types of image distortions. We propose Image Structural Component Analysis (ISCA) and also kernel ISCA by using SSIM, rather than Euclidean distance, in the formulation of PCA. This paper provides a bridge between image quality assessment and manifold learning opening a broad new area for future research.Comment: Paper for the methods named "Image Structural Component Analysis (ISCA)" and "Kernel Image Structural Component Analysis (Kernel ISCA)

arXiv.org e-Print Archive

Crossref

Metrics for Graph Comparison: A Practitioner's Guide

Author: Meyer Francois G.
Wills Peter
Publication venue
Publication date: 16/12/2019
Field of study

Comparison of graph structure is a ubiquitous task in data analysis and machine learning, with diverse applications in fields such as neuroscience, cyber security, social network analysis, and bioinformatics, among others. Discovery and comparison of structures such as modular communities, rich clubs, hubs, and trees in data in these fields yields insight into the generative mechanisms and functional properties of the graph. Often, two graphs are compared via a pairwise distance measure, with a small distance indicating structural similarity and vice versa. Common choices include spectral distances (also known as

\lambda

distances) and distances based on node affinities. However, there has of yet been no comparative study of the efficacy of these distance measures in discerning between common graph topologies and different structural scales. In this work, we compare commonly used graph metrics and distance measures, and demonstrate their ability to discern between common topological features found in both random graph models and empirical datasets. We put forward a multi-scale picture of graph structure, in which the effect of global and local structure upon the distance measures is considered. We make recommendations on the applicability of different distance measures to empirical graph data problem based on this multi-scale view. Finally, we introduce the Python library NetComp which implements the graph distances used in this work

arXiv.org e-Print Archive

Learning Genomic Sequence Representations using Graph Neural Networks over De Bruijn Graphs

Author: Burger Manuel
Joudaki Amir
Kapuśniak Kacper
Rätsch Gunnar
Publication venue
Publication date: 06/12/2023
Field of study

The rapid expansion of genomic sequence data calls for new methods to achieve robust sequence representations. Existing techniques often neglect intricate structural details, emphasizing mainly contextual information. To address this, we developed k-mer embeddings that merge contextual and structural string information by enhancing De Bruijn graphs with structural similarity connections. Subsequently, we crafted a self-supervised method based on Contrastive Learning that employs a heterogeneous Graph Convolutional Network encoder and constructs positive pairs based on node similarities. Our embeddings consistently outperform prior techniques for Edit Distance Approximation and Closest String Retrieval tasks.Comment: Poster at "NeurIPS 2023 New Frontiers in Graph Learning Workshop (NeurIPS GLFrontiers 2023)

arXiv.org e-Print Archive

Unsupervised segmentation of mitochondria using model-based spectral clustering

Author: Dietlmeier Julia
Ghita Ovidiu
Whelan Paul F.
Publication venue
Publication date: 25/03/2013
Field of study

Segmentation of mitochondria in microscopic images represents a significant challenge that is motivated by the wide morphological and structural variations that are characteristic for this category of membrane enclosed sub cellular organelles. To address the drawbacks associated with manual mark-up procedures (which are common in current clinical evaluations), a recent direction of research investigate the application of statistical machine learning methods to mitochondria segmentation. Within this field of research the main issue was generated by the complexity of the training set that is able to describe the vast structural variation that is associated with mitochondria. To avoid this problem, in this paper we apply perceptual organization models such as Figure-Ground, Similarity, Proximity and Closure which target the identification of the closed membranes in EM images using multistage spectral clustering [1,2]. Our unsupervised mitochondria segmentation algorithm is outlined in Fig. 1. The first stage of the spectral clustering implements foreground segmentation with the similarity model S1 that aims to identify the dark contours that are given by the outer membrane of the mitochondrion. In the second stage, the foreground data is re-clustered with a different similarity model S2 to identify the inner membrane of the mitochondrion. The last stage involves a contour processing step that eliminates the pixels that are not consistent with the minimum distance between the inner and outer membranes of the mitochondrion. The algorithm has been tested on a suite of EM images provided by the American Society of Cell Biology and a number of experimental results are presented in Fig. 2

CiteSeerX

Irish Universities

DCU Online Research Access Service