225,987 research outputs found
Learning Graph Embeddings from WordNet-based Similarity Measures
We present path2vec, a new approach for learning graph embeddings that relies
on structural measures of pairwise node similarities. The model learns
representations for nodes in a dense space that approximate a given
user-defined graph distance measure, such as e.g. the shortest path distance or
distance measures that take information beyond the graph structure into
account. Evaluation of the proposed model on semantic similarity and word sense
disambiguation tasks, using various WordNet-based similarity measures, show
that our approach yields competitive results, outperforming strong graph
embedding baselines. The model is computationally efficient, being orders of
magnitude faster than the direct computation of graph-based distances.Comment: Accepted to StarSem 201
Principal Component Analysis Using Structural Similarity Index for Images
Despite the advances of deep learning in specific tasks using images, the
principled assessment of image fidelity and similarity is still a critical
ability to develop. As it has been shown that Mean Squared Error (MSE) is
insufficient for this task, other measures have been developed with one of the
most effective being Structural Similarity Index (SSIM). Such measures can be
used for subspace learning but existing methods in machine learning, such as
Principal Component Analysis (PCA), are based on Euclidean distance or MSE and
thus cannot properly capture the structural features of images. In this paper,
we define an image structure subspace which discriminates different types of
image distortions. We propose Image Structural Component Analysis (ISCA) and
also kernel ISCA by using SSIM, rather than Euclidean distance, in the
formulation of PCA. This paper provides a bridge between image quality
assessment and manifold learning opening a broad new area for future research.Comment: Paper for the methods named "Image Structural Component Analysis
(ISCA)" and "Kernel Image Structural Component Analysis (Kernel ISCA)
Metrics for Graph Comparison: A Practitioner's Guide
Comparison of graph structure is a ubiquitous task in data analysis and
machine learning, with diverse applications in fields such as neuroscience,
cyber security, social network analysis, and bioinformatics, among others.
Discovery and comparison of structures such as modular communities, rich clubs,
hubs, and trees in data in these fields yields insight into the generative
mechanisms and functional properties of the graph.
Often, two graphs are compared via a pairwise distance measure, with a small
distance indicating structural similarity and vice versa. Common choices
include spectral distances (also known as distances) and distances
based on node affinities. However, there has of yet been no comparative study
of the efficacy of these distance measures in discerning between common graph
topologies and different structural scales.
In this work, we compare commonly used graph metrics and distance measures,
and demonstrate their ability to discern between common topological features
found in both random graph models and empirical datasets. We put forward a
multi-scale picture of graph structure, in which the effect of global and local
structure upon the distance measures is considered. We make recommendations on
the applicability of different distance measures to empirical graph data
problem based on this multi-scale view. Finally, we introduce the Python
library NetComp which implements the graph distances used in this work
Learning Genomic Sequence Representations using Graph Neural Networks over De Bruijn Graphs
The rapid expansion of genomic sequence data calls for new methods to achieve
robust sequence representations. Existing techniques often neglect intricate
structural details, emphasizing mainly contextual information. To address this,
we developed k-mer embeddings that merge contextual and structural string
information by enhancing De Bruijn graphs with structural similarity
connections. Subsequently, we crafted a self-supervised method based on
Contrastive Learning that employs a heterogeneous Graph Convolutional Network
encoder and constructs positive pairs based on node similarities. Our
embeddings consistently outperform prior techniques for Edit Distance
Approximation and Closest String Retrieval tasks.Comment: Poster at "NeurIPS 2023 New Frontiers in Graph Learning Workshop
(NeurIPS GLFrontiers 2023)
Unsupervised segmentation of mitochondria using model-based spectral clustering
Segmentation of mitochondria in microscopic images represents a significant challenge that is motivated by the wide morphological and structural variations that are characteristic for this category of membrane enclosed sub cellular organelles. To address the drawbacks associated with manual mark-up procedures (which are common in current clinical evaluations), a recent direction of research investigate the application of statistical machine learning methods to mitochondria segmentation. Within this field of research the main issue was generated by the complexity of the training set that is able to describe the vast structural variation that is associated with mitochondria. To avoid this problem, in this paper we apply perceptual organization models such as Figure-Ground, Similarity, Proximity and Closure which target the identification of the closed membranes in EM images using multistage spectral clustering [1,2].
Our unsupervised mitochondria segmentation algorithm is outlined in Fig. 1. The first stage of the spectral clustering implements foreground segmentation with the similarity model S1 that aims to identify the dark contours that are given by the outer membrane of the mitochondrion. In the second stage, the foreground data is re-clustered with a different similarity model S2 to identify the inner membrane of the mitochondrion. The last stage involves a contour processing step that eliminates the pixels that are not consistent with the minimum distance between the inner and outer membranes of the mitochondrion. The algorithm has been tested on a suite of EM images provided by the American Society of Cell Biology and a number of experimental results are presented in Fig. 2
- …