122 research outputs found
Latent Graphs for Semi-Supervised Learning on Biomedical Tabular Data
In the domain of semi-supervised learning, the current approaches
insufficiently exploit the potential of considering inter-instance
relationships among (un)labeled data. In this work, we address this limitation
by providing an approach for inferring latent graphs that capture the intrinsic
data relationships. By leveraging graph-based representations, our approach
facilitates the seamless propagation of information throughout the graph,
effectively incorporating global and local knowledge. Through evaluations on
biomedical tabular datasets, we compare the capabilities of our approach to
other contemporary methods. Our work demonstrates the significance of
inter-instance relationship discovery as practical means for constructing
robust latent graphs to enhance semi-supervised learning techniques. The
experiments show that the proposed methodology outperforms contemporary
state-of-the-art methods for (semi-)supervised learning on three biomedical
datasets.Comment: Accepted at IJCLR 202
Discovery of relevant response in infected potato plants from time series of gene expression data
The paper presents a methodology for analyzing time series of gene expression data collected from the leaves of potato virus Y (PVY) infected and non-infected potato plants, with the aim to identify significant differences between the two sets of potato plants’ characteristic for various time points. We aim at identifying differentially- expressed genes whose expression values are statistically significantly different in the set of PVY infected potato plants compared to non- infected plants, and which demonstrate also statistically significant changes of expression values of genes of PVY infected potato plants in time. The novelty of the approach includes stratified data randomization used in estimating the statistical properties of gene expression of the samples in the control set of non-infected potato plants. A novel estimate that computes the relative minimal distance between the samples has been defined that enables reliable identification of the differences between the target and control datasets when these sets are small. The relevance of the outcomes is demonstrated by visualizing the relative minimal distance of gene expression changes in time for three different types of potato leaves for the genes that have been identified as relevant by the proposed methodology
- …