Search CORE

39 research outputs found

Linear Bandits with Feature Feedback

Author: Bhargava Aniruddha
Nowak Robert
Oswal Urvashi
Publication venue
Publication date: 11/03/2019
Field of study

This paper explores a new form of the linear bandit problem in which the algorithm receives the usual stochastic rewards as well as stochastic feedback about which features are relevant to the rewards, the latter feedback being the novel aspect. The focus of this paper is the development of new theory and algorithms for linear bandits with feature feedback. We show that linear bandits with feature feedback can achieve regret over time horizon

T

that scales like

k\sqrt{T}

, without prior knowledge of which features are relevant nor the number

k

of relevant features. In comparison, the regret of traditional linear bandits is

d\sqrt{T}

, where

d

is the total number of (relevant and irrelevant) features, so the improvement can be dramatic if

k\ll d

. The computational complexity of the new algorithm is proportional to

k

rather than

d

, making it much more suitable for real-world applications compared to traditional linear bandits. We demonstrate the performance of the new algorithm with synthetic and real human-labeled data

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

Knowledge Base Population using Semantic Label Propagation

Author: Deleu Johannes
Demeester Thomas
Develder Chris
Sterckx Lucas
Publication venue
Publication date: 01/01/2016
Field of study

A crucial aspect of a knowledge base population system that extracts new facts from text corpora, is the generation of training data for its relation extractors. In this paper, we present a method that maximizes the effectiveness of newly trained relation extractors at a minimal annotation cost. Manual labeling can be significantly reduced by Distant Supervision, which is a method to construct training data automatically by aligning a large text corpus with an existing knowledge base of known facts. For example, all sentences mentioning both 'Barack Obama' and 'US' may serve as positive training instances for the relation born_in(subject,object). However, distant supervision typically results in a highly noisy training set: many training sentences do not really express the intended relation. We propose to combine distant supervision with minimal manual supervision in a technique called feature labeling, to eliminate noise from the large and noisy initial training set, resulting in a significant increase of precision. We further improve on this approach by introducing the Semantic Label Propagation method, which uses the similarity between low-dimensional representations of candidate training instances, to extend the training set in order to increase recall while maintaining high precision. Our proposed strategy for generating training data is studied and evaluated on an established test collection designed for knowledge base population tasks. The experimental results show that the Semantic Label Propagation strategy leads to substantial performance gains when compared to existing approaches, while requiring an almost negligible manual annotation effort.Comment: Submitted to Knowledge Based Systems, special issue on Knowledge Bases for Natural Language Processin

arXiv.org e-Print Archive

Ghent University Academic Bibliography

Automatic Segmentation of Land Cover in Satellite Images

Author: Bağcı Furkan Burak
Kındıroglu Ahmet Alp
Uluyağmur Öztürk Mahiye
Uyan Ufuk
Yalçın Metehan
Publication venue: Mohammad Nassar for Researches (MNFR)
Publication date: 11/08/2023
Field of study

Semantic segmentation problems such as landcover segmentation rely on large amounts of annotated images to excel. Without such data for target regions, transfer learning methods are widely used to incorporate knowledge from other areas and domains to improve performance. In this study, we analyze the performance of landcover segmentation models trained on low-resolution images with insufficient data for the targeted region or zoom level. In order to boost performance on target data, we experiment with models trained with unsupervised, semi-supervised, and supervised transfer learning approaches, including satellite images from public datasets and other unlabeled sources.According to experimental results, transfer learning improves segmentation performance by 3.4% MIoU (mean intersection over union) in rural regions and 12.9% MIoU in urban regions. We observed that transfer learning is more effective when two datasets share a comparable zoom level and are labeled with identical rules; otherwise, semi-supervised learning is more effective using unlabeled data. Pseudo labeling based unsupervised domain adaptation method improved building detection performance in urban cities. In addition, experiments showed that HRNet outperformed building segmentation approaches in multi-class segmentation

American Scientific Research Journal for Engineering, Technology, and Sciences (ASRJETS)