Search CORE

45,578 research outputs found

Co-regularized Alignment for Unsupervised Domain Adaptation

Author: Feris Rogerio
Freeman William T.
Karlinsky Leonid
Kumar Abhishek
Sattigeri Prasanna
Wadhawan Kahini
Wornell Gregory
Publication venue
Publication date: 13/11/2018
Field of study

Deep neural networks, trained with large amount of labeled data, can fail to generalize well when tested with examples from a \emph{target domain} whose distribution differs from the training data distribution, referred as the \emph{source domain}. It can be expensive or even infeasible to obtain required amount of labeled data in all possible domains. Unsupervised domain adaptation sets out to address this problem, aiming to learn a good predictive model for the target domain using labeled examples from the source domain but only unlabeled examples from the target domain. Domain alignment approaches this problem by matching the source and target feature distributions, and has been used as a key component in many state-of-the-art domain adaptation methods. However, matching the marginal feature distributions does not guarantee that the corresponding class conditional distributions will be aligned across the two domains. We propose co-regularized domain alignment for unsupervised domain adaptation, which constructs multiple diverse feature spaces and aligns source and target distributions in each of them individually, while encouraging that alignments agree with each other with regard to the class predictions on the unlabeled target examples. The proposed method is generic and can be used to improve any domain adaptation method which uses domain alignment. We instantiate it in the context of a recent state-of-the-art method and observe that it provides significant performance improvements on several domain adaptation benchmarks.Comment: NIPS 2018 accepted versio

arXiv.org e-Print Archive

DSpace@MIT

Challenges and solutions for Latin named entity recognition

Author: Ajaka Petra
Brown Christopher
de Marneffe Marie-Catherine
Elsner Micha
Erdmann Alex
Janse Mark
Joseph Brian D.
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2016
Field of study

Although spanning thousands of years and genres as diverse as liturgy, historiography, lyric and other forms of prose and poetry, the body of Latin texts is still relatively sparse compared to English. Data sparsity in Latin presents a number of challenges for traditional Named Entity Recognition techniques. Solving such challenges and enabling reliable Named Entity Recognition in Latin texts can facilitate many down-stream applications, from machine translation to digital historiography, enabling Classicists, historians, and archaeologists for instance, to track the relationships of historical persons, places, and groups on a large scale. This paper presents the first annotated corpus for evaluating Named Entity Recognition in Latin, as well as a fully supervised model that achieves over 90% F-score on a held-out test set, significantly outperforming a competitive baseline. We also present a novel active learning strategy that predicts how many and which sentences need to be annotated for named entities in order to attain a specified degree of accuracy when recognizing named entities automatically in a given text. This maximizes the productivity of annotators while simultaneously controlling quality

Ghent University Academic Bibliography

Advances in Hyperspectral Image Classification: Earth monitoring with statistical learning methods

Author: Benediktsson Jón Atli
Bruzzone Lorenzo
Camps-Valls Gustavo
Tuia Devis
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 18/10/2013
Field of study

Hyperspectral images show similar statistical properties to natural grayscale or color photographic images. However, the classification of hyperspectral images is more challenging because of the very high dimensionality of the pixels and the small number of labeled examples typically available for learning. These peculiarities lead to particular signal processing problems, mainly characterized by indetermination and complex manifolds. The framework of statistical learning has gained popularity in the last decade. New methods have been presented to account for the spatial homogeneity of images, to include user's interaction via active learning, to take advantage of the manifold structure with semisupervised learning, to extract and encode invariances, or to adapt classifiers and image representations to unseen yet similar scenes. This tutuorial reviews the main advances for hyperspectral remote sensing image classification through illustrative examples.Comment: IEEE Signal Processing Magazine, 201

arXiv.org e-Print Archive

CiteSeerX

Wageningen University & Research Publications

Informative sample generation using class aware generative adversarial networks for classification of chest Xrays

Author: Bozorgtabar Behzad
Ebner Lukas
Mahapatra Dwarikanath
Pollinger Alexander
Reyes Mauricio
Thiran Jean-Phillipe
von Teng Hendrik
Publication venue
Publication date: 01/01/2019
Field of study

Training robust deep learning (DL) systems for disease detection from medical images is challenging due to limited images covering different disease types and severity. The problem is especially acute, where there is a severe class imbalance. We propose an active learning (AL) framework to select most informative samples for training our model using a Bayesian neural network. Informative samples are then used within a novel class aware generative adversarial network (CAGAN) to generate realistic chest xray images for data augmentation by transferring characteristics from one class label to another. Experiments show our proposed AL framework is able to achieve state-of-the-art performance by using about

35\%

of the full dataset, thus saving significant time and effort over conventional methods

arXiv.org e-Print Archive

Infoscience - École polytechnique fédérale de Lausanne

Bern Open Repository and Information System (BORIS)