Search CORE

68 research outputs found

Global Normalization of Convolutional Neural Networks for Joint Entity and Relation Classification

Author: Adel Heike
Schütze Hinrich
Publication venue
Publication date: 01/01/2017
Field of study

We introduce globally normalized convolutional neural networks for joint entity classification and relation extraction. In particular, we propose a way to utilize a linear-chain conditional random field output layer for predicting entity types and relations between entities at the same time. Our experiments show that global normalization outperforms a locally normalized softmax layer on a benchmark dataset.Comment: EMNLP 201

arXiv.org e-Print Archive

Crossref

Exploring Different Dimensions of Attention for Uncertainty Detection

Author: Adel Heike
Schütze Hinrich
Publication venue
Publication date: 01/01/2017
Field of study

Neural networks with attention have proven effective for many natural language processing tasks. In this paper, we develop attention mechanisms for uncertainty detection. In particular, we generalize standardly used attention mechanisms by introducing external attention and sequence-preserving attention. These novel architectures differ from standard approaches in that they use external resources to compute attention weights and preserve sequence information. We compare them to other configurations along different dimensions of attention. Our novel architectures set the new state of the art on a Wikipedia benchmark dataset and perform similar to the state-of-the-art model on a biomedical benchmark which uses a large set of linguistic features.Comment: accepted at EACL 201

arXiv.org e-Print Archive

Crossref

GradSim: Gradient-Based Language Grouping for Effective Multilingual Training

Author: Adel Heike
Lange Lukas
Schütze Hinrich
Strötgen Jannik
Wang Mingyang
Publication venue
Publication date: 23/10/2023
Field of study

Most languages of the world pose low-resource challenges to natural language processing models. With multilingual training, knowledge can be shared among languages. However, not all languages positively influence each other and it is an open research question how to select the most suitable set of languages for multilingual training and avoid negative interference among languages whose characteristics or data distributions are not compatible. In this paper, we propose GradSim, a language grouping method based on gradient similarity. Our experiments on three diverse multilingual benchmark datasets show that it leads to the largest performance gains compared to other similarity measures and it is better correlated with cross-lingual model performance. As a result, we set the new state of the art on AfriSenti, a benchmark dataset for sentiment analysis on low-resource African languages. In our extensive analysis, we further reveal that besides linguistic features, the topics of the datasets play an important role for language grouping and that lower layers of transformer models encode language-specific features while higher layers capture task-specific information

arXiv.org e-Print Archive

NLNDE at SemEval-2023 Task 12: Adaptive Pretraining and Source Language Selection for Low-Resource Multilingual Sentiment Analysis

Author: Adel Heike
Lange Lukas
Schütze Hinrich
Strötgen Jannik
Wang Mingyang
Publication venue
Publication date: 28/04/2023
Field of study

This paper describes our system developed for the SemEval-2023 Task 12 "Sentiment Analysis for Low-resource African Languages using Twitter Dataset". Sentiment analysis is one of the most widely studied applications in natural language processing. However, most prior work still focuses on a small number of high-resource languages. Building reliable sentiment analysis systems for low-resource languages remains challenging, due to the limited training data in this task. In this work, we propose to leverage language-adaptive and task-adaptive pretraining on African texts and study transfer learning with source language selection on top of an African language-centric pretrained language model. Our key findings are: (1) Adapting the pretrained model to the target language and task using a small yet relevant corpus improves performance remarkably by more than 10 F1 score points. (2) Selecting source languages with positive transfer gains during training can avoid harmful interference from dissimilar languages, leading to better results in multilingual and cross-lingual settings. In the shared task, our system wins 8 out of 15 tracks and, in particular, performs best in the multilingual evaluation

arXiv.org e-Print Archive

Intramuscular DNA Vaccination of Juvenile Carp against Spring Viremia of Carp Virus Induces Full Protection and Establishes a Virus-Specific B and T Cell Response

Author: Boudinot Pierre
Dauber Malte
Embregts Carmen W. E.
Forlenza Maria
Houel Armel
Lorenzen Niels
Petit Jules
Pokorová Dagmar
Rigaudeau Dimitri
Schütze Heike
Veselý Tomas
Wiegertjes Geert F.
Publication venue: 'Frontiers Media SA'
Publication date: 01/01/2017
Field of study

Although spring viremia of carp virus (SVCV) can cause high mortalities in common carp, a commercial vaccine is not available for worldwide use. Here, we report a DNA vaccine based on the expression of the SVCV glycoprotein (G) which, when injected in the muscle even at a single low dose of 0.1 µg DNA/g of fish, confers up to 100% protection against a subsequent bath challenge with SVCV. Importantly, to best validate vaccine efficacy, we also optimized a reliable bath challenge model closely mimicking a natural infection, based on a prolonged exposure of carp to SVCV at 15°C. Using this optimized bath challenge, we showed a strong age-dependent susceptibility of carp to SVCV, with high susceptibility at young age (3 months) and a full resistance at 9 months. We visualized local expression of the G protein and associated early inflammatory response by immunohistochemistry and described changes in the gene expression of pro-inflammatory cytokines, chemokines, and antiviral genes in the muscle of vaccinated fish. Adaptive immune responses were investigated by analyzing neutralizing titers against SVCV in the serum of vaccinated fish and the in vitro proliferation capacity of peripheral SVCV-specific T cells. We show significantly higher serum neutralizing titers and the presence of SVCV-specific T cells in the blood of vaccinated fish, which proliferated upon stimulation with SVCV. Altogether, this is the first study reporting on a protective DNA vaccine against SVCV in carp and the first to provide a detailed characterization of local innate as well as systemic adaptive immune responses elicited upon DNA vaccination that suggest a role not only of B cells but also of T cells in the protection conferred by the SVCV-G DNA vaccine

Crossref

Directory of Open Access Journals

Frontiers - Publisher Connector

Wageningen University & Research Publications

ProdInra

Online Research Database In Technology

Hal-Diderot

Phylogeny of the Viral Hemorrhagic Septicemia Virus in European Aquaculture

Author: Baud Marine
Cieslak Michael
Diserens Nicolas
Engelsma Marc Y
Haenen Olga L M
Mikkelsen Susie Sommer
Mousakhani Shirin
Olesen Niels Jørgen
Panzarin Valentina
Schütze Heike
Skall Helle F.
Wahli Thomas
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2016
Field of study

<p>One of the most valuable aquaculture fish in Europe is the rainbow trout, Oncorhynchus mykiss, but the profitability of trout production is threatened by a highly lethal infectious disease, viral hemorrhagic septicemia (VHS), caused by the VHS virus (VHSV). For the past few decades, the subgenogroup Ia of VHSV has been the main cause of VHS outbreaks in European freshwater-farmed rainbow trout. Little is currently known, however, about the phylogenetic radiation of this Ia lineage into subordinate Ia clades and their subsequent geographical spread routes. We investigated this topic using the largest Ia-isolate dataset ever compiled, comprising 651 complete G gene sequences: 209 GenBank Ia isolates and 442 Ia isolates from this study. The sequences come from 11 European countries and cover the period 1971-2015. Based on this dataset, we documented the extensive spread of the Ia population and the strong mixing of Ia isolates, assumed to be the result of the Europe-wide trout trade. For example, the Ia lineage underwent a radiation into nine Ia clades, most of which are difficult to allocate to a specific geographic distribution. Furthermore, we found indications for two rapid, large-scale population growth events, and identified three polytomies among the Ia clades, both of which possibly indicate a rapid radiation. However, only about 4% of Ia haplotypes (out of 398) occur in more than one European country. This apparently conflicting finding regarding the Europe-wide spread and mixing of Ia isolates can be explained by the high mutation rate of VHSV. Accordingly, the mean period of occurrence of a single Ia haplotype was less than a full year, and we found a substitution rate of up to 7.813 × 10<sup>-4</sup> nucleotides per site per year. Finally, we documented significant differences between Germany and Denmark regarding their VHS epidemiology, apparently due to those countries' individual handling of VHS.</p

Crossref

Directory of Open Access Journals

PubMed Central

Bern Open Repository and Information System (BORIS)

Online Research Database In Technology

FigShare