Search CORE

328 research outputs found

Dynamic Conditional Random Fields: Factorized Probabilistic Models for Labeling and Segmenting Sequence Data

Author: McCallum Andrew
Rohanimanesh Khashayar
Sutton Charles
Publication venue: ScholarWorks@UMass Amherst
Publication date: 01/01/2004
Field of study

In sequence modeling, we often wish to represent complex interaction between labels, such as when performing multiple, cascaded labeling tasks on the same sequence, or when longrange dependencies exist. We present dynamic conditional random fields (DCRFs), a generalization of linear-chain conditional random fields (CRFs) in which each time slice contains a set of state variables and edges—a distributed state representation as in dynamic Bayesian networks (DBNs)—and parameters are tied across slices. Since exact inference can be intractable in such models, we perform approximate inference using several schedules for belief propagation, including tree-based reparameterization (TRP). On a natural-language chunking task, we show that a DCRF performs better than a series of linearchain CRFs, achieving comparable performance using only half the training data

CiteSeerX

ScholarWorks@UMass Amherst

Edinburgh Research Explorer

Local String Transduction as Sequence Labeling

Author: Carreras Xavier
Cohen Shay
Narayan Shashi
Ribeiro Joana
Publication venue
Publication date: 01/08/2018
Field of study

[EN]We show that the general problem of string transduction can be reduced to the problem of sequence labeling. While character deletion and insertions are allowed in string transduction, they do not exist in sequence labeling. We show how to overcome this difference. Our approach can be used with any sequence labeling algorithm and it works best for problems in which string transduction imposes a strong notion of locality (no long range dependencies). We experiment with spelling correction for social media, OCR correction, and morphological inflection, and we see that it behaves better than seq2seq models and yields state-of-the-art results in several cases.Peer reviewe

Edinburgh Research Explorer

Digital.CSIC

Towards Automatic Creation of Annotations to Foster Development of Named Entity Recognizers

Author: Matos Emanuel
Miguel Pedro
Publication venue: OASIcs - OpenAccess Series in Informatics. 10th Symposium on Languages, Applications and Technologies (SLATE 2021)
Publication date: 01/01/2021
Field of study

Named Entity Recognition (NER) is an essential step for many natural language processing tasks, including Information Extraction. Despite recent advances, particularly using deep learning techniques, the creation of accurate named entity recognizers continues a complex task, highly dependent on annotated data availability. To foster existence of NER systems for new domains it is crucial to obtain the required large volumes of annotated data with low or no manual labor. In this paper it is proposed a system to create the annotated data automatically, by resorting to a set of existing NERs and information sources (DBpedia). The approach was tested with documents of the Tourism domain. Distinct methods were applied for deciding the final named entities and respective tags. The results show that this approach can increase the confidence on annotations and/or augment the number of categories possible to annotate. This paper also presents examples of new NERs that can be rapidly created with the obtained annotated data. The annotated data, combined with the possibility to apply both the ensemble of NER systems and the new Gazetteer-based NERs to large corpora, create the necessary conditions to explore the recent neural deep learning state-of-art approaches to NER (ex: BERT) in domains with scarce or nonexistent data for training

Dagstuhl Research Online Publication Server

Improving historical spelling normalization with bi-directional LSTMs and multi-task learning

Author: Bollman Marcel
Søgaard Anders
Publication venue
Publication date: 01/01/2016
Field of study

Copenhagen University Research Information System

Improving historical spelling normalization with bi-directional LSTMs and multi-task learning

Author: Bollman Marcel
Søgaard Anders
Publication venue
Publication date: 01/01/2016
Field of study

Natural-language processing of historical documents is complicated by the abundance of variant spellings and lack of annotated data. A common approach is to normalize the spelling of historical words to modern forms. We explore the suitability of a deep neural network architecture for this task, particularly a deep bi-LSTM network applied on a character level. Our model compares well to previously established normalization algorithms when evaluated on a diverse set of texts from Early New High German. We show that multi-task learning with additional normalization data can improve our model's performance further.Comment: Accepted to COLING 201

arXiv.org e-Print Archive

Copenhagen University Research Information System