Search CORE

23 research outputs found

Measuring alignment bias in neural Seq2Seq semantic parsers

Author: Locatelli Davide
Quattoni Ariadna Julieta
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2022
Field of study

Prior to deep learning the semantic parsing community has been interested in understanding and modeling the range of possible word alignments between natural language sentences and their corresponding meaning representations. Sequence-to-sequence models changed the research landscape suggesting that we no longer need to worry about alignments since they can be learned automatically by means of an attention mechanism. More recently, researchers have started to question such premise. In this work we investigate whether seq2seq models can handle both simple and complex alignments. To answer this question we augment the popular GEO semantic parsing dataset with alignment annotations and create GEO-ALIGNED. We then study the performance of standard seq2seq models on the examples that can be aligned monotonically versus examples that require more complex alignments. Our empirical study shows that performance is significantly better over monotonic alignments.This work is supported by the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation program (grant agreement No.853459).Peer ReviewedPostprint (published version

UPCommons. Portal del coneixement obert de la UPC

Spectral learning of transducers over continuous sequences

Author: Quattoni Ariadna Julieta
Recasens Adria
Publication venue
Publication date: 01/01/2013
Field of study

In this paper we present a spectral algorithm for learning weighted nite state transducers (WFSTs) over paired input-output sequences, where the input is continuous and the output discrete. WFSTs are an important tool for modeling paired input-output sequences and have numerous applications in real-world problems. Recently, Balle et al (2011) proposed a spectral method for learning WFSTs that overcomes some of the well known limitations of gradient-based or EM optimizations which can be computationally expensive and su er from local optima issues. Their algorithm can model distributions where both inputs and outputs are sequences from a discrete alphabet. However, many real world problems require modeling paired sequences where the inputs are not discrete but continuos sequences. Modelling continuous sequences with spectral methods has been studied in the context of HMMs (Song et al 2010), where a spectral algorithm for this case was derived. In this paper we follow that line of work and propose a spectral learning algorithm for modelling paired input-output sequences where the inputs are continuous and the outputs are discrete. Our approach is based on generalizing the class of weighted nite state transducers over discrete input-output sequences to a class where transitions are linear combinations of elementary transitions and the weights of this linear combinations are determined by dynamic features of the continuous input sequence. At its core, the algorithm is simple and scalable to large data sets. We present experiments on a real task that validate the eff ectiveness of the proposed approach.Postprint (published version

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

UPCommons. Portal del coneixement obert de la UPC

Learning task-specific bilexical embeddings

Author: Carreras Pérez Xavier
Madhyastha Pranava S.
Quattoni Ariadna Julieta
Publication venue
Publication date: 01/01/2014
Field of study

We present a method that learns bilexical operators over distributional representations of words and leverages supervised data for a linguistic relation. The learning algorithm exploits lowrank bilinear forms and induces low-dimensional embeddings of the lexical space tailored for the target linguistic relation. An advantage of imposing low-rank constraints is that prediction is expressed as the inner-product between low-dimensional embeddings, which can have great computational benefits. In experiments with multiple linguistic bilexical relations we show that our method effectively learns using embeddings of a few dimensions.Peer ReviewedPostprint (published version

UPCommons. Portal del coneixement obert de la UPC

Unsupervised spectral learning of WCFG as low-rank matrix completion

Author: Bailly Raphaël
Carreras Pérez Xavier
Luque Franco M.
Quattoni Ariadna Julieta
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2013
Field of study

We derive a spectral method for unsupervised learning ofWeighted Context Free Grammars. We frame WCFG induction as finding a Hankel matrix that has low rank and is linearly constrained to represent a function computed by inside-outside recursions. The proposed algorithm picks the grammar that agrees with a sample and is the simplest with respect to the nuclear norm of the Hankel matrix.Peer ReviewedPreprin

UPCommons. Portal del coneixement obert de la UPC

Spectral regularization for max-margin sequence tagging

Author: Balle Pigem Borja de
Carreras Pérez Xavier
Globerson Amir
Quattoni Ariadna Julieta
Publication venue
Publication date: 01/01/2014
Field of study

We frame max-margin learning of latent variable structured prediction models as a convex optimization problem, making use of scoring functions computed by input-output observable operator models. This learning problem can be expressed as an optimization problem involving a low-rank Hankel matrix that represents the inputoutput operator model. The direct outcome of our work is a new spectral regularization method for max-margin structured prediction. Our experiments confirm that our proposed regularization framework leads to an effective way of controlling the capacity of structured prediction models.Peer ReviewedPostprint (published version

UPCommons. Portal del coneixement obert de la UPC

Semantic tuples for evaluation of image sentence generation

Author: Cordero Rama Jose Alejandro
Ellebracht Lily Delores
Moreno-Noguer Francesc
Quattoni Ariadna Julieta
Ramisa Ayats Arnau
Shantharam Madhyastha Pranava Swaroop
Publication venue
Publication date: 01/01/2015
Field of study

The automatic generation of image captions has received considerable attention. The problem of evaluating caption generation systems, though, has not been that much explored. We propose a novel evaluation approach based on comparing the underlying visual semantics of the candidate and ground-truth captions. With this goal in mind we have defined a semantic representation for visually descriptive language and have augmented a subset of the Flickr-8K dataset with semantic annotations. Our evaluation metric (BAST) can be used not only to compare systems but also to do error analysis and get a better understanding of the type of mistakes a system does. To compute BAST we need to predict the semantic representation for the automatically generated captions. We use the Flickr-ST dataset to train classifiers that predict STs so that evaluation can be fully automated.Peer ReviewedPostprint (author's final draft

UPCommons. Portal del coneixement obert de la UPC

Spectral learning of transducers over continuous sequences

Author: Quattoni Ariadna Julieta
Recasens Adria
Publication venue
Publication date: 01/01/2013
Field of study

Spectral learning of sequence taggers over continuous sequences

Author: Quattoni Ariadna Julieta
Recasens Adria
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

In this paper we present a spectral algorithm for learning weighted finite-state sequence taggers (WFSTs) over paired input-output sequences, where the input is continuous and the output discrete. WFSTs are an important tool for modelling paired input-output sequences and have numerous applications in real-world problems. Our approach is based on generalizing the class of weighted finite-state sequence taggers over discrete input-output sequences to a class where transitions are linear combinations of elementary transitions and the weights of the linear combination are determined by dynamic features of the continuous input sequence. The resulting learning algorithm is efficient and accurate.Peer Reviewe

UPCommons. Portal del coneixement obert de la UPC

Spectral learning of transducers over continuous sequences

Author: Quattoni Ariadna Julieta
Recasens Adria
Publication venue
Publication date
Field of study

RECERCAT

Spectral learning of sequence taggers over continuous sequences

Author: Quattoni Ariadna Julieta
Recasens Adria
Publication venue: Springer-Verlag
Publication date
Field of study

RECERCAT