Search CORE

17,649 research outputs found

Methods to integrate a language model with semantic information for a word prediction component

Author: Antoine Jean-Yves
Wandmacher Tonio
Publication venue
Publication date: 01/06/2007
Field of study

Most current word prediction systems make use of n-gram language models (LM) to estimate the probability of the following word in a phrase. In the past years there have been many attempts to enrich such language models with further syntactic or semantic information. We want to explore the predictive powers of Latent Semantic Analysis (LSA), a method that has been shown to provide reliable information on long-distance semantic dependencies between words in a context. We present and evaluate here several methods that integrate LSA-based information with a standard language model: a semantic cache, partial reranking, and different forms of interpolation. We found that all methods show significant improvements, compared to the 4-gram baseline, and most of them to a simple cache model as well.Comment: 10 pages ; EMNLP'2007 Conference (Prague

arXiv.org e-Print Archive

HAL Université de Tours

ASR error management for improving spoken language understanding

Author: Camelin Nathalie
De Mori Renato
Estève Yannick
Ghannay Sahar
Simonnet Edwin
Publication venue
Publication date: 26/05/2017
Field of study

This paper addresses the problem of automatic speech recognition (ASR) error detection and their use for improving spoken language understanding (SLU) systems. In this study, the SLU task consists in automatically extracting, from ASR transcriptions , semantic concepts and concept/values pairs in a e.g touristic information system. An approach is proposed for enriching the set of semantic labels with error specific labels and by using a recently proposed neural approach based on word embeddings to compute well calibrated ASR confidence measures. Experimental results are reported showing that it is possible to decrease significantly the Concept/Value Error Rate with a state of the art system, outperforming previously published results performance on the same experimental data. It also shown that combining an SLU approach based on conditional random fields with a neural encoder/decoder attention based architecture , it is possible to effectively identifying confidence islands and uncertain semantic output segments useful for deciding appropriate error handling actions by the dialogue manager strategy .Comment: Interspeech 2017, Aug 2017, Stockholm, Sweden. 201

arXiv.org e-Print Archive

Crossref

SINVAD: Search-based Image Space Navigation for DNN Image Classifier Test Input Generation

Author: Goodfellow Ian
Goodfellow Ian J.
Oord Aaron Van
Simonyan Karen
Publication venue
Publication date: 01/01/2020
Field of study

The testing of Deep Neural Networks (DNNs) has become increasingly important as DNNs are widely adopted by safety critical systems. While many test adequacy criteria have been suggested, automated test input generation for many types of DNNs remains a challenge because the raw input space is too large to randomly sample or to navigate and search for plausible inputs. Consequently, current testing techniques for DNNs depend on small local perturbations to existing inputs, based on the metamorphic testing principle. We propose new ways to search not over the entire image space, but rather over a plausible input space that resembles the true training distribution. This space is constructed using Variational Autoencoders (VAEs), and navigated through their latent vector space. We show that this space helps efficiently produce test inputs that can reveal information about the robustness of DNNs when dealing with realistic tests, opening the field to meaningful exploration through the space of highly structured images

arXiv.org e-Print Archive

Crossref

Chalmers Research

Distributional analyses in the picture-word interference paradigm: Exploring the semantic interference and the distractor frequency effects.

Author: NAVARRETE SANCHEZ Eduardo
Peressotti Francesca
Scaltritti Michele
Publication venue: 'Informa UK Limited'
Publication date: 01/01/2015
Field of study

he present study explores the distributional features of two important effects within the picture-word interference paradigm: the semantic interference and the distractor frequency effects. These two effects display different and specific distributional profiles. Semantic interference appears greatly reduced in faster response times, while it reaches its full magnitude only in slower responses. This can be interpreted as a sign of fluctuant attentional efficiency in resolving response conflict. In contrast, the distractor frequency effect is mediated mainly by a distributional shift, with low frequency distractors uniformly shifting reaction times distribution towards a slower range of latencies. This finding fits with the idea that distractor frequency exerts its effect by modulating the point in time in which operations required to discard the distractor can start. Taken together, these results are congruent with current theoretical accounts of both the semantic interference and distractor frequency effects. Critically, distributional analyses highlight and further describe the different cognitive dynamics underlying these two effects, suggesting that this analytical tool is able to offer important insights about lexical access during speech productio

Archivio istituzionale della ricerca - Università di Padova

Zero-Shot Event Detection by Multimodal Distributional Semantic Embedding of Videos

Author: Cheng Hui
Elgammal Ahmed
Elhoseiny Mohamed
Liu Jingen
Sawhney Harpreet
Publication venue
Publication date: 15/12/2015
Field of study

We propose a new zero-shot Event Detection method by Multi-modal Distributional Semantic embedding of videos. Our model embeds object and action concepts as well as other available modalities from videos into a distributional semantic space. To our knowledge, this is the first Zero-Shot event detection model that is built on top of distributional semantics and extends it in the following directions: (a) semantic embedding of multimodal information in videos (with focus on the visual modalities), (b) automatically determining relevance of concepts/attributes to a free text query, which could be useful for other applications, and (c) retrieving videos by free text event query (e.g., "changing a vehicle tire") based on their content. We embed videos into a distributional semantic space and then measure the similarity between videos and the event query in a free text form. We validated our method on the large TRECVID MED (Multimedia Event Detection) challenge. Using only the event title as a query, our method outperformed the state-of-the-art that uses big descriptions from 12.6% to 13.5% with MAP metric and 0.73 to 0.83 with ROC-AUC metric. It is also an order of magnitude faster.Comment: To appear in AAAI 201

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications