485 research outputs found
The Limiting Spectra of Girko's Block-Matrix
To analyze the limiting spectral distribution of some random block-matrices,
Girko [Girko, 2000] uses a system of canonical equations from [Girko, 98]. In
this paper, we use the method of moments to give an integral form for the
almost sure limiting spectral distribution of such matrices.Comment: 10 page
The spectral laws of Hermitian block-matrices with large random blocks
We are going to study the limiting spectral measure of fixed dimensional
Hermitian block-matrices with large dimensional Wigner blocks. We are going
also to identify the limiting spectral measure when the Hermitian
block-structure is Circulant. Using the limiting spectral measure of a
Hermitian Circulant block-matrix we will show that the spectral measure of a
Wigner matrix with weakly dependent entries need not to be the semicircle
law in the limit
Harvesting Creative Templates for Generating Stylistically Varied Restaurant Reviews
Many of the creative and figurative elements that make language exciting are
lost in translation in current natural language generation engines. In this
paper, we explore a method to harvest templates from positive and negative
reviews in the restaurant domain, with the goal of vastly expanding the types
of stylistic variation available to the natural language generator. We learn
hyperbolic adjective patterns that are representative of the strongly-valenced
expressive language commonly used in either positive or negative reviews. We
then identify and delexicalize entities, and use heuristics to extract
generation templates from review sentences. We evaluate the learned templates
against more traditional review templates, using subjective measures of
"convincingness", "interestingness", and "naturalness". Our results show that
the learned templates score highly on these measures. Finally, we analyze the
linguistic categories that characterize the learned positive and negative
templates. We plan to use the learned templates to improve the conversational
style of dialogue systems in the restaurant domain.Comment: 9 pages, 2 figures, Stylistic Variation Workshop at EMNLP 201
On slow-fading non-separable correlation MIMO systems
In a frequency selective slow-fading channel in a MIMO system, the channel
matrix is of the form of a block matrix. We propose a method to calculate the
limit of the eigenvalue distribution of block matrices if the size of the
blocks tends to infinity. We will also calculate the asymptotic eigenvalue
distribution of , where the entries of are jointly Gaussian, with a
correlation of the form (where is fixed and does not increase
with the size of the matrix). We will use an operator-valued free probability
approach to achieve this goal. Using this method, we derive a system of
equations, which can be solved numerically to compute the desired eigenvalue
distribution.Comment: 24 pages and 3 figure
Are you serious?: Rhetorical Questions and Sarcasm in Social Media Dialog
Effective models of social dialog must understand a broad range of rhetorical
and figurative devices. Rhetorical questions (RQs) are a type of figurative
language whose aim is to achieve a pragmatic goal, such as structuring an
argument, being persuasive, emphasizing a point, or being ironic. While there
are computational models for other forms of figurative language, rhetorical
questions have received little attention to date. We expand a small dataset
from previous work, presenting a corpus of 10,270 RQs from debate forums and
Twitter that represent different discourse functions. We show that we can
clearly distinguish between RQs and sincere questions (0.76 F1). We then show
that RQs can be used both sarcastically and non-sarcastically, observing that
non-sarcastic (other) uses of RQs are frequently argumentative in forums, and
persuasive in tweets. We present experiments to distinguish between these uses
of RQs using SVM and LSTM models that represent linguistic features and
post-level context, achieving results as high as 0.76 F1 for "sarcastic" and
0.77 F1 for "other" in forums, and 0.83 F1 for both "sarcastic" and "other" in
tweets. We supplement our quantitative experiments with an in-depth
characterization of the linguistic variation in RQs.Comment: 10 pages, 1 figure, SIGDIAL 201
Learning Lexico-Functional Patterns for First-Person Affect
Informal first-person narratives are a unique resource for computational
models of everyday events and people's affective reactions to them. People
blogging about their day tend not to explicitly say I am happy. Instead they
describe situations from which other humans can readily infer their affective
reactions. However current sentiment dictionaries are missing much of the
information needed to make similar inferences. We build on recent work that
models affect in terms of lexical predicate functions and affect on the
predicate's arguments. We present a method to learn proxies for these functions
from first-person narratives. We construct a novel fine-grained test set, and
show that the patterns we learn improve our ability to predict first-person
affective reactions to everyday events, from a Stanford sentiment baseline of
.67F to .75F.Comment: 7 pages, Association for Computational Linguistics (ACL) 201
Neural MultiVoice Models for Expressing Novel Personalities in Dialog
Natural language generators for task-oriented dialog should be able to vary
the style of the output utterance while still effectively realizing the system
dialog actions and their associated semantics. While the use of neural
generation for training the response generation component of conversational
agents promises to simplify the process of producing high quality responses in
new domains, to our knowledge, there has been very little investigation of
neural generators for task-oriented dialog that can vary their response style,
and we know of no experiments on models that can generate responses that are
different in style from those seen during training, while still maintain- ing
semantic fidelity to the input meaning representation. Here, we show that a
model that is trained to achieve a single stylis- tic personality target can
produce outputs that combine stylistic targets. We carefully evaluate the
multivoice outputs for both semantic fidelity and for similarities to and
differences from the linguistic features that characterize the original
training style. We show that contrary to our predictions, the learned models do
not always simply interpolate model parameters, but rather produce styles that
are distinct, and novel from the personalities they were trained on.Comment: Interspeech 201
And That's A Fact: Distinguishing Factual and Emotional Argumentation in Online Dialogue
We investigate the characteristics of factual and emotional argumentation
styles observed in online debates. Using an annotated set of "factual" and
"feeling" debate forum posts, we extract patterns that are highly correlated
with factual and emotional arguments, and then apply a bootstrapping
methodology to find new patterns in a larger pool of unannotated forum posts.
This process automatically produces a large set of patterns representing
linguistic expressions that are highly correlated with factual and emotional
language. Finally, we analyze the most discriminating patterns to better
understand the defining characteristics of factual and emotional arguments.Comment: 11 pages, 6 figures, Proceedings of the 2nd Workshop on Argumentation
Mining at NAACL 201
Combining Search with Structured Data to Create a More Engaging User Experience in Open Domain Dialogue
The greatest challenges in building sophisticated open-domain conversational
agents arise directly from the potential for ongoing mixed-initiative
multi-turn dialogues, which do not follow a particular plan or pursue a
particular fixed information need. In order to make coherent conversational
contributions in this context, a conversational agent must be able to track the
types and attributes of the entities under discussion in the conversation and
know how they are related. In some cases, the agent can rely on structured
information sources to help identify the relevant semantic relations and
produce a turn, but in other cases, the only content available comes from
search, and it may be unclear which semantic relations hold between the search
results and the discourse context. A further constraint is that the system must
produce its contribution to the ongoing conversation in real-time. This paper
describes our experience building SlugBot for the 2017 Alexa Prize, and
discusses how we leveraged search and structured data from different sources to
help SlugBot produce dialogic turns and carry on conversations whose length
over the semi-finals user evaluation period averaged 8:17 minutes.Comment: SCAI 201
Exploring Conversational Language Generation for Rich Content about Hotels
Dialogue systems for hotel and tourist information have typically simplified
the richness of the domain, focusing system utterances on only a few selected
attributes such as price, location and type of rooms. However, much more
content is typically available for hotels, often as many as 50 distinct
instantiated attributes for an individual entity. New methods are needed to use
this content to generate natural dialogues for hotel information, and in
general for any domain with such rich complex content. We describe three
experiments aimed at collecting data that can inform an NLG for hotels
dialogues, and show, not surprisingly, that the sentences in the original
written hotel descriptions provided on webpages for each hotel are
stylistically not a very good match for conversational interaction. We quantify
the stylistic features that characterize the differences between the original
textual data and the collected dialogic data. We plan to use these in stylistic
models for generation, and for scoring retrieved utterances for use in hotel
dialoguesComment: This version contains updates to the version published at LREC '1
- …