23,936 research outputs found
Predicting the Quality of Short Narratives from Social Media
An important and difficult challenge in building computational models for
narratives is the automatic evaluation of narrative quality. Quality evaluation
connects narrative understanding and generation as generation systems need to
evaluate their own products. To circumvent difficulties in acquiring
annotations, we employ upvotes in social media as an approximate measure for
story quality. We collected 54,484 answers from a crowd-powered
question-and-answer website, Quora, and then used active learning to build a
classifier that labeled 28,320 answers as stories. To predict the number of
upvotes without the use of social network features, we create neural networks
that model textual regions and the interdependence among regions, which serve
as strong benchmarks for future research. To our best knowledge, this is the
first large-scale study for automatic evaluation of narrative quality.Comment: 7 pages, 2 figures. Accepted at the 2017 IJCAI conferenc
Two-Sample Tests for High Dimensional Means with Thresholding and Data Transformation
We consider testing for two-sample means of high dimensional populations by
thresholding. Two tests are investigated, which are designed for better power
performance when the two population mean vectors differ only in sparsely
populated coordinates. The first test is constructed by carrying out
thresholding to remove the non-signal bearing dimensions. The second test
combines data transformation via the precision matrix with the thresholding.
The benefits of the thresholding and the data transformations are showed by a
reduced variance of the test thresholding statistics, the improved power and a
wider detection region of the tests. Simulation experiments and an empirical
study are performed to confirm the theoretical findings and to demonstrate the
practical implementations.Comment: 64 page
- …