149 research outputs found

    Eyettention: An Attention-based Dual-Sequence Model for Predicting Human Scanpaths during Reading

    Full text link
    Eye movements during reading offer insights into both the reader's cognitive processes and the characteristics of the text that is being read. Hence, the analysis of scanpaths in reading have attracted increasing attention across fields, ranging from cognitive science over linguistics to computer science. In particular, eye-tracking-while-reading data has been argued to bear the potential to make machine-learning-based language models exhibit a more human-like linguistic behavior. However, one of the main challenges in modeling human scanpaths in reading is their dual-sequence nature: the words are ordered following the grammatical rules of the language, whereas the fixations are chronologically ordered. As humans do not strictly read from left-to-right, but rather skip or refixate words and regress to previous words, the alignment of the linguistic and the temporal sequence is non-trivial. In this paper, we develop Eyettention, the first dual-sequence model that simultaneously processes the sequence of words and the chronological sequence of fixations. The alignment of the two sequences is achieved by a cross-sequence attention mechanism. We show that Eyettention outperforms state-of-the-art models in predicting scanpaths. We provide an extensive within- and across-data set evaluation on different languages. An ablation study and qualitative analysis support an in-depth understanding of the model's behavior

    Is media just noise? The link between media factors and stock performance

    Get PDF
    PURPOSE OF THE STUDY Interest towards media analytics has increased significantly by both practitioners and academia alike. The hot topic is whether or not qualitative texts contain information relevant to stock financials, and if they do, whether the impact can be used to earn abnormal returns. In order to answer this, we study the impact media factors have on financial metrics in a novel specification that combines all the major media factors in a holistic media model. To transform qualitative texts information into a "sentiment score", we develop a new methodology to estimate sentiment more accurately than currently prevailing methods. DATA AND METHODOLOGY Our study focuses on the S&P 100 constituents between the time period of 2006 and 2011. As a source of qualitative texts, we use major news publications and earnings announcements retrieved from LexisNexis -database using a web scraper program developed for the purpose of this study. We retrieve the financials data for our study using Thomson Reuters Datastream -database. In order to estimate investor sentiment, we employ both the customary word count, as well as our novel Linearized Phrase-Structure -methodology. For word count, we test the Harvard Psychological -dictionary and a finance-specific dictionary by Loughran and McDonald (2011). As our data is panel in nature, we analyze the correlations in our error terms in line with Petersen (2009), first without clustering and then clustering by firm and by time. We find time-effect in our error terms, and therefore employ a Fama-Macbeth (1973) methodology with clustering done in quarters. To mitigate a methodological choice driving our results, we run our specifications with a multitude of alternative specifications. RESULTS We find that Linearized Phrase-Structure (LPS) outperforms the predominant naïve word count methodology. Also, we find that if employing word counts, researchers should employ context dependent dictionaries, such as Loughran and McDonald's (2011). In terms of our main variables, we find that the existing media factors are not mutually exclusive, and impact financial metrics in chorus. Alas, we do not find statistically significant relationship between sentiment and abnormal returns. However, we find a relationship between aggregate market news volume and abnormal returns, and also between sentiment and abnormal volatility. We infer that our findings support limited attention -theory, and provide evidence against market efficiency

    Sentiment Analysis for Social Media

    Get PDF
    Sentiment analysis is a branch of natural language processing concerned with the study of the intensity of the emotions expressed in a piece of text. The automated analysis of the multitude of messages delivered through social media is one of the hottest research fields, both in academy and in industry, due to its extremely high potential applicability in many different domains. This Special Issue describes both technological contributions to the field, mostly based on deep learning techniques, and specific applications in areas like health insurance, gender classification, recommender systems, and cyber aggression detection
    • …
    corecore