Search CORE

1,734 research outputs found

Speeding up Context-based Sentence Representation Learning with Non-autoregressive Convolutional Decoding

Author: de Sa Virginia R.
Fang Chen
Jin Hailin
Tang Shuai
Wang Zhaowen
Publication venue
Publication date: 01/01/2018
Field of study

Context plays an important role in human language understanding, thus it may also be useful for machines learning vector representations of language. In this paper, we explore an asymmetric encoder-decoder structure for unsupervised context-based sentence representation learning. We carefully designed experiments to show that neither an autoregressive decoder nor an RNN decoder is required. After that, we designed a model which still keeps an RNN as the encoder, while using a non-autoregressive convolutional decoder. We further combine a suite of effective designs to significantly improve model efficiency while also achieving better performance. Our model is trained on two different large unlabelled corpora, and in both cases the transferability is evaluated on a set of downstream NLP tasks. We empirically show that our model is simple and fast while producing rich sentence representations that excel in downstream tasks

arXiv.org e-Print Archive

Crossref

Econometrics meets sentiment : an overview of methodology and applications

Author: Algaba Andres
Ardia David
Bluteau Keven
Borms Samuel
Boudt Kris
Publication venue: 'Wiley'
Publication date: 01/01/2020
Field of study

The advent of massive amounts of textual, audio, and visual data has spurred the development of econometric methodology to transform qualitative sentiment data into quantitative sentiment variables, and to use those variables in an econometric analysis of the relationships between sentiment and other variables. We survey this emerging research field and refer to it as sentometrics, which is a portmanteau of sentiment and econometrics. We provide a synthesis of the relevant methodological approaches, illustrate with empirical results, and discuss useful software

VU Research Portal

Crossref

Ghent University Academic Bibliography

Volatility Prediction using Financial Disclosures Sentiments with Word Embedding-based IR Models

Author: Anderson Linda
Baklanov Artem
Duer Alexander
Hanbury Allan
Lupu Mihai
Rekabsaz Navid
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2017
Field of study

Volatility prediction--an essential concept in financial markets--has recently been addressed using sentiment analysis methods. We investigate the sentiment of annual disclosures of companies in stock markets to forecast volatility. We specifically explore the use of recent Information Retrieval (IR) term weighting models that are effectively extended by related terms using word embeddings. In parallel to textual information, factual market data have been widely used as the mainstream approach to forecast market risk. We therefore study different fusion methods to combine text and market data resources. Our word embedding-based approach significantly outperforms state-of-the-art methods. In addition, we investigate the characteristics of the reports of the companies in different financial sectors

arXiv.org e-Print Archive

Crossref

International Institute for Applied Systems Analysis (IIASA)

The impact of news narrative on the economy and financial markets

Author: Tilly Sonja
Publication venue: UCL (University College London)
Publication date: 28/05/2022
Field of study

This thesis investigates the impact of news narrative on socio-economic systems across four experiments. Recent years have witnessed a rise in the use of so-called alternative data sources to model and predict dynamics in socio-economic systems. Notably, sources such as newspaper text allow researchers to quantify the elusive concept of narrative, to incorporate text-based features into forecasting frameworks and thus to evaluate the impact of narrative on economic events. The first experiment proposes a new method of incorporating a wide array of sentiment scores from global newspaper articles into macroeconomic forecasts, attempting to forecast industrial production and consumer prices leveraging narrative and sentiment from global newspapers. I model industrial production and consumer prices across a diverse range of economies using an autoregressive framework. The second experiment uses narrative from global newspapers to construct themebased knowledge graphs about world events, demonstrating that features extracted from such graphs improve forecasts of industrial production in three large economies. The third experiment proposes a novel method of including news themes and their associated sentiment into predictions of changes in breakeven inflation rates (BEIR) for eight diverse economies with mature fixed income markets. I utilise five types of machine learning algorithms incorporating narrative-based features for each economy. In the above experiments, models incorporating narrative-based features generally outperform their benchmarks that do not contain such variables, demonstrating the predictive power of features derived from news narrative. The fourth experiment utilises GDELT data and the filtering methodology introduced in the first experiment to create a profitable systematic trading strategy based on the average tone scores for 15 diverse economies

UCL Discovery

Benchmarking down-scaled versions of large pre-trained language models

Author: Schulze Patrick
Publication venue
Publication date: 17/12/2020
Field of study

Open Access LMU

Analyzing and Interpreting Neural Networks for NLP: A Report on the First BlackboxNLP Workshop

Author: Alishahi Afra
Chrupała Grzegorz
Linzen Tal
Publication venue
Publication date: 05/04/2019
Field of study

The EMNLP 2018 workshop BlackboxNLP was dedicated to resources and techniques specifically developed for analyzing and understanding the inner-workings and representations acquired by neural models of language. Approaches included: systematic manipulation of input to neural networks and investigating the impact on their performance, testing whether interpretable knowledge can be decoded from intermediate representations acquired by neural networks, proposing modifications to neural network architectures to make their knowledge state or generated output more explainable, and examining the performance of networks on simplified or formal languages. Here we review a number of representative studies in each category

arXiv.org e-Print Archive

Tilburg University Repository