6 research outputs found
Stock portfolio selection using learning-to-rank algorithms with news sentiment
In this study, we apply learning-to-rank algorithms to design trading strategies
using relative performance of a group of stocks based on investors' sentiment
toward these stocks. We show that learning-to-rank algorithms are effective in
producing reliable rankings of the best and the worst performing stocks based
on investors' sentiment. More specifically, we use the sentiment shock and trend
indicators introduced in the previous studies, and we design stock selection rules
of holding long positions of the top 25% stocks and short positions of the bottom
25% stocks according to rankings produced by learning-to-rank algorithms.
We then apply two learning-to-rank algorithms, ListNet and RankNet, in stock
selection processes and test long-only and long-short portfolio selection strategies
using 10 years of market and news sentiment data. Through backtesting of
these strategies from 2006 to 2014, we demonstrate that our portfolio strategies
produce risk-adjusted returns superior to the S&P500 index return, the hedge
fund industry average performance - HFRIEMN, and some sentiment-based approaches
without learning-to-rank algorithm during the same period
The role of investor attention in global asset price variation during the invasion of Ukraine
We study the impact of event-specific attention indices -- based on Google
Trends -- in predictive price variation models before and during the Russian
invasion of Ukraine in February 2022. We extend our analyses to the importance
of geographical proximity and economic openness to Russia within 51 global
equity markets. Our results demonstrate that 36 countries show significant
attention to the conflict at the onset of and during the invasion, which helps
predict volatility. We find that the impact of attention is more significant in
countries with a higher degree of economic openness to Russia and those nearer
to it.Comment: This paper includes 17 pages, 3 figures and 8 table
๋คํธ์ํฌ ํ๋ํ ์ฑ์ง์ ๊ธฐ๋ฐํ ๊ธ์ต์์ฅ ์์ธก ๋ฐ ๊ฑฐ๋์ ๋ต
ํ์๋
ผ๋ฌธ (๋ฐ์ฌ) -- ์์ธ๋ํ๊ต ๋ํ์ : ๊ณต๊ณผ๋ํ ์ฐ์
๊ณตํ๊ณผ, 2020. 8. ์ฅ์ฐ์ง.Extensive academic research was performed for the financial market as it is closely connected to practical economy. Research in traditional financial economics resulted in economic indicators and the econometrics was instrumental for quantitative research in financial market. However, it proved to be difficult to predict the market behavior as it is a result of complex interaction among many agents with their own agenda. An effective tool to predict a change in market would be beneficial for policy makers and market participants to assist them with rational and consistent decision making. On the other hand, inconsistent prediction would lead to a suboptimal and inconsistent market activity which sometimes result in sudden collapse in the market as it did in 2008 Financial crisis and 1997 Asian financial crisis. The purpose of this dissertation is to develop approach based on econophysics and machine learning to systematically analyze the financial market.
The main focus of this dissertation involves the network structure of stock market. To predict the change in market behavior, it is critical to understand the relationship or correlation among the market participants beforehand, and complex network analysis is one of the most prominent methods for such study. The fractal theory was employed as the primary approach to analyze the network structure of financial market. The empirical study shows that the network of financial market exhibits fractal properties. Also, analysis of fractal dimension and network topology led to two key discoveries. First, the fractal dimension and the Strong effective repulsion between distinct network nodes known as the hub are closely related. Second, the fractal dimension reveals the shortcut of network structure. Through further analysis, these two properties were proved to be useful for risk management in financial market. Three fractal measures were proposed to specify network structure for ease of implementation in future studies.
In the second step, the fractal measures were implemented in a financial market to assess its ability to predict the market movement. Recently, studies were conducted to determine if a new measure or index improves the prediction accuracy for financial time series. These studies are advantageous for future studies as it proposes new indices for other implementation and further analysis rather than studying the precision of their own method. In this paper, machine learning algorithms were employed to assess the predictive properties of fractal measures. Empirical experiments were performed to predict direction of market movement, which is effectively a classification task, and prediction for returns, a regression task. The studies concludes that the fractal measure proposed were effective in prediction for long-term stock returns of more than three months period.
Finally, a model to improve trading strategy based on learning-to-rank algorithm and the fractal measures was introduced. Previous studies are often based on the modern portfolio theory(MPT), but it is insufficient for real-world application as it doesnt provide any implication for rebalancing period of portfolio. The optimal rebalancing model proposed in this study allows its application with traditional portfolio methods. The experiments were carried out in two steps. The model learns to predict the better time period to perform rebalancing between two time periods in the future, followed by the empirical simulation to apply the model in real world trading scenario. Two traditional portfolio methods, equal weighted and maximized Sharpe ratio, were taken for experiment. The result affirms that the optimal rebalancing model was able to capture the better time period of rebalancing portfolio. In addition, the model outperformed a simple rebalancing method of fixed time period. When the fractal measures were employed as an input variable, the model performance was further improved. The primary contribution achieved through this model is that it allows application and expansion into all traditional portfolio models. Also, the fractal measures observed in the network structure grants insight regarding the market behavior and empirically proved that the measure provides benefit in prediction for the real-world stock market.๊ธ์ต์์ฅ์ ๋ํ ์ฐ๊ตฌ๋ ์ ๋ฐ์ ์ธ ๊ฒฝ์ ํ๋๊ณผ ๋ฐ์ ํ ์ฐ๊ด์ฑ์ด ์๊ธฐ ๋๋ฌธ์, ๋ค์ํ ํ๊ณ์ ์ง์๋ค๊ณผ ์ฐ๊ณ๋์ด ๊ด๋ฒ์ํ๊ฒ ์ฐ๊ตฌ๋๊ณ ์๋ค. ์ ํต์ ์ธ ๊ฒฝ์ ํ ์ด๋ก ์ ๋ฐํ์ผ๋ก ์ฌ๋ฌ๊ฐ์ง ๊ฒฝ์ ์งํ๋ค์ด ๊ฐ๋ฐ๋์๊ณ , ๊ณ๋๊ฒฝ์ ํ์ ๋ฐ์ ์ผ๋ก ์ด๋ฅผ ์ ๋์ ์ผ๋ก ๋ถ์ํ๋ ์ฐ๊ตฌ๊ฐ ์งํ๋์๋ค. ํ์ง๋ง, ์๋ก ๋ค๋ฅธ ํน์ง์ ๊ฐ๋ ์์ฅ์ฐธ์ฌ์๋ค์ ํ์๋ก ์ด๋ฃจ์ด์ง ๊ธ์ต์์ฅ์ ๋ณต์กํ ํน์ฑ ๋๋ฌธ์, ๊ธฐ์กด์ ๊ฒฝ์ ํ ๊ธฐ๋ฐ์ ๋ฐฉ๋ฒ๋ก ๋ค๋ง์ผ๋ก ๊ธ์ต์์ฅ์ ๋ณํ๋ฅผ ์ ๋ฐํ๊ฒ ์์ธกํ๊ธฐ์๋ ํ๊ณ๊ฐ ์์๋ค. ๋ง์ฝ ๊ธ์ต์์ฅ์ ๋ณํ๋ฅผ ํจ์จ์ ์ผ๋ก ์์ธก ํ ์ ์๋ค๋ฉด, ๊ตญ๊ฐ ์ ์ฑ
์ด๋ ๊ธฐ์
๋ค ๋ฐ ์์ฅ ์ฐธ์ฌ์๋ค์ ํฉ๋ฆฌ์ ์ธ ์์ฌ๊ฒฐ์ ์ ํตํด์ ๊ฑด์ ํ ๊ธ์ต ํ๋์ ํ ์ ์์ ๊ฒ์ด๋ค. ๋ฐ๋ฉด์ ์ด๋ฌํ ๊ธ์ต์์ฅ์ ๋ณํ๋ฅผ ํจ์จ์ ์ผ๋ก ์์ธกํ์ง ๋ชปํด ๋น์ด์์ ์ธ ๊ธ์ต ํ๋์ด ์ง์๋๋ค๋ฉด, ์ต์
์ ๊ฒฝ์ฐ์๋ ๊ธ๋ก๋ฒ ๊ธ์ต ์๊ธฐ์ ๊ฐ์ ๋๊ท๋ชจ ์์ฅ ๋ถ๊ดด ํ์์ด ๋ฐ์ํ ์ ์์ ๊ฒ์ด์. ๋ฐ๋ผ์ ๋ณธ ํ์๋
ผ๋ฌธ์์๋ ๊ฒฝ์ ๋ฌผ๋ฆฌํ๊ณผ ๋จธ์ ๋ฌ๋์ ์ตํฉํ์ฌ ์ฒด๊ณ์ ์ผ๋ก ๊ธ์ต์์ฅ ๋ถ์์ ์งํํ๊ณ ์ ํ๋ค.
๋ณธ ํ์๋
ผ๋ฌธ์์๋ ๊ธ์ต์์ฅ์ ๋ค์ํ ์นํฐ ์ค์์ ์ฃผ์์์ฅ ๋คํธ์ํฌ ๊ตฌ์กฐ๋ฅผ ๋ถ์ํ๋๋ฐ ์ด์ ์ ๋ง์ถ๋ค. ๋ฏธ๋์ ์ฃผ์์์ฅ์ ๋ณํ๋ฅผ ์ฌ๋ฐ๋ฅด๊ฒ ์์ธกํ๊ธฐ ์ํด์๋ ์ฃผ์์์ฅ ๊ตฌ์ฑ์๋ค๊ฐ์ ๊ด๊ณ ํ์
์ด ์ ํ๋์ด์ผ ํ๋๋ฐ, ์ด์ ๋ํ์ ์ธ ๋ถ์ ๋ฐฉ๋ฒ์ด ๋ณต์ก๊ณ ๋คํธ์ํฌ ๋ถ์(Complex network analysis)์ด๊ธฐ ๋๋ฌธ์ด๋ค. ๋ณธ ์ฐ๊ตฌ์์๋ ์ฃผ์์์ฅ ๋คํธ์ํฌ ๊ตฌ์กฐ๋ฅผ ๋ถ์ํ๋ ์ฌ๋ฌ ๋ฐฉ๋ฒ๋ก ๋ค ์ค ํ๋ํ ์ด๋ก (Fractal theory)์ ๋์
์ ์ ์ํ๋ค. ์คํ ๊ฒฐ๊ณผ ์ฃผ์์์ฅ ๋คํธ์ํฌ์ ๊ตฌ์กฐ๋ ํ๋ํ ํน์ฑ์ ๊ฐ์ง์ ๋ฐํ๋๋ค. ๋ํ, ์ธก์ ๋ ํ๋ํ ์ฐจ์(Fractal dimension)๊ณผ ๋คํธ์ํฌ์ ํ ํด๋ก์ง(Topology)์์ ๊ด๊ณ๋ฅผ ์ดํด๋ณธ ๊ฒฐ๊ณผ ๋ ๊ฐ์ง ์ฃผ์ํ ์ฃผ์์์ฅ ๋คํธ์ํฌ์ ๊ตฌ์กฐ์ ์ธ ํน์ง์ ๋ฐ๊ฒฌํ ์ ์์๋ค. ์ฒซ๋ฒ์งธ๋, ํ๋ํ ์ฐจ์๊ณผ ์์ ํ๋ธ(Hub)๋ผ๊ณ ๋ถ๋ฆฌ์ฐ๋ ๋คํธ์ํฌ ์์์ ์ฐ๊ฒฐ์ด ๋ง์ด๋ ๋
ธ๋๋ค๊ฐ์ ๊ฐํ ๋ฐ๋ฐ(Strong effective repulsion) ํ์๊ณผ ์ฐ๊ด์ฑ์ด ์๋ค๋ ์ ์ด๋ค. ๋๋ฒ์งธ๋, ํ๋ํ ์ฐจ์์ผ๋ก ๋คํธ์ํฌ์ ์ง๋ฆ๊ธธ(Shortcut) ๊ตฌ์กฐ๋ฅผ ๊ด์ฐฐํ ์ ์์๋ค. ๋ํ ์ด ๋ ๊ฐ์ง ๋คํธ์ํฌ์ ๊ตฌ์กฐ์ ์ธ ํน์ฑ์ ์ฃผ์์์ฅ์ ์ํ ๊ด๋ฆฌ(Risk management) ๊ด์ ์์ ์ ์ฉํ๊ฒ ์ฐ์ผ ์ ์์์ ๋ถ์ํ๋ค. ๊ทธ๋ฆฌ๊ณ , ์ ํน์ฑ๋ค์ ๋ค๋ฅธ ์ฐ๊ตฌ๋ค์ ์ฝ๊ฒ ์ ์ฉ ๊ฐ๋ฅํ๋๋ก ๋คํธ์ํฌ ๊ตฌ์กฐ๋ฅผ ํํํ๋ 3๊ฐ์ง ํ๋ํ ์งํ(Fractal measures)๋ค์ ์ ์ํ๋ค.
๋ค์ ๋จ๊ณ๋ก ์ฃผ์์์ฅ์์ ์ธก์ ํ ํ๋ํ ์งํ๊ฐ ๋ฏธ๋์ ์ฃผ๊ฐ ์ง์์ ์์ธก๋ ฅ ํฅ์์ ๋์์ด ๋๋์ง๋ฅผ ๊ฒ์ฆํ๋ค. ์ต๊ทผ ๋ค์ํ ๋ถ์ผ์์ ์๋กญ๊ฒ ๋ฐ๊ฒฌํ ์งํ๋ค์ด ๊ธ์ต ์๊ณ์ด ๋ฐ์ดํฐ์ ๋ํ์ฌ ์์ธก๋ ฅ ํฅ์์ ๋์์ด ๋๋์ง๋ฅผ ๊ฒ์ฆํ๋ ์ฐ๊ตฌ๋ค์ด ์งํ๋๊ณ ์๋ค. ์ด๋ฌํ ์ฐ๊ตฌ๋ค์ ๋ฐ๊ฒฌํ ์งํ๋ค ๋ง์ ์ฌ์ฉํ์ฌ ์ ๋ฐํ ์์ธก์ ํ๋ ๋ชฉ์ ์ด ์๋, ๋ฐ๊ฒฌํ ์งํ๋ค์ด ์์ธก๋ ฅ ํฅ์์ ๋์์ด ๋๋ค๋ ์ ์ ๋ฐํ๋ด๋๋ฐ ์ฃผ ๋ชฉ์ ์ด ์๋ค. ์ด๋ ๊ฒ ์์ธก๋ ฅ ํฅ์์ด ์๋๊ฒ์ด ๋ฐํ์ง ์งํ๋ค์ ๋ค๋ฅธ ์ฐ๊ตฌ๋ ์ฐ์
์ ์ฝ๊ฒ ์ ์ฉํ ์ ์๋ ์ฅ์ ์ด ์๋ค. ๋ณธ ํ์๋
ผ๋ฌธ์์๋ ๋ช ๊ฐ์ง ๋จธ์ ๋ฌ๋ ์๊ณ ๋ฆฌ์ฆ์ ํ์ฉํ์ฌ ์ธก์ ํ ํ๋ํ ์งํ๊ฐ ๋ฏธ๋์ ์ฃผ๊ฐ ์ง์์ ์์ธก๋ ฅ ํฅ์์ ๋์์ด ๋๋์ง๋ฅผ ๊ฒ์ฆํ๋ค. ๊ฒ์ฆ ์คํ์ ๊ฐ์ฅ ๋จ์ํ ๋ฏธ๋ ์ฃผ๊ฐ ์ง์์ ๋ฐฉํฅ ๋ถ๋ฅ(Classification) ๋ถํฐ, ์ฃผ๊ฐ ์ง์ ์์ต๋ฅ ์ ์์ธก(Prediction) ๊น์ง ์ด๋ฃจ์ด์ง๋ค. ๊ทธ ๊ฒฐ๊ณผ ์ ์ํ ํ๋ํ ์งํ๋ค์ ์ฝ 3๊ฐ์ ์ดํ์ ์ฅ๊ธฐ ๋ฏธ๋์ ์ฃผ๊ฐ ์ง์์ ๋ํด ์ผ๊ด์ฑ์๋ ์์ธก๋ ฅ ํฅ์ ํจ๊ณผ๊ฐ ์์์ ๋ฐํ๋๋ค.
๋ง์ง๋ง์ผ๋ก ์ ์ํ ํ๋ํ ์งํ๋ค๊ณผ Learning-to-rank ์๊ณ ๋ฆฌ์ฆ์ ํ์ฉํ์ฌ ๊ธฐ์กด์ ์ฃผ์์์ฅ์ ์ฐ๊ตฌ๋์๋ ๊ฑฐ๋ ์ ๋ต(Trading strategy)์ ์ฑ๋ฅ์ ๊ฐ์ ํ ์ ์๋ ๋ชจ๋ธ์ ์ ์ํ๋ค. ๊ธฐ์กด์ ์ฃผ์์์ฅ์์ ์ฐ๊ตฌ๋ ๊ฑฐ๋ ์ ๋ต๋ค ์ค ํฐ ๋น์จ์ ์ฐจ์งํ๋ ์ฐ๊ตฌ๋ค์ ํ๋ ํฌํธํด๋ฆฌ์ค ์ด๋ก (Modern portfolio theory)์ ๊ธฐ๋ฐํ ํฌํธํด๋ฆฌ์ค ๊ตฌ์ฑ ๋ฐฉ๋ฒ์ ๋ํ ์ฐ๊ตฌ๋ค์ด๋ค. ํ์ง๋ง, ์ค์ ํฌ์์ ์ ์ฉํ๊ธฐ ์ํด์๋ ํฌํธํด๋ฆฌ์ค๋ฅผ ๊ตฌ์ฑํ๋ ๋ฐฉ๋ฒ๋ก ๋ฟ ๋ง ์๋๋ผ, ์ธ์ ํฌํธํด๋ฆฌ์ค๋ฅผ ์ฌ๊ตฌ์ฑํด์ผ ํ๋์ง๋ฅผ ํ๋จํ๋๊ฒ ๋ํ ์ค์ํ ์์ฌ๊ฒฐ์ ์์์ด๋ค. ๋ณธ ํ์๋
ผ๋ฌธ์์ ์ ์ํ๋ ๋ชจ๋ธ์ ๊ธฐ์กด์ ์ฐ๊ตฌ๋ ๋ฐฉ๋ฒ๋ก ๋ค์ ์ ์ฐํ๊ฒ ์ ๋ชฉํ์ฌ ํ์ฉํ ์ ์๋ ์ต์ ๋ฆฌ๋ฐธ๋ฐ์ฑ ์์ ํ๋จ ๋ชจ๋ธ(Optimal rebalancing model)์ด๋ค. ์คํ์ ๋๋จ๊ณ๋ก ์งํ๋๋ค. ๋จผ์ , ์ ์ํ ๋ชจ๋ธ๋ก ํ์ต ๋ฐ์ดํฐ ๋ด์ ์๋ก ๋ค๋ฅธ ๋ ์์ ์ค ๋ฏธ๋์ ๋ ๋์ ์ฑ๋ฅ์ ๋ณด์ด๋ ๋ฆฌ๋ฐธ๋ฐ์ฑ ์ง์ ์ ์์ธกํ ์ ์๋์ง๋ฅผ ํ์ตํ๋ค. ๊ทธํ์, ํ์ต๋ ๋ชจ๋ธ๋ค ์ค ์ข์ ์ฑ๋ฅ์ ๊ฐ๋ ํ๋ผ๋ฏธํฐ๋ฅผ ์ ํํ๊ณ , ์๋ฎฌ๋ ์ด์
๋ถ์์ ํตํด์ ์ค์ ๊ฑฐ๋์ ๋ต์ ์ ์ฉ ๊ฐ๋ฅ์ฑ์ ํ๊ฐํ๋ค. ์คํ์์ ์ฌ์ฉ๋ ๊ธฐ์กด์ ํฌํธํด๋ฆฌ์ค ๊ตฌ์ฑ ๋ฐฉ๋ฒ๋ก ์, ๊ด๋ จ ์ฐ๊ตฌ๋ค์์ ๊ฐ์ฅ ๋ํ์ ์ธ ๋ฒค์น๋งํฌ๋ก ํ์ฉ๋๋ ์์ฐ ๊ท ๋ฑ ๋ถ๋ฐฐ ํฌํธํด๋ฆฌ์ค ๋ฐฉ์๊ณผ ์คํ ๋น์จ(Sharpe ratio) ์ต๋ํ ํฌํธํด๋ฆฌ์ค ๋ฐฉ์์ด๋ค. ์คํ๊ฒฐ๊ณผ ๋ ๋ฐฉ์ ๋ชจ๋์์ ๋ณธ ์ฐ๊ตฌ์์ ์ ์ํ ์ต์ ๋ฆฌ๋ฐธ๋ฐ์ฑ ์์ ํ๋จ ๋ชจ๋ธ์ด ๋ ๋์ ํฌํธํด๋ฆฌ์ค ๊ตฌ์ฑ ์์ ์ ๊ตฌํด๋๋ค. ๋ํ ์๋ฎฌ๋ ์ด์
๊ฒฐ๊ณผ ์ผ์ ์ฃผ๊ธฐ๋ก ๋ฆฌ๋ฐธ๋ฐ์ฑํ๋ ๋์ผํ ํฌํธํด๋ฆฌ์ค ๊ตฌ์ฑ๋ฐฉ์๋ณด๋ค ๋ ๋์ ์ฑ๋ฅ์ ๋ณด์๋ค. ํนํ ์
๋ ฅ ๋ณ์๋ก ํ๋ํ ์งํ๋ค์ ์ถ๊ฐํ์ ๋ ๊ฐ์ฅ ์ข์ ์ฑ๋ฅ์ ๋ณด์์ ๊ด์ฐฐํ๋ค. ๋ณธ ๋ชจ๋ธ์ ์ฐ๊ตฌ๋ ๊ธฐ์กด์ ๋ชจ๋ ํฌํธํด๋ฆฌ์ค ๊ตฌ์ฑ ๋ฐฉ๋ฒ๋ก ๋ค์ ์ ์ฉํ ์ ์๋ ํ์ฅ์ฑ์ ๊ด์ ์์ ์ค์ํ ๊ธฐ์ฌ๊ฐ ์๋ค. ๊ทธ๋ฆฌ๊ณ ํ๋ํ ์งํ๋ฅผ ํตํด์ ๊ด์ฐฐ๋๋ ๋คํธ์ํฌ์ ๊ตฌ์กฐ์ ํน์ง๋ค์ด ๋ฏธ๋ ์์ฅ์ ํ๋จํ๋๋ฐ ๋์์ด ๋จ์ ๋ณด์์ผ๋ก์จ, ์ ์ํ ํ๋ํ ์งํ๋ค์ด ์ค์ ์ฃผ์์์ฅ์ ์ ์ฉ ๊ฐ๋ฅํ ์ค์ฉ์ ์ธ ํน์ฑ์ ๋ํ๋๋ ๊ฒ์ฆํ๋ค.Chapter 1 Introduction 1
1.1 Research Motivation and Purpose 1
1.2 Organization of the Research 5
Chapter 2 Literature Review 7
2.1 Complex Network 7
2.2 Market Prediction with Machine Learning 8
2.3 Trading Strategies 11
Chapter 3 Fractal Structure in Stock Market 13
3.1 Network Fractality 13
3.1.1 Threshold Network 13
3.1.2 Fractal Dimension 15
3.1.3 Fractal Measures 17
3.2 Fractal Analysis on Stock Market 21
3.2.1 Data Description 21
3.2.2 Fractality of S&P500 Network 25
3.2.3 Network Topology and Fractal Measures 27
3.3 Summary and Discussion 43
Chapter 4 Stock Market Prediction with Fractality 45
4.1 Classification of Stock Market 45
4.1.1 Classification Model 45
4.1.2 Classification Results 50
4.2 Fractal Measures and Predictive Power 55
4.2.1 Prediction of Stock Market Return 55
4.2.2 Parameter Analysis 59
4.2.3 Predictive Power Results 60
4.3 Summary and Discussion 66
Chapter 5 Trading Strategy with Optimal Rebalancing Model 69
5.1 Optimal Rebalancing Model 69
5.1.1 Portfolio Selection Method 69
5.1.2 Learning-to-rank algorithm 71
5.1.3 Proposed Modeling Method 73
5.1.4 Model Results 77
5.2 Simulation Analysis 84
5.2.1 Simulation Structure 84
5.2.2 Simulation Results 88
5.3 Summary and Discussion 101
Chapter 6 Conclusion 103
6.1 Conclusions 103
6.2 Future Works 106
Bibliography 107
๊ตญ๋ฌธ์ด๋ก 117Docto
Predictive Analytics on Emotional Data Mined from Digital Social Networks with a Focus on Financial Markets
This dissertation is a cumulative dissertation and is comprised of five articles. User-Generated Content (UGC) comprises a substantial part of communication via social media. In this dissertation, UGC that carries and facilitates the exchange of emotions is referred to as โemotional data.โ People โproduceโ emotional data, that is, they express their emotions via tweets, forum posts, blogs, and so on, or they โconsumeโ it by being influenced by expressed sentiments, feelings, opinions, and the like. Decisions often depend on shared emotions and data โ which again lead to new data because decisions may change behaviors or results. โEmotional Data Intelligenceโ ultimately seeks an answer to the question of how all the different emotions expressed in public online sources influence decision-making processes.
The overarching research topic of this dissertation follows the question whether network structures and emotional sentiment data extracted from digital social networks contain predictive information or they are just noise. Underlying data was collected from different social media sources, such as Twitter, blogs, message boards, or online news and social networking sites, such as Xing. By means of methodologies of social network analysis (SNA), sentiment analysis, and predictive analysis the individual contributions of this dissertation study whether sentiment data from social media or online social networking structures can predict real-world behaviors. The focus lies on the analysis of emotional data and network structures and its predictive power for financial markets. With the formal construction of the data analyses methodologies introduced in the individual contributions this dissertation contributes to the theories of social network analysis, sentiment analysis, and predictive analytics
Stock market movement prediction using machine learning techniques and graph-based approaches
Machine learning techniques are preferred now than the statistical methods for stock movement prediction due to their efficiency and effectiveness. Stock market movement prediction is impacted significantly by choice of input features and prediction algorithms. We focus on a specific event of ex-dividend day and use event-specific input features of cum-dividend period for predicting price movement on the ex-dividend day. Performance improves significantly when these event-specific optimum input features are used along with machine learning models.
The relative order or ranking of stocks is more important than the price or return of a single stock for better investment decisions. Stock ranking performance can be improved by incorporating the stock relationship information in the prediction task. We employ a graph-based approach for stock ranking prediction and use the stock relationship information as the input of the machine learning model.
Investing in the top-k stocks is more profitable than the others. Thus, the performance measure for stock ranking prediction should be top-weighted and bounded for any value of k. Existing evaluation measures lack these properties, and we propose normalized rank biased overlap for top-k (NRBO@k) stocks for stock ranking prediction. Moreover, we show that the list-wise loss function can significantly improve the stock ranking performance in a graph-based approach. We find that node embedding techniques such as Node2Vec can significantly reduce graph-based approachesโ training time for the stock ranking prediction.
In our survey study, we discuss the existing graph-based works from five perspectives: i) stock market graph formulation, ii) stock market graph filtering, iii) stock market graph clustering, iv) stock movement prediction, and v) portfolio optimization. This survey contains a concise description of major techniques and algorithms relevant to graph-based approaches for the stock market