128,820 research outputs found

    Time series prediction and forecasting using Deep learning Architectures

    Get PDF
    Nature brings time series data everyday and everywhere, for example, weather data, physiological signals and biomedical signals, financial and business recordings. Predicting the future observations of a collected sequence of historical observations is called time series forecasting. Forecasts are essential, considering the fact that they guide decisions in many areas of scientific, industrial and economic activity such as in meteorology, telecommunication, finance, sales and stock exchange rates. A massive amount of research has already been carried out by researchers over many years for the development of models to improve the time series forecasting accuracy. The major aim of time series modelling is to scrupulously examine the past observation of time series and to develop an appropriate model which elucidate the inherent behaviour and pattern existing in time series. The behaviour and pattern related to various time series may possess different conventions and infact requires specific countermeasures for modelling. Consequently, retaining the neural networks to predict a set of time series of mysterious domain remains particularly challenging. Time series forecasting remains an arduous problem despite the fact that there is substantial improvement in machine learning approaches. This usually happens due to some factors like, different time series may have different flattering behaviour. In real world time series data, the discriminative patterns residing in the time series are often distorted by random noise and affected by high-frequency perturbations. The major aim of this thesis is to contribute to the study and expansion of time series prediction and multistep ahead forecasting method based on deep learning algorithms. Time series forecasting using deep learning models is still in infancy as compared to other research areas for time series forecasting.Variety of time series data has been considered in this research. We explored several deep learning architectures on the sequential data, such as Deep Belief Networks (DBNs), Stacked AutoEncoders (SAEs), Recurrent Neural Networks (RNNs) and Convolutional Neural Networks (CNNs). Moreover, we also proposed two different new methods based on muli-step ahead forecasting for time series data. The comparison with state of the art methods is also exhibited. The research work conducted in this thesis makes theoretical, methodological and empirical contributions to time series prediction and multi-step ahead forecasting by using Deep Learning Architectures

    닀차원 거래 데이터와 λ”₯λŸ¬λ‹ λͺ¨ν˜•μ„ μ΄μš©ν•œ 금육 μ‹œμž₯ λ³΅μž‘κ³„ λ„€νŠΈμ›Œν¬ 뢄석

    Get PDF
    ν•™μœ„λ…Όλ¬Έ(박사) -- μ„œμšΈλŒ€ν•™κ΅λŒ€ν•™μ› : κ³΅κ³ΌλŒ€ν•™ 산업곡학과, 2021.8. μž₯μš°μ§„.주식은 독립적 μš”μ†Œκ°€ μ•„λ‹Œ, λ‹€λ₯Έ μžμ‚°λ“€μ„ ν¬ν•¨ν•œ λ‹€μ–‘ν•œ μš”μ†Œλ“€κ³Ό μƒν˜Έ 연관성이 μžˆλ‹€. λ”°λΌμ„œ μ£Όμ‹μ˜ μƒν˜Έ 연관성을 λΆ„μ„ν•˜λŠ” 것은 주식 μ‹œμž₯의 ꡬ쑰λ₯Ό λ¬˜μ‚¬ν•˜κ³  μ΄ν•΄ν•˜κΈ° μœ„ν•œ ν•„μˆ˜μ  μš”μ†ŒμΌ 뿐만 μ•„λ‹ˆλΌ, νˆ¬μžμ™€ μœ„ν—˜κ΄€λ¦¬μ™€ 같은 싀무적 κ΄€μ μ—μ„œλ„ μ€‘μš”ν•œ μš”μ†Œμ΄λ‹€. 주식 μ‹œμž₯은 κ°œλ³„ μš”μΈμ˜ μƒν˜Έ μž‘μš©μ„ 톡해 ν˜•μ„±λœ λ³΅μž‘κ³„ λ„€νŠΈμ›Œν¬λΌλŠ” κ΄€μ ν•˜μ—, λ§Žμ€ 연ꡬ듀이 주식 κ°„μ˜ μƒν˜Έ 연관성을 λΆ„μ„ν•˜κΈ° μœ„ν•΄ κ²½μ œλ¬Όλ¦¬ν•™μ˜ λ„€νŠΈμ›Œν¬ 뢄석을 μ‚¬μš©ν•˜κ³  μžˆλ‹€. 주식 κ°„μ˜ μƒν˜Έ 관계λ₯Ό μ •μ˜ ν•  λ•Œ, λ§Žμ€ μ—°κ΅¬μ—μ„œ μ£Όκ°€ μ‹œκ³„μ—΄μ—λ§Œ μ΄ˆμ μ„ λ‘”λ‹€. κ·ΈλŸ¬λ‚˜, 주식을 λ‚˜νƒ€λ‚΄λŠ” μ •λ³΄λŠ” μ£Όκ°€λΏλ§Œ μ•„λ‹ˆλΌ κ±°λž˜λŸ‰, 변동성, 그리고 νšŒκ³„ 정보와 같은 λ‹€μ–‘ν•œ 데이터가 μžˆλ‹€. λ”°λΌμ„œ μ£Όκ°€λ§Œμ„ μ΄μš©ν•˜μ—¬ 주식 κ°„μ˜ μƒν˜Έ 연관성을 λΆ„μ„ν•˜λŠ” 것은, 금육 μ‹œμž₯에 λŒ€ν•œ λ‹¨νŽΈμ μΈ μ‹œκ°λ§Œμ„ μ˜λ―Έν•  수 μžˆλ‹€. κ²Œλ‹€κ°€, λ”₯λŸ¬λ‹κ³Ό 같은 기술λ ₯의 λ°œμ „μ— 따라, ν•™λ¬Έκ³Ό λ°μ΄ν„°μ˜ μ’…λ₯˜μ˜ 경계λ₯Ό ν—ˆλ¬΄λŠ” ν•™μ œκ°„ μœ΅ν•©μ˜ μ€‘μš”μ„±μ΄ λŒ€λ‘λ˜κ³  μžˆλ‹€. μ΄λŸ¬ν•œ λ§₯λ½μ—μ„œ λ³Έ λ…Όλ¬Έμ˜ λͺ©μ μ€ 경제 물리학과 λ”₯λŸ¬λ‹μ˜ ν•™μ œ κ°„ μœ΅ν•©μ„ 톡해 금육 μ‹œμž₯ 뢄석에 λŒ€ν•œ μƒˆλ‘œμš΄ μ‹œκ°μ„ μ œμ‹œν•˜λŠ”λ° μžˆλ‹€. μš°λ¦¬λŠ” S\&P500 주식 μ‹œμž₯을 λ‹€λ³€λŸ‰ κ΄€μ μ—μ„œ λΆ„μ„ν•˜κΈ° μœ„ν•΄ 이차원 νžˆμŠ€ν† κ·Έλž¨(Bi-dimensional histogram)κ³Ό μ˜€ν† μΈμ½”λ”(Autoencoder)에 κΈ°λ°˜ν•œ λ”₯λŸ¬λ‹κ³Ό κ²½μ œλ¬Όλ¦¬ν•™μ΄ μœ΅ν•©λœ ν”„λ ˆμž„μ›Œν¬(Framework)λ₯Ό μ œμ•ˆν•œλ‹€. λ¨Όμ €, μš°λ¦¬λŠ” 주식을 λ‹€λ³€λŸ‰μœΌλ‘œ ν‘œν˜„ν•˜κΈ° μœ„ν•΄ 주가와 κ±°λž˜λŸ‰μ„ 톡해 κ΅¬μ„±λœ 이차원 νžˆμŠ€ν† κ·Έλž¨μ„ μ •μ˜ν•œλ‹€. 각 주식 별 이차원 νžˆμŠ€ν† κ·Έλž¨μ€ 일별 μ£Όκ°€ μ‹œκ³„μ—΄κ³Ό κ±°λž˜λŸ‰ μ‹œκ³„μ—΄λ‘œ κ΅¬μ„±λ˜λ©°, 주식을 뢄포 κ΄€μ μ—μ„œ μ§κ΄€μ μœΌλ‘œ ν‘œν˜„ν•˜λŠ”λ° λͺ©μ μ΄ μžˆλ‹€. κ·Έ λ‹€μŒ, 이차원 νžˆμŠ€ν† κ·Έλž¨μ˜ 차원을 μΆ•μ†Œν•˜κ³  μ˜λ―ΈμžˆλŠ” λ³€μˆ˜λ₯Ό μΆ”μΆœν•˜κΈ° μœ„ν•΄ μ˜€ν† μΈμ½”λ”κ°€ μ μš©λœλ‹€. μΆ”μΆœλœ λ³€μˆ˜λ₯Ό 톡해 이차원 νžˆμŠ€ν† κ·Έλž¨ κ°„μ˜ 거리 ν–‰λ ¬(Distance matrix)이 μ •μ˜λ˜κ³ , 거리행렬에 κ²½μ œλ¬Όλ¦¬ν•™ 방법둠인 PMFG(Planar Maximally Filtered Graph) μ•Œκ³ λ¦¬μ¦˜μ„ μ μš©ν•˜μ—¬ 금육 μ‹œμž₯을 λ‚˜νƒ€λ‚΄λŠ” νžˆμŠ€ν† κ·Έλž¨ λ„€νŠΈμ›Œν¬κ°€ κ΅¬μΆ•λœλ‹€. κ΅¬μΆ•λœ λ„€νŠΈμ›Œν¬λŠ” 이차원 νžˆμŠ€ν† κ·Έλž¨μ˜ 잠재 곡간(Latent space)λ₯Ό μ˜λ―Έν•˜λ©°, 금육 μ‹œμž₯의 ꡬ쑰적 νŠΉμ„±μ„ λΆ„μ„ν•˜κΈ° μœ„ν•œ λ„€νŠΈμ›Œν¬ 뢄석이 μˆ˜ν–‰λœλ‹€. μš°λ¦¬λŠ” νžˆμŠ€ν† κ·Έλž¨μ— κΈ°λ°˜ν•œ λ„€νŠΈμ›Œν¬μ˜ ꡬ쑰적 νŠΉμ„±μ΄ νžˆμŠ€ν† κ·Έλž¨μ˜ 산포도와 연관이 μžˆμŒμ„ λ°œκ²¬ν•˜μ˜€λ‹€. 이λ₯Ό 톡해 μš°λ¦¬λŠ” μ˜€ν† μΈμ½”λ”κ°€ 이차원 νžˆμŠ€ν† κ·Έλž¨μ˜ λ‚΄μž¬λœ λ³€μˆ˜λ₯Ό μΆ”μΆœν•˜λŠ”λ° νš¨κ³Όμ μž„μ„ ν™•μΈν•˜μ˜€λ‹€. λ˜ν•œ μš°λ¦¬λŠ” νžˆμŠ€ν† κ·Έλž¨ λ„€νŠΈμ›Œν¬λŠ” κΈ°μ‘΄ 방법둠인 μ£Όκ°€ μ‹œκ³„μ—΄ 기반의 λ„€νŠΈμ›Œν¬μ™€ μœ„μƒν•™μ  νŠΉμ§•μ΄ μ„œλ‘œ 상이함을 λ°œκ²¬ν–ˆλ‹€. μΆ”κ°€μ μœΌλ‘œ, νžˆμŠ€ν† κ·Έλž¨ λ„€νŠΈμ›Œν¬λŠ” κΈ°μ‘΄ 방법둠 보닀 변동성과 μ‹œμž₯ 베타(Beta)와 같은 금육 μ‹œμž₯에 λŒ€ν•œ 외생적인 νŠΉμ„±μ΄ μœ μ‚¬ν•œ 주식듀을 ꡬ쑰적으둜 κ΅°μ§‘ν™”ν•˜λŠ”λ° 더 νš¨κ³Όκ°€ μžˆλ‹€. 더 λ‚˜μ•„κ°€, μš°λ¦¬λŠ” νžˆμŠ€ν† κ·Έλž¨ λ„€νŠΈμ›Œν¬μ— κΈ°λ°˜ν•œ κ·Έλž˜ν”„ λ‰΄λŸ΄ λ„€νŠΈμ›Œν¬(Graph Neural Network) μ£Όκ°€ μΆ”μ„Έ 예츑 λͺ¨ν˜•μ„ μ œμ•ˆν•œλ‹€. μ£Όκ°€ μΆ”μ„Έ μ˜ˆμΈ‘μ€ 싀무적 μ€‘μš”μ„±μ„ λ°”νƒ•μœΌλ‘œ, κ³„λŸ‰μ  λͺ¨ν˜•λΆ€ν„° κΈ°κ³„ν•™μŠ΅ λͺ¨ν˜•κ³Ό λ”₯λŸ¬λ‹ λͺ¨ν˜•κΉŒμ§€ κ΄‘λ²”μœ„ν•˜κ²Œ μ—°κ΅¬λ˜κ³  μžˆλŠ” μ£Όμ œμ΄λ‹€. 졜근 κ·Έλž˜ν”„ λ‰΄λŸ΄ λ„€νŠΈμ›Œν¬ λͺ¨ν˜•μ€ μš”μ†Œ κ°„μ˜ μƒν˜Έ 연관성을 반영 ν•  수 μžˆλŠ” νŠΉμ§•μ„ λ°”νƒ•μœΌλ‘œ μ‹œκ³„μ—΄ 예츑 λΆ„μ•Όμ—μ„œ μ£Όλͺ© λ°›κ³  μžˆλ‹€. 졜근, μ—¬λŸ¬ 연ꡬ듀이 κ·Έλž˜ν”„ λ‰΄λŸ΄ λ„€νŠΈμ›Œν¬μ— 주식 κ°„μ˜ μƒν˜Έ 연관성을 λ°˜μ˜ν•˜μ—¬ μ£Όκ°€ μΆ”μ„Έ μ˜ˆμΈ‘μ— λŒ€ν•œ 연ꡬλ₯Ό 진행헀닀. ν•˜μ§€λ§Œ, μ΄λŸ¬ν•œ μ„ ν–‰ 연ꡬ듀은 주식 κ°„μ˜ μƒν˜Έ 연관성을 μ •λ¦½ν•˜λŠ”λ° 산업ꡬ쑰, λΉ„μ¦ˆλ‹ˆμŠ€ μ˜μ—­κ³Ό 같은 정적이고 사전 μ •μ˜κ°€ μš”κ΅¬λ˜λŠ” κ΄€κ³„λ§Œμ„ μ΄μš©ν•œλ‹€. ν•˜μ§€λ§Œ λ§Žμ€ 연ꡬ듀을 톡해 금육 μ‹œμž₯의 ꡬ쑰 및 주식 κ°„μ˜ μƒν˜Έ μ—°κ΄€μ„±μ˜ μ‹œλ³€ 속성(Time-varying property)이 μž…μ¦λ˜μ—ˆλ‹€. λ”°λΌμ„œ μš°λ¦¬λŠ” 금육 μ‹œμž₯의 μ‹œλ³€ 속성을 λͺ¨ν˜•μ— λ°˜μ˜ν•˜κ³ μž, νžˆμŠ€ν† κ·Έλž¨ λ„€νŠΈμ›Œν¬λ₯Ό 주식 κ°„ μƒν˜Έ κ΄€κ³„λ‘œ ν™œμš©ν•˜λŠ” κ·Έλž˜ν”„ λ‰΄λŸ΄ λ„€νŠΈμ›Œν¬ λͺ¨ν˜•μ„ μ œμ•ˆν•œλ‹€. μ£Όκ°€ μΆ”μ„Έ 예츑 μ‹€ν—˜μ€ 10일과 20일 λ’€μ˜ μ£Όκ°€μ˜ λ°©ν–₯ λΆ„λ₯˜(Classification) 문제둜 이루어진닀. μ‹€ν—˜ κ²°κ³Ό, μš°λ¦¬λŠ” λ³Έ ν•™μœ„ λ…Όλ¬Έμ—μ„œ μ œμ•ˆν•˜λŠ” λͺ¨ν˜•μ΄ 벀치마크(Benchmark) λŒ€λΉ„ μš°μ›”ν•œ 예츑 정확도와, F1-Scoreλ₯Ό κ°–λŠ” 것을 ν™•μΈν–ˆλ‹€. λ§ˆμ§€λ§‰μœΌλ‘œ μš°λ¦¬λŠ” μ£Όκ°€ μΆ”μ„Έ 예츑 λͺ¨ν˜•μ˜ 싀무적 ν™œμš©λ„λ₯Ό κ²€μ¦ν•˜κΈ° μœ„ν•΄ 예츑 λͺ¨ν˜•μ— κΈ°λ°˜ν•œ 투자 μ „λž΅μ„ μ œμ•ˆν•œλ‹€. μ œμ•ˆν•˜λŠ” 투자 μ „λž΅μ€ 전체 주식을 λŒ€μƒμœΌλ‘œ μΆ”μ„Έ μ˜ˆμΈ‘μ„ μˆ˜ν–‰ν–ˆμ„ λ•Œ, μƒμŠΉ ν™•λ₯ μ΄ κ°€μž₯ 높은 NN개의 주식을 투자 포트폴리였둜 κ΅¬μ„±ν•˜λŠ” 것이닀. 즉, μš°λ¦¬λŠ” νžˆμŠ€ν† κ·Έλž¨ λ„€νŠΈμ›Œν¬μ— κΈ°λ°˜ν•œ κ·Έλž˜ν”„ λ‰΄λŸ΄ λ„€νŠΈμ›Œν¬ λͺ¨ν˜•μœΌλ‘œ μΆ”μ„Έ μ˜ˆμΈ‘μ„ μˆ˜ν–‰ν•˜κ³ , 예츑 κ²°κ³Όλ₯Ό ν™œμš©ν•˜μ—¬ 투자 포트폴리였λ₯Ό κ΅¬μ„±ν•œλ‹€. 투자 μ‹€ν—˜ κ²°κ³Ό, μ œμ•ˆν•˜λŠ” 투자 μ „λž΅μ€ 전톡적 투자 μ „λž΅ λŒ€λΉ„ 샀프 μ§€μˆ˜(Sharpe ratio), μ†Œλ₯΄ν‹°λ…Έ μ§€μˆ˜(Sortino ratio)와 같은 μˆ˜μ΅μ„± μ§€ν‘œκ°€ λ›°μ–΄λ‚œ μ „λž΅μΌ 뿐만 μ•„λ‹ˆλΌ, ν•˜λ½ μΆ”μ„Έμ—μ„œ κ°€μž₯ λΉ λ₯΄κ²Œ λ²—μ–΄λ‚  수 μžˆλŠ” 투자 μ „λž΅μ΄λ‹€. μ •λ¦¬ν•˜λ©΄, λ³Έ ν•™μœ„ 논문은 κ²½μ œλ¬Όλ¦¬ν•™κ³Ό λ”₯λŸ¬λ‹μ˜ μœ΅ν•©μ— κΈ°λ°˜ν•œ 금육 μ‹œμž₯ ꡬ쑰화 방법둠 및 예츑 λͺ¨λΈμ„ μ œμ•ˆν•˜λŠ”λ° μ˜μ˜κ°€ 있으며, ꡰ집화, 예츑, 그리고 투자λ₯Ό 톡해 κ·Έ μœ μš©μ„±μ„ κ²€μ¦ν•˜μ˜€λ‹€.Stocks are correlated with a variety of factors, such as other financial assets and economic factors. Therefore, analyzing the relationship of stocks is an essential element for understanding the stock market structure and an essential element from a practical point of view, such as investment and risk management. Stock markets that fluctuate chaotically are modeled as complex networks formed through the interaction of individual factors. Many studies have used network analysis in econophysics to investigate the relationship between stocks. In defining interactions between stocks, many studies highlighted stock price time series. However, information on stocks is not only contained in the price, but also in various data such as trading volumes, volatility, and accounting variables. Therefore, the relationship between stocks using only the stock price can show a fragmentary view of the financial market. In addition, with the development of technologies such as deep learning, the boundaries between disciplines and data types are collapsing, and interdisciplinary convergence is becoming more significant. In this context, the purpose of this dissertation is to present a new perspective on financial market analysis through the interdisciplinary convergence of econophysics and deep learning. We propose a framework that combines deep learning and econophysics consisting of a bi-dimensional histogram and an autoencoder to analyze the S\&P500 stock market from a multivariate perspective. We utilize a bi-dimensional histogram to intuitively represent stock trading volume information as well as stock price information. Then, an autoencoder is applied to reduce the dimension of the bi-dimensional histogram and extract the latent vector. A distance matrix between bi-dimensional histograms is defined through the latent vectors, and a histogram network representing the financial market is constructed by applying the Planar Maximally Filtered Graph(PMFG) algorithm, an econophysics methodology, to the distance matrix. The constructed network implies the latent space of the bi-dimensional histogram, and network analysis is performed to analyze the structural properties of the financial market. We reveal that the structural properties of the histogram network are related to the dispersion of histogram, which means that the autoencoder is effective in extracting the latent vector of the bi-dimensional histogram. Additionally, the histogram network effectively clusters stocks with similar exogenous characteristics to financial markets, such as volatility and market beta. Furthermore, we propose a stock price prediction model based on a Graph Neural Network(GNN) and histogram network. Based on its practical importance, stock price trend prediction is a research topic studied extensively, from econometric models to machine learning and deep learning. Recently, GNN models are in the spotlight in time series prediction based on their properties that can reflect the relationship between elements. However, previous studies use only static and predefined relationships such as industrial structures to establish the relationship between stocks. However, many studies have demonstrated the time-varying property of the financial market. Therefore, we propose a graph neural network model that uses the histogram network as a relationship between stocks to reflect the time-varying properties of the financial market. The trend prediction experiment is performed with a classification problem of the stock price after 10 and 20 days. As a result, we confirmed that the proposed model has superior prediction accuracy and F1-Score compared to the benchmark. Finally, we propose an investment strategy based on the predictive model to verify the practical utility of the proposed model. The proposed investment strategy is to construct an investment portfolio of NN stocks that are most likely to rise in price in the future. In other words, we perform trend prediction with a graph neural network model based on a histogram network and construct an investment portfolio using the prediction results. Our results show that the proposed investment strategy is outstanding in profitability, including the Sharpe ratio and Sortino ratio and the cumulative return. In addition, our proposed investment strategy can recover from a downtrend faster than traditional investment strategies based on Modern Portfolio Theory. In summary, the novelty of the dissertation is that we construct a stock market PMFG network and build a GNN model based on the interdisciplinary convergence of econophysics and deep learning. The usefulness of the dissertation is verified through clustering, prediction, and investment experiments.Chapter 1 Introduction 1 1.1 Research Motivation and Purpose 1 1.2 Organization of the Research 8 Chapter 2 Literature Review 11 2.1 Stock market network 11 2.2 Autoencoder in Stock market 12 2.3 Predictive model in Stock market 13 Chapter 3 Bi-dimensional Histogram Network 17 3.1 Architecture of Bi-dimensional Histogram Network 17 3.1.1 Bi-dimensional Histogram 17 3.1.2 Auto-encoder 21 3.1.3 Stock market PMFG network 25 3.2 Data description 28 3.3 Experimental Results 29 3.3.1 Properties of bi-dimensional histogram 29 3.3.2 Dimensional reduction result of bi-dimensional histogram 34 3.3.3 Properties of bi-dimensional histogram network 36 3.3.4 Comparison of histogram network with price network 40 3.4 Summary and Discussion 47 Chapter 4 A hybrid graph neural network for time series trend prediction 49 4.1 Proposed Model 49 4.2 Benchmark Algorithms 58 4.3 Data Description 63 4.4 Experimental Procedure 64 4.5 Experimental Result 68 4.5.1 Properties of High-Low volatility PMFG 68 4.5.2 Prediction Performance 74 4.5.3 Robustness Test for Prediction Performance 79 4.6 Summary and Discussion 82 Chapter 5 Investment Strategy with Trend Prediction Model 85 5.1 Proposed Strategy 85 5.2 Benchmark Algorithms 87 5.3 Experiment Procedure 91 5.4 Experimental Result 94 5.5 Summary and Discussion 110 Chapter 6 Conclusion 113 6.1 Conclusions 113 6.2 Future Works 117 Appendix 119 Bibliography 129 ꡭ문초둝 145λ°•

    DeepLOB: Deep Convolutional Neural Networks for Limit Order Books

    Full text link
    We develop a large-scale deep learning model to predict price movements from limit order book (LOB) data of cash equities. The architecture utilises convolutional filters to capture the spatial structure of the limit order books as well as LSTM modules to capture longer time dependencies. The proposed network outperforms all existing state-of-the-art algorithms on the benchmark LOB dataset [1]. In a more realistic setting, we test our model by using one year market quotes from the London Stock Exchange and the model delivers a remarkably stable out-of-sample prediction accuracy for a variety of instruments. Importantly, our model translates well to instruments which were not part of the training set, indicating the model's ability to extract universal features. In order to better understand these features and to go beyond a "black box" model, we perform a sensitivity analysis to understand the rationale behind the model predictions and reveal the components of LOBs that are most relevant. The ability to extract robust features which translate well to other instruments is an important property of our model which has many other applications.Comment: 12 pages, 9 figure

    Enhanced news sentiment analysis using deep learning methods

    Get PDF
    We explore the predictive power of historical news sentiments based on financial market performance to forecast financial news sentiments. We define news sentiments based on stock price returns averaged over one minute right after a news article has been released. If the stock price exhibits positive (negative) return, we classify the news article released just prior to the observed stock return as positive (negative). We use Wikipedia and Gigaword five corpus articles from 2014 and we apply the global vectors for word representation method to this corpus to create word vectors to use as inputs into the deep learning TensorFlow network. We analyze high-frequency (intraday) Thompson Reuters News Archive as well as the high-frequency price tick history of the Dow Jones Industrial Average (DJIA 30) Index individual stocks for the period between 1/1/2003 and 12/30/2013. We apply a combination of deep learning methodologies of recurrent neural network with long short-term memory units to train the Thompson Reuters News Archive Data from 2003 to 2012, and we test the forecasting power of our method on 2013 News Archive data. We find that the forecasting accuracy of our methodology improves when we switch from random selection of positive and negative news to selecting the news with highest positive scores as positive news and news with highest negative scores as negative news to create our training data set.Published versio
    • …
    corecore