1,969 research outputs found

    Forecasting of electricity prices in the Spanish electricity market using machine learning tools

    Get PDF
    The objective of this research assignment was to forecast electricity prices in the Spanish electricity market using three different machine learning techniques: k-nearest neighbours, support vector regression and artificial neural networks. The achieved results were compared and the quality of developed models was evaluated. The project was implemented in Python3.Incomin

    Essays on Financial Applications of Nonlinear Models

    Get PDF
    In this thesis, we examine the relationship between news and the stock market. Further, we explore methods and build new nonlinear models for forecasting stock price movement and portfolio optimization based on past stock prices and on one type of big data, news items, which are obtained through the RavenPack News Analytics Global Equities editions. The thesis consists of three essays. In Essay 1, we investigate the relationship between news items and stock prices using the artificial neural network (ANN) model. First, we use Granger causality to ascertain how news items affect stock prices. The results show that news volume is not the Granger cause of stock price change; rather, news sentiment is. Second, we test the semi–strong form efficient market hypothesis, whereas most existing research testing efficient market hypothesis focuses on the weak–form version. Our ANN strategies consistently outperform the passive buy–and–hold strategy and this finding is apparently at odds with the notion of the efficient market hypothesis. Finally, using news sentiment analytics from RavenPack Dow Jones News Analytics, we show positive profitability with out–of–sample prediction using the proposed ANN strategies for Google Inc. (NASDAQ: GOOG). In Essay 2, we expand the utility of the information from news volume and news sentiments to encompass portfolio diversification. For the Dow Jones Industrial Average (DJIA) components, we assign different weights to build portfolios according to their weekly news volumes or news sentiments. Our results show that news volume contributes to portfolio variance both in–sample and out–of–sample: positive news sentiment contributes to the portfolio return in–sample, while negative contributes to the portfolio return out–of–sample, which is a consequence of investors overreacting to the news sentiment. Further, we propose a novel approach to portfolio diversification using the k–Nearest Neighbors (kNN) algorithm based on the idea that news sentiment correlates with stock returns. Out–of–sample results indicate that such strategy dominates the benchmark DJIA index portfolio. In Essay 3, we propose a new model called the Combined Markov and Hidden Markov Model (CMHMM), in which observation is affected by a Markov model and an HMM (Hidden Markov Model) model. The three fundamental questions of the CMHMM are discussed. Further, the application of the CMHMM, in which the news sentiment is one observation and the stock return is the other, is discussed. The empirical results of the trading strategy based on the CMHMM show the potential applications of the proposed model in finance. This thesis contributes to the literature in a number of ways. First, it extends the literature on financial applications of nonlinear models. We explore the applications of the ANNs and kNN in the financial market. Besides, the proposed new CMHMM model adheres to the nature of the stock market and has better potential prediction ability. Second, the empirical results from this dissertation contribute to the understanding of the relationship between news and the stock market. For instance, our research found that news volume contributes to the portfolio return and that investors overreact to news sentiment—a phenomenon that has been discussed by other scholars from different angles

    Machine learning techniques for predicting the stock market using daily market variables

    Get PDF
    Dissertation presented as the partial requirement for obtaining a Master's degree in Information Management, specialization in Knowledge Management and Business IntelligencePredicting the stock market was never seen as an easy task. The complexity of the financial systems makes it extremely difficult for anything or anyone to predict what the future of prices holds, let it be a day, a week, a month or even a year. Many variables influence the market’s volatility and some of these may even be the gut feeling of an investor on a specific day. Several machine learning techniques were already applied to forecast multiple stock market indexes, some presenting good values of accuracy when it comes to predict whether the prices will go up or down, and low values of error when dealing with regression data. This work aims to apply some state-of-the-art algorithms and compare their performance with Long Short-term Memory (LSTM) as well as between each other. The variables used to this empirical work were the prices of the Dow Jones Industrial Average (DJIA) registered for every business day, from January 1st of 2006 to January 1st of 2018, for 29 companies. Some changes and adjustments were made to the original variables to present different data types to the algorithms. To ensure good quality and certainty when evaluating the flexibility and stability of each model, the error measure used was the Root Mean Squared Error and the Mann-Whitney U test was also applied to assess statistical significance of the results obtained.Prever a bolsa nunca foi considerado ser uma tarefa fácil. A complexidade dos sistemas financeiros torna extremamente difícil que um ser humano ou uma máquina consigam prever o que o futuro dos preços reserva, seja para um dia, uma semana, um mês ou um ano. Muitas variáveis influenciam a volatilidade do mercado e algumas podem até ser a confiança de um investidor em apostar em determinada empresa, naquele dia específico. Várias técnicas de aprendizagem automática foram aplicadas ao longo do tempo para prever vários índices de bolsas, algumas apresentando bons valores de precisão quando se tratou de prever se os preços subiam ou desciam e outras, baixos valores de erro ao lidar com dados de regressão. Este trabalho tem como objetivo aplicar alguns dos mais conhecidos algoritmos e comparar os seus desempenhos com o Long Short-Term Memory (LSTM), e entre si. As variáveis utilizadas para a elaboração deste trabalho empírico foram os preços da Dow Jones Industrial Average (DJIA) registados para todos os dias úteis, de 1 de Janeiro de 2006 a 1 de Janeiro de 2018, para 29 empresas. Algumas alterações e ajustes foram efetuados sobre as variáveis originais de forma a construír diferentes tipos de dados para posteriormente dar aos algoritmos. Para garantir boa qualidade e veracidade ao avaliar a flexibilidade e estabilidade de cada modelo, a medida de erro utilizada foi o erro médio quadrático da raíz e, de seguida, o teste U de Mann-Whitney foi aplicado para avaliar a significância estatística dos resultados obtidos

    Using machine learning to forecast long-term equity price movement : an empirical study of the Finnish financial markets

    Get PDF
    Predicting equity price movement is one of the fundamental challenges in finance, and even small improvements in prediction performance can be highly profitable for investors. Long-term investment is one of the popular investment strategies that investors follow. However, evaluating which companies are going to perform well in the future is difficult. This research presents machine learning aided approach to forecast long-term price movement of the stocks listed on the Helsinki Stock Exchange. The purpose of the research is to find out which machine learning model performs the best in the Finnish financial markets and to understand what the key variables are, which have a major effect on the prediction accuracy of the models. The research is also testing whether the macroeconomic variables of Finland increase the accuracy of the machine learning models when forecasting long-term equity price movement. The following machine learning models are used in the research: logistic regression, support vector machine, decision tree, random forest, and k-nearest neighbors. This research produced a number of key findings: the results from the models indicated that the best performance was achieved by the random forest model, which obtained classification accuracy of 65.3% and F1 score of 60.8%; the random forest model is able to give over 60% chance for an investor to pick a stock, which will have a 10% or higher return over the period of one year; the macroeconomic variables increased the prediction performance of every machine learning model used in the research. The main conclusions drawn from this research are that the macroeconomic variables can provide new information, which is not explained by only using financial ratios in the models. Also, the equity prices in the Finnish financial markets are not equally random, meaning that they do not always follow a random walk process. Therefore, this research argues that the Finnish financial market is not highly efficient, thus stock prices are on some level predictable. These findings contribute to the financial theory of market efficiency

    Predicting stock return and volatility with machine learning and econometric models: A comparative case study of the Baltic stock market

    Get PDF
    For stock market predictions, the essence of the problem is usually predicting the magnitude and direction of the stock price movement as accurately as possible. There are different approaches (e.g., econometrics and machine learning) for predicting stock returns. However, it is non-trivial to find an approach which works the best. In this paper, we make a thorough analysis of the predictive accuracy of different machine learning and econometric approaches for predicting the returns and volatilities on the OMX Baltic Benchmark price index, which is a relatively less researched stock market. Our results show that the machine learning methods, namely the support vector regression and k-nearest neighbours, predict the returns better than autoregressive moving average models for most of the metrics, while for the other approaches, the results were not conclusive. Our analysis also highlighted that training and testing sample size plays an important role on the outcome of machine learning approaches

    Techniques for Stock Market Prediction: A Review

    Get PDF
    Stock market forecasting has long been viewed as a vital real-life topic in economics world. There are many challenges in stock market prediction systems such as the Efficient Market Hypothesis (EMH), Nonlinearity, complex, diverse datasets, and parameter optimization. A stock's value on the stock market fluctuates due to many factors like previous trends of the stock, the current news, twitter feeds, any online customer feedbacks etc. In this paper, the literature is critically analysed on approaches used for stock market prediction in terms of stock datasets, features used, evaluation metrics used, statistical, machine learning and deep learning techniques along with the directions for the future. The focus of this review is on trend and value prediction for stocks. Overall, 68 research papers have been considered for review from years 1998-2023. From the review, Indian stock market datasets are found to be most frequently used datasets. Evaluation metrics used commonly are accuracy and Mean Absolute Percentage Error. ARIMA is reported as the most used frequently statistical technique for stick market prediction. Long-Short Term Memory and Support Vector Machine are the commonly used algorithms in stock market prediction. The advantages and disadvantages of frequently used evaluation metrics, machine learning, deep learning and statistical approaches are also included in this survey

    Forecasting movements of health-care stock prices based on different categories of news articles using multiple kernel learning

    Get PDF
    —The market state changes when a new piece of information arrives. It affects decisions made by investors and is considered to be an important data source that can be used for financial forecasting. Recently information derived from news articles has become a part of financial predictive systems. The usage of news articles and their forecasting potential have been extensively researched. However, so far no attempts have been made to utilise different categories of news articles simultaneously. This paper studies how the concurrent, and appropriately weighted, usage of news articles, having different degrees of relevance to the target stock, can improve the performance of financial forecasting and support the decision-making process of investors and traders. Stock price movements are predicted using the multiple kernel learning technique which integrates information extracted from multiple news categories while separate kernels are utilised to analyse each category. News articles are partitioned according to their relevance to the target stock, its sub industry, industry, group industry and sector. The experiments are run on stocks from the Health Care sector and show that increasing the number of relevant news categories used as data sources for financial forecasting improves the performance of the predictive system in comparison with approaches based on a lower number of categories
    corecore