3,076 research outputs found
Technical and Fundamental Features Analysis for Stock Market Prediction with Data Mining Methods
Predicting stock prices is an essential objective in the financial world. Forecasting stock returns and their risk represents one of the most critical concerns of market decision makers. This thesis investigates the stock price forecasting with three approaches from the data mining concept and shows how different elements in the stock price can help to enhance the accuracy of our prediction. For this reason, the first and second approaches capture many fundamental indicators from the stocks and implement them as explanatory variables to do stock price classification and forecasting. In the third approach, technical features from the candlestick representation of the share prices are extracted and used to enhance the accuracy of the forecasting. In each approach, different tools and techniques from data mining and machine learning are employed to justify why the forecasting is working.
Furthermore, since the idea is to evaluate the potential of features in the stock trend forecasting, therefore we diversify our experiments using both technical and fundamental features. Therefore, in the first approach, a three-stage methodology is developed while in the first step, a comprehensive investigation of all possible features which can be effective on stocks risk and return are identified. Then, in the next stage, risk and return are predicted by applying data mining techniques for the given features. Finally, we develop a hybrid algorithm, based on some filters and function-based clustering; and re-predicted the risk and return of stocks.
In the second approach, instead of using single classifiers, a fusion model is proposed based on the use of multiple diverse base classifiers that operate on a common input and a meta-classifier that learns from base classifiers’ outputs to obtain a more precise stock return and risk predictions. A set of diversity methods, including Bagging, Boosting, and AdaBoost, is applied to create diversity in classifier combinations. Moreover, the number and procedure for selecting base classifiers for fusion schemes are determined using a methodology based on dataset clustering and candidate classifiers’ accuracy.
Finally, in the third approach, a novel forecasting model for stock markets based on the wrapper ANFIS (Adaptive Neural Fuzzy Inference System) – ICA (Imperialist Competitive Algorithm) and technical analysis of Japanese Candlestick is presented. Two approaches of Raw-based and Signal-based are devised to extract the model’s input variables and buy and sell signals are considered as output variables.
To illustrate the methodologies, for the first and second approaches, Tehran Stock Exchange (TSE) data for the period from 2002 to 2012 are applied, while for the third approach, we used General Motors and Dow Jones indexes.Predicting stock prices is an essential objective in the financial world. Forecasting stock returns and their risk represents one of the most critical concerns of market decision makers. This thesis investigates the stock price forecasting with three approaches from the data mining concept and shows how different elements in the stock price can help to enhance the accuracy of our prediction. For this reason, the first and second approaches capture many fundamental indicators from the stocks and implement them as explanatory variables to do stock price classification and forecasting. In the third approach, technical features from the candlestick representation of the share prices are extracted and used to enhance the accuracy of the forecasting. In each approach, different tools and techniques from data mining and machine learning are employed to justify why the forecasting is working.
Furthermore, since the idea is to evaluate the potential of features in the stock trend forecasting, therefore we diversify our experiments using both technical and fundamental features. Therefore, in the first approach, a three-stage methodology is developed while in the first step, a comprehensive investigation of all possible features which can be effective on stocks risk and return are identified. Then, in the next stage, risk and return are predicted by applying data mining techniques for the given features. Finally, we develop a hybrid algorithm, based on some filters and function-based clustering; and re-predicted the risk and return of stocks.
In the second approach, instead of using single classifiers, a fusion model is proposed based on the use of multiple diverse base classifiers that operate on a common input and a meta-classifier that learns from base classifiers’ outputs to obtain a more precise stock return and risk predictions. A set of diversity methods, including Bagging, Boosting, and AdaBoost, is applied to create diversity in classifier combinations. Moreover, the number and procedure for selecting base classifiers for fusion schemes are determined using a methodology based on dataset clustering and candidate classifiers’ accuracy.
Finally, in the third approach, a novel forecasting model for stock markets based on the wrapper ANFIS (Adaptive Neural Fuzzy Inference System) – ICA (Imperialist Competitive Algorithm) and technical analysis of Japanese Candlestick is presented. Two approaches of Raw-based and Signal-based are devised to extract the model’s input variables and buy and sell signals are considered as output variables.
To illustrate the methodologies, for the first and second approaches, Tehran Stock Exchange (TSE) data for the period from 2002 to 2012 are applied, while for the third approach, we used General Motors and Dow Jones indexes.154 - Katedra financívyhově
Financial crises and bank failures: a review of prediction methods
In this article we analyze financial and economic circumstances associated with the U.S. subprime mortgage crisis and the global financial turmoil that has led to severe crises in many countries. We suggest that the level of cross-border holdings of long-term securities between the United States and the rest of the world may indicate a direct link between the turmoil in the securitized market originated in the United States and that in other countries. We provide a summary of empirical results obtained in several Economics and Operations Research papers that attempt to explain, predict, or suggest remedies for financial crises or banking defaults; we also extensively outline the methodologies used in them. The intent of this article is to promote future empirical research for preventing financial crises.Subprime mortgage ; Financial crises
Financial crises and bank failures: a review of prediction methods
In this article we provide a summary of empirical results obtained in several economics and operations research papers that attempt to explain, predict, or suggest remedies for financial crises or banking defaults, as well as outlines of the methodologies used. We analyze financial and economic circumstances associated with the US subprime mortgage crisis and the global financial turmoil that has led to severe crises in many countries. The intent of the article is to promote future empirical research that might help to prevent bank failures and financial crises.financial crises; banking failures; operations research; early warning methods; leading indicators; subprime markets
Prediction of Banks Financial Distress
In this research we conduct a comprehensive review on the existing literature of
prediction techniques that have been used to assist on prediction of the bank distress.
We categorized the review results on the groups depending on the prediction techniques method,
our categorization started by firstly using time factors of the founded literature, so we mark the
literature founded in the period (1990-2010) as history of prediction techniques, and after this
period until 2013 as recent prediction techniques and then presented the strengths and
weaknesses of both. We came out by the fact that there was no specific type fit with all bank
distress issue although we found that intelligent hybrid techniques considered the most
candidates methods in term of accuracy and reputatio
Using Memory-Based Reasoning For Predicting Default Rates On Consumer Loans
In recent years, financial institutions have struggled with high default rates for consumer lending. An ability to reliably predict the probability of consumer loan defaults would have a significant impact of the profitability of that lending for these institutions. In response to this need, the financial institutions have employed loan analysis techniques such as logistic regression, discriminant analysis, and various machine learning techniques to improve the accuracy of detecting loan defaults. The objective of these techniques is to more precisely identify creditworthy applicants who are granted credit, thereby increasing profits, from non-creditworthy applicants who would be then denied credit, thus decreasing losses. The objective of this article is to employ an emergent data analysis technique, memory-based or case-based reasoning method, to this problem to test its accuracy in discriminating between good and bad loans. This paper examines historical data from consumer loans issued by a financial institution to individuals that the financial institution considered to be qualified customers. The data set consists of the financial attributes of each customer and includes a mixture of loans that the customers paid off or defaulted upon. The paper then compares the performance of this technique to other data mining techniques proposed in earlier works and analyzes the risk of default inherent in each loan for each technique
The influence of the smoothing component on the quality of algebraic forecasts
Short term time series forecasting model with different internal smoothing techniques is presented in this paper. Computational experiments with real world time series are used to demonstrate the influence of different smoothing techniques in fitness. Algebraic forecasting results with any internal smoothing model outperformed results of the algebraic forecasting without smoothing
Recommended from our members
How Transparent Are Central Banks?
Central bank transparency has become the topic of a lively public and academic debate on monetary policy. Unfortunately, it has been complicated by the fact that transparency is a qualitative concept that is hard to measure. This paper proposes a comprehensive index for central bank transparency that comprises the political, economic, procedural, policy and operational aspects of central banking. The index is compiled for nine major central banks. It is based on a detailed analysis of actual information disclosure and reveals a rich variety in the degree and dynamics of central bank transparency
Multiple proportion case-basing driven CBRE and its application in the evaluation of possible failure of firms
Case-based reasoning (CBR) is a unique tool for the evaluation of possible failure of firms (EOPFOF) for its eases of interpretation and implementation. Ensemble computing, a variation of group decision in society, provides a potential means of improving predictive performance of CBR-based EOPFOF. This research aims to integrate bagging and proportion case-basing with CBR to generate a method of proportion bagging CBR for EOPFOF. Diverse multiple case bases are first produced by multiple case-basing, in which a volume parameter is introduced to control the size of each case base. Then, the classic case retrieval algorithm is implemented to generate diverse member CBR predictors. Majority voting, the most frequently used mechanism in ensemble computing, is finally used to aggregate outputs of member CBR predictors in order to produce final prediction of the CBR ensemble. In an empirical experiment, we statistically validated the results of the CBR ensemble from multiple case bases by comparing them with those of multivariate discriminant analysis, logistic regression, classic CBR, the best member CBR predictor and bagging CBR ensemble. The results from Chinese EOPFOF prior to 3 years indicate that the new CBR ensemble, which significantly improved CBRs predictive ability, outperformed all the comparative methods
Risk prediction of product-harm events using rough sets and multiple classifier fusion:an experimental study of listed companies in China
With the increasing of frequency and destructiveness of product-harm events, study on enterprise crisis management becomes essentially important, but little literature thoroughly explores the risk prediction method of product-harm event. In this study, an initial index system for risk prediction was built based on the analysis of the key drivers of the product-harm event's evolution; ultimately, nine risk-forecasting indexes were obtained using rough set attribute reduction. With the four indexes of cumulative abnormal returns as the input, fuzzy clustering was used to classify the risk level of a product-harm event into four grades. In order to control the uncertainty and instability of single classifiers in risk prediction, multiple classifier fusion was introduced and combined with self-organising data mining (SODM). Further, an SODM-based multiple classifier fusion (SB-MCF) model was presented for the risk prediction related to a product-harm event. The experimental results based on 165 Chinese listed companies indicated that the SB-MCF model improved the average predictive accuracy and reduced variation degree simultaneously. The statistical analysis demonstrated that the SB-MCF model significantly outperformed six widely used single classification models (e.g. neural networks, support vector machine, and case-based reasoning) and other six commonly used multiple classifier fusion methods (e.g. majority voting, Bayesian method, and genetic algorithm)
Recommended from our members
Predicting business failure using artificial intelligence system
This thesis was submitted for the award of Doctor of Philosophy and was awarded by Brunel University LondonPredicting business insolvency is considered one of the main supportive sources of information
for decision making for financial institutions, investors, creditors, and other participants in the
business market. Financial reporting systems provide relevant information that can be used to
assess the financial position of firms. It is crucial to have classification and prediction models
that can analyse this financial information and provide accurate assurance for users about
business health. Recent studies have explored the use of machine learning tools as substitute
for traditional statistical methods to develop classification models to classify firm insolvency
according to financial statement information. However, these models have no ideal classifier,
since each provides a certain percentage of wrong outputs, which is a crucial consideration;
every percentage of wrong response can mean massive financial losses for stakeholders.
Therefore, this study proposes new insolvency classification and perdition models based on
machine learning modelling techniques to develop an improved classifier.
Individual modelling techniques using statistical methods and machine learning were used to
develop the classification model of business insolvency. The results showed that machine
learning method outperformed statistical methods. Deep Learning (DPL) achieved the highest
performance based on all performance measurements used in the study, and it was the best
individual classifier, with average accuracy of 97.2% using all-years dataset. Ensemble-
Boosted Decision Tree classifier ranked second, followed by Decision Tree classifier. Thus, it
has been proven that DPL modelling approach is useful for business insolvency classification.
A key contribution in enhancing individual classifier outputs is the use of traditional combining
methods with two new aggregation methods in business insolvency (Fuzzy Logic and
Consensus Approach). The Consensus Approach showed the best improvement in the results
of all individual classifiers with average accuracy of 97.7%, and it is considered the best
classification method not only in comparison with individual classifiers, but also with
traditional combiners.
This study pioneers the development of a time series business insolvency prediction model
with Big Data for UK businesses. The aim of the model is to provide early prediction about a
business health. Three prediction models were developed based on Nonlinear Autoregressive
with Exogenous Input models (NARX), Nonlinear Autoregressive Neural Network (NAR),
and Deep Learning Time-series model (DPL-SA) and achieved average accuracy rates of
83.6%, 89.5%, and 91.35%, respectively. The results show relatively high performance in
comparison with the best individual classifier (deep learning)
- …