132 research outputs found

    Sentiment Analysis Using Deep Learning: A Comparison Between Chinese And English

    Get PDF
    With the increasing popularity of opinion-rich resources, opinion mining and sentiment analysis has received increasing attention. Sentiment analysis is one of the most effective ways to find the opinion of authors. By mining what people think, sentiment analysis can provide the basis for decision making. Most of the objects of analysis are text data, such as Facebook status and movie reviews. Despite many sentiment classification models having good performance on English corpora, they are not good at Chinese or other languages. Traditional sentiment approaches impose many restrictions on the raw data, and they don't have enough capacity to deal with long-distance sequential dependencies. So, we propose a model based on recurrent neural network model using a context vector space model. Chinese information entropy is typically higher than English, we therefore hypothesise that context vector space model can be used to improve the accuracy of sentiment analysis. Our algorithm represents each complex input by a dense vector trained to translate sequence data to another sequence, like the translation of English and French. Then we build a recurrent neural network with the Long-Short-Term Memory model to deal the long-distance dependencies in input data, such as movie review. The results show that our approach has promise but still has a lot of room for improvement

    Sentiment Analysis Based on Deep Learning: A Comparative Study

    Full text link
    The study of public opinion can provide us with valuable information. The analysis of sentiment on social networks, such as Twitter or Facebook, has become a powerful means of learning about the users' opinions and has a wide range of applications. However, the efficiency and accuracy of sentiment analysis is being hindered by the challenges encountered in natural language processing (NLP). In recent years, it has been demonstrated that deep learning models are a promising solution to the challenges of NLP. This paper reviews the latest studies that have employed deep learning to solve sentiment analysis problems, such as sentiment polarity. Models using term frequency-inverse document frequency (TF-IDF) and word embedding have been applied to a series of datasets. Finally, a comparative study has been conducted on the experimental results obtained for the different models and input feature

    The impact of news narrative on the economy and financial markets

    Get PDF
    This thesis investigates the impact of news narrative on socio-economic systems across four experiments. Recent years have witnessed a rise in the use of so-called alternative data sources to model and predict dynamics in socio-economic systems. Notably, sources such as newspaper text allow researchers to quantify the elusive concept of narrative, to incorporate text-based features into forecasting frameworks and thus to evaluate the impact of narrative on economic events. The first experiment proposes a new method of incorporating a wide array of sentiment scores from global newspaper articles into macroeconomic forecasts, attempting to forecast industrial production and consumer prices leveraging narrative and sentiment from global newspapers. I model industrial production and consumer prices across a diverse range of economies using an autoregressive framework. The second experiment uses narrative from global newspapers to construct themebased knowledge graphs about world events, demonstrating that features extracted from such graphs improve forecasts of industrial production in three large economies. The third experiment proposes a novel method of including news themes and their associated sentiment into predictions of changes in breakeven inflation rates (BEIR) for eight diverse economies with mature fixed income markets. I utilise five types of machine learning algorithms incorporating narrative-based features for each economy. In the above experiments, models incorporating narrative-based features generally outperform their benchmarks that do not contain such variables, demonstrating the predictive power of features derived from news narrative. The fourth experiment utilises GDELT data and the filtering methodology introduced in the first experiment to create a profitable systematic trading strategy based on the average tone scores for 15 diverse economies

    The Stock Exchange Prediction using Machine Learning Techniques: A Comprehensive and Systematic Literature Review

    Get PDF
    This literature review identifies and analyzes research topic trends, types of data sets, learning algorithm, methods improvements, and frameworks used in stock exchange prediction. A total of 81 studies were investigated, which were published regarding stock predictions in the period January 2015 to June 2020 which took into account the inclusion and exclusion criteria. The literature review methodology is carried out in three major phases: review planning, implementation, and report preparation, in nine steps from defining systematic review requirements to presentation of results. Estimation or regression, clustering, association, classification, and preprocessing analysis of data sets are the five main focuses revealed in the main study of stock prediction research. The classification method gets a share of 35.80% from related studies, the estimation method is 56.79%, data analytics is 4.94%, the rest is clustering and association is 1.23%. Furthermore, the use of the technical indicator data set is 74.07%, the rest are combinations of datasets. To develop a stock prediction model 48 different methods have been applied, 9 of the most widely applied methods were identified. The best method in terms of accuracy and also small error rate such as SVM, DNN, CNN, RNN, LSTM, bagging ensembles such as RF, boosting ensembles such as XGBoost, ensemble majority vote and the meta-learner approach is ensemble Stacking. Several techniques are proposed to improve prediction accuracy by combining several methods, using boosting algorithms, adding feature selection and using parameter and hyper-parameter optimization

    Techniques for Stock Market Prediction: A Review

    Get PDF
    Stock market forecasting has long been viewed as a vital real-life topic in economics world. There are many challenges in stock market prediction systems such as the Efficient Market Hypothesis (EMH), Nonlinearity, complex, diverse datasets, and parameter optimization. A stock's value on the stock market fluctuates due to many factors like previous trends of the stock, the current news, twitter feeds, any online customer feedbacks etc. In this paper, the literature is critically analysed on approaches used for stock market prediction in terms of stock datasets, features used, evaluation metrics used, statistical, machine learning and deep learning techniques along with the directions for the future. The focus of this review is on trend and value prediction for stocks. Overall, 68 research papers have been considered for review from years 1998-2023. From the review, Indian stock market datasets are found to be most frequently used datasets. Evaluation metrics used commonly are accuracy and Mean Absolute Percentage Error. ARIMA is reported as the most used frequently statistical technique for stick market prediction. Long-Short Term Memory and Support Vector Machine are the commonly used algorithms in stock market prediction. The advantages and disadvantages of frequently used evaluation metrics, machine learning, deep learning and statistical approaches are also included in this survey

    A State-of-the-Art Review of Time Series Forecasting Using Deep Learning Approaches

    Get PDF
    Time series forecasting has recently emerged as a crucial study area with a wide spectrum of real-world applications. The complexity of data processing originates from the amount of data processed in the digital world. Despite a long history of successful time-series research using classic statistical methodologies, there are some limits in dealing with an enormous amount of data and non-linearity. Deep learning techniques effectually handle the complicated nature of time series data. The effective analysis of deep learning approaches like Artificial Neural Networks (ANN), Convolutional Neural Networks (CNN), Recurrent Neural Networks (RNN), Long short-term memory (LSTM), Gated Recurrent Unit (GRU), Autoencoders, and other techniques like attention mechanism, transfer learning, and dimensionality reduction are discussed with their merits and limitations. The performance evaluation metrics used to validate the model's accuracy are discussed. This paper reviews various time series applications using deep learning approaches with their benefits, challenges, and opportunities

    Social media cross-source and cross-domain sentiment classification

    Get PDF
    Due to the expansion of Internet and Web 2.0 phenomenon, there is a growing interest in the sentiment analysis of freely opinionated text. In this paper, we propose a novel cross-source cross-domain sentiment classification, in which cross-domain labeled Web sources (Amazon and Tripadvisor) are used to train supervised learning models (including two deep learning algorithms) that are tested on typically non labeled social media reviews (Facebook and Twitter). We explored a three step methodology, in which dis- tinct balanced training, text preprocessing and machine learning methods were tested, using two languages: English and Italian. The best results were achieved when using undersampling training and a Convolutional Neural Network. Interesting cross-source classification performances were achieved, in particular when using Amazon and Tripadvisor reviews to train a model that is tested on Facebook data for both English and Italian.Research carried out with the support of resources of Big&Open Data Innovation Laboratory (BODaI-Lab), the University of Brescia, granted by Fondazione Cariplo and Regione Lombardia. The work of P. Cortez was supported by FCT - Fundacao para a Ciencia e Tecnologia within the Project Scope UID/CEC/00319/2019. We would also like to thank the three anonymous reviewers for their helpful suggestions
    • …
    corecore