Search CORE

35,188 research outputs found

Predicting FTSE 100 returns and volatility using sentiment analysis

Author: Antweiler
Blasco
Bollen
Chan
Chatrath
Coulton
Da
Das
De Long
Ferguson
Fuehres
Geva
Gidófalvi
Gregory
Grob-Klubmann
Hagenau
Huang
Khadjeh Nassirtoussi
Khadjeh Nassirtoussi
Klein
Kumari
Lee
Li
Li
Loughran
Medhat
Olaniyan
Parkinson
Patton
Ravi
Schumaker
Schumaker
Schumaker
Sinha
Smales
Soyland
Tetlock
Tetlock
Uhl
Wu
Wu
Zhang
Publication venue: 'Wiley'
Publication date: 01/11/2018
Field of study

Bond University Research Portal

Crossref

Big data techniques in auditing research and practice: Current trends and future opportunities

Author: Gepp Adrian
Linnenluecke Martina K
O'Neill Terence J
Publication venue: 'Elsevier BV'
Publication date: 01/06/2018
Field of study

Bond University Research Portal

A framework for the forensic investigation of unstructured email relationship data

Author: Haggerty J
Karran A
Lamb D
Taylor M
Publication venue: 'IGI Global'
Publication date: 01/09/2011
Field of study

Our continued reliance on email communications ensures that it remains a major source of evidence during a digital investigation. Emails comprise both structured and unstructured data. Structured data provides qualitative information to the forensics examiner and is typically viewed through existing tools. Unstructured data is more complex as it comprises information associated with social networks, such as relationships within the network, identification of key actors and power relations, and there are currently no standardised tools for its forensic analysis. Moreover, email investigations may involve many hundreds of actors and thousands of messages. This paper posits a framework for the forensic investigation of email data. In particular, it focuses on the triage and analysis of unstructured data to identify key actors and relationships within an email network. This paper demonstrates the applicability of the approach by applying relevant stages of the framework to the Enron email corpus. The paper illustrates the advantage of triaging this data to identify (and discount) actors and potential sources of further evidence. It then applies social network analysis techniques to key actors within the data set. This paper posits that visualisation of unstructured data can greatly aid the examiner in their analysis of evidence discovered during an investigation

University of Salford Institutional Repository

BCS SGAI SMA 2013: the BCS SGAI workshop on social media analysis

Author
Publication venue: M. Jeusfeld
Publication date: 01/01/2013
Field of study

Portsmouth University Research Portal (Pure)

Locating bugs without looking back

Author: CD Manning
D Poshyvanyk
EM Voorhees
G Antoniol
G Salton
J Sillito
M Petrenko
MF Porter
Michel Wermelinger
N Wilde
T Zimmermann
Tezcan Dilshener
Yijun Yu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 10/10/2017
Field of study

Bug localisation is a core program comprehension task in software maintenance: given the observation of a bug, e.g. via a bug report, where is it located in the source code? Information retrieval (IR) approaches see the bug report as the query, and the source code files as the documents to be retrieved, ranked by relevance. Such approaches have the advantage of not requiring expensive static or dynamic analysis of the code. However, current state-of-the-art IR approaches rely on project history, in particular previously fixed bugs or previous versions of the source code. We present a novel approach that directly scores each current file against the given report, thus not requiring past code and reports. The scoring method is based on heuristics identified through manual inspection of a small sample of bug reports. We compare our approach to eight others, using their own five metrics on their own six open source projects. Out of 30 performance indicators, we improve 27 and equal 2. Over the projects analysed, on average we find one or more affected files in the top 10 ranked files for 76% of the bug reports. These results show the applicability of our approach to software projects without history

Crossref

Open Research Online (The Open University)

Topic-dependent sentiment analysis of financial blogs

Author: Bermingham Adam
Davy Michael
Ferguson Paul
Gurrin Cathal
O'Hare Neil
Sheridan Páraic
Smeaton Alan F.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2009
Field of study

While most work in sentiment analysis in the financial domain has focused on the use of content from traditional finance news, in this work we concentrate on more subjective sources of information, blogs. We aim to automatically determine the sentiment of financial bloggers towards companies and their stocks. To do this we develop a corpus of financial blogs, annotated with polarity of sentiment with respect to a number of companies. We conduct an analysis of the annotated corpus, from which we show there is a significant level of topic shift within this collection, and also illustrate the difficulty that human annotators have when annotating certain sentiment categories. To deal with the problem of topic shift within blog articles, we propose text extraction techniques to create topic-specific sub-documents, which we use to train a sentiment classifier. We show that such approaches provide a substantial improvement over full documentclassification and that word-based approaches perform better than sentence-based or paragraph-based approaches

CiteSeerX

Crossref

Irish Universities

DCU Online Research Access Service

Stock market random forest-text mining system mining critical indicators of stock market movements

Author: Elagamy Mazen Nabil
SHARP Bernadette
STANIER Clare
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

Stock Market (SM) is believed to be a significant sector of a free market economy as it plays a crucial role in the growth of commerce and industry of a country. The increasing importance of SMs and their direct influence on economy were the main reasons for analysing SM movements. The need to determine early warning indicators for SM crisis has been the focus of study by many economists and politicians. Whilst most research into the identification of these critical indicators applied data mining to uncover hidden knowledge, very few attempted to adopt a text mining approach. This paper demonstrates how text mining combined with Random Forest algorithm can offer a novel approach to the extraction of critical indicators, and classification of related news articles. The findings of this study extend the current classification of critical indicators from three to eight classes; it also show that Random Forest can outperform other classifiers and produce high accuracy

STORE - Staffordshire Online Repository

Forecasting movements of health-care stock prices based on different categories of news articles using multiple kernel learning

Author: Ammar Belatreche
Antweiler
Chan
Cheng
Deng
Duda
Fang
Fung
Gidófalvi
Groth
Hagenau
Hagenau
Hsu
Kloptchenko
Lavrenko
Li
Luss
Mitra
Mittermayer
Mittermayer
Nassirtoussi
Porter
Rachlin
Schumaker
Schumaker
Shynkevich
Shynkevich
Simon
Sonnenburg
Sonya A. Coleman
T.M. McGinnity
Wang
Wuthrich
Yauheniya Shynkevich
Yeh
Zhai
Zhao
Publication venue: 'Elsevier BV'
Publication date: 08/03/2016
Field of study

—The market state changes when a new piece of information arrives. It affects decisions made by investors and is considered to be an important data source that can be used for financial forecasting. Recently information derived from news articles has become a part of financial predictive systems. The usage of news articles and their forecasting potential have been extensively researched. However, so far no attempts have been made to utilise different categories of news articles simultaneously. This paper studies how the concurrent, and appropriately weighted, usage of news articles, having different degrees of relevance to the target stock, can improve the performance of financial forecasting and support the decision-making process of investors and traders. Stock price movements are predicted using the multiple kernel learning technique which integrates information extracted from multiple news categories while separate kernels are utilised to analyse each category. News articles are partitioned according to their relevance to the target stock, its sub industry, industry, group industry and sector. The experiments are run on stocks from the Health Care sector and show that increasing the number of relevant news categories used as data sources for financial forecasting improves the performance of the predictive system in comparison with approaches based on a lower number of categories

Northumbria Research Link

Crossref

Nottingham Trent Institutional Repository (IRep)

Ulster University's Research Portal