183 research outputs found
News Cohesiveness: an Indicator of Systemic Risk in Financial Markets
Motivated by recent financial crises significant research efforts have been
put into studying contagion effects and herding behaviour in financial markets.
Much less has been said about influence of financial news on financial markets.
We propose a novel measure of collective behaviour in financial news on the
Web, News Cohesiveness Index (NCI), and show that it can be used as a systemic
risk indicator. We evaluate the NCI on financial documents from large Web news
sources on a daily basis from October 2011 to July 2013 and analyse the
interplay between financial markets and financially related news. We
hypothesized that strong cohesion in financial news reflects movements in the
financial markets. Cohesiveness is more general and robust measure of systemic
risk expressed in news, than measures based on simple occurrences of specific
terms. Our results indicate that cohesiveness in the financial news is highly
correlated with and driven by volatility on the financial markets
Handling Disagreement inĀ Hate Speech Modelling
Hate speech annotation for training machine learning models is an inherently ambiguous and subjective task. In this paper, we adopt a perspectivist approach to data annotation, model training and evaluation for hate speech classification. We first focus on the annotation process and argue that it drastically influences the final data quality. We then present three large hate speech datasets that incorporate annotator disagreement and use them to train and evaluate machine learning models. As the main point, we propose to evaluate machine learning models through the lens of disagreement by applying proper performance measures to evaluate both annotatorsā agreement and modelsā quality. We further argue that annotator agreement poses intrinsic limits to the performance achievable by models. When comparing models and annotators, we observed that they achieve consistent levels of agreement across datasets. We reflect upon our results and propose some methodological and ethical considerations that can stimulate the ongoing discussion on hate speech modelling and classification with disagreement
Dynamics of online hate and misinformation
Online debates are often characterised by extreme polarisation and heated discussions among
users. The presence of hate speech online is becoming increasingly problematic, making necessary
the development of appropriate countermeasures. In this work, we perform hate speech detection
on a corpus of more than one million comments on YouTube videos through a machine learning
model, trained and fine-tuned on a large set of hand-annotated data. Our analysis shows that there
is no evidence of the presence of āpure hatersā, meant as active users posting exclusively hateful
comments. Moreover, coherently with the echo chamber hypothesis, we find that users skewed
towards one of the two categories of video channels (questionable, reliable) are more prone to use
inappropriate, violent, or hateful language within their opponentsā community. Interestingly, users
loyal to reliable sources use on average a more toxic language than their counterpart. Finally, we find
that the overall toxicity of the discussion increases with its length, measured both in terms of the
number of comments and time. Our results show that, coherently with Godwinās law, online debates
tend to degenerate towards increasingly toxic exchanges of views
Extraction of temporal networks from term co-occurrences in online textual sources
A stream of unstructured news can be a valuable source of hidden relations between different entities, such as financial institutions, countries, or persons. We present an approach to continuously collect online news, recognize relevant entities in them, and extract time-varying networks. The nodes of the network are the entities, and the links are their co-occurrences. We present a method to estimate the significance of co-occurrences, and a benchmark model against which their robustness is evaluated. The approach is applied to a large set of financial news, collected over a period of two years. The entities we consider are 50 countries which issue sovereign bonds, and which are insured by Credit Default Swaps (CDS) in turn. We compare the country co-occurrence networks to the CDS networks constructed from the correlations between the CDS. The results show relatively small, but significant overlap between the networks extracted from the news and those from the CDS correlations
Developing a biomaterial interface based on poly(lactic acid) via plasma-assisted covalent anchorage of D-glucosamine and its potential for tissue regeneration
The aim of this study was to develop the potential tissue engineering applications of D-glucosamine (GlcN) immobilized onto the surface of a biodegradable matrix in order to induce a desired biological effect at biointerfaces. Thus, for sample preparation we used a novel multistep physicochemical approach. In the first step the poly(lactic acid) (PLA) films were exposed to a low pressure plasma in air atmosphere, followed by radical graft copolymerization with acrylic acid to yield a carboxyl-functionalized spacer layer on the PLA surface. The carboxyl groups were then coupled to GlcN molecules via the carbodiimide chemistry. The developed surfaces were characterized by X-ray Photoelectron Spectroscopy (XPS), Contact angle measurements and Atomic Force Microscopy (AFM). A preliminary study on the proliferation of fibroblasts on the developed surfaces was performed using the NIH/3T3 cell line. Ā© 2016 Elsevier B.V.ARRS, Slovenian Research AgencyMinistry of Education, Youth and Sports of the Czech Republic [LO1504]; Ministry of Education, Science, Research and Sport of the Slovak Republic; Slovak Academy of Sciences [2/0199/14]; Slovenian Research Agency [P2-0082
- ā¦