143 research outputs found

    News Cohesiveness: an Indicator of Systemic Risk in Financial Markets

    Get PDF
    Motivated by recent financial crises significant research efforts have been put into studying contagion effects and herding behaviour in financial markets. Much less has been said about influence of financial news on financial markets. We propose a novel measure of collective behaviour in financial news on the Web, News Cohesiveness Index (NCI), and show that it can be used as a systemic risk indicator. We evaluate the NCI on financial documents from large Web news sources on a daily basis from October 2011 to July 2013 and analyse the interplay between financial markets and financially related news. We hypothesized that strong cohesion in financial news reflects movements in the financial markets. Cohesiveness is more general and robust measure of systemic risk expressed in news, than measures based on simple occurrences of specific terms. Our results indicate that cohesiveness in the financial news is highly correlated with and driven by volatility on the financial markets

    Dynamics of online hate and misinformation

    Get PDF
    Online debates are often characterised by extreme polarisation and heated discussions among users. The presence of hate speech online is becoming increasingly problematic, making necessary the development of appropriate countermeasures. In this work, we perform hate speech detection on a corpus of more than one million comments on YouTube videos through a machine learning model, trained and fine-tuned on a large set of hand-annotated data. Our analysis shows that there is no evidence of the presence of ā€œpure hatersā€, meant as active users posting exclusively hateful comments. Moreover, coherently with the echo chamber hypothesis, we find that users skewed towards one of the two categories of video channels (questionable, reliable) are more prone to use inappropriate, violent, or hateful language within their opponentsā€™ community. Interestingly, users loyal to reliable sources use on average a more toxic language than their counterpart. Finally, we find that the overall toxicity of the discussion increases with its length, measured both in terms of the number of comments and time. Our results show that, coherently with Godwinā€™s law, online debates tend to degenerate towards increasingly toxic exchanges of views

    A Semantics-Based Measure of Emoji Similarity

    Get PDF
    Emoji have grown to become one of the most important forms of communication on the web. With its widespread use, measuring the similarity of emoji has become an important problem for contemporary text processing since it lies at the heart of sentiment analysis, search, and interface design tasks. This paper presents a comprehensive analysis of the semantic similarity of emoji through embedding models that are learned over machine-readable emoji meanings in the EmojiNet knowledge base. Using emoji descriptions, emoji sense labels and emoji sense definitions, and with different training corpora obtained from Twitter and Google News, we develop and test multiple embedding models to measure emoji similarity. To evaluate our work, we create a new dataset called EmoSim508, which assigns human-annotated semantic similarity scores to a set of 508 carefully selected emoji pairs. After validation with EmoSim508, we present a real-world use-case of our emoji embedding models using a sentiment analysis task and show that our models outperform the previous best-performing emoji embedding model on this task. The EmoSim508 dataset and our emoji embedding models are publicly released with this paper and can be downloaded from http://emojinet.knoesis.org/.Comment: This paper is accepted at Web Intelligence 2017 as a full paper, In 2017 IEEE/WIC/ACM International Conference on Web Intelligence (WI). Leipzig, Germany: ACM, 201

    Handling Disagreement inĀ Hate Speech Modelling

    Get PDF
    Hate speech annotation for training machine learning models is an inherently ambiguous and subjective task. In this paper, we adopt a perspectivist approach to data annotation, model training and evaluation for hate speech classification. We first focus on the annotation process and argue that it drastically influences the final data quality. We then present three large hate speech datasets that incorporate annotator disagreement and use them to train and evaluate machine learning models. As the main point, we propose to evaluate machine learning models through the lens of disagreement by applying proper performance measures to evaluate both annotatorsā€™ agreement and modelsā€™ quality. We further argue that annotator agreement poses intrinsic limits to the performance achievable by models. When comparing models and annotators, we observed that they achieve consistent levels of agreement across datasets. We reflect upon our results and propose some methodological and ethical considerations that can stimulate the ongoing discussion on hate speech modelling and classification with disagreement

    Multimodal Emotion Classification

    Get PDF
    Most NLP and Computer Vision tasks are limited to scarcity of labelled data. In social media emotion classification and other related tasks, hashtags have been used as indicators to label data. With the rapid increase in emoji usage of social media, emojis are used as an additional feature for major social NLP tasks. However, this is less explored in case of multimedia posts on social media where posts are composed of both image and text. At the same time, w.e have seen a surge in the interest to incorporate domain knowledge to improve machine understanding of text. In this paper, we investigate whether domain knowledge for emoji can improve the accuracy of emotion classification task. We exploit the importance of different modalities from social media post for emotion classification task using state-of-the-art deep learning architectures. Our experiments demonstrate that the three modalities (text, emoji and images) encode different information to express emotion and therefore can complement each other. Our results also demonstrate that emoji sense depends on the textual context, and emoji combined with text encodes better information than considered separately. The highest accuracy of 71.98\% is achieved with a training data of 550k posts.Comment: Accepted at the 2nd Emoji Workshop co-located with The Web Conference 201

    Pojavnost alimentarnog mukozitisa kod pacijenata na klinici za onkologiju Kliničkog bolničkog centra Rebro - Zagreb The incidence of alimentary mucositis in patients at Clinic for Oncology in the Clinical Hospital Centre Rebro - Zagreb

    Get PDF
    Uvod: Mukozitis je upalna promjena sluznice koja može zahvatiti cijeli probavni trakt izazivajući bol, ulceracije u ustima, nadimanje, povraćanje, proljev i poremećaje peristaltike. Nastaje zbog citotoksičnog učinka medikamenata i/ili kao posljedica zračenja. Mukozitis je faktor rizika za povećanje incidencije lokalnih i sustavnih infekcija, smanjuje stupanj kvalitete života te je ograničavajući čimbenik u doziranju zračenja i provođenju kemoterapije. Svrha je članka utvrditi učestalost mukozitisa i njegov utjecaj na kvalitetu života pacijenata liječenih u Klinici za onkologiju Kliničkog bolničkog centra Zagreb. Ispitanici i metode: Istraživanje je provedeno na Klinici za onkologiju KBC-a Zagreb. Podatci su dobiveni izradom originalnog anonimnog anketnog upitnika koji se sastojao od 14 pitanja o prisutnosti simptoma mukozitisa. Ispitanici su anketirani između 20. 12. 2015. i 30. 4. 2016 godine. Upitnik je ispunilo 306 ispitanika koji su u tome razdoblju primali kemoterapiju i/ili radioterapiju. Rezultati: Iznalazi se da je 23,2% ispitanika imalo bolno i/ili otežano gutanje od blagog do ozbiljnog stupnja koje je povezano s terapijom. Nadalje, 43,6% ispitanika s navedenim simptomima liječeno je uporabom analgetika radi smanjenja bolnosti. Bol u grlu navodi 24,8% ispitanika, proljev 26,1% ispitanika, kod 18,6% ispitanika iznalazi se primjesa sluzi i/ili krvi u stolici, a 25,5% ispitanika uzima sredstva za smanjenje rijetkih stolica. Zaključak: Alimentarni mukozitis je komplikacija kemoterapije i/ili zračenja, koja smanjuje stupanj kvalitete života i to zbog pojačanja boli i otežanog gutanja kad promjene zahvaćaju cirkumferenciju jednjaka i cijeli gastrointestinalni trakt zbog pojave mučnine, povraćanja i proljeva
    • ā€¦
    corecore