35 research outputs found

    Fitting replicated multiple time series models

    Get PDF
    This paper shows that the interleaving of replicated multiple time series allows the estimation methods available in standard multiple time series packages to be applied simultaneously to each of the replicated series without loss of information. The methodology employs a non-trivial multivariate extension of an earlier univariate result involving interleaving. The interleaving approach is used to model more than sixty years of daily maximum and minimum temperatures for Perth, Western Australia

    Trump, Twitter, and news media responsiveness: a media systems approach

    Full text link
    How populists engage with media of various types, and are treated by those media, are questions of international interest. In the United States, Donald Trump stands out for both his populism-inflected campaign style and his success at attracting media attention. This article examines how interactions between candidate communications, social media, partisan media, and news media combined to shape attention to Trump, Clinton, Cruz, and Sanders during the 2015–2016 American presidential primary elections. We identify six major components of the American media system and measure candidates’ efforts to gain attention from them. Our results demonstrate that social media activity, in the form of retweets of candidate posts, provided a significant boost to news media coverage of Trump, but no comparable boost for other candidates. Furthermore, Trump tweeted more at times when he had recently garnered less of a relative advantage in news attention, suggesting he strategically used Twitter to trigger coverage.Accepted manuscrip

    Online geolocalized emotion across US cities during the COVID crisis: Universality, policy response, and connection with local mobility

    Full text link
    As the COVID-19 pandemic began to sweep across the US it elicited a wide spectrum of responses, both online and offline, across the population. To aid the development of effective spatially targeted interventions in the midst of this turmoil, it is important to understand the geolocalization of these online emotional responses, as well as their association with offline behavioral responses. Here, we analyze around 13 million geotagged tweets in 49 cities across the US from the first few months of the pandemic to assess regional dependence in online sentiments with respect to a few major topics, and how these sentiments correlate with policy development and human mobility. Surprisingly, we observe universal trends in overall and topic-based sentiments across cities over the time period studied, with variability primarily seen only in the immediate impact of federal guidelines and local lockdown policies. We also find that these local sentiments are highly correlated with and predictive of city-level mobility, while the correlations between sentiments and local cases and deaths are relatively weak. Our findings point to widespread commonalities in the online public emotional responses to COVID across the US, both temporally and relative to offline indicators, in contrast with the high variability seen in early local containment policies. This study also provides new insights into the use of social media data in crisis management by integrating offline data to gain an in-depth understanding of public emotional responses, policy development, and local mobility

    Analyzing cryptocurrency groups using topic modeling on Twitter posts

    Get PDF
    Cryptocurrencies are decentralized digital coins that use cryptographic protocols to provide more secure financial transactions. The world witnessed an impressive rise in the prices of these assets in the last few years, which stimulated a great interest regarding them. This thesis shifts the focus from the most notorious one, bitcoin, and from price aspects to concentrate on other cryptocoins and their technical features. A total of 25 cryptocoins were selected and then divided into 3 groups representing fundamental characteristics: Faster transactions, Smart Contracts and Privacy. Then, daily comments about these cryptocoins on Twitter were collected for 4 months. The main objective was to check whether the categorization fits well for each group, detect the prominent themes under discussion and perform a prediction task to see which ones may be discussed again in the future. Topic modeling, specifically Latent Dirichlet Allocation (LDA), was utilized to process the text data in order to find the topics that best represented each one of the groups. Coherence measures were applied to discover the optimal number of topics, which were later grouped into themes. Daily average probability distributions for topics, or topic weights, were treated as a time series data along with their theme representations. With that, it was possible to forecast theme weights using ARIMA and check the predictive ability of each theme by comparing mean squared error (MSE) of ARIMA and Naive methods. Overall, the cryptocoins seemed to be well represented since in every group there is at least one topic that directly refers to the meaning of the group. However, none of the previously mentioned topics was the most important in any of the groups. Faster transactions and Smart Contracts ended up being similar groups, having a Financial topic-group as the most notable theme, a similar organization of their remaining ones and low predictive ability, while the Privacy currency-group had different results, with a Mixed topic-group as the best-positioned theme and slightly better forecasting results

    Predicting influence on social networks with graph machine learning

    Get PDF

    Three Essays on Individuals’ Vulnerability to Security Attacks in Online Social Networks: Factors and Behaviors

    Get PDF
    With increasing reliance on the Internet, the use of online social networks (OSNs) for communication has grown rapidly. OSN platforms are used to share information and communicate with friends and family. However, these platforms can pose serious security threats to users. In spite of the extent of such security threats and resulting damages, little is known about factors associated with individuals’ vulnerability to online security attacks. We address this gap in the following three essays. Essay 1 draws on a synthesis of the epidemic theory in infectious disease epidemiology with the social capital theory to conceptualize factors that contribute to an individual’s role in security threat propagation in OSN. To test the model, we collected data and created a network of hacked individuals over three months from Twitter. The final hacked network consists of over 8000 individual users. Using this data set, we derived individual’s factors measuring threat propagation efficacy and threat vulnerability. The dependent variables were defined based on the concept of epidemic theory in disease propagation. The independent variables are measured based on the social capital theory. We use the regression method for data analysis. The results of this study uncover factors that have significant impact on threat propagation efficacy and threat vulnerability. We discuss the novel theoretical and managerial contributions of this work. Essay 2 explores the role of individuals’ interests in their threat vulnerability in OSNs. In OSNs, individuals follow social pages and post contents that can easily reveal their topics of interest. Prior studies show high exposure of individuals to topics of interest can decrease individuals’ ability to evaluate the risks associated with their interests. This gives attackers a chance to target people based on what they are interested in. However, interest-based vulnerability is not just a risk factor for individuals themselves. Research has reported that similar interests lead to friendship and individuals share similar interests with their friends. This similarity can increase trust among friends and makes individuals more vulnerable to security threat coming from their friends’ behaviors. Despite the potential importance of interest in the propagation of online security attacks online, the literature on this topic is scarce. To address this gap, we capture individuals’ interests in OSN and identify the association between individuals’ interests and their vulnerability to online security threats. The theoretical foundation of this work is a synthesis of dual-system theory and the theory of homophily. Communities of interest in OSN were detected using a known algorithm. We test our model using the data set and social network of hacked individuals from Essay 1. We used this network to collect additional data about individuals’ interests in OSN. The results determine communities of interests which were associated with individuals’ online threat vulnerability. Moreover, our findings reveal that similarities of interest among individuals and their friends play a role in individuals’ threat vulnerability in OSN. We discuss the novel theoretical and empirical contributions of this work. Essay 3 examines the role addiction to OSNs plays in individuals’ security perceptions and behaviors. Despite the prevalence of problematic use of OSNs and the possibility of addiction to these platforms, little is known about the functionalities of brain systems of users who suffer from OSN addiction and their online security perception and behaviors. In addressing these gaps, we have developed the Online addiction & security behaviors (OASB) theory by synthesizing dual-system theory and extended protection motivation theory (PMT). We collected data through an online survey. The results indicate that OSN addiction is rooted in the individual’s brain systems. For the OSN addicted, there is a strong cognitive-emotional preoccupation with using OSN. Our findings also reveal the positive and significant impact of OSN addiction on perceived susceptibility to and severity of online security threats. Moreover, our results show the negative association between OSN addiction and perceived self-efficacy. We discuss the theoretical and practical implications of this work

    Social media analytics with applications in disaster management and COVID-19 events

    Get PDF
    Social media such as Twitter offers a tremendous amount of data throughout an event or a disastrous situation. Leveraging social media data during a disaster is beneficial for effective and efficient disaster management. Information extraction, trend identification, and determining public reactions might help in the future disaster or even avert such an event. However, during a disaster situation, a robust system is required that can be deployed faster and process relevant information with satisfactory performance in real-time. This work outlines the research contributions toward developing such an effective system for disaster management, where it is paramount to develop automated machine-enabled methods that can provide appropriate tags or labels for further analysis for timely situation-awareness. In that direction, this work proposes machine learning models to identify the people who are seeking assistance using social media during a disaster and further demonstrates a prototype application that can collect and process Twitter data in real-time, identify the stranded people, and create rescue scheduling. In addition, to understand the people’s reactions to different trending topics, this work proposes a unique auxiliary feature-based deep learning model with adversarial sample generation for emotion detection using tweets related to COVID-19. This work also presents a custom Q&A-based RoBERTa model for extracting related phrases for emotions. Finally, with the aim of polarization detection, this research work proposes a deep learning pipeline for political ideology detection leveraging the tweet texts and the expressed emotions in the text. This work also studies and conducts the historical emotion and polarization analysis of the COVID-19 pandemic in the USA and several individual states using tweeter data --Abstract, page iv

    Understanding the Relationship between Online Discussions and Bitcoin Return and Volume: Topic Modeling and Sentiment Analysis

    Get PDF
    This thesis examines Bitcoin related discussions on Bitcointalk.com over the 2013-2022 period. Using Latent Dirichlet Allocation (LDA) topic modeling algorithm, we discover eight distinct topics: Mining, Regulation, Investment/trading, Public perception, Bitcoin’s nature, Wallet, Payment, and Other. Importantly, we find differences in relations between different topics’ sentiment, disagreement (proxy for uncertainty) and hype (proxy for attention) on one hand and Bitcoin return and trading volume on the other hand. Specifically, among all topics, only the sentiment and disagreement of Investment/trading topic have significant contemporaneous relation with Bitcoin return. In addition, sentiment and disagreement of several topics, such as Mining and Wallet, show significant relationships with Bitcoin return only on the tails of the return distribution (bullish and bearish markets). In contrast, sentiment, disagreement, and hype of each topic show significant relation with Bitcoin volume across the entire distribution. In addition, whereas hype has a positive relation with trading volume in a low-volume market, this relation becomes negative in a high-volume market
    corecore