431 research outputs found

    Sensing Social Media Signals for Cryptocurrency News

    Full text link
    The ability to track and monitor relevant and important news in real-time is of crucial interest in multiple industrial sectors. In this work, we focus on the set of cryptocurrency news, which recently became of emerging interest to the general and financial audience. In order to track relevant news in real-time, we (i) match news from the web with tweets from social media, (ii) track their intraday tweet activity and (iii) explore different machine learning models for predicting the number of the article mentions on Twitter within the first 24 hours after its publication. We compare several machine learning models, such as linear extrapolation, linear and random forest autoregressive models, and a sequence-to-sequence neural network. We find that the random forest autoregressive model behaves comparably to more complex models in the majority of tasks.Comment: full version of the paper, that is accepted at ACM WWW '19 Conference, MSM'19 Worksho

    A Phenomenological Study of 21-29-Year-Old Teachers\u27 Perceptions of Using Twitter for Professional Development

    Get PDF
    This transcendental phenomenological study explored the perception of 21-29-year-old public school teachers’ use of Twitter in their professional development. However, while teachers in the 21-29-year-old age range were part of the demographic dominating online social media use in general, they did not use online social networks for professional development purposes as much as their older peers (Carpenter & Krutka, 2014; Visser et al., 2014). While professional development was important for improving teachers’ classroom performance and student achievement (Coldwell, 2017), traditional professional development often was ineffective in changing classroom instruction (Carpenter & Krutka, 2014; Dingle, Brownwell, Leko, Boardman, & Haager, 2011; Harcourt & Jones, 2016; Visser, Evering, & Barrett, 2014). Also, lack of relative professional development was cited as a reason teachers leave the profession within a few years of joining (Barry & Shields, 2017). Using Twitter for professional development was used by older teachers effectively, but younger teachers did not use it for potentially helpful professional development (Carpenter & Krutka, 2014; Visser et al., 2014). The theoretical frameworks of this study include sociocultural learning (Vygotsky, 1978), social networking theory (Moreno, 1946), and communities of practice (Lave & Wenger, 1991). Participants selected were six licensed K-12 public school teachers aged 21-29. The setting was a South Carolina suburban public middle school. Data collection methods included interviews, focus groups, and observation of Twitter use after participation in a professional development session on using Twitter in education. Data analysis included horizonalization, reduction and elimination, clustering, and thematizing recommended by Moustakas (1994) to develop themes to understand how teachers age 21-29 perceive using online social networks for professional development

    Effective precursors for self-organization of complex systems into a critical state based on dynamic series data

    Get PDF
    Many different precursors are known, but not all of which are effective, i.e., giving enough time to take preventive measures and with a minimum number of false early warning signals. The study aims to select and study effective early warning measures from a set of measures directly related to critical slowing down as well as to the change in the structure of the reconstructed phase space in the neighborhood of the critical transition point of sand cellular automata. We obtained a dynamical series of the number of unstable nodes in automata with stochastic and deterministic vertex collapse rules, with different topological graph structure and probabilistic distribution law for pumping of automata. For these dynamical series we computed windowed early warning measures. We formulated the notion of an effective measure as the measure that has the smallest number of false signals and the longest early warning time among the set of early warning measures. We found that regardless of the rules, topological structure of graphs, and probabilistic distribution law for pumping of automata, the effective early warning measures are the embedding dimension, correlation dimension, and approximation entropy estimated using the false nearest neighbors algorithm. The variance has the smallest early warning time, and the largest Lyapunov exponent has the greatest number of false early warning signals. Autocorrelation at lag-1 and Welch’s estimate for the scaling exponent of power spectral density cannot be used as early warning measures for critical transitions in the automata. The efficiency definition we introduced can be used to search for and investigate new early warning measures. Embedding dimension, correlation dimension and approximation entropy can be used as effective real-time early warning measures for critical transitions in real-world systems isomorphic to sand cellular automata such as microblogging social network and stock exchange

    On-the-fly tracing for data-centric computing : parallelization, workflow and applications

    Get PDF
    As data-centric computing becomes the trend in science and engineering, more and more hardware systems, as well as middleware frameworks, are emerging to handle the intensive computations associated with big data. At the programming level, it is crucial to have corresponding programming paradigms for dealing with big data. Although MapReduce is now a known programming model for data-centric computing where parallelization is completely replaced by partitioning the computing task through data, not all programs particularly those using statistical computing and data mining algorithms with interdependence can be re-factorized in such a fashion. On the other hand, many traditional automatic parallelization methods put an emphasis on formalism and may not achieve optimal performance with the given limited computing resources. In this work we propose a cross-platform programming paradigm, called on-the-fly data tracing , to provide source-to-source transformation where the same framework also provides the functionality of workflow optimization on larger applications. Using a big-data approximation computations related to large-scale data input are identified in the code and workflow and a simplified core dependence graph is built based on the computational load taking in to account big data. The code can then be partitioned into sections for efficient parallelization; and at the workflow level, optimization can be performed by adjusting the scheduling for big-data considerations, including the I/O performance of the machine. Regarding each unit in both source code and workflow as a model, this framework enables model-based parallel programming that matches the available computing resources. The techniques used in model-based parallel programming as well as the design of the software framework for both parallelization and workflow optimization as well as its implementations with multiple programming languages are presented in the dissertation. Then, the following experiments are performed to validate the framework: i) the benchmarking of parallelization speed-up using typical examples in data analysis and machine learning (e.g. naive Bayes, k-means) and ii) three real-world applications in data-centric computing with the framework are also described to illustrate the efficiency: pattern detection from hurricane and storm surge simulations, road traffic flow prediction and text mining from social media data. In the applications, it illustrates how to build scalable workflows with the framework along with performance enhancements

    Understanding Bots on Social Media - An Application in Disaster Response

    Get PDF
    abstract: Social media has become a primary platform for real-time information sharing among users. News on social media spreads faster than traditional outlets and millions of users turn to this platform to receive the latest updates on major events especially disasters. Social media bridges the gap between the people who are affected by disasters, volunteers who offer contributions, and first responders. On the other hand, social media is a fertile ground for malicious users who purposefully disturb the relief processes facilitated on social media. These malicious users take advantage of social bots to overrun social media posts with fake images, rumors, and false information. This process causes distress and prevents actionable information from reaching the affected people. Social bots are automated accounts that are controlled by a malicious user and these bots have become prevalent on social media in recent years. In spite of existing efforts towards understanding and removing bots on social media, there are at least two drawbacks associated with the current bot detection algorithms: general-purpose bot detection methods are designed to be conservative and not label a user as a bot unless the algorithm is highly confident and they overlook the effect of users who are manipulated by bots and (unintentionally) spread their content. This study is trifold. First, I design a Machine Learning model that uses content and context of social media posts to detect actionable ones among them; it specifically focuses on tweets in which people ask for help after major disasters. Second, I focus on bots who can be a facilitator of malicious content spreading during disasters. I propose two methods for detecting bots on social media with a focus on the recall of the detection. Third, I study the characteristics of users who spread the content of malicious actors. These features have the potential to improve methods that detect malicious content such as fake news.Dissertation/ThesisDoctoral Dissertation Computer Science 201

    SYSTEMIC ANALYSIS OF ILLEGAL MASS MIGRATION IN THE CENTRAL MEDITERRANEAN REGION

    Get PDF
    This thesis explores the systemic behavior of illegal mass migration in the Central Mediterranean region and proposes strategic approaches to address the problem. We hypothesize that the illegal migration is a complex systemic problem, where parts of the system are interdependent and behavioral change of any element effects the behavior of the whole. This research applies a series of quantitative and qualitative analyses where each reveals different aspects and properties of the migration system as a whole. The systemic analysis highlights the interconnectedness of different parts and their impact of the system’s output. Also, it reveals the cognitive background as a unique aspect of this region: namely, the decision to migrate is based on biased perception and bounded rationality rather than rational choice. In conclusion, we claim that the system’s output (i.e. level of illegal migration) is characterized by the interrelated behavior of parts of the migration system; therefore, strategic planning requires the notion of the dominant feedback loops, self-organization, time delays, limitations, and non-linear relations. Also, we conclude that a skewed perception based on social influence and cognitive biases influences a large number of people in that region to migrate.Captain, Hungarian Defence ForceApproved for public release. Distribution is unlimited

    Control and Data Analysis of Complex Networks

    Get PDF
    abstract: This dissertation treats a number of related problems in control and data analysis of complex networks. First, in existing linear controllability frameworks, the ability to steer a network from any initiate state toward any desired state is measured by the minimum number of driver nodes. However, the associated optimal control energy can become unbearably large, preventing actual control from being realized. Here I develop a physical controllability framework and propose strategies to turn physically uncontrollable networks into physically controllable ones. I also discover that although full control can be guaranteed by the prevailing structural controllability theory, it is necessary to balance the number of driver nodes and control energy to achieve actual control, and my work provides a framework to address this issue. Second, in spite of recent progresses in linear controllability, controlling nonlinear dynamical networks remains an outstanding problem. Here I develop an experimentally feasible control framework for nonlinear dynamical networks that exhibit multistability. The control objective is to apply parameter perturbation to drive the system from one attractor to another. I introduce the concept of attractor network and formulate a quantifiable framework: a network is more controllable if the attractor network is more strongly connected. I test the control framework using examples from various models and demonstrate the beneficial role of noise in facilitating control. Third, I analyze large data sets from a diverse online social networking (OSN) systems and find that the growth dynamics of meme popularity exhibit characteristically different behaviors: linear, “S”-shape and exponential growths. Inspired by cell population growth model in microbial ecology, I construct a base growth model for meme popularity in OSNs. Then I incorporate human interest dynamics into the base model and propose a hybrid model which contains a small number of free parameters. The model successfully predicts the various distinct meme growth dynamics. At last, I propose a nonlinear dynamics model to characterize the controlling of WNT signaling pathway in the differentiation of neural progenitor cells. The model is able to predict experiment results and shed light on the understanding of WNT regulation mechanisms.Dissertation/ThesisDoctoral Dissertation Electrical Engineering 201
    • …
    corecore