98 research outputs found

    Twitter reciprocal reply networks exhibit assortativity with respect to happiness

    Full text link
    The advent of social media has provided an extraordinary, if imperfect, 'big data' window into the form and evolution of social networks. Based on nearly 40 million message pairs posted to Twitter between September 2008 and February 2009, we construct and examine the revealed social network structure and dynamics over the time scales of days, weeks, and months. At the level of user behavior, we employ our recently developed hedonometric analysis methods to investigate patterns of sentiment expression. We find users' average happiness scores to be positively and significantly correlated with those of users one, two, and three links away. We strengthen our analysis by proposing and using a null model to test the effect of network topology on the assortativity of happiness. We also find evidence that more well connected users write happier status updates, with a transition occurring around Dunbar's number. More generally, our work provides evidence of a social sub-network structure within Twitter and raises several methodological points of interest with regard to social network reconstructions.Comment: 22 pages, 21 figures, 5 tables, In press at the Journal of Computational Scienc

    Estimation of global network statistics from incomplete data

    Get PDF
    Complex networks underlie an enormous variety of social, biological, physical, and virtual systems. A profound complication for the science of complex networks is that in most cases, observing all nodes and all network interactions is impossible. Previous work addressing the impacts of partial network data is surprisingly limited, focuses primarily on missing nodes, and suggests that network statistics derived from subsampled data are not suitable estimators for the same network statistics describing the overall network topology. We generate scaling methods to predict true network statistics, including the degree distribution, from only partial knowledge of nodes, links, or weights. Our methods are transparent and do not assume a known generating process for the network, thus enabling prediction of network statistics for a wide variety of applications. We validate analytical results on four simulated network classes and empirical data sets of various sizes. We perform subsampling experiments by varying proportions of sampled data and demonstrate that our scaling methods can provide very good estimates of true network statistics while acknowledging limits. Lastly, we apply our techniques to a set of rich and evolving large-scale social networks, Twitter reply networks. Based on 100 million tweets, we use our scaling techniques to propose a statistical characterization of the Twitter Interactome from September 2008 to November 2008. Our treatment allows us to find support for Dunbar\u27s hypothesis in detecting an upper threshold for the number of active social contacts that individuals maintain over the course of one week

    Temporal patterns of happiness and information in a global social network: Hedonometrics and Twitter

    Get PDF
    Individual happiness is a fundamental societal metric. Normally measured through self-report, happiness has often been indirectly characterized and overshadowed by more readily quantifiable economic indicators such as gross domestic product. Here, we examine expressions made on the online, global microblog and social networking service Twitter, uncovering and explaining temporal variations in happiness and information levels over timescales ranging from hours to years. Our data set comprises over 46 billion words contained in nearly 4.6 billion expressions posted over a 33 month span by over 63 million unique users. In measuring happiness, we use a real-time, remote-sensing, non-invasive, text-based approach---a kind of hedonometer. In building our metric, made available with this paper, we conducted a survey to obtain happiness evaluations of over 10,000 individual words, representing a tenfold size improvement over similar existing word sets. Rather than being ad hoc, our word list is chosen solely by frequency of usage and we show how a highly robust metric can be constructed and defended.Comment: 27 pages, 17 figures, 3 tables. Supplementary Information: 1 table, 52 figure

    IS THE WHOLE GREATER THAN THE SUM OF ITS PARTS? A COMPARISON OF SMALL GROUP AND WHOLE CLASS DISCUSSION BOARD ACTIVITY IN ONLINE COURSES

    Get PDF
    Methods for characterizing asynchronous text-based discussions have received significant attention in the literature. In this study, we examine student and instructor posts made in seventeen undergraduate mathematics courses over the duration of a fifteen-week semester (n=6964 posts). We apply our previously developed multifactor discussion board metric to compare differences in student participation, quantities of student posts, quality of posts, extent of threading, and instructor presence in small group and whole class discussion board activities. Results from this study indicate that small group discussions contained greater levels of student participation, greater quantities of posts per student and greater numbers of educationally valuable (content-related) posts per student as compared to whole class discussions within these courses. Interestingly, small group discussions contained a greater proportion of less educationally valuable posts as compared to whole class discussions

    An Evolutionary Algorithm Approach to Link Prediction in Dynamic Social Networks

    Full text link
    Many real world, complex phenomena have underlying structures of evolving networks where nodes and links are added and removed over time. A central scientific challenge is the description and explanation of network dynamics, with a key test being the prediction of short and long term changes. For the problem of short-term link prediction, existing methods attempt to determine neighborhood metrics that correlate with the appearance of a link in the next observation period. Recent work has suggested that the incorporation of topological features and node attributes can improve link prediction. We provide an approach to predicting future links by applying the Covariance Matrix Adaptation Evolution Strategy (CMA-ES) to optimize weights which are used in a linear combination of sixteen neighborhood and node similarity indices. We examine a large dynamic social network with over 10610^6 nodes (Twitter reciprocal reply networks), both as a test of our general method and as a problem of scientific interest in itself. Our method exhibits fast convergence and high levels of precision for the top twenty predicted links. Based on our findings, we suggest possible factors which may be driving the evolution of Twitter reciprocal reply networks.Comment: 17 pages, 12 figures, 4 tables, Submitted to the Journal of Computational Scienc

    cMOOCs and Global Learning: An Authentic Alternative

    Get PDF
    Massive Open Online Courses (MOOCs) continue to attract press coverage as they change almost daily in their format, number of registrations and potential for credentialing. An enticing aspect of the MOOC is its global reach. In this paper, we will focus on a type of MOOC called a cMOOC, because it is based on the theory of connectivism and fits the definition of an Open Educational Resource (OER) identified for this special edition of JALN. We begin with a definition of the cMOOC and a discussion of the connectivism on which it is based. Definitions and a research review are followed with a description of two MOOCs offered by two of the authors. Research on one of these MOOCs completed by a third author is presented as well. Student comments that demonstrate the intercultural connections are shared. We end with reflections, lessons learned and recommendations

    Positivity of the English language

    Get PDF
    Over the last million years, human language has emerged and evolved as a fundamental instrument of social communication and semiotic representation. People use language in part to convey emotional information, leading to the central and contingent questions: (1) What is the emotional spectrum of natural language? and (2) Are natural languages neutrally, positively, or negatively biased? Here, we report that the human-perceived positivity of over 10,000 of the most frequently used English words exhibits a clear positive bias. More deeply, we characterize and quantify distributions of word positivity for four large and distinct corpora, demonstrating that their form is broadly invariant with respect to frequency of word use.Comment: Manuscript: 9 pages, 3 tables, 5 figures; Supplementary Information: 12 pages, 3 tables, 8 figure

    Extremely Long-Lived Stigmas Allow Extended Cross-Pollination Opportunities in a High Andean Plant

    Get PDF
    High-elevation ecosystems are traditionally viewed as environments in which predominantly autogamous breeding systems should be selected because of the limited pollinator availability. Chaetanthera renifolia (Asteraceae) is an endemic monocarpic triennial herb restricted to a narrow altitudinal range within the high Andes of central Chile (3300–3500 m a.s.l.), just below the vegetation limit. This species displays one of the larger capitulum within the genus. Under the reproductive assurance hypothesis, and considering its short longevity (monocarpic triennial), an autogamous breeding system and low levels of pollen limitation would be predicted for C. renifolia. In contrast, considering its large floral size, a xenogamous breeding system, and significant levels of pollen limitation could be expected. In addition, the increased pollination probability hypothesis predicts prolonged stigma longevity for high alpine plants. We tested these alternative predictions by performing experimental crossings in the field to establish the breeding system and to measure the magnitude of pollen limitation in two populations of C. renifolia. In addition, we measured the stigma longevity in unpollinated and open pollinated capitula, and pollinator visitation rates in the field. We found low levels of self-compatibility and significant levels of pollen limitation in C. renifolia. Pollinator visitation rates were moderate (0.047–0.079 visits per capitulum per 30 min). Although pollinator visitation rate significantly differed between populations, they were not translated into differences in achene output. Finally, C. renifolia stigma longevity of unpollinated plants was extremely long and significantly higher than that of open pollinated plants (26.3±2.8 days vs. 10.1±2.2, respectively), which gives support to the increased pollination probability hypothesis for high-elevation flowering plants. Our results add to a growing number of studies that show that xenogamous breeding systems and mechanisms to increase pollination opportunities can be selected in high-elevation ecosystems

    A subset of platinum-containing chemotherapeutic agents kills cells by inducing ribosome biogenesis stress

    Get PDF
    Cisplatin and its platinum analogs, carboplatin and oxaliplatin, are some of the most widely used cancer chemotherapeutics. Although cisplatin and carboplatin are used primarily in germ cell, breast and lung malignancies, oxaliplatin is instead used almost exclusively to treat colorectal and other gastrointestinal cancers. Here we utilize a unique, multi-platform genetic approach to study the mechanism of action of these clinically established platinum anti-cancer agents, as well as more recently developed cisplatin analogs. We show that oxaliplatin, unlike cisplatin and carboplatin, does not kill cells through the DNA-damage response. Rather, oxaliplatin kills cells by inducing ribosome biogenesis stress. This difference in drug mechanism explains the distinct clinical implementation of oxaliplatin relative to cisplatin, and it might enable mechanistically informed selection of distinct platinum drugs for distinct malignancies. These data highlight the functional diversity of core components of front-line cancer therapy and the potential benefits of applying a mechanism-based rationale to the use of our current arsenal of anti-cancer drugs

    An iconic language for the graphical representation of medical concepts

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Many medication errors are encountered in drug prescriptions, which would not occur if practitioners could remember the drug properties. They can refer to drug monographs to find these properties, however drug monographs are long and tedious to read during consultation. We propose a two-step approach for facilitating access to drug monographs. The first step, presented here, is the design of a graphical language, called VCM.</p> <p>Methods</p> <p>The VCM graphical language was designed using a small number of graphical primitives and combinatory rules. VCM was evaluated over 11 volunteer general practitioners to assess if the language is easy to learn, to understand and to use. Evaluators were asked to register their VCM training time, to indicate the meaning of VCM icons and sentences, and to answer clinical questions related to randomly generated drug monograph-like documents, supplied in text or VCM format.</p> <p>Results</p> <p>VCM can represent the various signs, diseases, physiological states, life habits, drugs and tests described in drug monographs. Grammatical rules make it possible to generate many icons by combining a small number of primitives and reusing simple icons to build more complex ones. Icons can be organized into simple sentences to express drug recommendations. Evaluation showed that VCM was learnt in 2 to 7 hours, that physicians understood 89% of the tested VCM icons, and that they answered correctly to 94% of questions using VCM (versus 88% using text, <it>p </it>= 0.003) and 1.8 times faster (<it>p </it>< 0.001).</p> <p>Conclusion</p> <p>VCM can be learnt in a few hours and appears to be easy to read. It can now be used in a second step: the design of graphical interfaces facilitating access to drug monographs. It could also be used for broader applications, including the design of interfaces for consulting other types of medical document or medical data, or, very simply, to enrich medical texts.</p
    • …
    corecore