3,217 research outputs found

    Measuring internet activity: a (selective) review of methods and metrics

    Get PDF
    Two Decades after the birth of the World Wide Web, more than two billion people around the world are Internet users. The digital landscape is littered with hints that the affordances of digital communications are being leveraged to transform life in profound and important ways. The reach and influence of digitally mediated activity grow by the day and touch upon all aspects of life, from health, education, and commerce to religion and governance. This trend demands that we seek answers to the biggest questions about how digitally mediated communication changes society and the role of different policies in helping or hindering the beneficial aspects of these changes. Yet despite the profusion of data the digital age has brought upon us—we now have access to a flood of information about the movements, relationships, purchasing decisions, interests, and intimate thoughts of people around the world—the distance between the great questions of the digital age and our understanding of the impact of digital communications on society remains large. A number of ongoing policy questions have emerged that beg for better empirical data and analyses upon which to base wider and more insightful perspectives on the mechanics of social, economic, and political life online. This paper seeks to describe the conceptual and practical impediments to measuring and understanding digital activity and highlights a sample of the many efforts to fill the gap between our incomplete understanding of digital life and the formidable policy questions related to developing a vibrant and healthy Internet that serves the public interest and contributes to human wellbeing. Our primary focus is on efforts to measure Internet activity, as we believe obtaining robust, accurate data is a necessary and valuable first step that will lead us closer to answering the vitally important questions of the digital realm. Even this step is challenging: the Internet is difficult to measure and monitor, and there is no simple aggregate measure of Internet activity—no GDP, no HDI. In the following section we present a framework for assessing efforts to document digital activity. The next three sections offer a summary and description of many of the ongoing projects that document digital activity, with two final sections devoted to discussion and conclusions

    Network Formation in the Political Blogosphere. An Application of Agent Based Simulation and e-Research Tools

    Get PDF
    The political blogosphere has recently been the focus of attention for social network analysis and applications of network and graph theory. In a recent paper, Adamic and Glance (2005) report differences between the linking behavior of politically conservative vs. politically liberal Web bloggers. We construct a simple agent-based network formation model which shows that one such difference, demonstrating what we term ‘political homophily’, can be generated by connecting the blogosphere to the underlying population distribution of political preferences. The model is implemented as a web service in the e-tool VOSON (Virtual Observatory for the Study of Online Networks), and both model and tool serve to define a natural environment for research into link formation behavior with large numbers of heterogeneous network participants.Network formation, Social network analysis, Blogosphere, VOSON, Agentbased simulation

    Doing blog research: the computational turn

    Get PDF
    Blogs and other online platforms for personal writing such as LiveJournal have been of interest to researchers across the social sciences and humanities for a decade now. Although growth in the uptake of blogging has stalled somewhat since the heyday of blogs in the early 2000s, blogging continues to be a major genre of Internet-based communication. Indeed, at the same time that mass participation has moved on to Facebook, Twitter, and other more recent communication phenomena, what has been left behind by the wave of mass adoption is a slightly smaller but all the more solidly established blogosphere of engaged and committed participants. Blogs are now an accepted part of institutional, group, and personal communications strategies (Bruns and Jacobs, 2006); in style and substance, they are situated between the more static information provided by conventional Websites and Webpages and the continuous newsfeeds provided through Facebook and Twitter updates. Blogs provide a vehicle for authors (and their commenters) to think through given topics in the space of a few hundred to a few thousand words – expanding, perhaps, on shorter tweets, and possibly leading to the publication of more fully formed texts elsewhere. Additionally, they are also a very flexible medium: they readily provide the functionality to include images, audio, video, and other additional materials – as well as the fundamental tool of blogging, the hyperlink itself. This chapter appeared in the Sage collection Research Methods & Methodologies in Education edited by James Arthur, Michael Waring, Robert Coe, and Larry V. Hedges. This version is a pre-print edition of the chapter

    Towards a hyperlinked society : a critical review of link studies

    Full text link
    The hyperlink is a fundamental feature of the web. This paper investigates how hyperlinks have been used as research objects in social sciences. Reviewing a body of literature belonging to sociology, political sciences, information sciences, geography or media studies, it particularly reflects on the study of hyperlinks as indicators of other social phenomena. Why are links counted and hyperlink networks measured? How are links interpreted? The paper then focuses on barriers and limitations to the study of links. It addresses the issue of unobtrusiveness, the importance of interpreting links in context, and the possibilities of large-scale, automatic link studies. We finally argue that beyond the apparent diversity and ad hoc methodologies that the reviewed studies propose, a unified framework exists. It combines quantitative link counts, qualitative inquiries and valuation of field expertise to support link interpretation

    The case against the democratic influence of the internet on journalism

    Get PDF
    Book synopsis: Web Journalism: A New Form of Citizenship provides a much-needed analytical account of the implications of interactive participation in the construction of media content. Although web journalism is a fast-changing technology this book will have sustained appeal to an international readership by seeking to critically assess Internet news production. … With the rise of blogging and citizen journalism, it is a commonplace to observe that interactive participatory media are transforming the relationship between the traditional professional media and their audience. A current, popular, assumption is that the traditional flow of information from media to citizen is being reformed into a democratic dialogue between members of a community. The editors and contributors analyse and debate this assumption through international case studies that include the United Kingdom and United States. … While the text has been written and designed for undergraduate and postgraduate use, Web Journalism: A New Form of Citizenship? will be of use and of interest to all those engaged in the debate over Web reporting and citizen journalism

    BlogForever D2.6: Data Extraction Methodology

    Get PDF
    This report outlines an inquiry into the area of web data extraction, conducted within the context of blog preservation. The report reviews theoretical advances and practical developments for implementing data extraction. The inquiry is extended through an experiment that demonstrates the effectiveness and feasibility of implementing some of the suggested approaches. More specifically, the report discusses an approach based on unsupervised machine learning that employs the RSS feeds and HTML representations of blogs. It outlines the possibilities of extracting semantics available in blogs and demonstrates the benefits of exploiting available standards such as microformats and microdata. The report proceeds to propose a methodology for extracting and processing blog data to further inform the design and development of the BlogForever platform

    Link Prediction via Matrix Completion

    Full text link
    Inspired by practical importance of social networks, economic networks, biological networks and so on, studies on large and complex networks have attracted a surge of attentions in the recent years. Link prediction is a fundamental issue to understand the mechanisms by which new links are added to the networks. We introduce the method of robust principal component analysis (robust PCA) into link prediction, and estimate the missing entries of the adjacency matrix. On one hand, our algorithm is based on the sparsity and low rank property of the matrix, on the other hand, it also performs very well when the network is dense. This is because a relatively dense real network is also sparse in comparison to the complete graph. According to extensive experiments on real networks from disparate fields, when the target network is connected and sufficiently dense, whatever it is weighted or unweighted, our method is demonstrated to be very effective and with prediction accuracy being considerably improved comparing with many state-of-the-art algorithms
    corecore