1,355 research outputs found
A Wikipedia Literature Review
This paper was originally designed as a literature review for a doctoral
dissertation focusing on Wikipedia. This exposition gives the structure of
Wikipedia and the latest trends in Wikipedia research
Making Math Searchable in Wikipedia
Wikipedia, the world largest encyclopedia contains a lot of knowledge that is
expressed as formulae exclusively. Unfortunately, this knowledge is currently
not fully accessible by intelligent information retrieval systems. This immense
body of knowledge is hidden form value-added services, such as search. In this
paper, we present our MathSearch implementation for Wikipedia that enables
users to perform a combined text and fully unlock the potential benefits.Comment: 7 pages, 2 figures, Conference on Intelligent Computer Mathematics,
July 9-14 2012, Bremen, Germany. To be published in Lecture Notes, Artificial
Intelligence, Springe
WikiLinkGraphs: A Complete, Longitudinal and Multi-Language Dataset of the Wikipedia Link Networks
Wikipedia articles contain multiple links connecting a subject to other pages
of the encyclopedia. In Wikipedia parlance, these links are called internal
links or wikilinks. We present a complete dataset of the network of internal
Wikipedia links for the largest language editions. The dataset contains
yearly snapshots of the network and spans years, from the creation of
Wikipedia in 2001 to March 1st, 2018. While previous work has mostly focused on
the complete hyperlink graph which includes also links automatically generated
by templates, we parsed each revision of each article to track links appearing
in the main text. In this way we obtained a cleaner network, discarding more
than half of the links and representing all and only the links intentionally
added by editors. We describe in detail how the Wikipedia dumps have been
processed and the challenges we have encountered, including the need to handle
special pages such as redirects, i.e., alternative article titles. We present
descriptive statistics of several snapshots of this network. Finally, we
propose several research opportunities that can be explored using this new
dataset.Comment: 10 pages, 3 figures, 7 tables, LaTeX. Final camera-ready version
accepted at the 13TH International AAAI Conference on Web and Social Media
(ICWSM 2019) - Munich (Germany), 11-14 June 201
Bringing Web Time Travel to MediaWiki: An Assessment of the Memento MediaWiki Extension
We have implemented the Memento MediaWiki Extension Version 2.0, which brings
the Memento Protocol to MediaWiki, used by Wikipedia and the Wikimedia
Foundation. Test results show that the extension has a negligible impact on
performance. Two 302 status code datetime negotiation patterns, as defined by
Memento, have been examined for the extension: Pattern 1.1, which requires 2
requests, versus Pattern 2.1, which requires 3 requests. Our test results and
mathematical review find that, contrary to intuition, Pattern 2.1 performs
better than Pattern 1.1 due to idiosyncrasies in MediaWiki. In addition to
implementing Memento, Version 2.0 allows administrators to choose the optional
200-style datetime negotiation Pattern 1.2 instead of Pattern 2.1. It also
permits administrators the ability to have the Memento MediaWiki Extension
return full HTTP 400 and 500 status codes rather than using standard MediaWiki
error pages. Finally, version 2.0 permits administrators to turn off
recommended Memento headers if desired. Seeing as much of our work focuses on
producing the correct revision of a wiki page in response to a user's datetime
input, we also examine the problem of finding the correct revisions of the
embedded resources, including images, stylesheets, and JavaScript; identifying
the issues and discussing whether or not MediaWiki must be changed to support
this functionality.Comment: 23 pages, 18 figures, 9 tables, 17 listing
Building Collaborative Capacities in Learners: The M/cyclopedia Project Revisited
In this paper we trace the evolution of a project using a wiki-based learning environment in a tertiary education setting. The project has the pedagogical goal of building learners’ capacities to work effectively in the networked, collaborative, creative environments of the knowledge economy. The paper explores the four key characteristics of a ‘produsage’ environment and identifies four strategic capacities that need to be developed in learners to be effective ‘produsers’ (user-producers). A case study is presented of our experiences with the subject New Media Technologies, run at Queensland University of Technology, Brisbane, Australia. This progress report updates our observations made at the 2005 WikiSym conference
Bots in Wikipedia: Unfolding their duties
The success of crowdsourcing systems such as Wikipedia relies on people participating in these systems. However, in this research we reveal to what extent human and machine intelligence is combined to carry out semi-automatic workflows of complex tasks. In Wikipedia, bots are used to realize such combination of human-machine intelligence. We provide an extensive overview on various edit types bots carry out in this regard through the analysis of 1,639 approved task requests. We classify existing tasks by an action-object-pair structure and reveal existing differences in their probability of occurrence depending on the investigated work context. In the context of community services, bots mainly create reports, whereas in the area of guidelines or policies bots are mostly responsible for adding templates to pages. Moreover, the analysis of existing bot tasks revealed insights that suggest general reasons, why Wikipedia’s editor community uses bots as well as approaches, how they organize machine tasks to provide a sustainable service. We conclude by discussing how these insights can prepare the foundation for further research
Embedding R in the Mediawiki
Teaching statistics to students in our area of economics and educational science often brings about the problem that students have either forgottentheir statistical knowledge, or have taken different classes than the ones we offer in basic statistics. We therefore need some kind of statistical dictionary where we, as teachers, can refer to a common base and where students can look up specific terms. The Wikipedia - a general online encyclopaedia - compelled us to use a wiki for our dictionary. While the Wikipedia contains a large number of statistical terms, these are often too long and detailed to be visual displayed in lectures very well and some more specific terms are not included.R, wiki, Mediawiki
Connecting every bit of knowledge: The Structure of Wikipedia’s first link network
Apples, porcupines, and the most obscure Bob Dylan song\u27is every topic a few clicks from Philosophy? Within Wikipedia, the surprising answer is yes: nearly all paths lead to Philosophy. Wikipedia is the largest, most meticulously indexed collection of human knowledge ever amassed. More than information about a topic, Wikipedia is a web of naturally emerging relationships. By following the first link in each article, we algorithmically construct a directed network of all 4.7 million articles: Wikipedia\u27s First Link Network.
Here we study the English edition of Wikipedia\u27s First Link Network for insight into how the many inventions, places, people, objects, and events are related and organized. We traverse every path, measuring the accumulation of first links, path lengths, basins, cycles, and the influence each article exerts in shaping the network. We discover scale-free distributions describe path length, accumulation, and influence. Far from dispersed, first links disproportionately accumulate at a few articles\u27flowing from specific to general and culminating around fundamental notions such as Community, State, and Science. Philosophy shapes more paths than any other article by two orders of magnitude. Curiously, we also observe a gravitation towards topical articles such as Health Care and Fossil Fuel. These findings enrich our view of the connections and structure of Wikipedia\u27s ever growing store of knowledge
Recommended from our members
The afterlife of 'living deliverables': angels or zombies?
Within the STELLAR project, we provide the possibility to use living documents for the collaborative writing work on deliverables. Compared to 'normal' deliverables, 'living' deliverables come into existence much earlier than their delivery deadline and are expected to 'live on' after their official delivery to the European Commission. They are expected to foster collaboration. Within this contribution we investigate, how these deliverables have been used over the first 16 months of the project. We therefore propose a set of new analysis methods facilitating social network analysis on publicly available revision history data. With this instrumentarium, we critically look at whether the living deliverables have been successfully used for collaboration and whether their 'afterlife' beyond the contractual deadline had turned them into 'zombies' (still visible, but no or little live editing activities). The results show that the observed deliverables show signs of life, but often in connection with a topical change and in conjunction with changes in the pattern of collaboration
Assessing Educational Research -- An Information Service for Monitoring a Heterogeneous Research Field
The paper presents a web prototype that visualises different characteristics
of research projects in the heterogeneous domain of educational research. The
concept of the application derives from the project "Monitoring Educational
Research" (MoBi) that aims at identifying and implementing indicators that
adequately describe structural properties and dynamics of the research field.
The prototype enables users to visualise data regarding different indicators,
e.g. "research activity", "funding", "qualification project", "disciplinary
area". Since the application is based on Semantic MediaWikitechnology it
furthermore provides an easily accessible opportunity to collaboratively work
on a database of research projects. Users can jointly and in a semantically
controlled way enter metadata on research projects which are the basis for the
computation and visualisation of indicators.Comment: 8 pages, 10 figures, Libraries in the digital age (LIDA) 2014
conferenc
- …