8 research outputs found

    How clumpy is my image? Evaluating crowdsourced annotation tasks

    Get PDF
    13th UK Workshop on Computational Intelligence (UKCI), Guildford, UK, 9-11 September 2013This is the author accepted manuscript. The final version is available from the publisher via the DOI in this record.The use of citizen science to obtain annotations from multiple annotators has been shown to be an effective method for annotating datasets in which computational methods alone are not feasible. The way in which the annotations are obtained is an important consideration which affects the quality of the resulting consensus estimates. In this paper, we examine three separate approaches to obtaining scores for instances rather than merely classifications. To obtain a consensus score annotators were asked to make annotations in one of three paradigms: classification, scoring and ranking. A web-based citizen science experiment is described which implements the three approaches as crowdsourced annotation tasks. The tasks are evaluated in relation to the accuracy and agreement among the participants using both simulated and real-world data from the experiment. The results show a clear difference in performance between the three tasks, with the ranking task obtaining the highest accuracy and agreement among the participants. We show how a simple evolutionary optimiser may be used to improve the performance by reweighting the importance of annotators

    How clumpy is my image?: Scoring in crowdsourced annotation tasks

    Get PDF
    The use of citizen science to obtain annotations from multiple annotators has been shown to be an effective method for annotating datasets in which computational methods alone are not feasible. The way in which the annotations are obtained is an important consideration which affects the quality of the resulting consensus annotation. In this paper, we examine three separate approaches to obtaining consensus scores for instances rather than merely binary classifications. To obtain a consensus score, annotators were asked to make annotations in one of three paradigms: classification, scoring and ranking. A web-based citizen science experiment is described which implements the three approaches as crowdsourced annotation tasks. The tasks are evaluated in relation to the accuracy and agreement among the participants using both simulated and real-world data from the experiment. The results show a clear difference in performance between the three tasks, with the ranking task obtaining the highest accuracy and agreement among the participants. We show how a simple evolutionary optimiser may be used to improve the performance by reweighting the importance of annotators

    Task workflow design and its impact on performance and volunteers' subjective preference in virtual citizen science

    Get PDF
    Virtual citizen science platforms allow non-scientists to take part in scientific research across a range of disciplines. What they ask of volunteers varies considerably in terms of task type, variety, user judgement required and user freedom, which has received little direct investigation. A study was performed with the Planet Four: Craters project to investigate the effect of task workflow design on both volunteer experience and the scientific results they produce. Participants' feedback through questionnaire responses indicated a preference for interfaces providing greater autonomy and variety, with free-text responses suggesting that autonomy was the more important. This did not translate into improved performance however, with the most autonomous interface not resulting in significantly better performance in data volume, agreement or accuracy compared to other less autonomous interfaces. The interface with the least number of task types, variety and autonomy resulted in the greatest data coverage. Agreement, both between participants and with the expert equivalent, was significantly improved when the interface most directly afforded tasks that captured the required underlying data (i.e. crater position or diameter). The implications for the designers of virtual citizen science platforms is that they have a balancing act to perform, weighing up the importance of user satisfaction, the data needs of the science case and the resources that can be committed both in terms of time and data reduction

    Geoinformatics in Citizen Science

    Get PDF
    The book features contributions that report original research in the theoretical, technological, and social aspects of geoinformation methods, as applied to supporting citizen science. Specifically, the book focuses on the technological aspects of the field and their application toward the recruitment of volunteers and the collection, management, and analysis of geotagged information to support volunteer involvement in scientific projects. Internationally renowned research groups share research in three areas: First, the key methods of geoinformatics within citizen science initiatives to support scientists in discovering new knowledge in specific application domains or in performing relevant activities, such as reliable geodata filtering, management, analysis, synthesis, sharing, and visualization; second, the critical aspects of citizen science initiatives that call for emerging or novel approaches of geoinformatics to acquire and handle geoinformation; and third, novel geoinformatics research that could serve in support of citizen science

    Characterizing Novelty as a Motivator in Online Citizen Science

    Get PDF
    Citizen science projects rely on the voluntary contribution of nonscientists to take part in scientific research projects. Projects taking place exclusively over the Internet face significant challenges, chief among them is the attracting and keeping the critical mass of volunteers needed to conduct the work outlined by the science team. The extent to which platforms can design experiences that positively influence volunteers’ motivation can help address the contribution challenges. Consequently, project organizers need to develop strategies to attract new participants and keep existing ones. One strategy to encourage participation is implementing features, which re-enforce motives known to change people’s attitudes towards contributing positively. The literature in psychology noted that novelty is an attribute of objects and environments that occasion curiosity in humans leading to exploratory behaviors, e.g., prolonged engagement with the object or environment. This dissertation described the design, implementation, and evaluation of an experiment conducted in three online citizen science projects. Volunteers received novelty cues when they classified data objects that no other volunteer had previously seen. The hypothesis was that exposure to novelty cues while classifying data positively influences motivational attitudes leading to increased engagement in the classification task and increased retention. The experiments resulted in mixed results. In some projects, novelty cues were universally salient, and in other projects, novelty cues had no significant impact on volunteers’ contribution behaviors. The results, while mixed, are promising since differences in the observed behaviors arise because of individual personality differences and the unique attributes found in each project setting. This research contributes to empirically grounded studies on motivation in citizen science with analyses that produce new insights and questions into the functioning of novelty and its impact on volunteers’ behaviors

    Supporting Exploratory Search Tasks Through Alternative Representations of Information

    Get PDF
    Information seeking is a fundamental component of many of the complex tasks presented to us, and is often conducted through interactions with automated search systems such as Web search engines. Indeed, the ubiquity of Web search engines makes information so readily available that people now often turn to the Web for all manners of information seeking needs. Furthermore, as the range of online information seeking tasks grows, more complex and open-ended search activities have been identified. One type of complex search activities that is of increasing interest to researchers is exploratory search, where the goal involves "learning" or "investigating", rather than simply "looking-up". Given the massive increase in information availability and the use of online search for tasks beyond simply looking-up, researchers have noted that it becomes increasingly challenging for users to effectively leverage the available online information for complex and open-ended search activities. One of the main limitations of the current document retrieval paradigm offered by modern search engines is that it provides a ranked list of documents as a response to the searcher’s query with no further support for locating and synthesizing relevant information. Therefore, the searcher is left to find and make sense of useful information in a massive information space that lacks any overview or conceptual organization. This thesis explores the impact of alternative representations of search results on user behaviors and outcomes during exploratory search tasks. Our inquiry is inspired by the premise that exploratory search tasks require sensemaking, and that sensemaking involves constructing and interacting with representations of knowledge. As such, in order to provide the searchers with more support in performing exploratory activities, there is a need to move beyond the current document retrieval paradigm by extending the support for locating and externalizing semantic information from textual documents and by providing richer representations of the extracted information coupled with mechanisms for accessing and interacting with the information in ways that support exploration and sensemaking. This dissertation presents a series of discrete research endeavour to explore different aspects of providing information and presenting this information in ways that both extraction and assimilation of relevant information is supported. We first address the problem of extracting information – that is more granular than documents – as a response to a user's query by developing a novel information extraction system to represent documents as a series of entity-relationship tuples. Next, through a series of designing and evaluating alternative representations of search results, we examine how this extracted information can be represented such that it extends the document-based search framework's support for exploratory search tasks. Finally, we assess the ecological validity of this research by exploring error-prone representations of search results and how they impact a searcher's ability to leverage our representations to perform exploratory search tasks. Overall, this research contributes towards designing future search systems by providing insights into the efficacy of alternative representations of search results for supporting exploratory search activities, culminating in a novel hybrid representation called Hierarchical Knowledge Graphs (HKG). To this end we propose and develop a framework that enables a reliable investigation of the impact of different representations and how they are perceived and utilized by information seekers

    Crowdview: uma plataforma crowdsourcing para gerenciamento temporal de entidades

    Get PDF
    The web generates a lot of data about the same real-world object over time. In addition to this volume, the variety with which these data are presented grows substantially. In many cases this data is described in unstructured forms such as videos, images and texts. Algorithms for extracting data from unstructured forms are not yet accurate. In this case, it is appropriate to use the crowd to solve these tasks. In this context, crowdsourcing emerges as a paradigm shift in which the crowd, through open calls, provides solutions to specific problems. It can perform tasks distributed in different stages such as data collection, treatment, processing and analysis. These crowdsourcing systems produce a lot of data in a short amount of time. These initiatives also reduce time, operational costs and improve decision making. Given this scenario, the proposal in this doctoral thesis is the development of a crowdsourcing approach to extract and manage temporal characteristics of a real world object from unstructured information. This approach was implemented in a computer system called CrowdView. The case study is applied to the temporal management of urban forms. The analysis of the changes in characteristics in a chronological context can support decision making on the curation of urban forms within management of urban space.Também disponível on-line.A web gera uma grande quantidade de dados sobre um mesmo objeto do mundo real ao longo do tempo. Além deste volume, a variedade com que estes dados são apresentados cresce substancialmente. Em muitos casos estes dados são descritos em formas não estruturadas como vídeos, imagens e textos. Os algoritmos para extração de dados de formas não estruturadas ainda não são precisos. Neste caso é apropriado o uso da multidão para a resolução destas tarefas. Neste contexto, crowdsourcing surge como uma mudança de paradigma no qual a multidão, através de chamadas abertas, passa a prover soluções para problemas específicos. Sua participação é concretizada através da realização de tarefas distribuídas em diferentes etapas como coleta, tratamento, processamento e análise dos dados. Estes sistemas de crowdsourcing produzem uma grande quantidade de dados em um curto espaço de tempo. Estas iniciativas também reduzem tempo, custos operacionais e melhoram a tomada de decisões. Diante deste cenário, a proposta nesta tese de doutorado é o desenvolvimento de uma abordagem crowdsourcing para extrair e gerenciar características temporais de um objeto do mundo real a partir de informações não estruturadas. Esta abordagem foi implementada em um sistema computacional chamado CrowdView. O estudo de caso é aplicado ao gerenciamento temporal de formas urbanas. A análise das mudanças das características em um contexto cronológico pode apoiar a tomada de decisões sobre a curadoria dessas formas urbanas dentro da gestão do espaço urbano
    corecore