1,235 research outputs found
BUOCA: Budget-Optimized Crowd Worker Allocation
Due to concerns about human error in crowdsourcing, it is standard practice to collect labels for the same data point from multiple internet workers. We here show that the resulting budget can be used more effectively with a flexible worker assignment strategy that asks fewer workers to analyze easy-to-label data and more workers to analyze data that requires extra scrutiny. Our main contribution is to show how the allocations of the number of workers to a task can be computed optimally based on task features alone, without using worker profiles. Our target tasks are delineating cells in microscopy images and analyzing the sentiment toward the 2016 U.S. presidential candidates in tweets. We first propose an algorithm that computes budget-optimized crowd worker allocation (BUOCA). We next train a machine learning system (BUOCA-ML) that predicts an optimal number of crowd workers needed to maximize the accuracy of the labeling. We show that the computed allocation can yield large savings in the crowdsourcing budget (up to 49 percent points) while maintaining labeling accuracy. Finally, we envisage a human-machine system for performing budget-optimized data analysis at a scale beyond the feasibility of crowdsourcing.First author draf
BUOCA: Budget-Optimized Crowd Worker Allocation
Due to concerns about human error in crowdsourcing, it is standard practice
to collect labels for the same data point from multiple internet workers. We
here show that the resulting budget can be used more effectively with a
flexible worker assignment strategy that asks fewer workers to analyze
easy-to-label data and more workers to analyze data that requires extra
scrutiny. Our main contribution is to show how the allocations of the number of
workers to a task can be computed optimally based on task features alone,
without using worker profiles. Our target tasks are delineating cells in
microscopy images and analyzing the sentiment toward the 2016 U.S. presidential
candidates in tweets. We first propose an algorithm that computes
budget-optimized crowd worker allocation (BUOCA). We next train a machine
learning system (BUOCA-ML) that predicts an optimal number of crowd workers
needed to maximize the accuracy of the labeling. We show that the computed
allocation can yield large savings in the crowdsourcing budget (up to 49
percent points) while maintaining labeling accuracy. Finally, we envisage a
human-machine system for performing budget-optimized data analysis at a scale
beyond the feasibility of crowdsourcing
Harnessing the power of the general public for crowdsourced business intelligence: a survey
International audienceCrowdsourced business intelligence (CrowdBI), which leverages the crowdsourced user-generated data to extract useful knowledge about business and create marketing intelligence to excel in the business environment, has become a surging research topic in recent years. Compared with the traditional business intelligence that is based on the firm-owned data and survey data, CrowdBI faces numerous unique issues, such as customer behavior analysis, brand tracking, and product improvement, demand forecasting and trend analysis, competitive intelligence, business popularity analysis and site recommendation, and urban commercial analysis. This paper first characterizes the concept model and unique features and presents a generic framework for CrowdBI. It also investigates novel application areas as well as the key challenges and techniques of CrowdBI. Furthermore, we make discussions about the future research directions of CrowdBI
- …