3,049 research outputs found
Crowdsourcing in Computer Vision
Computer vision systems require large amounts of manually annotated data to
properly learn challenging visual concepts. Crowdsourcing platforms offer an
inexpensive method to capture human knowledge and understanding, for a vast
number of visual perception tasks. In this survey, we describe the types of
annotations computer vision researchers have collected using crowdsourcing, and
how they have ensured that this data is of high quality while annotation effort
is minimized. We begin by discussing data collection on both classic (e.g.,
object recognition) and recent (e.g., visual story-telling) vision tasks. We
then summarize key design decisions for creating effective data collection
interfaces and workflows, and present strategies for intelligently selecting
the most important data instances to annotate. Finally, we conclude with some
thoughts on the future of crowdsourcing in computer vision.Comment: A 69-page meta review of the field, Foundations and Trends in
Computer Graphics and Vision, 201
Collective intelligence: aggregation of information from neighbors in a guessing game
Complex systems show the capacity to aggregate information and to display
coordinated activity. In the case of social systems the interaction of
different individuals leads to the emergence of norms, trends in political
positions, opinions, cultural traits, and even scientific progress. Examples of
collective behavior can be observed in activities like the Wikipedia and Linux,
where individuals aggregate their knowledge for the benefit of the community,
and citizen science, where the potential of collectives to solve complex
problems is exploited. Here, we conducted an online experiment to investigate
the performance of a collective when solving a guessing problem in which each
actor is endowed with partial information and placed as the nodes of an
interaction network. We measure the performance of the collective in terms of
the temporal evolution of the accuracy, finding no statistical difference in
the performance for two classes of networks, regular lattices and random
networks. We also determine that a Bayesian description captures the behavior
pattern the individuals follow in aggregating information from neighbors to
make decisions. In comparison with other simple decision models, the strategy
followed by the players reveals a suboptimal performance of the collective. Our
contribution provides the basis for the micro-macro connection between
individual based descriptions and collective phenomena.Comment: 9 pages, 9 figure
A hermeneutic inquiry into user-created personas in different Namibian locales
Persona is a tool broadly used in technology design to support communicational interactions between designers and users. Different Persona types and methods have evolved mostly in the Global North, and been partially deployed in the Global South every so often in its original User-Centred Design methodology. We postulate persona conceptualizations are expected to differ across cultures. We demonstrate this with an exploratory-case study on user-created persona co-designed with four Namibian ethnic groups: ovaHerero, Ovambo, ovaHimba and Khoisan. We follow a hermeneutic inquiry approach to discern cultural nuances from diverse human conducts. Findings reveal diverse self-representations whereby for each ethnic group results emerge in unalike fashions, viewpoints, recounts and storylines. This paper ultimately argues User-Created Persona as a potentially valid approach for pursuing cross-cultural depictions of personas that communicate cultural features and user experiences paramount to designing acceptable and gratifying technologies in dissimilar locales
Salient object subitizing
We study the problem of salient object subitizing, i.e. predicting the existence and the number of salient objects in an image using holistic cues. This task is inspired by the ability of people to quickly and accurately identify the number of items within the subitizing range (1–4). To this end, we present a salient object subitizing image dataset of about 14 K everyday images which are annotated using an online crowdsourcing marketplace. We show that using an end-to-end trained convolutional neural network (CNN) model, we achieve prediction accuracy comparable to human performance in identifying images with zero or one salient object. For images with multiple salient objects, our model also provides significantly better than chance performance without requiring any localization process. Moreover, we propose a method to improve the training of the CNN subitizing model by leveraging synthetic images. In experiments, we demonstrate the accuracy and generalizability of our CNN subitizing model and its applications in salient object detection and image retrieval.This research was supported in part by US NSF Grants 0910908 and 1029430, and gifts from Adobe and NVIDIA. (0910908 - US NSF; 1029430 - US NSF)https://arxiv.org/abs/1607.07525https://arxiv.org/pdf/1607.07525.pdfAccepted manuscrip
Mouse Behavior Recognition with The Wisdom of Crowd
In this thesis, we designed and implemented a crowdsourcing system to annotatemouse behaviors in videos; this involves the development of a novel clip-based video labeling tools, that is more efficient than traditional labeling tools in crowdsourcing platform, as well as the design of probabilistic inference algorithms that predict the true labels and the workers' expertise from multiple workers' responses. Our algorithms are shown to perform better than majority vote heuristic. We also carried out extensive experiments to determine the effectiveness of our labeling tool, inference algorithms and the overall system
- …