Search CORE

1,461 research outputs found

Evorus: A Crowd-powered Conversational Assistant Built to Automate Itself Over Time

Author: Banchs Rafael E
Carpenter Rollo
Constine Josh
Gasic Milica
Han Bo
Hempel Jessi
Huang Ting-Hao K.
Kamar Ece
Kamar Ece
Kenneth Huang Ting-Hao
Kenneth Huang Ting-Hao
Li Jiwei
Li Xuijun
Lynley Matthew
Newton Casey
Pennington Jeffrey
Sadun Erica
Sarma Akash Das
Ting-Hao
Wen Tsung-Hsien
Wikipedia Cleverbot
Wikipedia Tay
Witten Ian H
Zhao Tiancheng
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 09/01/2018
Field of study

Crowd-powered conversational assistants have been shown to be more robust than automated systems, but do so at the cost of higher response latency and monetary costs. A promising direction is to combine the two approaches for high quality, low latency, and low cost solutions. In this paper, we introduce Evorus, a crowd-powered conversational assistant built to automate itself over time by (i) allowing new chatbots to be easily integrated to automate more scenarios, (ii) reusing prior crowd answers, and (iii) learning to automatically approve response candidates. Our 5-month-long deployment with 80 participants and 281 conversations shows that Evorus can automate itself without compromising conversation quality. Crowd-AI architectures have long been proposed as a way to reduce cost and latency for crowd-powered systems; Evorus demonstrates how automation can be introduced successfully in a deployed system. Its architecture allows future researchers to make further innovation on the underlying automated components in the context of a deployed open domain dialog system.Comment: 10 pages. To appear in the Proceedings of the Conference on Human Factors in Computing Systems 2018 (CHI'18

arXiv.org e-Print Archive

Crossref

When in doubt ask the crowd : leveraging collective intelligence for improving event detection and machine learning

Author: Georgescu Mihai
Publication venue: Hannover : Gottfried Wilhelm Leibniz Universität Hannover
Publication date: 01/01/2015
Field of study

[no abstract

Institutionelles Repositorium der Leibniz Universität Hannover

An Efficient Top-k Query Scheme Based on Multilayer Grouping

Author: Caixue Zhou
Guangyong Gao
Yu Gao
Zhuolin Mei
Zongda Wu
Zongmin Cui
Publication venue: 'Mechanical Engineering Faculty in Slavonski Brod'
Publication date: 01/01/2019
Field of study

The top-k query is to find the k data that has the highest scores from a candidate dataset. Sorting is a common method to find out top-k results. However, most of existing methods are not efficient enough. To remove this issue, we propose an efficient top-k query scheme based on multilayer grouping. First, we find the reference item by computing the average score of the candidate dataset. Second, we group the candidate dataset into three datasets: winner set, middle set and loser set based on the reference item. Third, we further group the winner set to the second-layer three datasets according to k value. And so on, until the data number of winner set is close to k value. Meanwhile, if k value is larger than the data number of winner set, we directly return the winner set to the user as a part of top-k results almost without sorting. In this case, we also return the top results with the highest scores from the middle set almost without sorting. Based on above innovations, we almost minimize the sorting. Experimental results show that our scheme significantly outperforms the current classical method on the performance of memory consumption and top-k query

HRČAK - Portal of Croatian Scientific and Professional Journals

Hrčak - Portal of scientific journals of Croatia

Answering skyline queries over incomplete data with crowdsourcing (Extended Abstract)

Author: Chen Lu
Gao Yunjun
Guo Su
Li Qing
Miao Xiaoye
Yin Jianwei
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/04/2020
Field of study

VBN

Crowdsourcing in Computer Vision

Author: Fei-Fei Li
Grauman Kristen
Kovashka Adriana
Russakovsky Olga
Publication venue: 'Now Publishers'
Publication date: 01/01/2016
Field of study

Computer vision systems require large amounts of manually annotated data to properly learn challenging visual concepts. Crowdsourcing platforms offer an inexpensive method to capture human knowledge and understanding, for a vast number of visual perception tasks. In this survey, we describe the types of annotations computer vision researchers have collected using crowdsourcing, and how they have ensured that this data is of high quality while annotation effort is minimized. We begin by discussing data collection on both classic (e.g., object recognition) and recent (e.g., visual story-telling) vision tasks. We then summarize key design decisions for creating effective data collection interfaces and workflows, and present strategies for intelligently selecting the most important data instances to annotate. Finally, we conclude with some thoughts on the future of crowdsourcing in computer vision.Comment: A 69-page meta review of the field, Foundations and Trends in Computer Graphics and Vision, 201

arXiv.org e-Print Archive

Crossref

Retrieve-and-Read: Multi-task Learning of Information Retrieval and Reading Comprehension

Author: Asano Hisako
Nishida Kyosuke
Otsuka Atsushi
Saito Itsumi
Tomita Junji
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 31/08/2018
Field of study

This study considers the task of machine reading at scale (MRS) wherein, given a question, a system first performs the information retrieval (IR) task of finding relevant passages in a knowledge source and then carries out the reading comprehension (RC) task of extracting an answer span from the passages. Previous MRS studies, in which the IR component was trained without considering answer spans, struggled to accurately find a small number of relevant passages from a large set of passages. In this paper, we propose a simple and effective approach that incorporates the IR and RC tasks by using supervised multi-task learning in order that the IR component can be trained by considering answer spans. Experimental results on the standard benchmark, answering SQuAD questions using the full Wikipedia as the knowledge source, showed that our model achieved state-of-the-art performance. Moreover, we thoroughly evaluated the individual contributions of our model components with our new Japanese dataset and SQuAD. The results showed significant improvements in the IR task and provided a new perspective on IR for RC: it is effective to teach which part of the passage answers the question rather than to give only a relevance score to the whole passage.Comment: 10 pages, 6 figure. Accepted as a full paper at CIKM 201

arXiv.org e-Print Archive

Crossref

Recommended from our members

Robust Algorithms for Clustering with Applications to Data Integration

Author: Galhotra Sainyam
Publication venue: ScholarWorks@UMass Amherst
Publication date: 20/10/2021
Field of study

A growing number of data-based applications are used for decision-making that have far-reaching consequences and significant societal impact. Entity resolution, community detection and taxonomy construction are some of the building blocks of these applications and for these methods, clustering is the fundamental underlying concept. Therefore, the use of accurate, robust and scalable methods for clustering cannot be overstated. We tackle the various facets of clustering with a multi-pronged approach described below. 1. While identification of clusters that refer to different entities is challenging for automated strategies, it is relatively easy for humans. We study the robustness of clustering methods that leverage supervision through an oracle i.e an abstraction of crowdsourcing. Additionally, we focus on scalability to handle web-scale datasets. 2. In community detection applications, a common setback in evaluation of the quality of clustering techniques is the lack of ground truth data. We propose a generative model that considers dependent edge formation and devise techniques for efficient cluster recovery

ScholarWorks@UMass Amherst