Search CORE

2,827 research outputs found

On Cognitive Preferences and the Plausibility of Rule-based Models

Author: Fürnkranz Johannes
Kliegr Tomáš
Paulheim Heiko
Publication venue
Publication date: 22/04/2019
Field of study

It is conventional wisdom in machine learning and data mining that logical models such as rule sets are more interpretable than other models, and that among such rule-based models, simpler models are more interpretable than more complex ones. In this position paper, we question this latter assumption by focusing on one particular aspect of interpretability, namely the plausibility of models. Roughly speaking, we equate the plausibility of a model with the likeliness that a user accepts it as an explanation for a prediction. In particular, we argue that, all other things being equal, longer explanations may be more convincing than shorter ones, and that the predominant bias for shorter models, which is typically necessary for learning powerful discriminative models, may not be suitable when it comes to user acceptance of the learned models. To that end, we first recapitulate evidence for and against this postulate, and then report the results of an evaluation in a crowd-sourcing study based on about 3.000 judgments. The results do not reveal a strong preference for simple rules, whereas we can observe a weak preference for longer rules in some domains. We then relate these results to well-known cognitive biases such as the conjunction fallacy, the representative heuristic, or the recogition heuristic, and investigate their relation to rule length and plausibility.Comment: V4: Another rewrite of section on interpretability to clarify focus on plausibility and relation to interpretability, comprehensibility, and justifiabilit

arXiv.org e-Print Archive

MAnnheim DOCument Server

Crowdsourcing as a way to access external knowledge for innovation

Author: Cédric Pelissier
Pascale Trompette
Valérie Chanal
Publication venue
Publication date
Field of study

This paper focuses on “crowdsourcing” as a significant trend in the new paradigm of open innovation (Chesbrough 2006; Chesbrough & Appleyard 2007). Crowdsourcing conveys the idea of opening the R&D processes to “the crowd” through a web 2.0 infrastructure. Based on two cases studies of crowdsourcing webstartups (Wilogo and CrowdSpirit), the paper aims to build a framework to characterize and interpret the tension between value creation by a community and value capture by a private economic actor. Contributing to the discussions on “hybrid organizational forms” in organizational studies (Bruce & Jordan 2007), the analysis examines how theses new models combine various forms of relationships and exchanges (market or non market). It describes how crowdsourcing conveys new patterns of control, incentives and co-ordination mechanisms.communauté ; crowdsourcing ; innovation ; formes organisationnelles hybrides ; plateforme ; web 2.0

Genie: A Generator of Natural Language Semantic Parsers for Virtual Assistant Commands

Author: Alvarez-Melis David
Banarescu Laura
Chen David L
Chu Shumo
Ganitkevitch Juri
Kate Rohit J
Kingma Diederik P
Pasupat Panupong
Quirk Chris
Shetty Jitesh
Steedman Mark
Trakhtenbrot Boris A.
Wang Yushi
Wong Yuk Wah
Xu Xiaojun
Zelle John M
Zettlemoyer Luke S
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 18/04/2019
Field of study

To understand diverse natural language commands, virtual assistants today are trained with numerous labor-intensive, manually annotated sentences. This paper presents a methodology and the Genie toolkit that can handle new compound commands with significantly less manual effort. We advocate formalizing the capability of virtual assistants with a Virtual Assistant Programming Language (VAPL) and using a neural semantic parser to translate natural language into VAPL code. Genie needs only a small realistic set of input sentences for validating the neural model. Developers write templates to synthesize data; Genie uses crowdsourced paraphrases and data augmentation, along with the synthesized data, to train a semantic parser. We also propose design principles that make VAPL languages amenable to natural language translation. We apply these principles to revise ThingTalk, the language used by the Almond virtual assistant. We use Genie to build the first semantic parser that can support compound virtual assistants commands with unquoted free-form parameters. Genie achieves a 62% accuracy on realistic user inputs. We demonstrate Genie's generality by showing a 19% and 31% improvement over the previous state of the art on a music skill, aggregate functions, and access control.Comment: To appear in PLDI 201

arXiv.org e-Print Archive

Spatial and temporal epidemiological analysis in the Big Data era

Author: Alvarado-Serrano
Anderson
Andrienko
Anon
Anon
Anon
Anon
Baker
Bell
Breiman
Brownstein
Brownstein
Brownstein
Brunker
Butler
Butler
Carneiro
Carrel
Carroll
Chan
Chew
Chunara
Clements
Collier
Collins
Correa
Costa
Cowen
de Glanville
Dhar
Dirk U. Pfeiffer
Dodge
Eastman
Elith
Elith
Eysenbach
Faghmous
Faria
Feizizadeh
Fernández
Firestone
Firestone
França
Freifeld
Gandomi
Gartner
Gibney
Giebultowicz
Gilbert
Ginsberg
Goodchild
Goodchild
Grein
Haklay
Hartley
Hay
Hay
Heipke
Heymann
Hirzel
Hirzel
Hongoh
Istepanian
Jankowski
Jones
Kambatla
Kamel Boulos
Kamel Boulos
Keller
Kim B. Stevens
Kuhn
Lawson
Lazer
Lee
Leetaru
Li
Liang
Ligmann-Zielinska
Malczewski
Malczewski
Martin
Mayer-Schönberger
Milinovich
Milinovich
Mortari
Mullins
Murray
Mykhalovskiy
Okabe
Oliver
Olsen
O’Driscoll
Peters
Pfeiffer
Pfeiffer
Pigliucci
Pigott
Porter
Prates
Pybus
Rutten
Sanchez-Matamoros
Sarojinie Fernando
Schadt
Scholkopf
Schutt
See
Signorini
Solanas
Sorensen
St Louis
Stevens
Stevens
Tatem
Tatem
Tolentino
Tran
van Zyl
van Zyl
Vatsavai
Wesolowski
Wesolowski
Wilson
Wilson
Wilson
Wing
Yemshanov
You
Zeldenrust
Ziegler
Publication venue: 'Elsevier BV'
Publication date: 01/11/2015
Field of study

Crowdsourced network measurements: Benefits and best practices

Author: Hirth Matthias
Hoßfeld Tobias
Lehrieder Frank
MELLIA Marco
Schwartz Christian
Publication venue: 'Elsevier BV'
Publication date: 01/01/2015
Field of study

Network measurements are of high importance both for the operation of networks and for the design and evaluation of new management mechanisms. Therefore, several approaches exist for running network measurements, ranging from analyzing live traffic traces from campus or Internet Service Provider (ISP) networks to performing active measurements on distributed testbeds, e.g., PlanetLab, or involving volunteers. However, each method falls short, offering only a partial view of the network. For instance, the scope of passive traffic traces is limited to an ISP’s network and customers’ habits, whereas active measurements might be biased by the population or node location involved. To complement these techniques, we propose to use (commercial) crowdsourcing platforms for network measurements. They permit a controllable, diverse and realistic view of the Internet and provide better control than do measurements with voluntary participants. In this study, we compare crowdsourcing with traditional measurement techniques, describe possible pitfalls and limitations, and present best practices to overcome these issues. The contribution of this paper is a guideline for researchers to understand when and how to exploit crowdsourcing for network measurements

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)

PORTO Publications Open Repository TOrino