Search CORE

2,898 research outputs found

Helping crisis responders find the informative needle in the tweet haystack

Author: Bontcheva K.
Derczynski L.
Maynard D.
Meesters K.
Publication venue: International Association for Information Systems for Crisis Response and Management (ISCRAM)
Publication date: 01/01/2018
Field of study

Crisis responders are increasingly using social media, data and other digital sources of information to build a situational understanding of a crisis situation in order to design an effective response. However with the increased availability of such data, the challenge of identifying relevant information from it also increases. This paper presents a successful automatic approach to handling this problem. Messages are filtered for informativeness based on a definition of the concept drawn from prior research and crisis response experts. Informative messages are tagged for actionable data -- for example, people in need, threats to rescue efforts, changes in environment, and so on. In all, eight categories of actionability are identified. The two components -- informativeness and actionability classification -- are packaged together as an openly-available tool called Emina (Emergent Informativeness and Actionability)

arXiv.org e-Print Archive

White Rose Research Online

Tilburg University Repository

Explaining Machine Learning Classifiers through Diverse Counterfactual Explanations

Author: Dai Wuyang
Kim Been
Kingma Diederik P
Lundberg Scott M
Tan S
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 06/12/2019
Field of study

Post-hoc explanations of machine learning models are crucial for people to understand and act on algorithmic predictions. An intriguing class of explanations is through counterfactuals, hypothetical examples that show people how to obtain a different prediction. We posit that effective counterfactual explanations should satisfy two properties: feasibility of the counterfactual actions given user context and constraints, and diversity among the counterfactuals presented. To this end, we propose a framework for generating and evaluating a diverse set of counterfactual explanations based on determinantal point processes. To evaluate the actionability of counterfactuals, we provide metrics that enable comparison of counterfactual-based methods to other local explanation methods. We further address necessary tradeoffs and point to causal implications in optimizing for counterfactuals. Our experiments on four real-world datasets show that our framework can generate a set of counterfactuals that are diverse and well approximate local decision boundaries, outperforming prior approaches to generating diverse counterfactuals. We provide an implementation of the framework at https://github.com/microsoft/DiCE.Comment: 13 page

arXiv.org e-Print Archive

Crossref

The FAIR Guiding Principles for scientific data management and stewardship

Author: Aalbersberg I.J. (Ijsbrand Jan)
Appleton G. (Gabrielle)
Axton M. (Myles)
Baak A. (Arie)
Blomberg N. (Niklas)
Boiten J.W. (Jan-Willem)
Bourne P.E. (Philip)
Bouwman J. (Jildau)
Brookes A.J. (Anthony)
Clark T. (Tim)
Crosas M. (Mercè)
Dillo I. (Ingrid)
Dumon O. (Olivier)
Dumontier M. (Michel)
Edmunts S. (Scott)
Evelo C.T. (Chris)
Finkers R. (Richard)
Goble C.A. (Carole Ann)
Gonzalez-Beltran A. (Alejandra)
Gray A. (Alastair)
Grethe S. (Jeffrey)
Groth P. (Paul)
Heringa J. (Jaap)
Hoen P.A.C. (Peter) 't
Hooft R. (Rob)
Kok J. (Joost)
Kok R. (Ruben)
Kuhn T. (Tobias)
Lei J. (Johan) van der
Lusher S.J. (Scott)
Martone M.E. (Maryann)
Mons A. (Albert)
Mons B. (Barend)
Mulligen E.M. (Erik) van
Packer A. (Abel)
Persson B. (Bengt)
Roca-Serra P. (Philippe)
Roos M. (Marco)
Sansone S.A. (Susanna-Assunta)
Schaik R. (Rene) van
Schultes E. (Erik)
Sengstag T. (Thierry)
Silva Santos L.B. (Luiz Bonino) da
Slater T. (Ted)
Strawn G. (George)
Swertz M. (Morris)
Thompson M. (Mark)
Velterop J. (Jan)
Waagmeester A. (Andra)
Wilkinson J.M. (Mark)
Wittenburg P. (Peter)
Wolstencroft K. (Katherine)
Zhao J. (Jun)
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 15/03/2016
Field of study

There is an urgent need to improve the infrastructure supporting the reuse of scholarly data. A diverse set of stakeholders—representing academia, industry, funding agencies, and scholarly publishers—have come together to design and jointly endorse a concise and measureable set of principles that we refer to as the FAIR Data Principles. The intent is that these may act as a guideline for those wishing to enhance the reusability of their data holdings. Distinct from peer initiatives that focus on the human scholar, the FAIR Principles put specific emphasis on enhancing the ability of machines to automatically find and use the data, in addition to supporting its reuse by individuals. This Comment is the first formal publication of the FAIR Principles, and includes the rationale behind them, and some exemplar implementations in the community

Erasmus University Digital Repository

Interpretable Predictions of Tree-based Ensembles via Actionable Feature Tweaking

Author: Haines Andrew
Lalmas Mounia
Silvestri Fabrizio
Tolomei Gabriele
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2017
Field of study

Machine-learned models are often described as "black boxes". In many real-world applications however, models may have to sacrifice predictive power in favour of human-interpretability. When this is the case, feature engineering becomes a crucial task, which requires significant and time-consuming human effort. Whilst some features are inherently static, representing properties that cannot be influenced (e.g., the age of an individual), others capture characteristics that could be adjusted (e.g., the daily amount of carbohydrates taken). Nonetheless, once a model is learned from the data, each prediction it makes on new instances is irreversible - assuming every instance to be a static point located in the chosen feature space. There are many circumstances however where it is important to understand (i) why a model outputs a certain prediction on a given instance, (ii) which adjustable features of that instance should be modified, and finally (iii) how to alter such a prediction when the mutated instance is input back to the model. In this paper, we present a technique that exploits the internals of a tree-based ensemble classifier to offer recommendations for transforming true negative instances into positively predicted ones. We demonstrate the validity of our approach using an online advertising application. First, we design a Random Forest classifier that effectively separates between two types of ads: low (negative) and high (positive) quality ads (instances). Then, we introduce an algorithm that provides recommendations that aim to transform a low quality ad (negative instance) into a high quality one (positive instance). Finally, we evaluate our approach on a subset of the active inventory of a large ad network, Yahoo Gemini.Comment: 10 pages, KDD 201

arXiv.org e-Print Archive

Crossref

Archivio della ricerca- Università di Roma La Sapienza

Archivio istituzionale della ricerca - Università di Padova

Recommended from our members

From Classification Rules to Action Recommendations

Author: Cordier Marie-Odile
Cordier Marie-Odile
Masson Veronique
Masson Véronique
Salleb-Aouissi Ansaf
Salleb-Aouissi Ansaf
Trépos Ronan
Trépos Ronan
Publication venue: Center for Computational Learning Systems, Columbia University
Publication date: 01/01/2008
Field of study

Rule induction has attracted a great deal of attention in Machine Learning and Data Mining. However, generating rules is not an end in itself because their applicability is not straightforward especially when the number of rules is large. Ideally, the user would ultimately like to use these rules to decide which actions to take. In the literature, this notion is usually referred to as actionability. The contribution of this paper1 is two-fold: first we propose a survey of the main approaches developed to address actionability. This topic has received growing attention in the past years. We present a classification of the main research in this area as well as a comparative study between the different approaches. Second, we propose a new framework to address actionability. Our goal is to lighten the burden of analyzing a large set of classification rules when the user is confronted with an "unsatisfactory situation" and needs help to decide what appropriate actions to take in order to remedy the situation. The method consists in comparing the situation to a set of classification rules. This is achieved by using a suitable distance that allows one to suggest action recommendations requiring minimal changes to improve the situation. We propose the algorithm DAKAR for learning action recommendations and we present an application to environment protection. Our experiment shows the usefulness of our contribution for action recommendation but also raises some concerns about the impact of the redundancy of a set of rules in learning action recommendations of good quality

Columbia University Academic Commons