Search CORE

7 research outputs found

A Nested Genetic Algorithm for Explaining Classification Data Sets with Decision Rules

Author: Brajovic Danilo
Huber Marco F.
Matt Paul-Amaury
Roth Marco
Ziegler Rosina
Publication venue
Publication date: 23/08/2022
Field of study

Our goal in this paper is to automatically extract a set of decision rules (rule set) that best explains a classification data set. First, a large set of decision rules is extracted from a set of decision trees trained on the data set. The rule set should be concise, accurate, have a maximum coverage and minimum number of inconsistencies. This problem can be formalized as a modified version of the weighted budgeted maximum coverage problem, known to be NP-hard. To solve the combinatorial optimization problem efficiently, we introduce a nested genetic algorithm which we then use to derive explanations for ten public data sets

arXiv.org e-Print Archive

Model Reporting for Certifiable AI: A Proposal from Merging EU Regulation into AI Development

Author: Biller Martin
Brajovic Danilo
Fresz Benjamin
Goebels Vincent Philipp
Huber Marco F.
Klaeb Mara
Kutz Janika
Neuhuettler Jens
Renner Niclas
Wagner Philipp
Publication venue
Publication date: 21/07/2023
Field of study

Despite large progress in Explainable and Safe AI, practitioners suffer from a lack of regulation and standards for AI safety. In this work we merge recent regulation efforts by the European Union and first proposals for AI guidelines with recent trends in research: data and model cards. We propose the use of standardized cards to document AI applications throughout the development process. Our main contribution is the introduction of use-case and operation cards, along with updates for data and model cards to cope with regulatory requirements. We reference both recent research as well as the source of the regulation in our cards and provide references to additional support material and toolboxes whenever possible. The goal is to design cards that help practitioners develop safe AI systems throughout the development process, while enabling efficient third-party auditing of AI applications, being easy to understand, and building trust in the system. Our work incorporates insights from interviews with certification experts as well as developers and individuals working with the developed AI applications.Comment: 54 pages, 1 figure, to be submitte

arXiv.org e-Print Archive

DMLR: Data-centric Machine Learning Research -- Past, Present and Future

Drawing from discussions at the inaugural DMLR workshop at ICML 2023 and meetings prior, in this report we outline the relevance of community engagement and infrastructure development for the creation of next-generation public datasets that will advance machine learning science. We chart a path forward as a collective effort to sustain the creation and maintenance of these datasets and methods towards positive scientific, societal and business impact.Comment: This editorial report accompanies the inaugural Data-centric Machine Learning Research (DMLR) Workshop that took place at ICML 2023 https://dmlr.ai

arXiv.org e-Print Archive