Search CORE

28,140 research outputs found

Enhancing Decision Tree based Interpretation of Deep Neural Networks through L1-Orthogonal Regularization

Author: Huber Marco F.
Maucher Johannes
Schaaf Nina
Publication venue
Publication date: 03/10/2019
Field of study

One obstacle that so far prevents the introduction of machine learning models primarily in critical areas is the lack of explainability. In this work, a practicable approach of gaining explainability of deep artificial neural networks (NN) using an interpretable surrogate model based on decision trees is presented. Simply fitting a decision tree to a trained NN usually leads to unsatisfactory results in terms of accuracy and fidelity. Using L1-orthogonal regularization during training, however, preserves the accuracy of the NN, while it can be closely approximated by small decision trees. Tests with different data sets confirm that L1-orthogonal regularization yields models of lower complexity and at the same time higher fidelity compared to other regularizers.Comment: 8 pages, 18th IEEE International Conference on Machine Learning and Applications (ICMLA) 201

arXiv.org e-Print Archive

Crossref

Fraunhofer-ePrints

Learning the structure of Bayesian Networks: A quantitative assessment of the effect of different algorithmic schemes

Author: Beretta Stefano
Castelli Mauro
Goncalves Ivo
Henriques Roberto
Ramazzotti Daniele
Publication venue
Publication date: 01/01/2018
Field of study

One of the most challenging tasks when adopting Bayesian Networks (BNs) is the one of learning their structure from data. This task is complicated by the huge search space of possible solutions, and by the fact that the problem is NP-hard. Hence, full enumeration of all the possible solutions is not always feasible and approximations are often required. However, to the best of our knowledge, a quantitative analysis of the performance and characteristics of the different heuristics to solve this problem has never been done before. For this reason, in this work, we provide a detailed comparison of many different state-of-the-arts methods for structural learning on simulated data considering both BNs with discrete and continuous variables, and with different rates of noise in the data. In particular, we investigate the performance of different widespread scores and algorithmic approaches proposed for the inference and the statistical pitfalls within them

arXiv.org e-Print Archive

Directory of Open Access Journals

Repositório da Universidade Nova de Lisboa

Estudo Geral