3,937 research outputs found

    Interpreting Embedding Models of Knowledge Bases: A Pedagogical Approach

    Full text link
    Knowledge bases are employed in a variety of applications from natural language processing to semantic web search; alas, in practice their usefulness is hurt by their incompleteness. Embedding models attain state-of-the-art accuracy in knowledge base completion, but their predictions are notoriously hard to interpret. In this paper, we adapt "pedagogical approaches" (from the literature on neural networks) so as to interpret embedding models by extracting weighted Horn rules from them. We show how pedagogical approaches have to be adapted to take upon the large-scale relational aspects of knowledge bases and show experimentally their strengths and weaknesses.Comment: presented at 2018 ICML Workshop on Human Interpretability in Machine Learning (WHI 2018), Stockholm, Swede

    Analysis of Neural Networks in Terms of Domain Functions

    Get PDF
    Despite their success-story, artificial neural networks have one major disadvantage compared to other techniques: the inability to explain comprehensively how a trained neural network reaches its output; neural networks are not only (incorrectly) seen as a "magic tool" but possibly even more as a mysterious "black box". Although much research has already been done to "open the box," there is a notable hiatus in known publications on analysis of neural networks. So far, mainly sensitivity analysis and rule extraction methods have been used to analyze neural networks. However, these can only be applied in a limited subset of the problem domains where neural network solutions are encountered. In this paper we propose a wider applicable method which, for a given problem domain, involves identifying basic functions with which users in that domain are already familiar, and describing trained neural networks, or parts thereof, in terms of those basic functions. This will provide a comprehensible description of the neural network's function and, depending on the chosen base functions, it may also provide an insight into the neural network' s inner "reasoning." It could further be used to optimize neural network systems. An analysis in terms of base functions may even make clear how to (re)construct a superior system using those base functions, thus using the neural network as a construction advisor

    Building Credit-Risk Evaluation Expert Systems Using Neural Network Rule Extraction and Decision Tables.

    Get PDF
    In this paper, we evaluate and contrast four neural network rule extraction approaches for credit scoring. Experiments are carried out on three real life credit scoring data sets. Both the continuous and the discretised versions of all data sets are analysed. The rule extraction algorithms, Neurolinear, Neurorule, Trepan and Nefclass, have different characteristics with respect to their perception of the neural network and their way of representing the generated rules or knowledge. It is shown that Neurolinear, Neurorule and Trepan are able to extract very concise rule sets or trees with a high predictive accuracy when compared to classical decision tree (rule) induction algorithms like C4.5(rules). Especially Neurorule extracted easy to understand and powerful propositional ifthen rules for all discretised data sets. Hence, the Neurorule algorithm may offer a viable alternative for rule generation and knowledge discovery in the domain of credit scoring.Credit; Information systems; International; Systems;

    Comprehensible credit scoring models using rule extraction from support vector machines.

    Get PDF
    In recent years, Support Vector Machines (SVMs) were successfully applied to a wide range of applications. Their good performance is achieved by an implicit non-linear transformation of the original problem to a high-dimensional (possibly infinite) feature space in which a linear decision hyperplane is constructed that yields a nonlinear classifier in the input space. However, since the classifier is described as a complex mathematical function, it is rather incomprehensible for humans. This opacity property prevents them from being used in many real- life applications where both accuracy and comprehensibility are required, such as medical diagnosis and credit risk evaluation. To overcome this limitation, rules can be extracted from the trained SVM that are interpretable by humans and keep as much of the accuracy of the SVM as possible. In this paper, we will provide an overview of the recently proposed rule extraction techniques for SVMs and introduce two others taken from the artificial neural networks domain, being Trepan and G-REX. The described techniques are compared using publicly avail- able datasets, such as Ripley's synthetic dataset and the multi-class iris dataset. We will also look at medical diagnosis and credit scoring where comprehensibility is a key requirement and even a regulatory recommendation. Our experiments show that the SVM rule extraction techniques lose only a small percentage in performance compared to SVMs and therefore rank at the top of comprehensible classification techniques.Credit; Credit scoring; Models; Model; Applications; Performance; Space; Decision; Yield; Real life; Risk; Evaluation; Rules; Neural networks; Networks; Classification; Research;

    Rule Extraction and Insertion to Improve the Performance of a Dynamic Cell Structure Neural Network

    Get PDF
    Artificial Neural Networks are extremely useful machine learning tools. They are used for many purposes, such as prediction, classification, pattern recognition, etc. Although neural networks have been used for decades, they are still often not completely understood or trusted, especially in safety and mission critical situations. Typically, neural networks are trained on data sets that are representative of what needs to be learned. Sometimes training sets are constructed in order to train the neural network in a certain way, in order to embed appropriate knowledge. The purpose of this research is to determine if there is another method that can be used to embed specific knowledge in a neural network before training and if this improves the performance of a neural network. This research develops and tests a new method of embedding pre-knowledge into the Dynamic Cell Structure (DCS) neural network. The DCS is a type of self-organizing map neural network that has been used for many purposes, including classification. In the research presented here, the method used for embedding pre-knowledge into the neural network is to start by converting the knowledge to a set of IF/THEN rules, that can be easily understood and/or validated by a human expert. Once the rules are constructed and validated, then they are converted to a beginning neural network structure. This allows pre-knowledge to be embedded before training the neural network. This conversion and embedding process is called Rule Insertion. In order to determine whether this process improves performance, the neural network was trained with and without pre-knowledge embedded. After the training, the neural network structure was again converted to rules, Rule Extraction, and then the neural network accuracy and the rule accuracy were computed. Also, the agreement between the neural network and the extracted rules was computed. The findings of this research show that using Rule Insertion to embed pre-knowledge into a DCS neural network can increase the accuracy of the neural network. An expert can create the rules to be embedded and can also examine and validate the rules extracted to give more confidence in what the neural network has learned during training. The extracted rules are also a refinement of the inserted rules, meaning the neural network was able to improve upon the expert knowledge based on the data presented

    Hybrid rule-extraction from support vector machines

    Get PDF
    Rule-extraction from artificial neural networks(ANNs) as well as support vector machines (SVMs) provide explanations for the decisions made by these systems. This explanation capability is very important in applications such as medical diagnosis. Over the last decade, a multitude of algorithms for rule-extraction from ANNs have been developed. However, rule-extraction from SVMs is not widely available yet.In this paper, a hybrid approach for rule-extraction from SVMs is outlined. This approach has two basic components: (1) data reduction using a logistic regression model and (2) learning based rule-extraction. The quality of the extracted rules is then evaluated in terms of fidelity, accuracy, consistency and comprehensibility. The rules are also verified against the available knowledge from the domain problem (diabetes) to assure correctness and validity
    corecore