Search CORE

725 research outputs found

Generation of Explicit Knowledge from Empirical Data through Pruning of Trainable Neural Networks

Author: Gorban A. N.
Mirkes Eu. M.
Tsaregorodtsev V. G.
Publication venue
Publication date: 06/08/2002
Field of study

This paper presents a generalized technology of extraction of explicit knowledge from data. The main ideas are 1) maximal reduction of network complexity (not only removal of neurons or synapses, but removal all the unnecessary elements and signals and reduction of the complexity of elements), 2) using of adjustable and flexible pruning process (the pruning sequence shouldn't be predetermined - the user should have a possibility to prune network on his own way in order to achieve a desired network structure for the purpose of extraction of rules of desired type and form), and 3) extraction of rules not in predetermined but any desired form. Some considerations and notes about network architecture and training process and applicability of currently developed pruning techniques and rule extraction algorithms are discussed. This technology, being developed by us for more than 10 years, allowed us to create dozens of knowledge-based expert systems. In this paper we present a generalized three-step technology of extraction of explicit knowledge from empirical data.Comment: 9 pages, The talk was given at the IJCNN '99 (Washington DC, July 1999

arXiv.org e-Print Archive

Leicester Research Archive

Theoretical Interpretations and Applications of Radial Basis Function Networks

Author: Blanzieri Enrico
Publication venue
Publication date: 01/05/2003
Field of study

Medical applications usually used Radial Basis Function Networks just as Artificial Neural Networks. However, RBFNs are Knowledge-Based Networks that can be interpreted in several way: Artificial Neural Networks, Regularization Networks, Support Vector Machines, Wavelet Networks, Fuzzy Controllers, Kernel Estimators, Instanced-Based Learners. A survey of their interpretations and of their corresponding learning algorithms is provided as well as a brief survey on dynamic learning algorithms. RBFNs' interpretations can suggest applications that are particularly interesting in medical domains

Unitn-eprints Research

Rule-Extraction Methods From Feedforward Neural Networks: A Systematic Literature Review

Author: Benabbou Loubna
Berrado Abdelaziz
Mekkaoui Sara El
Publication venue
Publication date: 20/12/2023
Field of study

Motivated by the interpretability question in ML models as a crucial element for the successful deployment of AI systems, this paper focuses on rule extraction as a means for neural networks interpretability. Through a systematic literature review, different approaches for extracting rules from feedforward neural networks, an important block in deep learning models, are identified and explored. The findings reveal a range of methods developed for over two decades, mostly suitable for shallow neural networks, with recent developments to meet deep learning models' challenges. Rules offer a transparent and intuitive means of explaining neural networks, making this study a comprehensive introduction for researchers interested in the field. While the study specifically addresses feedforward networks with supervised learning and crisp rules, future work can extend to other network types, machine learning methods, and fuzzy rule extraction

arXiv.org e-Print Archive

Learning Language from a Large (Unannotated) Corpus

Author: Goertzel Ben
Vepstas Linas
Publication venue
Publication date: 14/01/2014
Field of study

A novel approach to the fully automated, unsupervised extraction of dependency grammars and associated syntax-to-semantic-relationship mappings from large text corpora is described. The suggested approach builds on the authors' prior work with the Link Grammar, RelEx and OpenCog systems, as well as on a number of prior papers and approaches from the statistical language learning literature. If successful, this approach would enable the mining of all the information needed to power a natural language comprehension and generation system, directly from a large, unannotated corpus.Comment: 29 pages, 5 figures, research proposa

arXiv.org e-Print Archive

CiteSeerX

Humanoid Introspection: A Practical Approach

Author: Filippo Vella
Giovanni Pilato
Ignazio Infantino
Riccardo Rizzo
Publication venue
Publication date: 01/01/2013
Field of study

Abstract We describe an approach to robot introspection based on self observation and communication. Self observation is what the robot should do in order to build, represent and understand its internal state. It is necessary to translate the state representation in order to build a suitable input to an ontology that supplies the meaning of the internal state. The ontology supports the linguistic level that is used to communicate information about the robot state to the human user

Directory of Open Access Journals

Open Access Repository

Human-in-the-Loop Learning From Crowdsourcing and Social Media

Author: Liu Tong
Publication venue: RIT Scholar Works
Publication date: 01/06/2020
Field of study

Computational social studies using public social media data have become more and more popular because of the large amount of user-generated data available. The richness of social media data, coupled with noise and subjectivity, raise significant challenges for computationally studying social issues in a feasible and scalable manner. Machine learning problems are, as a result, often subjective or ambiguous when humans are involved. That is, humans solving the same problems might come to legitimate but completely different conclusions, based on their personal experiences and beliefs. When building supervised learning models, particularly when using crowdsourced training data, multiple annotations per data item are usually reduced to a single label representing ground truth. This inevitably hides a rich source of diversity and subjectivity of opinions about the labels. Label distribution learning associates for each data item a probability distribution over the labels for that item, thus it can preserve diversities of opinions, beliefs, etc. that conventional learning hides or ignores. We propose a humans-in-the-loop learning framework to model and study large volumes of unlabeled subjective social media data with less human effort. We study various annotation tasks given to crowdsourced annotators and methods for aggregating their contributions in a manner that preserves subjectivity and disagreement. We introduce a strategy for learning label distributions with only five-to-ten labels per item by aggregating human-annotated labels over multiple, semantically related data items. We conduct experiments using our learning framework on data related to two subjective social issues (work and employment, and suicide prevention) that touch many people worldwide. Our methods can be applied to a broad variety of problems, particularly social problems. Our experimental results suggest that specific label aggregation methods can help provide reliable representative semantics at the population level

RIT Scholar Works

An Emotion Type Informed Multi-Task Model for Emotion Cause Pair Extraction

Author: Chen Zhe
Feng Ying
Palade Vasile
Wang Liya
Zhang Junchi
Zhang Ming
Publication venue
Publication date: 01/02/2024
Field of study

Emotion-Cause Pair Extraction (ECPE) aims to jointly extract emotion clauses and the corresponding cause clauses from a document, which is important for user evaluation or public opinion analysis. Existing research addresses the ECPE task through a two-step or an end-to-end approach. Although previous work shows promising performances, they suffer from two limitations: 1) they fail to take full advantage of emotion type information, which has advantages for modelling the dependencies between emotion and cause clauses from a semantic perspective; 2) they ignored the interaction between local and global information, which is important for ECPE. To address these issues, we propose an ECPE Pair Generator (ECPE-PG), with a Clause-Encoder layer, a Pre-Output layer and an Information Interaction-based Pair Generation (IIPG) Module embedded. This model first encodes clauses into vector representations through the Clause-Encoder layer and then preforms emotion clause extraction (EE), cause clause extraction (CE) and emotion type extraction (ETE), respectively, through the Pre-Output layer, on the basis of which the IIPG module analyzes the complex emotional logic of relationships between clauses and estimates the candidate pairs based on the interaction of global and local information. It should be noted that emotion type information is regarded as a crucial indication in the IIPG module to assist the identification of emotion-cause pairs. Experimental results show that our method outperforms the state-of-the-art methods on benchmark datasets

Coventry University Pure Portal

Exploiting extensible background knowledge for clustering-based automatic keyphrase extraction

Author: Alrehamy Hassan
Walker Coral
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/11/2018
Field of study

Keyphrases are single- or multi-word phrases that are used to describe the essential content of a document. Utilizing an external knowledge source such as WordNet is often used in keyphrase extraction methods to obtain relation information about terms and thus improves the result, but the drawback is that a sole knowledge source is often limited. This problem is identified as the coverage limitation problem. In this paper, we introduce SemCluster, a clustering-based unsupervised keyphrase extraction method that addresses the coverage limitation problem by using an extensible approach that integrates an internal ontology (i.e., WordNet) with other knowledge sources to gain a wider background knowledge. SemCluster is evaluated against three unsupervised methods, TextRank, ExpandRank, and KeyCluster, and under the F1-measure metric. The evaluation results demonstrate that SemCluster has better accuracy and computational efficiency and is more robust when dealing with documents from different domains

Online Research @ Cardiff

TC-GAT: Graph Attention Network for Temporal Causality Discovery

Author: Chen Ke
Yuan Xiaosong
Zhang Yijia
Zuo Wanli
Publication venue
Publication date: 20/04/2023
Field of study

The present study explores the intricacies of causal relationship extraction, a vital component in the pursuit of causality knowledge. Causality is frequently intertwined with temporal elements, as the progression from cause to effect is not instantaneous but rather ensconced in a temporal dimension. Thus, the extraction of temporal causality holds paramount significance in the field. In light of this, we propose a method for extracting causality from the text that integrates both temporal and causal relations, with a particular focus on the time aspect. To this end, we first compile a dataset that encompasses temporal relationships. Subsequently, we present a novel model, TC-GAT, which employs a graph attention mechanism to assign weights to the temporal relationships and leverages a causal knowledge graph to determine the adjacency matrix. Additionally, we implement an equilibrium mechanism to regulate the interplay between temporal and causal relations. Our experiments demonstrate that our proposed method significantly surpasses baseline models in the task of causality extraction.Comment: Accepted by IJCNN 202

arXiv.org e-Print Archive