Causation generalization through the identification of equivalent nodes in causal sparse graphs constructed from text using node similarity strategies

Drury, Brett Mylo; Lopes, Alneu de Andrade; Valverde-Rebaza, Jorge Carlos

research

Causation generalization through the identification of equivalent nodes in causal sparse graphs constructed from text using node similarity strategies

Authors: Brett Mylo Drury
Alneu de Andrade Lopes
Jorge Carlos Valverde-Rebaza
Publication date
Publisher: Cusco

Abstract

Causal Bayesian Graphs can be constructed from causal information in text. These graphs can be sparse because the cause or effect event can be expressed in various ways to represent the same information. This sparseness can corrupt inferences made on the graph. This paper proposes to reduce sparseness by merging: equivalent nodes and their edges. This paper presents a number of experiments that evaluates the applicability of node similarity techniques to detect equivalent nodes. The experiments found that techniques that rely upon combination of node contents and structural information are the most accurate strategies, specifically we have employed: 1. node name similarity and 2. combination of node name similarity and common neighbours (SMCN). In addition, the SMCN returns ”better” equivalent nodes than the string matching strategy.São Paulo Research Foundation (FAPESP) (grants 2013/12191-5, 2011/22749-8 and 2011/20451-1

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Biblioteca Digital da Produção Intelectual da Universidade de São Paulo (BDPI/USP)

oai:www.producao.usp.br:BDPI/4...

Last time updated on 27/05/2016