1,299 research outputs found

    Discriminating word senses with tourist walks in complex networks

    Full text link
    Patterns of topological arrangement are widely used for both animal and human brains in the learning process. Nevertheless, automatic learning techniques frequently overlook these patterns. In this paper, we apply a learning technique based on the structural organization of the data in the attribute space to the problem of discriminating the senses of 10 polysemous words. Using two types of characterization of meanings, namely semantical and topological approaches, we have observed significative accuracy rates in identifying the suitable meanings in both techniques. Most importantly, we have found that the characterization based on the deterministic tourist walk improves the disambiguation process when one compares with the discrimination achieved with traditional complex networks measurements such as assortativity and clustering coefficient. To our knowledge, this is the first time that such deterministic walk has been applied to such a kind of problem. Therefore, our finding suggests that the tourist walk characterization may be useful in other related applications

    Distance to the scaling law: a useful approach for unveiling relationships between crime and urban metrics

    Get PDF
    We report on a quantitative analysis of relationships between the number of homicides, population size and other ten urban metrics. By using data from Brazilian cities, we show that well defined average scaling laws with the population size emerge when investigating the relations between population and number of homicides as well as population and urban metrics. We also show that the fluctuations around the scaling laws are log-normally distributed, which enabled us to model these scaling laws by a stochastic-like equation driven by a multiplicative and log-normally distributed noise. Because of the scaling laws, we argue that it is better to employ logarithms in order to describe the number of homicides in function of the urban metrics via regression analysis. In addition to the regression analysis, we propose an approach to correlate crime and urban metrics via the evaluation of the distance between the actual value of the number of homicides (as well as the value of the urban metrics) and the value that is expected by the scaling law with the population size. This approach have proved to be robust and useful for unveiling relationships/behaviors that were not properly carried out by the regression analysis, such as i) the non-explanatory potential of the elderly population when the number of homicides is much above or much below the scaling law, ii) the fact that unemployment has explanatory potential only when the number of homicides is considerably larger than the expected by the power law, and iii) a gender difference in number of homicides, where cities with female population below the scaling law are characterized by a number of homicides above the power law.Comment: Accepted for publication in PLoS ON

    A complex network approach to stylometry

    Get PDF
    Statistical methods have been widely employed to study the fundamental properties of language. In recent years, methods from complex and dynamical systems proved useful to create several language models. Despite the large amount of studies devoted to represent texts with physical models, only a limited number of studies have shown how the properties of the underlying physical systems can be employed to improve the performance of natural language processing tasks. In this paper, I address this problem by devising complex networks methods that are able to improve the performance of current statistical methods. Using a fuzzy classification strategy, I show that the topological properties extracted from texts complement the traditional textual description. In several cases, the performance obtained with hybrid approaches outperformed the results obtained when only traditional or networked methods were used. Because the proposed model is generic, the framework devised here could be straightforwardly used to study similar textual applications where the topology plays a pivotal role in the description of the interacting agents.Comment: PLoS ONE, 2015 (to appear

    Word sense ranking based on semantic similarity and graph entropy

    Get PDF
    In this paper we propose a system for the recommendation of tagged pictures obtained from the Web. The system, driven by user feedback, executes an abductive reasoning (based on WordNet synset semantic relations) that is able to iteratively lead to new concepts which progressively represent the cognitive creative user state. Furthermore we design a selection mechanism to pick the most relevant abductive inferences by mixing a topological graph analysis together with a semantic similitude measure.Preprin
    corecore