243 research outputs found

    A Frame-Based System for Automatic Classification of Semi- Structured Data

    Get PDF
    The problem of data classification goes back to the definition oftaxonomies covering knowledge areas. With the advent of the Web, the amount of data available increased several orders of magnitude, making manual data classification impossible. This work presents a tool to automatically classify semi-structured data, represented by frames, without any previous knowledge about structured classes. The tool uses a variation of the K-Medoid algorithm and organizes a set of frames into classes, structured as a strict hierarchy

    No balanço do tédio: Heidegger e o tédio como tonalidade afetiva fática

    Get PDF
    The aim of the present paper is to follow the reasons for which Heidegger, in his lectures from 1929/30 The Fundamental Concepts of Metaphysics (world – finitude – solitude) repeats the project of Being and Time of an analysis of the human existence and of a description of his existential crises as central for the possibilities of a historical mobilization of the world now no longer from a simply ontological mood, but rather from a so called factical mood, i.e., a mood that possesses a structural connection with our historical world. From that, the paper analyses the figures of boredom going through the most superficial one to the deep boredom, while also exposing what once more hinders Heidegger’s attempt to think in a consistent way about the fundamental event of singularization as a key path into the temporality of Being as such, which implies a new failure of the project of fundamental ontology in its essential liaison with existential analytics.O intuito do presente texto é acompanhar as razões pelas quais Heidegger retoma em sua preleção de 1929/30 Os conceitos fundamentais da metafísica (mundo – finitude – solidão) o projeto de Ser e tempo de uma análise do existente humano e de uma descrição de suas crises existenciais como decisivas para as possibilidades de mobilização histórica do mundo, agora não mais a partir de uma tonalidade afetiva de matiz ontológica, a angústia, mas sim a partir de uma tonalidade afetiva fática, ou seja, uma tonalidade que possui um vínculo estrutural com o mundo histórico que é o nosso. Partindo daí, o texto analisa o aprofundamento das diversas figuras do tédio até o tédio dito profundo, assim como expõe o problema que impede uma vez mais que Heidegger consiga pensar de maneira consistente o acontecimento fundamental da singularização como via de acesso à temporialidade propriamente dita do ser, ou seja, a ontologia fundamental em sua ligação com a analítica existencial

    Interlinking documents based on semantic graphs

    Get PDF
    Connectivity and relatedness of Web resources are two concepts that define to what extent different parts are connected or related to one another. Measuring connectivity and relatedness between Web resources is a growing field of research, often the starting point of recommender systems. Although relatedness is liable to subjective interpretations, connectivity is not. Given the Semantic Web's ability of linking Web resources, connectivity can be measured by exploiting the links between entities. Further, these connections can be exploited to uncover relationships between Web resources. In this paper, we apply and expand a relationship assessment methodology from social network theory to measure the connectivity between documents. The connectivity measures are used to identify connected and related Web resources. Our approach is able to expose relations that traditional text-based approaches fail to identify. We validate and assess our proposed approaches through an evaluation on a real world dataset, where results show that the proposed techniques outperform state of the art approaches.CAPESEU/FP7/2007-2013CNPFAPER

    Análise de risco creditício, proposta do modelo credit scoring

    Get PDF
    This work is applied in a company dedicated to the production, commercialization and distribution of asphalt products in the south of Chile. The aforementioned company has preferred not to disclose its corporate name, for this purpose we have called this, Fantasy S.A. During the last few years Fantasy has experienced a significant growth in its sales and with it, a decrease in its level of liquidity and quality of its accounts receivable. However, this increase in accounts receivable is associated with a greater risk assumed of collection, given its policy of deregulating accounts receivable. Moreover, Fantasy S.A., does not have an objective credit management system that allows an adequate evaluation of the quality and credit capacity of its current and potential clients. Therefore, in this article, a credit assessment model for its current and potential clients adjusted and weighted to its reality, which allows to reduce the credit risk or uncollectible is proposed to Fantasy. The present work considers, a description of the models of evaluation of credits and in specific of the models of credit scoring. Through interviews with experts, quantitative and qualitative variables critical to be considered in a credit management process were defined. Regarding the quality of the proposed credit assessment model, this shows that 81.82% of the loans granted to its clients have exceeded the minimum level of evaluation or limit of approval by the companyEl presente trabajo aplica en una empresa dedicada a la producción, comercialización y distribución de productos derivados del asfalto en la zona sur Chile. La empresa referida, ha preferido no revelar su razón social, para tal efecto hemos denominado a esta, Fantasía S.A. Durante los últimos años Fantasía ha experimentado un crecimiento significativo en sus ventas y con ello, una disminución de su nivel de liquidez y calidad de sus cuentas por cobrar. Sin embargo, este incremento en cuentas por cobrar está asociado a un mayor riesgo asumido de cobro, dada su política liberalizadora de cuentas por cobrar. Más aún, Fantasía S.A., no dispone de un sistema de gestión de crédito objetivo que permita una evaluación adecuada de la calidad y capacidad crediticia de sus clientes actuales y potenciales. Por tanto, en este artículo se propone a Fantasía un modelo de evaluación crediticia a sus clientes actuales y potenciales ajustado y ponderado a su realidad, que permite disminuir el riesgo de crédito o incobrables. El presente trabajo considera, una descripción de los modelos de evaluación de créditos y en específico de los modelos de credit scoring. A través de entrevistas a expertos, se definieron variables cuantitativas y cualitativas críticas a considerar en un proceso de gestión de créditos. Respecto de la calidad del modelo de evaluación crediticia propuesto, este muestra que un 81,82% de los créditos otorgados a sus clientes han superado el nivel mínino de evaluación o límite de aprobación por la empresaO presente trabalho se aplica a uma empresa dedicada à produção, comercialização e distribuição de produtos derivados do asfalto na zona sul Chile. A empresa referida tem preferido não revelar sua razão social, e para isto foi denominada Fantasia S.A. Durante os últimos anos Fantasia tem experimentado um crescimento significativo em suas vendas e com isso, uma diminuição de seu nível de liquidez e qualidade de suas contas por cobrar. No entanto, este incremento em contas por cobrar está associado a um maior risco assumido de cobrança, dada sua política liberadora de contas por cobrar. Mais ainda, Fantasia S.A., não dispõe de um sistema de gerenciamento de crédito objetivo que permita uma avaliação adequada da qualidade e capacidade creditícia de seus clientes atuais e potenciais. Portanto, neste artigo propõe-se a Fantasia um modelo de avaliação creditícia a seus clientes atuais e potenciais ajustado e ponderado a sua realidade, que permite diminuir o risco de crédito ou incobráveis. O presente trabalho considera, uma descrição dos modelos de avaliação de créditos e especificamente dos modelos de credit scoring. Através de entrevistas a peritos, definiram-se variáveis quantitativas e qualitativas críticas a considerar em um processo de gerenciamento de créditos. A respeito da qualidade do modelo de avaliação creditícia proposto, este mostra que 81,82% dos créditos outorgados a seus clientes têm superado o nível mínimo de avaliação ou limite de aprovação pela empres

    Identifying candidate datasets for data interlinking

    Get PDF
    One of the design principles that can stimulate the growth and increase the usefulness of the Web of data is URIs linkage. However, the related URIs are typically in different datasets managed by different publishers. Hence, the designer of a new dataset must be aware of the existing datasets and inspect their content to define sameAs links. This paper proposes a technique based on probabilistic classifiers that, given a datasets S to be published and a set T of known published datasets, ranks each Ti ∈ T according to the probability that links between S and Ti can be found by inspecting the most relevant datasets. Results from our technique show that the search space can be reduced up to 85%, thereby greatly decreasing the computational effort. The final publication is available at Springer via http://dx.doi.org/10.1007/978-3-642-39200-9_29

    Combining a co-occurrence-based and a semantic measure for entity linking

    Get PDF
    One key feature of the Semantic Web lies in the ability to link related Web resources. However, while relations within particular datasets are often well-defined, links between disparate datasets and corpora of Web resources are rare. The increasingly widespread use of cross-domain reference datasets, such as Freebase and DBpedia for annotating and enriching datasets as well as documents, opens up opportunities to exploit their inherent semantic relationships to align disparate Web resources. In this paper, we present a combined approach to uncover relationships between disparate entities which exploits (a) graph analysis of reference datasets together with (b) entity co-occurrence on the Web with the help of search engines. In (a), we introduce a novel approach adopted and applied from social network theory to measure the connectivity between given entities in reference datasets. The connectivity measures are used to identify connected Web resources. Finally, we present a thorough evaluation of our approach using a publicly available dataset and introduce a comparison with established measures in the field. The final publication is available at Springer via http://dx.doi.org/10.1007/978-3-642-38288-8_37

    Recommending tripleset interlinking through a social network approach

    Get PDF
    Tripleset interlinking is one of the main principles of Linked Data. However, the discovery of existing triplesets relevant to be linked with a new tripleset is a non-trivial task in the publishing process. Without prior knowledge about the entire Web of Data, a data publisher must perform an exploratory search, which demands substantial effort and may become impracticable, with the growth and dissemination of Linked Data. Aiming at alleviating this problem, this paper proposes a recommendation approach for this scenario, using a Social Network perspective. The experimental results show that the proposed approach obtains high levels of recall and reduces in up to 90% the number of triplesets to be further inspected for establishing appropriate links. The final publication is available at Springer via http://dx.doi.org/10.1007/978-3-642-41230-1_13.CNPq/160326/2012-5CNPq/301497/2006-0CNPq/475717/2011-2CNPq/57128/2009-9FAPERJ/E-26/170028/2008FAPERJ/E-26/103.070/2011CAPES/PROCAD/NF 1128/201

    Answering confucius: The reason why we complicate

    Get PDF
    Learning is a level-progressing process. In any field of study, one must master basic concepts to understand more complex ones. Thus, it is important that during the learning process learners are presented and challenged with knowledge which they are able to comprehend (not a level below, not a level too high). In this work we focus on language learners. By gradually improving (complicating) texts, readers are challenged to learn new vocabulary. To achieve such goals, in this paper we propose and evaluate the 'complicator' that translates given sentences to a chosen level of higher degree of difficulty. The 'complicator' is based on natural language processing and information retrieval approaches that perform lexical replacements. 30 native English speakers participated in a user study evaluating our methods on an expert-tailored dataset of children books. Results show that our tool can be of great utility for language learners who are willing to improve their vocabulary. The final publication is available at Springer via http://dx.doi.org/10.1007/978-3-642-40814-4_45.TERENCEEC/FP

    Two approaches to the dataset interlinking recommendation problem

    Get PDF
    Whenever a dataset t is published on the Web of Data, an exploratory search over existing datasets must be performed to identify those datasets that are potential candidates to be interlinked with t. This paper introduces and compares two approaches to address the dataset interlinking recommendation problem, respectively based on Bayesian classifiers and on Social Network Analysis techniques. Both approaches define rank score functions that explore the vocabularies, classes and properties that the datasets use, in addition to the known dataset links. After extensive experiments using real-world datasets, the results show that the rank score functions achieve a mean average precision of around 60%. Intuitively, this means that the exploratory search for datasets to be interlinked with t might be limited to just the top-ranked datasets, reducing the cost of the dataset interlinking process. The final publication is available at Springer via http://dx.doi.org/10.1007/978-3-319-11749-2_25.EC/FP7/LinkedUpCNPq/160326/2012-5CNPq/303332/2013-1CNPq/557128/2009-9FAPERJ/E-26/170028/2008FAPERJ/E-26/103.070/2011FAPERJ/E-26/101.382/2014CAPES/141082
    • …
    corecore