6 research outputs found

    Using entropy-based local weighting to improve similarity assessment

    Get PDF
    This paper enhances and analyses the power of local weighted similarity measures. The paper proposes a new entropy-based local weighting algorithm to be used in similarity assessment to improve the performance of the CBR retrieval task. It has been carried out a comparative analysis of the performance of unweighted similarity measures, global weighted similarity measures, and local weighting similarity measures. The testing has been done using several similarity measures, and some data sets from the UCI Machine Learning Database Repository and other environmental databases.Postprint (published version

    Fractal-based re-design

    Get PDF
    Engineering conceptual design is a knowledge-intensive process that generates solutions to a product specification. It is a process that can benefit from past experience of similar designs. In reality however, designers often have limited time to build up the necessary experience and are, in any event, unlikely to become experts in all relevant fields. Hence there is a need to capture, store and reuse valuable knowledge. Currently available conventional CAD systems offer limited possibilities for the re-use of existing designs. Techniques from the field of Artificial Intelligence (Al) may be applied to aid the conceptual design phase, which is known as the area of intelligent computer-aided design. The aim of this work is to identify and externalise design knowledge using a fractal-like model, to understand the role of design knowledge in conceptual design and to use design knowledge as a guide for every stage of concept development. This research provides a framework for supporting conceptual design, which uses the techniques of Case-Based Reasoning (CBR) and fractal theory, for reasoning about the design and development of computer-based design aids. The framework is comprised of three parts. The first is case representation. This research proposes a new representation technique, Fractal-like Design Modelling (FDM), which integrates design knowledge in a graph-based form and has fractal-specific characteristics. The second is case retrieval. Based on FDM, the similarity between a new design and the existing designs is assessed by concurrently applying a feature-based similarity measure and a structure-based similarity measure. The third is case adaptation. With the help of fractal characteristics, an approach of adaptive design is developed by performance revision and by goal-oriented substitution. These three parts work together to achieve an automated, case-based, conceptual design method: Fractal-Based Re-design.EThOS - Electronic Theses Online ServiceGBUnited Kingdo

    Fractal-based re-design.

    Get PDF
    Engineering conceptual design is a knowledge-intensive process that generates solutions to a product specification. It is a process that can benefit from past experience of similar designs. In reality however, designers often have limited time to build up the necessary experience and are, in any event, unlikely to become experts in all relevant fields. Hence there is a need to capture, store and reuse valuable knowledge. Currently available conventional CAD systems offer limited possibilities for the re-use of existing designs. Techniques from the field of Artificial Intelligence (Al) may be applied to aid the conceptual design phase, which is known as the area of intelligent computer-aided design. The aim of this work is to identify and externalise design knowledge using a fractal-like model, to understand the role of design knowledge in conceptual design and to use design knowledge as a guide for every stage of concept development. This research provides a framework for supporting conceptual design, which uses the techniques of Case-Based Reasoning (CBR) and fractal theory, for reasoning about the design and development of computer-based design aids. The framework is comprised of three parts. The first is case representation. This research proposes a new representation technique, Fractal-like Design Modelling (FDM), which integrates design knowledge in a graph-based form and has fractal-specific characteristics. The second is case retrieval. Based on FDM, the similarity between a new design and the existing designs is assessed by concurrently applying a feature-based similarity measure and a structure-based similarity measure. The third is case adaptation. With the help of fractal characteristics, an approach of adaptive design is developed by performance revision and by goal-oriented substitution. These three parts work together to achieve an automated, case-based, conceptual design method: Fractal-Based Re-design

    Método de agrupamiento no supervisado para el procesamiento del lenguaje natural utilizando medidas de similitud asimétricas y propiedades paradigmáticas

    Get PDF
    Una de las tareas más comunes para el ser humano, pero de con una alta complejidad es la agrupación y clasificación. Por otro lado, la debilidad del ser humano es la capacidad de procesar altas cantidades de datos y de forma rápida, característica propia de los computadores. Hoy en día se generan grandes cantidades de datos en el Internet, datos de distintos tipos y con diferentes objetivos. Para esto se necesitan de algoritmos de agrupación que nos permitan identificar los distintos grupos y características de estos grupos, de forma automática sin conocimiento previo. Por otro lado, es importante definir con claridad qué medida de similitud se utilizará en el proceso de agrupación, la gran mayoría de las medidas de agrupación se enfocan en un aspecto simétrico. En la presente tesis se propone una novedosa medida de similitud asimétrica, Coeficiente d Similitud Unilateral Jaccard (uJaccard), similitud no es igual entre dos objetos uJaccard(a,b) ≠ uJaccard(b,a). Así también se presenta una similitud asimétrica con pesos Coeficiente Ponderado de Similitud Unilateral Jaccard, la cual mide el nivel de incertidumbre entre dos objetos. Así también en esta tesis se propone una nueva propiedad de grafos, la propiedad paradigmática la cual considera la equivalencia regular como característica fundamental y por último se propone un algoritmo de agrupación PaC, por sus siglas en inglés Paradigmatic Clustering, el cual incorpora la uJaccard y la propiedad paradigmática. Se ha realizado evaluaciones extensivas con datos pequeños, reales, sintéticos y se ha procesado 3 grandes corpus. Se ha demostrado que PaC es un algoritmo que sobre pasa los resultados de algoritmos de agrupación del estado del arte. Más aun PaC es un algoritmo capas de ser ejecutado de forma paralela, distribuida, incremental y en flujo, características que se necesitan para el procedimiento de grandes cantidades de datos y de constante generación de dato

    Exploiting Wikipedia Semantics for Computing Word Associations

    No full text
    Semantic association computation is the process of automatically quantifying the strength of a semantic connection between two textual units based on various lexical and semantic relations such as hyponymy (car and vehicle) and functional associations (bank and manager). Humans have can infer implicit relationships between two textual units based on their knowledge about the world and their ability to reason about that knowledge. Automatically imitating this behavior is limited by restricted knowledge and poor ability to infer hidden relations. Various factors affect the performance of automated approaches to computing semantic association strength. One critical factor is the selection of a suitable knowledge source for extracting knowledge about the implicit semantic relations. In the past few years, semantic association computation approaches have started to exploit web-originated resources as substitutes for conventional lexical semantic resources such as thesauri, machine readable dictionaries and lexical databases. These conventional knowledge sources suffer from limitations such as coverage issues, high construction and maintenance costs and limited availability. To overcome these issues one solution is to use the wisdom of crowds in the form of collaboratively constructed knowledge sources. An excellent example of such knowledge sources is Wikipedia which stores detailed information not only about the concepts themselves but also about various aspects of the relations among concepts. The overall goal of this thesis is to demonstrate that using Wikipedia for computing word association strength yields better estimates of humans' associations than the approaches based on other structured and unstructured knowledge sources. There are two key challenges to achieve this goal: first, to exploit various semantic association models based on different aspects of Wikipedia in developing new measures of semantic associations; and second, to evaluate these measures compared to human performance in a range of tasks. The focus of the thesis is on exploring two aspects of Wikipedia: as a formal knowledge source, and as an informal text corpus. The first contribution of the work included in the thesis is that it effectively exploited the knowledge source aspect of Wikipedia by developing new measures of semantic associations based on Wikipedia hyperlink structure, informative-content of articles and combinations of both elements. It was found that Wikipedia can be effectively used for computing noun-noun similarity. It was also found that a model based on hybrid combinations of Wikipedia structure and informative-content based features performs better than those based on individual features. It was also found that the structure based measures outperformed the informative content based measures on both semantic similarity and semantic relatedness computation tasks. The second contribution of the research work in the thesis is that it effectively exploited the corpus aspect of Wikipedia by developing a new measure of semantic association based on asymmetric word associations. The thesis introduced the concept of asymmetric associations based measure using the idea of directional context inspired by the free word association task. The underlying assumption was that the association strength can change with the changing context. It was found that the asymmetric association based measure performed better than the symmetric measures on semantic association computation, relatedness based word choice and causality detection tasks. However, asymmetric-associations based measures have no advantage for synonymy-based word choice tasks. It was also found that Wikipedia is not a good knowledge source for capturing verb-relations due to its focus on encyclopedic concepts specially nouns. It is hoped that future research will build on the experiments and discussions presented in this thesis to explore new avenues using Wikipedia for finding deeper and semantically more meaningful associations in a wide range of application areas based on humans' estimates of word associations

    Defining and combining symmetric and asymmetric similarity measures

    No full text
    corecore