325 research outputs found
An incremental tri-partite approach to ontology learning
In this paper we present a new approach to ontology learning. Its basis lies in a dynamic and iterative view of knowledge acquisition for ontologies. The Abraxas approach is founded on three resources, a set of texts, a set of learning patterns and a set of ontological triples, each of which must remain in equilibrium. As events occur which disturb this equilibrium various actions are triggered to re-establish a balance between the resources. Such events include acquisition of a further text from external resources such as the Web or the addition of ontological triples to the ontology. We develop the concept of a knowledge gap between the coverage of an ontology and the corpus of texts as a measure triggering actions. We present an overview of the algorithm and its functionalities
Tag-Aware Recommender Systems: A State-of-the-art Survey
In the past decade, Social Tagging Systems have attracted increasing
attention from both physical and computer science communities. Besides the
underlying structure and dynamics of tagging systems, many efforts have been
addressed to unify tagging information to reveal user behaviors and
preferences, extract the latent semantic relations among items, make
recommendations, and so on. Specifically, this article summarizes recent
progress about tag-aware recommender systems, emphasizing on the contributions
from three mainstream perspectives and approaches: network-based methods,
tensor-based methods, and the topic-based methods. Finally, we outline some
other tag-related works and future challenges of tag-aware recommendation
algorithms.Comment: 19 pages, 3 figure
An interactive visualization tool to explore the biophysical properties of amino acids and their contribution to substitution matrices
BACKGROUND: Quantitative descriptions of amino acid similarity, expressed as probabilistic models of evolutionary interchangeability, are central to many mainstream bioinformatic procedures such as sequence alignment, homology searching, and protein structural prediction. Here we present a web-based, user-friendly analysis tool that allows any researcher to quickly and easily visualize relationships between these bioinformatic metrics and to explore their relationships to underlying indices of amino acid molecular descriptors. RESULTS: We demonstrate the three fundamental types of question that our software can address by taking as a specific example the connections between 49 measures of amino acid biophysical properties (e.g., size, charge and hydrophobicity), a generalized model of amino acid substitution (as represented by the PAM74-100 matrix), and the mutational distance that separates amino acids within the standard genetic code (i.e., the number of point mutations required for interconversion during protein evolution). We show that our software allows a user to recapture the insights from several key publications on these topics in just a few minutes. CONCLUSION: Our software facilitates rapid, interactive exploration of three interconnected topics: (i) the multidimensional molecular descriptors of the twenty proteinaceous amino acids, (ii) the correlation of these biophysical measurements with observed patterns of amino acid substitution, and (iii) the causal basis for differences between any two observed patterns of amino acid substitution. This software acts as an intuitive bioinformatic exploration tool that can guide more comprehensive statistical analyses relating to a diverse array of specific research questions
Linking Folksonomies and Ontologies for Supporting Knowledge Sharing: a State of the Art
Deliverable of ISICIL ANR-funded projectSocial tagging systems have recently become very popular as a means to classify large sets of resources shared among on-line communities over the social Web. However, the folksonomies resulting from the use of these systems revealed limitations: tags are ambiguous and their spelling may vary, and folksonomies are difficult to exploit in order to retrieve or exchange information. This report compares the recent attempts to overcome these limitations and to support the use of folksonomies with formal languages and ontologies from the Semantic Web
Domain ontology learning from the web
El Aprendizaje de OntologÃas se define como el conjunto de métodos utilizados para construir, enriquecer o adaptar una ontologÃa existente de forma semiautomática, utilizando fuentes de información heterogéneas. En este proceso se emplea texto, diccionarios electrónicos, ontologÃas lingüÃsticas e información estructurada y semiestructurada para extraer conocimiento. Recientemente, gracias al enorme crecimiento de la Sociedad de la Información, la Web se ha convertido en una valiosa fuente de información para casi cualquier dominio. Esto ha provocado que los investigadores empiecen a considerar a la Web como un repositorio válido para Recuperar Información y Adquirir Conocimiento. No obstante, la Web presenta algunos problemas que no se observan en repositorios de información clásicos: presentación orientada al usuario, ruido, fuentes no confiables, alta dinamicidad y tamaño abrumador. Pese a ello, también presenta algunas caracterÃsticas que pueden ser interesantes para la adquisición de conocimiento: debido a su enorme tamaño y heterogeneidad, se asume que la Web aproxima la distribución real de la información a nivel global. Este trabajo describe una aproximación novedosa para el aprendizaje de ontologÃas, presentando nuevos métodos para adquirir conocimiento de la Web. La propuesta se distingue de otros trabajos previos principalmente en la particular adaptación de algunas técnicas clásicas de aprendizaje al corpus Web y en la explotación de las caracterÃsticas interesantes del entorno Web para componer una aproximación automática, no supervisada e independiente del dominio. Con respecto al proceso de construcción de la ontologÃas, se han desarrollado los siguientes métodos: i) extracción y selección de términos relacionados con el dominio, organizándolos de forma taxonómica; ii) descubrimiento y etiquetado de relaciones no taxonómicas entre los conceptos; iii) métodos adicionales para mejorar la estructura final, incluyendo la detección de entidades con nombre, atributos, herencia múltiple e incluso un cierto grado de desambiguación semántica. La metodologÃa de aprendizaje al completo se ha implementado mediante un sistema distribuido basado en agentes, proporcionando una solución escalable. También se ha evaluado para varios dominios de conocimiento bien diferenciados, obteniendo resultados de buena calidad. Finalmente, se han desarrollado varias aplicaciones referentes a la estructuración automática de librerÃas digitales y recursos Web, y la recuperación de información basada en ontologÃas.Ontology Learning is defined as the set of methods used for building from scratch, enriching or adapting an existing ontology in a semi-automatic fashion using heterogeneous information sources. This data-driven procedure uses text, electronic dictionaries, linguistic ontologies and structured and semi-structured information to acquire knowledge. Recently, with the enormous growth of the Information Society, the Web has become a valuable source of information for almost every possible domain of knowledge. This has motivated researchers to start considering the Web as a valid repository for Information Retrieval and Knowledge Acquisition. However, the Web suffers from problems that are not typically observed in classical information repositories: human oriented presentation, noise, untrusted sources, high dynamicity and overwhelming size. Even though, it also presents characteristics that can be interesting for knowledge acquisition: due to its huge size and heterogeneity it has been assumed that the Web approximates the real distribution of the information in humankind. The present work introduces a novel approach for ontology learning, introducing new methods for knowledge acquisition from the Web. The adaptation of several well known learning techniques to the web corpus and the exploitation of particular characteristics of the Web environment composing an automatic, unsupervised and domain independent approach distinguishes the present proposal from previous works.With respect to the ontology building process, the following methods have been developed: i) extraction and selection of domain related terms, organising them in a taxonomical way; ii) discovery and label of non-taxonomical relationships between concepts; iii) additional methods for improving the final structure, including the detection of named entities, class features, multiple inheritance and also a certain degree of semantic disambiguation. The full learning methodology has been implemented in a distributed agent-based fashion, providing a scalable solution. It has been evaluated for several well distinguished domains of knowledge, obtaining good quality results. Finally, several direct applications have been developed, including automatic structuring of digital libraries and web resources, and ontology-based Web Information Retrieval
A feminist political ecology of food justice in Iowa
In this dissertation, I study strategies for local and alternative food initiatives to advance more just and equitable approaches to food insecurity. My research focuses on one emerging type of local food initiative designed to address food insecurity, community donation gardening. I engaged in a three-year feminist and ethnographic study of a community-run and USDA- and Cooperative Extension-sponsored donation gardening and food rescue program, Growing Together Iowa. In Growing Together, community gardeners grow or glean food to distribute directly to community members experiencing food insecurity or alternatively to donate to partnering emergency food organizations, such as area food pantries. I focus on the institutional and community gardening partners in Growing Together in an effort to better understand the ways in which solutions to food insecurity emerge from social relations and socio-historical contexts. Framing community donation gardening as sites of political-ecological negotiation and struggle, I show how the politics of food insecurity unfolds in everyday life in Growing Together. I also explore how the social justice praxis of marginalized communities and community-engaged scholar-activists can play a role in that unfolding. Drawing upon feminist theories and methodologies to examine these sites, I demonstrate how uneven power relations permeate even well-intended efforts to address food insecurity. Through community-engaged scholar-activism and co-authorship, this dissertation also identifies possibilities for developing solutions to food insecurity that include but extend beyond immediate food needs. These efforts reveal opportunities for developing new subjectivities and practices by identifying different modes of connecting to food and community and by contesting the rhetoric of personal responsibility and poor food choices attributed to food insecurity. Engaging with food justice and a feminist political ecology of food insecurity that is material, structural, and discursive, this research works to reconfigure power relations in local food initiatives by working with and for those who are most marginalized in our food system
Addressing the cold start problem in tag-based recommender systems
Folksonomies have become a powerful tool to describe, discover, search, and navigate
online resources (e.g., pictures, videos, blogs) on the Social Web. Unlike taxonomies and
ontologies, which impose a hierarchical categorisation on content, folksonomies directly
allow end users to freely create and choose the categories (in this case, tags) that best
describe a piece of information. However, the freedom aafforded to users comes at a cost:
as tags are defined informally, the retrieval of information becomes more challenging.
Different solutions have been proposed to help users discover content in this highly dynamic
setting. However, they have proved to be effective only for users who have already heavily
used the system (active users) and who are interested in popular items (i.e., items tagged
by many other users).
In this thesis we explore principles to help both active users and more importantly new or
inactive users (cold starters) to find content they are interested in even when this content
falls into the long tail of medium-to-low popularity items (cold start items). We investigate
the tagging behaviour of users on content and show how the similarities between users and
tags can be used to produce better recommendations. We then analyse how users create
new content on social tagging websites and show how preferences of only a small portion
of active users (leaders), responsible for the vast majority of the tagged content, can be
used to improve the recommender system's scalability. We also investigate the growth of
the number of users, items and tags in the system over time. We then show how this
information can be used to decide whether the benefits of an update of the data structures
modelling the system outweigh the corresponding cost.
In this work we formalize the ideas introduced above and we describe their implementation.
To demonstrate the improvements of our proposal in recommendation efficacy and
efficiency, we report the results of an extensive evaluation conducted on three different
social tagging websites: CiteULike, Bibsonomy and MovieLens. Our results demonstrate
that our approach achieves higher accuracy than state-of-the-art systems for cold start
users and for users searching for cold start items. Moreover, while accuracy of our technique
is comparable to other techniques for active users, the computational cost that it
requires is much smaller. In other words our approach is more scalable and thus more
suitable for large and quickly growing settings
New Fundamental Technologies in Data Mining
The progress of data mining technology and large public popularity establish a need for a comprehensive text on the subject. The series of books entitled by "Data Mining" address the need by presenting in-depth description of novel mining algorithms and many useful applications. In addition to understanding each section deeply, the two books present useful hints and strategies to solving problems in the following chapters. The contributing authors have highlighted many future research directions that will foster multi-disciplinary collaborations and hence will lead to significant development in the field of data mining
- …