18 research outputs found

    Data mining and database systems: integrating conceptual clustering with a relational database management system.

    Get PDF
    Many clustering algorithms have been developed and improved over the years to cater for large scale data clustering. However, much of this work has been in developing numeric based algorithms that use efficient summarisations to scale to large data sets. There is a growing need for scalable categorical clustering algorithms as, although numeric based algorithms can be adapted to categorical data, they do not always produce good results. This thesis presents a categorical conceptual clustering algorithm that can scale to large data sets using appropriate data summarisations. Data mining is distinguished from machine learning by the use of larger data sets that are often stored in database management systems (DBMSs). Many clustering algorithms require data to be extracted from the DBMS and reformatted for input to the algorithm. This thesis presents an approach that integrates conceptual clustering with a DBMS. The presented approach makes the algorithm main memory independent and supports on-line data mining

    Approches environnement-centrées pour la simulation de systèmes multi-agents: Pour un déplacement de la complexité des agents vers l'environnement

    Get PDF
    This habilitation thesis synthesizes research works which are mainly related to the field of Multi-Agent Based Simulation (MABS). MABS is a general framework for modeling and experimenting with systems in which the dynamics emerges from local interactions among individuals (autonomous agents). Examples of use range from the study of natural systems (e.g. ant colonies, crowds or traffic jams) to the engineering of artificial ones (e.g., collective robotics, distributed artificial intelligence-based softwares). To this end, MABS modeling represents the behavior of individuals, their environment and interactions, so that global dynamics can be computed and studied from the bottom up. In this context, we have been investigating research on the theory and practice of MABS from two different perspectives : (1) the design of generic abstractions dedicated to the modeling of multi-agent dynamics (e.g., the IRM4S model) and (2) the engineering of MABS (MaDKit and TurtleKit platforms). Besides, we have been experimenting with MABS in different application domains such as image processing, video games, and collective robotics. Contrary to approaches that put the emphasis on the agent behaviors, all these works have been done by considering the environment of the agents as a first order abstraction. In this thesis, we first reflect upon the research we have conducted according to this perspective. Next, we show how we actually use this perspective to propose an original approach for using General-Purpose processing on Graphics Processing Units (GPGPU) within MABS, and then present the research perspectives related to our positioning.Les travaux de recherche synthétisés dans ce mémoire s’inscrivent principalement dans le domaine de la modélisation et de la simulation de systèmes multi-agents (SMA). La simulation multi-agents met en œuvre des modèles où les individus, leur environnement et leurs interactions sont directement représentés. Dans ces modèles, chaque individu –agent autonome– possède son propre comportement et produit ses actions en fonction d’une perception locale de son environnement. Ainsi, la simulation multi-agents est utilisée pour étudier des systèmes naturels comme les colonies de fourmis, les dynamiques de foules ou le trafic urbain, mais aussi pour concevoir des systèmes artificiels, par exemple dans le cadre de la robotique collective ou le développement de logiciels basés sur de l’intelligence artificielle distribuée. Dans ce cadre, nos recherches ont porté sur des problématiques liées à la modélisation de simulations multi-agents, avec la proposition de modèles formels et conceptuels (e.g. le modèle IRM4S) et d’outils logiciels génériques (plates-formes MaDKit et TurtleKit), et sur leur utilisation dans divers domaines tels que le jeu vidéo, le traitement numérique de l’image ou la robotique collective. Contrairement aux approches centrées sur la conception des comportements individuels, dans ces travaux l’environnement des agents est considéré comme une abstraction de premier ordre. Dans ce mémoire, nous dressons tout d’abord un bilan de nos recherches en argumentant l’intérêt d’une telle démarche pour les modèles multi-agents. Nous montrons ensuite comment celle-ci nous a récemment permis de proposer une approche originale dans le cadre de l’utilisation du calcul haute performance sur carte graphique (GPGPU) pour la simulation de SMA, avant de présenter les perspectives de recherche associées à notre positionnement

    Políticas de Copyright de Publicações Científicas em Repositórios Institucionais: O Caso do INESC TEC

    Get PDF
    A progressiva transformação das práticas científicas, impulsionada pelo desenvolvimento das novas Tecnologias de Informação e Comunicação (TIC), têm possibilitado aumentar o acesso à informação, caminhando gradualmente para uma abertura do ciclo de pesquisa. Isto permitirá resolver a longo prazo uma adversidade que se tem colocado aos investigadores, que passa pela existência de barreiras que limitam as condições de acesso, sejam estas geográficas ou financeiras. Apesar da produção científica ser dominada, maioritariamente, por grandes editoras comerciais, estando sujeita às regras por estas impostas, o Movimento do Acesso Aberto cuja primeira declaração pública, a Declaração de Budapeste (BOAI), é de 2002, vem propor alterações significativas que beneficiam os autores e os leitores. Este Movimento vem a ganhar importância em Portugal desde 2003, com a constituição do primeiro repositório institucional a nível nacional. Os repositórios institucionais surgiram como uma ferramenta de divulgação da produção científica de uma instituição, com o intuito de permitir abrir aos resultados da investigação, quer antes da publicação e do próprio processo de arbitragem (preprint), quer depois (postprint), e, consequentemente, aumentar a visibilidade do trabalho desenvolvido por um investigador e a respetiva instituição. O estudo apresentado, que passou por uma análise das políticas de copyright das publicações científicas mais relevantes do INESC TEC, permitiu não só perceber que as editoras adotam cada vez mais políticas que possibilitam o auto-arquivo das publicações em repositórios institucionais, como também que existe todo um trabalho de sensibilização a percorrer, não só para os investigadores, como para a instituição e toda a sociedade. A produção de um conjunto de recomendações, que passam pela implementação de uma política institucional que incentive o auto-arquivo das publicações desenvolvidas no âmbito institucional no repositório, serve como mote para uma maior valorização da produção científica do INESC TEC.The progressive transformation of scientific practices, driven by the development of new Information and Communication Technologies (ICT), which made it possible to increase access to information, gradually moving towards an opening of the research cycle. This opening makes it possible to resolve, in the long term, the adversity that has been placed on researchers, which involves the existence of barriers that limit access conditions, whether geographical or financial. Although large commercial publishers predominantly dominate scientific production and subject it to the rules imposed by them, the Open Access movement whose first public declaration, the Budapest Declaration (BOAI), was in 2002, proposes significant changes that benefit the authors and the readers. This Movement has gained importance in Portugal since 2003, with the constitution of the first institutional repository at the national level. Institutional repositories have emerged as a tool for disseminating the scientific production of an institution to open the results of the research, both before publication and the preprint process and postprint, increase the visibility of work done by an investigator and his or her institution. The present study, which underwent an analysis of the copyright policies of INESC TEC most relevant scientific publications, allowed not only to realize that publishers are increasingly adopting policies that make it possible to self-archive publications in institutional repositories, all the work of raising awareness, not only for researchers but also for the institution and the whole society. The production of a set of recommendations, which go through the implementation of an institutional policy that encourages the self-archiving of the publications developed in the institutional scope in the repository, serves as a motto for a greater appreciation of the scientific production of INESC TEC

    Intelligent techniques for context-aware systems

    Get PDF
    Nowadays, with advances in communication technologies, researches are focused in the fields of designing new devices with increasing capabilities, implanting software frameworks or middleware to make these devices interoperable. Building better human interfaces is a challenging task and the adoption of Artificial Intelligence (AI) techniques to the process help associating semantic meaning to devices which makes possible the gesture recognition and voice recognition. This thesis is mainly concerned with the open problem in context-aware systems: the evaluation of these systems in Ambient Intelligence (AmI) environments. With regard to this issue, we argue that due to highly dynamic properties of the AmI environments, it should exist a methodology for evaluating these systems taking into account the type of scenarios. However in order to support with a solid ground for that discussion, some elements are to be discussed as well. In particular, we: • use a commercial platform that allows us to design and manage the contextual information of context- aware systems by means of a context manager included in the architecture; • analyze the formal representation of this contextual information by means of a knowledge based system (KBS); • discuss the possible methodologies to be used for modelling knowledge in KBS and our approach; • give reasons why intelligent agents is a valid technique to be applied to systems in AmI environments; • propose a generic multi-agent system (MAS) architecture that can be applied to a large class of envisaged AmI applications; • propose a multimodal user interface and its integration with our MAS; • propose an evaluation methodology for context-aware systems in AmI scenarios. The formulation of the above mentioned elements became necessary as this thesis was developed. The lack of an evaluation methodology for context-aware systems in AmI environments, where so many issues to be covered, took us to the main objective of this thesis. In this regard: • we provide an updated and exhaustive state-of-the-art of this matter; • examine the properties and characteristics of AmI scenarios; • put forward an evaluation methodology and experimentally test our methodology in AmI scenarios. ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------La Inteligencia Ambiental y los entornos inteligentes hacen hincapié en una mayor facilidad de uso, soporte de servicios más eficientes, el apoderamiento de los usuarios, y el apoyo a las interacciones humanas. En esta visión, las personas estarán rodeadas de interfaces inteligentes e intuitivas incrustados en objetos cotidianos que nos rodean y los sistemas desarrollados para este ambiente deberán reconocer y responder a la presencia de individuos de una manera invisible y transparente a ellos. Esta tesis se centra principalmente en el problema abierto en los sistemas sensibles al contexto: la evaluación de estos sistemas en los entornos de Inteligencia Ambiental. Con respecto a este tema, se argumenta que debido a las propiedades altamente dinámica de los entornos de inteligencia ambiental, debería existir una metodología para la evaluación de estos sistemas, teniendo en cuenta el tipo de escenarios. Sin embargo, con el fin de apoyar con una base sólida para la discusión, algunos elementos deben ser discutidos también. En particular, nosotros: • Usamos una plataforma comercial que nos permite diseñar y gestionar la información contextual de los sistemas sensibles al contexto a través de un gestor de contexto incluido en la arquitectura; • Analizamos la representación formal de esta información contextual a través de un sistema basado en el conocimiento (SBC); • Discutimos las posibles metodologías que se utilizarán para el modelado del conocimiento en SBC y nuestra aproximación y propuesta; • Discutimos las razones del por qué los agentes inteligentes son una técnica válida para ser aplicada a los sistemas en entornos inteligencia ambiental; • Proponemos un sistema multi-agente (SMA), con una arquitectura genérica que se puede aplicar a una gran clase de aplicaciones de inteligencia ambiental; • Proponemos una interfaz de usuario multimodales y su integración con nuestro SMA; • Proponemos una metodología de evaluación de los sistemas sensibles al contexto en los escenarios de inteligencia ambiental. La formulación de los elementos antes mencionados se hizo necesaria en la medida que esta tesis se ha desarrollado. La falta de una metodología de evaluación de los sistemas sensibles al contexto en entornos de inteligencia ambiental, donde existen tantos temas a tratar, nos llevó al objetivo principal de esta tesis. En este sentido, en esta tesis: • Proporcionamos un estado del arte actualizado y exhaustivo de este asunto; • Examinamos las propiedades y características de los escenarios de inteligencia ambiental; • Proponemos una metodología de evaluación para este tipo de sistemas y experimentalmente probamos nuestra metodología en diversos escenarios de inteligencia ambiental

    Data mining and database systems : integrating conceptual clustering with a relational database management system

    Get PDF
    Many clustering algorithms have been developed and improved over the years to cater for large scale data clustering. However, much of this work has been in developing numeric based algorithms that use efficient summarisations to scale to large data sets. There is a growing need for scalable categorical clustering algorithms as, although numeric based algorithms can be adapted to categorical data, they do not always produce good results. This thesis presents a categorical conceptual clustering algorithm that can scale to large data sets using appropriate data summarisations. Data mining is distinguished from machine learning by the use of larger data sets that are often stored in database management systems (DBMSs). Many clustering algorithms require data to be extracted from the DBMS and reformatted for input to the algorithm. This thesis presents an approach that integrates conceptual clustering with a DBMS. The presented approach makes the algorithm main memory independent and supports on-line data mining.EThOS - Electronic Theses Online ServiceGBUnited Kingdo

    Linguistics parameters for zero anaphora resolution

    Get PDF
    Dissertação de mest., Natural Language Processing and Human Language Technology, Univ. do Algarve, 2009This dissertation describes and proposes a set of linguistically motivated rules for zero anaphora resolution in the context of a natural language processing chain developed for Portuguese. Some languages, like Portuguese, allow noun phrase (NP) deletion (or zeroing) in several syntactic contexts in order to avoid the redundancy that would result from repetition of previously mentioned words. The co-reference relation between the zeroed element and its antecedent (or previous mention) in the discourse is here called zero anaphora (Mitkov, 2002). In Computational Linguistics, zero anaphora resolution may be viewed as a subtask of anaphora resolution and has an essential role in various Natural Language Processing applications such as information extraction, automatic abstracting, dialog systems, machine translation and question answering. The main goal of this dissertation is to describe the grammatical rules imposing subject NP deletion and referential constraints in the Brazilian Portuguese, in order to allow a correct identification of the antecedent of the deleted subject NP. Some of these rules were then formalized into the Xerox Incremental Parser or XIP (Ait-Mokhtar et al., 2002: 121-144) in order to constitute a module of the Portuguese grammar (Mamede et al. 2010) developed at Spoken Language Laboratory (L2F). Using this rule-based approach we expected to improve the performance of the Portuguese grammar namely by producing better dependency structures with (reconstructed) zeroed NPs for the syntactic-semantic interface. Because of the complexity of the task, the scope of this dissertation had to be limited: (a) subject NP deletion; b) within sentence boundaries and (c) with an explicit antecedent; besides, (d) rules were formalized based solely on the results of the shallow parser (or chunks), that is, with minimal syntactic (and no semantic) knowledge. A corpus of different text genres was manually annotated for zero anaphors and other zero-shaped, usually indefinite, subjects. The rule-based approached is evaluated and results are presented and discussed

    Proceedings of The Multi-Agent Logics, Languages, and Organisations Federated Workshops (MALLOW 2010)

    Get PDF
    http://ceur-ws.org/Vol-627/allproceedings.pdfInternational audienceMALLOW-2010 is a third edition of a series initiated in 2007 in Durham, and pursued in 2009 in Turin. The objective, as initially stated, is to "provide a venue where: the cost of participation was minimum; participants were able to attend various workshops, so fostering collaboration and cross-fertilization; there was a friendly atmosphere and plenty of time for networking, by maximizing the time participants spent together"
    corecore