8 research outputs found

    Epistemic logic for metadata modelling from scientific papers on Covid-19

    Get PDF
    The field of epistemic logic developed into an interdisciplinary area focused on explicating epistemic issues in, for example, artificial intelligence, computer security, game theory, economics, multiagent systems and the social sciences. Inspired, in part, by issues in these different ‘application’ areas, in this paper I propose an epistemic logic T for metadata extracted from scientific papers on COVID-19. More in details, I introduce a structure S to syntactically and semantically modelling metadata extracted with systems for extracting structured metadata from scientific articles in a born-digital form. These systems will be considered, in the logical model created, as ‘Metadata extraction agents’ (MEA). In this case MEA taken into consideration are CERMINE and TeamBeam. In an increasingly data-driven world, modelling data or metadata means to help systematise existing information and support the research community in building solutions to the COVID-19 pandemic

    Towards a Modular Recommender System for Research Papers written in Albanian

    Get PDF
    In the recent years there has been an increase in scientific papers publications in Albania and its neighboring countries that have large communities of Albanian speaking researchers. Many of these papers are written in Albanian. It is a very time consuming task to find papers related to the researchers' work, because there is no concrete system that facilitates this process. In this paper we present the design of a modular intelligent search system for articles written in Albanian. The main part of it is the recommender module that facilitates searching by providing relevant articles to the users (in comparison with a given one). We used a cosine similarity based heuristics that differentiates the importance of term frequencies based on their location in the article. We did not notice big differences on the recommendation results when using different combinations of the importance factors of the keywords, title, abstract and body. We got similar results when using only the title and abstract in comparison with the other combinations. Because we got fairly good results in this initial approach, we believe that similar recommender systems for documents written in Albanian can be build also in contexts not related to scientific publishing.Comment: 8 page

    Doctor of Philosophy

    Get PDF
    dissertationMedical knowledge learned in medical school can become quickly outdated given the tremendous growth of the biomedical literature. It is the responsibility of medical practitioners to continuously update their knowledge with recent, best available clinical evidence to make informed decisions about patient care. However, clinicians often have little time to spend on reading the primary literature even within their narrow specialty. As a result, they often rely on systematic evidence reviews developed by medical experts to fulfill their information needs. At the present, systematic reviews of clinical research are manually created and updated, which is expensive, slow, and unable to keep up with the rapidly growing pace of medical literature. This dissertation research aims to enhance the traditional systematic review development process using computer-aided solutions. The first study investigates query expansion and scientific quality ranking approaches to enhance literature search on clinical guideline topics. The study showed that unsupervised methods can improve retrieval performance of a popular biomedical search engine (PubMed). The proposed methods improve the comprehensiveness of literature search and increase the ratio of finding relevant studies with reduced screening effort. The second and third studies aim to enhance the traditional manual data extraction process. The second study developed a framework to extract and classify texts from PDF reports. This study demonstrated that a rule-based multipass sieve approach is more effective than a machine-learning approach in categorizing document-level structures and iv that classifying and filtering publication metadata and semistructured texts enhances the performance of an information extraction system. The proposed method could serve as a document processing step in any text mining research on PDF documents. The third study proposed a solution for the computer-aided data extraction by recommending relevant sentences and key phrases extracted from publication reports. This study demonstrated that using a machine-learning classifier to prioritize sentences for specific data elements performs equally or better than an abstract screening approach, and might save time and reduce errors in the full-text screening process. In summary, this dissertation showed that there are promising opportunities for technology enhancement to assist in the development of systematic reviews. In this modern age when computing resources are getting cheaper and more powerful, the failure to apply computer technologies to assist and optimize the manual processes is a lost opportunity to improve the timeliness of systematic reviews. This research provides methodologies and tests hypotheses, which can serve as the basis for further large-scale software engineering projects aimed at fully realizing the prospect of computer-aided systematic reviews

    Extracción automática de metadatos para la administración del Repositorio Institucional de la Universidad Nacional del Altiplano Puno

    Get PDF
    Los Repositorios Institucionales permiten organizar y preservar la producción científica de una Institución, la presente investigación tiene como finalidad optimizar la extracción de metadatos y publicación de documentos de investigación procesos fundamentales para la administración de Repositorios Institucionales que requieren de tiempo, mediante la implementación del software “E-MeRI”, cuya población se compone por 1518 documentos de investigación. Para el desarrollo del sistema se utilizó la programación por capas y para el contraste de la hipótesis se utilizó prueba t para muestras relacionadas. Con respecto a la extracción automática se elaboró un algoritmo mediante técnicas de procesamiento de lenguaje natural, al cual se determinó la complejidad algorítmica lineal O(n) y demostró ser eficiente en comparación a otras herramientas extractoras. A la misma vez se determinó el nivel de precisión entre 96% y 99% de resultados correctos en base a las métricas Precisión y Recall. De la diferencia del tiempo de extracción, el sistema logra reducir en 5 minutos y 21 segundos por documento y permitió extraer en un minuto 4 documentos. Se concluye que la extracción automática de metadatos y la publicación de documentos de investigación mejoran la administración del Repositorio Institucional de la Universidad Nacional del Altiplano, reduciendo el tiempo de extracción y publicación de forma significativa con un valor p (0.000)< α=0.05, además la evaluación del software basado en la norma ISO 25000 obtuvo un valor de 8.93 de calidad total, logrando un nivel cumple con los requisitos y un grado muy satisfactorio
    corecore