1,429 research outputs found

    BiblioDAP'21: The 1st Workshop on Bibliographic Data Analysis and Processing

    Get PDF
    Automatic processing of bibliographic data becomes very important in digital libraries, data science and machine learning due to its importance in keeping pace with the significant increase of published papers every year from one side and to the inherent challenges from the other side. This processing has several aspects including but not limited to I) Automatic extraction of references from PDF documents, II) Building an accurate citation graph, III) Author name disambiguation, etc. Bibliographic data is heterogeneous by nature and occurs in both structured (e.g. citation graph) and unstructured (e.g. publications) formats. Therefore, it requires data science and machine learning techniques to be processed and analysed. Here we introduce BiblioDAP'21: The 1st Workshop on Bibliographic Data Analysis and Processing

    TechMiner: Extracting Technologies from Academic Publications

    Get PDF
    In recent years we have seen the emergence of a variety of scholarly datasets. Typically these capture ‘standard’ scholarly entities and their connections, such as authors, affiliations, venues, publications, citations, and others. However, as the repositories grow and the technology improves, researchers are adding new entities to these repositories to develop a richer model of the scholarly domain. In this paper, we introduce TechMiner, a new approach, which combines NLP, machine learning and semantic technologies, for mining technologies from research publications and generating an OWL ontology describing their relationships with other research entities. The resulting knowledge base can support a number of tasks, such as: richer semantic search, which can exploit the technology dimension to support better retrieval of publications; richer expert search; monitoring the emergence and impact of new technologies, both within and across scientific fields; studying the scholarly dynamics associated with the emergence of new technologies; and others. TechMiner was evaluated on a manually annotated gold standard and the results indicate that it significantly outperforms alternative NLP approaches and that its semantic features improve performance significantly with respect to both recall and precision

    The practice of self-citations: a longitudinal study

    Get PDF
    In this article, we discuss the outcomes of an experiment where we analysed whether and to what extent the introduction, in 2012, of the new research assessment exercise in Italy (a.k.a. Italian Scientific Habilitation) affected self-citation behaviours in the Italian research community. The Italian Scientific Habilitation attests to the scientific maturity of researchers and in Italy, as in many other countries, is a requirement for accessing to a professorship. To this end, we obtained from ScienceDirect 35,673 articles published from 1957 to 2016 by the participants to the 2012 Italian Scientific Habilitation, that resulted in the extraction of 1,379,050 citations retrieved through Semantic Publishing technologies. Our analysis showed an overall increment in author self-citations (i.e. where the citing article and the cited article share at least one author) in several of the 24 academic disciplines considered. However, we depicted a stronger causal relation between such increment and the rules introduced by the 2012 Italian Scientific Habilitation in 10 out of 24 disciplines analysed

    Predicting the results of evaluation procedures of academics

    Get PDF
    Background. The 2010 reform of the Italian university system introduced the National Scientific Habilitation (ASN) as a requirement for applying to permanent professor positions. Since the CVs of the 59,149 candidates and the results of their assessments have been made publicly available, the ASN constitutes an opportunity to perform analyses about a nation-wide evaluation process. Objective. The main goals of this paper are: (i) predicting the ASN results using the information contained in the candidates’ CVs; (ii) identifying a small set of quantitative indicators that can be used to perform accurate predictions. Approach. Semantic technologies are used to extract, systematize and enrich the information contained in the applicants’ CVs, and machine learning methods are used to predict the ASN results and to identify a subset of relevant predictors. Results. For predicting the success in the role of associate professor, our best models using all and the top 15 predictors make accurate predictions (F-measure values higher than 0.6) in 88% and 88.6% of the cases, respectively. Similar results have been achieved for the role of full professor. Evaluation. The proposed approach outperforms the other models developed to predict the results of researchers’ evaluation procedures. Conclusions. Such results allow the development of an automated system for supporting both candidates and committees in the future ASN sessions and other scholars’ evaluation procedures

    Perioperative latex allergy.

    Get PDF
    The prevalence of latex allergy varies greatly depending on the population studied and the methods used to detect sensitization. Subjects considered to be at high risk for latex allergy are rubber industry workers, children with spina bifida and urological abnormalities, children undergoing multiple surgical procedures and with urinary catheterization, health care workers and people with food allergy (latex fruit syndrome). In this paper we report a review of latex proteins, the symptoms of latex allergy, diagnosis and management in subjects with latex allergy

    Impact of delivery mode on the colostrum microbiota composition.

    Get PDF
    BACKGROUND: Breast milk is a rich nutrient with a temporally dynamic nature. In particular, numerous alterations in the nutritional, immunological and microbiological content occur during the transition from colostrum to mature milk. The objective of our study was to evaluate the potential impact of delivery mode on the microbiota of colostrum, at both the quantitative and qualitative levels (bacterial abundance and microbiota network). METHODS: Twenty-nine Italian mothers (15 vaginal deliveries vs 14 Cesarean sections) were enrolled in the study. The microbiota of colostrum samples was analyzed by next generation sequencing (Ion Torrent Personal Genome Machine). The colostrum microbiota network associated with Cesarean section and vaginal delivery was evaluated by means of the Auto Contractive Map (AutoCM), a mathematical methodology based on Artificial Neural Network (ANN) architecture. RESULTS: Numerous differences between Cesarean section and vaginal delivery colostrum were observed. Vaginal delivery colostrum had a significant lower abundance of Pseudomonas spp., Staphylococcus spp. and Prevotella spp. when compared to Cesarean section colostrum samples. Furthermore, the mode of delivery had a strong influence on the microbiota network, as Cesarean section colostrum showed a higher number of bacterial hubs if compared to vaginal delivery, sharing only 5 hubs. Interestingly, the colostrum of mothers who had a Cesarean section was richer in environmental bacteria than mothers who underwent vaginal delivery. Finally, both Cesarean section and vaginal delivery colostrum contained a greater number of anaerobic bacteria genera. CONCLUSIONS: The mode of delivery had a large impact on the microbiota composition of colostrum. Further studies are needed to better define the meaning of the differences we observed between Cesarean section and vaginal delivery colostrum microbiota

    Klink-2: integrating multiple web sources to generate semantic topic networks

    Get PDF
    The amount of scholarly data available on the web is steadily increasing, enabling different types of analytics which can provide important insights into the research activity. In order to make sense of and explore this large-scale body of knowledge we need an accurate, comprehensive and up-to-date ontology of research topics. Unfortunately, human crafted classifications do not satisfy these criteria, as they evolve too slowly and tend to be too coarse-grained. Current automated methods for generating ontologies of research areas also present a number of limitations, such as: i) they do not consider the rich amount of indirect statistical and semantic relationships, which can help to understand the relation between two topics – e.g., the fact that two research areas are associated with a similar set of venues or technologies; ii) they do not distinguish between different kinds of hierarchical relationships; and iii) they are not able to handle effectively ambiguous topics characterized by a noisy set of relationships. In this paper we present Klink-2, a novel approach which improves on our earlier work on automatic generation of semantic topic networks and addresses the aforementioned limitations by taking advantage of a variety of knowledge sources available on the web. In particular, Klink-2 analyses networks of research entities (including papers, authors, venues, and technologies) to infer three kinds of semantic relationships between topics. It also identifies ambiguous keywords (e.g., “ontology”) and separates them into the appropriate distinct topics – e.g., “ontology/philosophy” vs. “ontology/semantic web”. Our experimental evaluation shows that the ability of Klink-2 to integrate a high number of data sources and to generate topics with accurate contextual meaning yields significant improvements over other algorithms in terms of both precision and recall

    A Systematic Review on the Effects of Botanicals on Skeletal Muscle Health in Order to Prevent Sarcopenia

    Get PDF
    We performed a systematic review to evaluate the evidence-based medicine regarding the main botanical extracts and their nutraceutical compounds correlated to skeletal muscle health in order to identify novel strategies that effectively attenuate skeletal muscle loss and enhance muscle function and to improve the quality of life of older subjects. This review contains all eligible studies from 2010 to 2015 and included 57 publications. We focused our attention on effects of botanical extracts on growth and health of muscle and divided these effects into five categories: anti-inflammation, muscle damage prevention, antifatigue, muscle atrophy prevention, and muscle regeneration and differentiation
    corecore