1,429 research outputs found
BiblioDAP'21: The 1st Workshop on Bibliographic Data Analysis and Processing
Automatic processing of bibliographic data becomes very important in digital libraries, data science and machine learning due to its importance in keeping pace with the significant increase of published papers every year from one side and to the inherent challenges from the other side. This processing has several aspects including but not limited to I) Automatic extraction of references from PDF documents, II) Building an accurate citation graph, III) Author name disambiguation, etc. Bibliographic data is heterogeneous by nature and occurs in both structured (e.g. citation graph) and unstructured (e.g. publications) formats. Therefore, it requires data science and machine learning techniques to be processed and analysed. Here we introduce BiblioDAP'21: The 1st Workshop on Bibliographic Data Analysis and Processing
TechMiner: Extracting Technologies from Academic Publications
In recent years we have seen the emergence of a variety of scholarly datasets. Typically these capture ‘standard’ scholarly entities and their connections, such as authors, affiliations, venues, publications, citations, and others. However, as the repositories grow and the technology improves, researchers are adding new entities to these repositories to develop a richer model of the scholarly domain. In this paper, we introduce TechMiner, a new approach, which combines NLP, machine learning and semantic technologies, for mining technologies from research publications and generating an OWL ontology describing their relationships with other research entities. The resulting knowledge base can support a number of tasks, such as: richer semantic search, which can exploit the technology dimension to support better retrieval of publications; richer expert search; monitoring the emergence and impact of new technologies, both within and across scientific fields; studying the scholarly dynamics associated with the emergence of new technologies; and others. TechMiner was evaluated on a manually annotated gold standard and the results indicate that it significantly outperforms alternative NLP approaches and that its semantic features improve performance significantly with respect to both recall and precision
The practice of self-citations: a longitudinal study
In this article, we discuss the outcomes of an experiment where we analysed whether and to what extent the introduction, in 2012, of the new research assessment exercise in Italy (a.k.a. Italian Scientific Habilitation) affected self-citation behaviours in the Italian research community. The Italian Scientific Habilitation attests to the scientific maturity of researchers and in Italy, as in many other countries, is a requirement for accessing to a professorship. To this end, we obtained from ScienceDirect 35,673 articles published from 1957 to 2016 by the participants to the 2012 Italian Scientific Habilitation, that resulted in the extraction of 1,379,050 citations retrieved through Semantic Publishing technologies. Our analysis showed an overall increment in author self-citations (i.e. where the citing article and the cited article share at least one author) in several of the 24 academic disciplines considered. However, we depicted a stronger causal relation between such increment and the rules introduced by the 2012 Italian Scientific Habilitation in 10 out of 24 disciplines analysed
Predicting the results of evaluation procedures of academics
Background. The 2010 reform of the Italian university system introduced the National Scientific Habilitation (ASN) as a requirement for applying to permanent professor positions. Since the CVs of the 59,149 candidates and the results of their assessments have been made publicly available, the ASN constitutes an opportunity to perform analyses about a nation-wide evaluation process. Objective. The main goals of this paper are: (i) predicting the ASN results using the information contained in the candidates’ CVs; (ii) identifying a small set of quantitative indicators that can be used to perform accurate predictions. Approach. Semantic technologies are used to extract, systematize and enrich the information contained in the applicants’ CVs, and machine learning methods are used to predict the ASN results and to identify a subset of relevant predictors. Results. For predicting the success in the role of associate professor, our best models using all and the top 15 predictors make accurate predictions (F-measure values higher than 0.6) in 88% and 88.6% of the cases, respectively. Similar results have been achieved for the role of full professor. Evaluation. The proposed approach outperforms the other models developed to predict the results of researchers’ evaluation procedures. Conclusions. Such results allow the development of an automated system for supporting both candidates and committees in the future ASN sessions and other scholars’ evaluation procedures
Perioperative latex allergy.
The prevalence of latex allergy varies greatly depending on the population studied and the methods used to detect sensitization. Subjects considered to be at high risk for latex allergy are rubber industry workers, children with spina bifida and urological abnormalities, children undergoing multiple surgical procedures and with urinary catheterization, health care workers and people with food allergy (latex fruit syndrome). In this paper we report a review of latex proteins, the symptoms of latex allergy, diagnosis and management in subjects with latex allergy
Impact of delivery mode on the colostrum microbiota composition.
BACKGROUND: Breast milk is a rich nutrient with a temporally dynamic nature. In
particular, numerous alterations in the nutritional, immunological and
microbiological content occur during the transition from colostrum to mature
milk. The objective of our study was to evaluate the potential impact of delivery
mode on the microbiota of colostrum, at both the quantitative and qualitative
levels (bacterial abundance and microbiota network).
METHODS: Twenty-nine Italian mothers (15 vaginal deliveries vs 14 Cesarean
sections) were enrolled in the study. The microbiota of colostrum samples was
analyzed by next generation sequencing (Ion Torrent Personal Genome Machine). The
colostrum microbiota network associated with Cesarean section and vaginal
delivery was evaluated by means of the Auto Contractive Map (AutoCM), a
mathematical methodology based on Artificial Neural Network (ANN) architecture.
RESULTS: Numerous differences between Cesarean section and vaginal delivery
colostrum were observed. Vaginal delivery colostrum had a significant lower
abundance of Pseudomonas spp., Staphylococcus spp. and Prevotella spp. when
compared to Cesarean section colostrum samples. Furthermore, the mode of delivery
had a strong influence on the microbiota network, as Cesarean section colostrum
showed a higher number of bacterial hubs if compared to vaginal delivery, sharing
only 5 hubs. Interestingly, the colostrum of mothers who had a Cesarean section
was richer in environmental bacteria than mothers who underwent vaginal delivery.
Finally, both Cesarean section and vaginal delivery colostrum contained a greater
number of anaerobic bacteria genera.
CONCLUSIONS: The mode of delivery had a large impact on the microbiota
composition of colostrum. Further studies are needed to better define the meaning
of the differences we observed between Cesarean section and vaginal delivery
colostrum microbiota
Klink-2: integrating multiple web sources to generate semantic topic networks
The amount of scholarly data available on the web is steadily increasing, enabling different types of analytics which can provide important insights into the research activity. In order to make sense of and explore this large-scale body of knowledge we need an accurate, comprehensive and up-to-date ontology of research topics. Unfortunately, human crafted classifications do not satisfy these criteria, as they evolve too slowly and tend to be too coarse-grained. Current automated methods for generating ontologies of research areas also present a number of limitations, such as: i) they do not consider the rich amount of indirect statistical and semantic relationships, which can help to understand the relation between two topics – e.g., the fact that two research areas are associated with a similar set of venues or technologies; ii) they do not distinguish between different kinds of hierarchical relationships; and iii) they are not able to handle effectively ambiguous topics characterized by a noisy set of relationships. In this paper we present Klink-2, a novel approach which improves on our earlier work on automatic generation of semantic topic networks and addresses the aforementioned limitations by taking advantage of a variety of knowledge sources available on the web. In particular, Klink-2 analyses networks of research entities (including papers, authors, venues, and technologies) to infer three kinds of semantic relationships between topics. It also identifies ambiguous keywords (e.g., “ontology”) and separates them into the appropriate distinct topics – e.g., “ontology/philosophy” vs. “ontology/semantic web”. Our experimental evaluation shows that the ability of Klink-2 to integrate a high number of data sources and to generate topics with accurate contextual meaning yields significant improvements over other algorithms in terms of both precision and recall
A Systematic Review on the Effects of Botanicals on Skeletal Muscle Health in Order to Prevent Sarcopenia
We performed a systematic review to evaluate the evidence-based medicine regarding the main botanical extracts and their nutraceutical compounds correlated to skeletal muscle health in order to identify novel strategies that effectively attenuate skeletal muscle loss and enhance muscle function and to improve the quality of life of older subjects. This review contains all eligible studies from 2010 to 2015 and included 57 publications. We focused our attention on effects of botanical extracts on growth and health of muscle and divided these effects into five categories: anti-inflammation, muscle damage prevention, antifatigue, muscle atrophy prevention, and muscle regeneration and differentiation
- …