15 research outputs found

    Large expert-curated database for benchmarking document similarity detection in biomedical literature search

    Get PDF
    Document recommendation systems for locating relevant literature have mostly relied on methods developed a decade ago. This is largely due to the lack of a large offline gold-standard benchmark of relevant documents that cover a variety of research fields such that newly developed literature search techniques can be compared, improved and translated into practice. To overcome this bottleneck, we have established the RElevant LIterature SearcH consortium consisting of more than 1500 scientists from 84 countries, who have collectively annotated the relevance of over 180 000 PubMed-listed articles with regard to their respective seed (input) article/s. The majority of annotations were contributed by highly experienced, original authors of the seed articles. The collected data cover 76% of all unique PubMed Medical Subject Headings descriptors. No systematic biases were observed across different experience levels, research fields or time spent on annotations. More importantly, annotations of the same document pairs contributed by different scientists were highly concordant. We further show that the three representative baseline methods used to generate recommended articles for evaluation (Okapi Best Matching 25, Term Frequency–Inverse Document Frequency and PubMed Related Articles) had similar overall performances. Additionally, we found that these methods each tend to produce distinct collections of recommended articles, suggesting that a hybrid method may be required to completely capture all relevant articles. The established database server located at https://relishdb.ict.griffith.edu.au is freely available for the downloading of annotation data and the blind testing of new methods. We expect that this benchmark will be useful for stimulating the development of new powerful techniques for title and title/abstract-based search engines for relevant articles in biomedical research

    Large expert-curated database for benchmarking document similarity detection in biomedical literature search

    Get PDF
    Document recommendation systems for locating relevant literature have mostly relied on methods developed a decade ago. This is largely due to the lack of a large offline gold-standard benchmark of relevant documents that cover a variety of research fields such that newly developed literature search techniques can be compared, improved and translated into practice. To overcome this bottleneck, we have established the RElevant LIterature SearcH consortium consisting of more than 1500 scientists from 84 countries, who have collectively annotated the relevance of over 180 000 PubMed-listed articles with regard to their respective seed (input) article/s. The majority of annotations were contributed by highly experienced, original authors of the seed articles. The collected data cover 76% of all unique PubMed Medical Subject Headings descriptors. No systematic biases were observed across different experience levels, research fields or time spent on annotations. More importantly, annotations of the same document pairs contributed by different scientists were highly concordant. We further show that the three representative baseline methods used to generate recommended articles for evaluation (Okapi Best Matching 25, Term Frequency-Inverse Document Frequency and PubMed Related Articles) had similar overall performances. Additionally, we found that these methods each tend to produce distinct collections of recommended articles, suggesting that a hybrid method may be required to completely capture all relevant articles. The established database server located at https://relishdb.ict.griffith.edu.au is freely available for the downloading of annotation data and the blind testing of new methods. We expect that this benchmark will be useful for stimulating the development of new powerful techniques for title and title/abstract-based search engines for relevant articles in biomedical research.Peer reviewe

    Epigenome-wide association study of serum folate in maternal peripheral blood leukocytes

    No full text
    Aim: To perform an epigenome-wide association study (EWAS) of serum folate in maternal blood. Methods: Cross-ancestry (Europeans = 302, South Asians = 161) and ancestry-specific EWAS in the EPIPREG cohort were performed, followed by methyl quantitative trait loci analysis and association with cardiometabolic phenotypes. Replication was attempted using maternal folate intake and blood methylation data from the MoBa study and verified if the findings were significant in a previous EWAS of maternal serum folate in cord blood. Results & conclusion: cg19888088 (cross-ancestry) in EBF3, cg01952260 (Europeans) and cg07077240 (South Asians) in HERC3 were associated with serum folate. cg19888088 and cg01952260 were associated with diastolic blood pressure. cg07077240 was associated with variants in CASC15. The findings were not replicated and were not significant in cord blood

    Heterogeneity of glycaemic phenotypes in type 1 diabetes

    No full text
    International audienceAims/hypothesis: Our study aims to uncover glycaemic phenotype heterogeneity in type 1 diabetes.Methods: In the Study of the French-speaking Society of Type 1 Diabetes (SFDT1), we characterised glycaemic heterogeneity thanks to a set of complementary metrics: HbA1c, time in range (TIR), time below range (TBR), CV, Gold score and glycaemia risk index (GRI). Applying the Discriminative Dimensionality Reduction with Trees (DDRTree) algorithm, we created a phenotypic tree, i.e. a 2D visual mapping. We also carried out a clustering analysis for comparison.Results: We included 618 participants with type 1 diabetes (52.9% men, mean age 40.6 years [SD 14.1]). Our phenotypic tree identified seven glycaemic phenotypes. The 2D phenotypic tree comprised a main branch in the proximal region and glycaemic phenotypes in the distal areas. Dimension 1, the horizontal dimension, was positively associated with GRI (coefficient [95% CI]) (0.54 [0.52, 0.57]), HbA1c (0.39 [0.35, 0.42]), CV (0.24 [0.19, 0.28]) and TBR (0.11 [0.06, 0.15]), and negatively with TIR (-0.52 [-0.54, -0.49]). The vertical dimension was positively associated with TBR (0.41 [0.38, 0.44]), CV (0.40 [0.37, 0.43]), TIR (0.16 [0.12, 0.20]), Gold score (0.10 [0.06, 0.15]) and GRI (0.06 [0.02, 0.11]), and negatively with HbA1c (-0.21 [-0.25, -0.17]). Notably, socioeconomic factors, cardiovascular risk indicators, retinopathy and treatment strategy were significant determinants of glycaemic phenotype diversity. The phenotypic tree enabled more granularity than traditional clustering in revealing clinically relevant subgroups of people with type 1 diabetes.Conclusions/interpretation: Our study advances the current understanding of the complex glycaemic profile in people with type 1 diabetes and suggests that strategies based on isolated glycaemic metrics might not capture the complexity of the glycaemic phenotypes in real life. Relying on these phenotypes could improve patient stratification in type 1 diabetes care and personalise disease management

    Influence of Nucleoshuttling of the ATM Protein in the Healthy Tissues Response to Radiation Therapy: Toward a Molecular Classification of Human Radiosensitivity

    No full text
    International audiencePURPOSE: Whereas post-radiation therapy overreactions (OR) represent a clinical and societal issue, there is still no consensual radiobiological endpoint to predict clinical radiosensitivity. Since 2003, skin biopsy specimens have been collected from patients treated by radiation therapy against different tumor localizations and showing a wide range of OR. Here, we aimed to establish quantitative links between radiobiological factors and OR severity grades that would be relevant to radioresistant and genetic hyperradiosensitive cases. METHODS AND MATERIALS: Immunofluorescence experiments were performed on a collection of skin fibroblasts from 12 radioresistant, 5 hyperradiosensitive, and 100 OR patients irradiated at 2 Gy. The numbers of micronuclei, γH2AX, and pATM foci that reflect different steps of DNA double-strand breaks (DSB) recognition and repair were assessed from 10 minutes to 24 hours after irradiation and plotted against the severity grades established by the Common Terminology Criteria for Adverse Events and the Radiation Therapy Oncology Group. RESULTS: OR patients did not necessarily show a gross DSB repair defect but a systematic delay in the nucleoshuttling of the ATM protein required for complete DSB recognition. Among the radiobiological factors, the maximal number of pATM foci provided the best discrimination among OR patients and a significant correlation with each OR severity grade, independently of tumor localization and of the early or late nature of reactions. CONCLUSIONS: Our results are consistent with a general classification of human radiosensitivity based on 3 groups: radioresistance (group I); moderate radiosensitivity caused by delay of nucleoshuttling of ATM, which includes OR patients (group II); and hyperradiosensitivity caused by a gross DSB repair defect, which includes fatal cases (group III

    ESC core curriculumfor the cardiologist

    No full text

    Roadmap for cardiovascular education across the European Society of Cardiology: inspiring better knowledge and skills, now and for the future

    No full text
    corecore