8 research outputs found

    Large expert-curated database for benchmarking document similarity detection in biomedical literature search

    Get PDF
    Document recommendation systems for locating relevant literature have mostly relied on methods developed a decade ago. This is largely due to the lack of a large offline gold-standard benchmark of relevant documents that cover a variety of research fields such that newly developed literature search techniques can be compared, improved and translated into practice. To overcome this bottleneck, we have established the RElevant LIterature SearcH consortium consisting of more than 1500 scientists from 84 countries, who have collectively annotated the relevance of over 180 000 PubMed-listed articles with regard to their respective seed (input) article/s. The majority of annotations were contributed by highly experienced, original authors of the seed articles. The collected data cover 76% of all unique PubMed Medical Subject Headings descriptors. No systematic biases were observed across different experience levels, research fields or time spent on annotations. More importantly, annotations of the same document pairs contributed by different scientists were highly concordant. We further show that the three representative baseline methods used to generate recommended articles for evaluation (Okapi Best Matching 25, Term Frequency-Inverse Document Frequency and PubMed Related Articles) had similar overall performances. Additionally, we found that these methods each tend to produce distinct collections of recommended articles, suggesting that a hybrid method may be required to completely capture all relevant articles. The established database server located at https://relishdb.ict.griffith.edu.au is freely available for the downloading of annotation data and the blind testing of new methods. We expect that this benchmark will be useful for stimulating the development of new powerful techniques for title and title/abstract-based search engines for relevant articles in biomedical research.Peer reviewe

    Synthesis of biologically decomposable terpolymer nanocapsules and higher-order nanoassemblies using RCMP-PISA

    No full text
    Terpolymers are used to generate bio-decomposable self-assemblies. PEG-PVL-PGMA terpolymers are synthesized via a combination of organocatalyzed living radical polymerization (RCMP) with polymerization-induced self-assembly (PISA), generating vesicle, multicompartment micelle (MCM), and core-compartmentalized worm (CCW), where PEG, PVL, and PGMA are poly(ethylene glycol) monomethyl ether, poly(δ-valerolactone), and poly(glycidyl methacrylate) and PGMA is grown during the PISA. MCM and CCW are uniquely obtainable using terpolymers. The PVL segment is biodegradable, and the obtained assemblies are bio-decomposable. Heavy metal-free and sulfur-free synthesis and bio-decomposability of the assemblies are attractive features.National Research Foundation (NRF)Accepted versionThis work was supported by the National Research Foundation (NRF) Investigatorship in Singapore (NRF-NRFI05-2019-0001)

    Multi-center evaluation of artificial intelligent imaging and clinical models for predicting neoadjuvant chemotherapy response in breast cancer

    No full text
    Background: Neoadjuvant chemotherapy (NAC) plays an important role in the management of locally advanced breast cancer. It allows for downstaging of tumors, potentially allowing for breast conservation. NAC also allows for in-vivo testing of the tumors’ response to chemotherapy and provides important prognostic information. There are currently no clearly defined clinical models that incorporate imaging with clinical data to predict response to NAC. Thus, the aim of this work is to develop a predictive AI model based on routine CT imaging and clinical parameters to predict response to NAC. Methods: The CT scans of 324 patients with NAC from multiple centers in Singapore were used in this study. Four different radiomics models were built for predicting pathological complete response (pCR): first two were based on textural features extracted from peri-tumoral and tumoral regions, the third model based on novel space-resolved radiomics which extract feature maps using voxel-based radiomics and the fourth model based on deep learning (DL). Clinical parameters were included to build a final prognostic model. Results: The best performing models were based on space-resolved and DL approaches. Space-resolved radiomics improves the clinical AUCs of pCR prediction from 0.743 (0.650 to 0.831) to 0.775 (0.685 to 0.860) and our DL model improved it from 0.743 (0.650 to 0.831) to 0.772 (0.685 to 0.853). The tumoral radiomics model performs the worst with no improvement of the AUC from the clinical model. The peri-tumoral combined model gives moderate performance with an AUC of 0.765 (0.671 to 0.855). Conclusions: Radiomics features extracted from diagnostic CT augment the predictive ability of pCR when combined with clinical features. The novel space-resolved radiomics and DL radiomics approaches outperformed conventional radiomics techniques.W.L.N. is supported by the National Medical Research Council Fellowship (NMRC/MOH-000166-00)

    Pectobacterium and Dickeya: Environment to Disease Development

    No full text
    The soft rot Pectobacteriaceae (SRP) infect a wide range of plants worldwide and cause economic damage to crops and ornamentals but can also colonize other plants as part of their natural life cycle. They are found in a variety of environmental niches, including water, soil and insects, where they may spread to susceptible plants and cause disease. In this chapter, we look in detail at the plants colonized and infected by these pathogens and at the diseases and symptoms they cause. We also focus on where in the environment these organisms are found and their ability to survive and thrive there. Finally, we present evidence that SRP may assist the colonization of human enteric pathogens on plants, potentially implicating them in aspects of human/animal as well as plant health

    Nine loci for ocular axial length identified through genome-wide association studies, including shared loci with refractive error

    Get PDF
    10.1016/j.ajhg.2013.06.016American Journal of Human Genetics932264-277AJHG

    Large expert-curated database for benchmarking document similarity detection in biomedical literature search

    No full text

    Large expert-curated database for benchmarking document similarity detection in biomedical literature search

    No full text
    Document recommendation systems for locating relevant literature have mostly relied on methods developed a decade ago. This is largely due to the lack of a large offline gold-standard benchmark of relevant documents that cover a variety of research fields such that newly developed literature search techniques can be compared, improved and translated into practice. To overcome this bottleneck, we have established the RElevant LIterature SearcH consortium consisting of more than 1500 scientists from 84 countries, who have collectively annotated the relevance of over 180 000 PubMed-listed articles with regard to their respective seed (input) article/s. The majority of annotations were contributed by highly experienced, original authors of the seed articles. The collected data cover 76% of all unique PubMed Medical Subject Headings descriptors. No systematic biases were observed across different experience levels, research fields or time spent on annotations. More importantly, annotations of the same document pairs contributed by different scientists were highly concordant. We further show that the three representative baseline methods used to generate recommended articles for evaluation (Okapi Best Matching 25, Term Frequency-Inverse Document Frequency and PubMed Related Articles) had similar overall performances. Additionally, we found that these methods each tend to produce distinct collections of recommended articles, suggesting that a hybrid method may be required to completely capture all relevant articles. The established database server located at https://relishdb.ict.griffith.edu.au is freely available for the downloading of annotation data and the blind testing of new methods. We expect that this benchmark will be useful for stimulating the development of new powerful techniques for title and title/abstract-based search engines for relevant articles in biomedical science. © The Author(s) 2019. Published by Oxford University Press

    1997 Amerasia Journal

    No full text
    corecore