21 research outputs found

    Large expert-curated database for benchmarking document similarity detection in biomedical literature search

    Get PDF
    Document recommendation systems for locating relevant literature have mostly relied on methods developed a decade ago. This is largely due to the lack of a large offline gold-standard benchmark of relevant documents that cover a variety of research fields such that newly developed literature search techniques can be compared, improved and translated into practice. To overcome this bottleneck, we have established the RElevant LIterature SearcH consortium consisting of more than 1500 scientists from 84 countries, who have collectively annotated the relevance of over 180 000 PubMed-listed articles with regard to their respective seed (input) article/s. The majority of annotations were contributed by highly experienced, original authors of the seed articles. The collected data cover 76% of all unique PubMed Medical Subject Headings descriptors. No systematic biases were observed across different experience levels, research fields or time spent on annotations. More importantly, annotations of the same document pairs contributed by different scientists were highly concordant. We further show that the three representative baseline methods used to generate recommended articles for evaluation (Okapi Best Matching 25, Term Frequency-Inverse Document Frequency and PubMed Related Articles) had similar overall performances. Additionally, we found that these methods each tend to produce distinct collections of recommended articles, suggesting that a hybrid method may be required to completely capture all relevant articles. The established database server located at https://relishdb.ict.griffith.edu.au is freely available for the downloading of annotation data and the blind testing of new methods. We expect that this benchmark will be useful for stimulating the development of new powerful techniques for title and title/abstract-based search engines for relevant articles in biomedical research.Peer reviewe

    From gut dysbiosis to altered brain function and mental illness: mechanisms and pathways

    Get PDF
    The human body hosts an enormous abundance and diversity of microbes, which perform a range of essential and beneficial functions. Our appreciation of the importance of these microbial communities to many aspects of human physiology has grown dramatically in recent years. We know, for example, that animals raised in a germ-free environment exhibit substantially altered immune and metabolic function, while the disruption of commensal microbiota in humans is associated with the development of a growing number of diseases. Evidence is now emerging that, through interactions with the gut-brain axis, the bidirectional communication system between the central nervous system and the gastrointestinal tract, the gut microbiome can also influence neural development, cognition and behaviour, with recent evidence that changes in behaviour alter gut microbiota composition, while modifications of the microbiome can induce depressive-like behaviours. Although an association between enteropathy and certain psychiatric conditions has long been recognized, it now appears that gut microbes represent direct mediators of psychopathology. Here, we examine roles of gut microbiome in shaping brain development and neurological function, and the mechanisms by which it can contribute to mental illness. Further, we discuss how the insight provided by this new and exciting field of research can inform care and provide a basis for the design of novel, microbiota-targeted, therapies.GB Rogers, DJ Keating, RL Young, M-L Wong, J Licinio, and S Wesseling

    Retrospective evaluation of whole exome and genome mutation calls in 746 cancer samples

    No full text
    Funder: NCI U24CA211006Abstract: The Cancer Genome Atlas (TCGA) and International Cancer Genome Consortium (ICGC) curated consensus somatic mutation calls using whole exome sequencing (WES) and whole genome sequencing (WGS), respectively. Here, as part of the ICGC/TCGA Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium, which aggregated whole genome sequencing data from 2,658 cancers across 38 tumour types, we compare WES and WGS side-by-side from 746 TCGA samples, finding that ~80% of mutations overlap in covered exonic regions. We estimate that low variant allele fraction (VAF < 15%) and clonal heterogeneity contribute up to 68% of private WGS mutations and 71% of private WES mutations. We observe that ~30% of private WGS mutations trace to mutations identified by a single variant caller in WES consensus efforts. WGS captures both ~50% more variation in exonic regions and un-observed mutations in loci with variable GC-content. Together, our analysis highlights technological divergences between two reproducible somatic variant detection efforts
    corecore