3 research outputs found

    Large expert-curated database for benchmarking document similarity detection in biomedical literature search

    Get PDF
    Document recommendation systems for locating relevant literature have mostly relied on methods developed a decade ago. This is largely due to the lack of a large offline gold-standard benchmark of relevant documents that cover a variety of research fields such that newly developed literature search techniques can be compared, improved and translated into practice. To overcome this bottleneck, we have established the RElevant LIterature SearcH consortium consisting of more than 1500 scientists from 84 countries, who have collectively annotated the relevance of over 180 000 PubMed-listed articles with regard to their respective seed (input) article/s. The majority of annotations were contributed by highly experienced, original authors of the seed articles. The collected data cover 76% of all unique PubMed Medical Subject Headings descriptors. No systematic biases were observed across different experience levels, research fields or time spent on annotations. More importantly, annotations of the same document pairs contributed by different scientists were highly concordant. We further show that the three representative baseline methods used to generate recommended articles for evaluation (Okapi Best Matching 25, Term Frequency-Inverse Document Frequency and PubMed Related Articles) had similar overall performances. Additionally, we found that these methods each tend to produce distinct collections of recommended articles, suggesting that a hybrid method may be required to completely capture all relevant articles. The established database server located at https://relishdb.ict.griffith.edu.au is freely available for the downloading of annotation data and the blind testing of new methods. We expect that this benchmark will be useful for stimulating the development of new powerful techniques for title and title/abstract-based search engines for relevant articles in biomedical research.Peer reviewe

    Factors Associated with Revision Surgery after Internal Fixation of Hip Fractures

    Get PDF
    Background: Femoral neck fractures are associated with high rates of revision surgery after management with internal fixation. Using data from the Fixation using Alternative Implants for the Treatment of Hip fractures (FAITH) trial evaluating methods of internal fixation in patients with femoral neck fractures, we investigated associations between baseline and surgical factors and the need for revision surgery to promote healing, relieve pain, treat infection or improve function over 24 months postsurgery. Additionally, we investigated factors associated with (1) hardware removal and (2) implant exchange from cancellous screws (CS) or sliding hip screw (SHS) to total hip arthroplasty, hemiarthroplasty, or another internal fixation device. Methods: We identified 15 potential factors a priori that may be associated with revision surgery, 7 with hardware removal, and 14 with implant exchange. We used multivariable Cox proportional hazards analyses in our investigation. Results: Factors associated with increased risk of revision surgery included: female sex, [hazard ratio (HR) 1.79, 95% confidence interval (CI) 1.25-2.50; P = 0.001], higher body mass index (fo

    Overlapping sense and antisense transcription units in Trypanosoma brucei

    No full text
    Procyclins are the major surface glycoproteins of insect-form Trypanosoma brucei. The procyclin expression sites are polycistronic and are transcribed by an alpha-amanitin-resistant polymerase, probably RNA polymerase I (Pol I). The expression sites are flanked by transcription units that are sensitive to alpha-amanitin, which is a hallmark of Pol II-driven transcription. We have analysed a region of 9.5 kb connecting the EP/PAG2 expression site with the downstream transcription unit. The procyclin expression site is longer than was previously realized and contains an additional gene, procyclin-associated gene 4 (PAG4), and a region of unknown function, the T region, that gives rise to trans-spliced, polyadenylated RNAs containing small open reading frames (ORFs). Two new genes, GU1 and GU2, were identified in the downstream transcription unit on the opposite strand. Unexpectedly, the 3' untranslated region of GU2 and the complementary T transcripts overlap by several hundred base pairs. Replacement of GU2 by a unique tag confirmed that sense and antisense transcription occurred from a single chromosomal locus. Overlapping transcription is stage specific and may extend > or = 10 kb in insect-form trypanosomes. The nucleotide composition of the T. brucei genome is such that antisense ORFs occur frequently. If stable mRNAs can be derived from both strands, the coding potential of the genome may be substantially larger than has previously been suspected.Journal ArticleResearch Support, Non-U.S. Gov'tFLWINinfo:eu-repo/semantics/publishe
    corecore