316 research outputs found

    The effect of word sense disambiguation accuracy on literature based discovery

    Get PDF
    Background The volume of research published in the biomedical domain has increasingly lead to researchers focussing on specific areas of interest and connections between findings being missed. Literature based discovery (LBD) attempts to address this problem by searching for previously unnoticed connections between published information (also known as “hidden knowledge”). A common approach is to identify hidden knowledge via shared linking terms. However, biomedical documents are highly ambiguous which can lead LBD systems to over generate hidden knowledge by hypothesising connections through different meanings of linking terms. Word Sense Disambiguation (WSD) aims to resolve ambiguities in text by identifying the meaning of ambiguous terms. This study explores the effect of WSD accuracy on LBD performance. Methods An existing LBD system is employed and four approaches to WSD of biomedical documents integrated with it. The accuracy of each WSD approach is determined by comparing its output against a standard benchmark. Evaluation of the LBD output is carried out using timeslicing approach, where hidden knowledge is generated from articles published prior to a certain cutoff date and a gold standard extracted from publications after the cutoff date. Results WSD accuracy varies depending on the approach used. The connection between the performance of the LBD and WSD systems are analysed to reveal a correlation between WSD accuracy and LBD performance. Conclusion This study reveals that LBD performance is sensitive to WSD accuracy. It is therefore concluded that WSD has the potential to improve the output of LBD systems by reducing the amount of spurious hidden knowledge that is generated. It is also suggested that further improvements in WSD accuracy have the potential to improve LBD accuracy

    Constructing a semantic predication gold standard from the biomedical literature

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Semantic relations increasingly underpin biomedical text mining and knowledge discovery applications. The success of such practical applications crucially depends on the quality of extracted relations, which can be assessed against a gold standard reference. Most such references in biomedical text mining focus on narrow subdomains and adopt different semantic representations, rendering them difficult to use for benchmarking independently developed relation extraction systems. In this article, we present a multi-phase gold standard annotation study, in which we annotated 500 sentences randomly selected from MEDLINE abstracts on a wide range of biomedical topics with 1371 semantic predications. The UMLS Metathesaurus served as the main source for conceptual information and the UMLS Semantic Network for relational information. We measured interannotator agreement and analyzed the annotations closely to identify some of the challenges in annotating biomedical text with relations based on an ontology or a terminology.</p> <p>Results</p> <p>We obtain fair to moderate interannotator agreement in the practice phase (0.378-0.475). With improved guidelines and additional semantic equivalence criteria, the agreement increases by 12% (0.415 to 0.536) in the main annotation phase. In addition, we find that agreement increases to 0.688 when the agreement calculation is limited to those predications that are based only on the explicitly provided UMLS concepts and relations.</p> <p>Conclusions</p> <p>While interannotator agreement in the practice phase confirms that conceptual annotation is a challenging task, the increasing agreement in the main annotation phase points out that an acceptable level of agreement can be achieved in multiple iterations, by setting stricter guidelines and establishing semantic equivalence criteria. Mapping text to ontological concepts emerges as the main challenge in conceptual annotation. Annotating predications involving biomolecular entities and processes is particularly challenging. While the resulting gold standard is mainly intended to serve as a test collection for our semantic interpreter, we believe that the lessons learned are applicable generally.</p

    Precise measurement of the W-boson mass with the CDF II detector

    Get PDF
    We have measured the W-boson mass MW using data corresponding to 2.2/fb of integrated luminosity collected in proton-antiproton collisions at 1.96 TeV with the CDF II detector at the Fermilab Tevatron collider. Samples consisting of 470126 W->enu candidates and 624708 W->munu candidates yield the measurement MW = 80387 +- 12 (stat) +- 15 (syst) = 80387 +- 19 MeV. This is the most precise measurement of the W-boson mass to date and significantly exceeds the precision of all previous measurements combined

    Cardiovascular risk assessment scores for people with diabetes: a systematic review

    Get PDF
    People with type 2 diabetes have an increased risk of cardiovascular disease (CVD). Multivariate cardiovascular risk scores have been used in many countries to identify individuals who are at high risk of CVD. These risk scores include those originally developed in individuals with diabetes and those developed in a general population. This article reviews the published evidence for the performance of CVD risk scores in diabetic patients by: (1) examining the overall rationale for using risk scores; (2) systematically reviewing the literature on available scores; and (3) exploring methodological issues surrounding the development, validation and comparison of risk scores. The predictive performance of cardiovascular risk scores varies substantially between different populations. There is little evidence to suggest that risk scores developed in individuals with diabetes estimate cardiovascular risk more accurately than those developed in the general population. The inconsistency in the methods used in evaluation studies makes it difficult to compare and summarise the predictive ability of risk scores. Overall, CVD risk scores rank individuals reasonably accurately and are therefore useful in the management of diabetes with regard to targeting therapy to patients at highest risk. However, due to the uncertainty in estimation of true risk, care is needed when using scores to communicate absolute CVD risk to individuals

    X-ray emission from the Sombrero galaxy: discrete sources

    Get PDF
    We present a study of discrete X-ray sources in and around the bulge-dominated, massive Sa galaxy, Sombrero (M104), based on new and archival Chandra observations with a total exposure of ~200 ks. With a detection limit of L_X = 1E37 erg/s and a field of view covering a galactocentric radius of ~30 kpc (11.5 arcminute), 383 sources are detected. Cross-correlation with Spitler et al.'s catalogue of Sombrero globular clusters (GCs) identified from HST/ACS observations reveals 41 X-rays sources in GCs, presumably low-mass X-ray binaries (LMXBs). We quantify the differential luminosity functions (LFs) for both the detected GC and field LMXBs, whose power-low indices (~1.1 for the GC-LF and ~1.6 for field-LF) are consistent with previous studies for elliptical galaxies. With precise sky positions of the GCs without a detected X-ray source, we further quantify, through a fluctuation analysis, the GC LF at fainter luminosities down to 1E35 erg/s. The derived index rules out a faint-end slope flatter than 1.1 at a 2 sigma significance, contrary to recent findings in several elliptical galaxies and the bulge of M31. On the other hand, the 2-6 keV unresolved emission places a tight constraint on the field LF, implying a flattened index of ~1.0 below 1E37 erg/s. We also detect 101 sources in the halo of Sombrero. The presence of these sources cannot be interpreted as galactic LMXBs whose spatial distribution empirically follows the starlight. Their number is also higher than the expected number of cosmic AGNs (52+/-11 [1 sigma]) whose surface density is constrained by deep X-ray surveys. We suggest that either the cosmic X-ray background is unusually high in the direction of Sombrero, or a distinct population of X-ray sources is present in the halo of Sombrero.Comment: 11 figures, 5 tables, ApJ in pres

    Measurement of the Z/gamma* + b-jet cross section in pp collisions at 7 TeV

    Get PDF
    The production of b jets in association with a Z/gamma* boson is studied using proton-proton collisions delivered by the LHC at a centre-of-mass energy of 7 TeV and recorded by the CMS detector. The inclusive cross section for Z/gamma* + b-jet production is measured in a sample corresponding to an integrated luminosity of 2.2 inverse femtobarns. The Z/gamma* + b-jet cross section with Z/gamma* to ll (where ll = ee or mu mu) for events with the invariant mass 60 < M(ll) < 120 GeV, at least one b jet at the hadron level with pT > 25 GeV and abs(eta) < 2.1, and a separation between the leptons and the jets of Delta R > 0.5 is found to be 5.84 +/- 0.08 (stat.) +/- 0.72 (syst.) +(0.25)/-(0.55) (theory) pb. The kinematic properties of the events are also studied and found to be in agreement with the predictions made by the MadGraph event generator with the parton shower and the hadronisation performed by PYTHIA.Comment: Submitted to the Journal of High Energy Physic

    Performance of the CMS Cathode Strip Chambers with Cosmic Rays

    Get PDF
    The Cathode Strip Chambers (CSCs) constitute the primary muon tracking device in the CMS endcaps. Their performance has been evaluated using data taken during a cosmic ray run in fall 2008. Measured noise levels are low, with the number of noisy channels well below 1%. Coordinate resolution was measured for all types of chambers, and fall in the range 47 microns to 243 microns. The efficiencies for local charged track triggers, for hit and for segments reconstruction were measured, and are above 99%. The timing resolution per layer is approximately 5 ns

    How do care-provider and home exercise program characteristics affect patient adherence in chronic neck and back pain: a qualitative study

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The aim of this study is to explore perceptions of people with chronic neck or low back pain about how characteristics of home exercise programs and care-provider style during clinical encounters may affect adherence to exercises.</p> <p>Methods</p> <p>This is a qualitative study consisting of seven focus groups, with a total of 34 participants presenting chronic neck or low back pain. The subjects were included if they were receiving physiotherapy treatment and were prescribed home-based exercises.</p> <p>Results</p> <p>Two themes emerged: home-based exercise programme conditions and care provider's style. In the first theme, the participants described their positive and negative experiences regarding time consumption, complexity and effects of prescribed exercises. In the second theme, participants perceived more bonding to prescribed exercises when their care provider presented knowledge about the disease, promoted feedback and motivation during exercise instruction, gave them reminders to exercise, or monitored their results and adherence to exercises.</p> <p>Conclusions</p> <p>Our experiential findings indicate that patient's adherence to home-based exercise is more likely to happen when care providers' style and the content of exercise programme are positively experienced. These findings provide additional information to health care providers, by showing which issues should be considered when delivering health care to patients presenting chronic neck or back pain.</p
    corecore