218 research outputs found

    Exploiting MeSH indexing in MEDLINE to generate a data set for word sense disambiguation

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Evaluation of Word Sense Disambiguation (WSD) methods in the biomedical domain is difficult because the available resources are either too small or too focused on specific types of entities (e.g. diseases or genes). We present a method that can be used to automatically develop a WSD test collection using the Unified Medical Language System (UMLS) Metathesaurus and the manual MeSH indexing of MEDLINE. We demonstrate the use of this method by developing such a data set, called MSH WSD.</p> <p>Methods</p> <p>In our method, the Metathesaurus is first screened to identify ambiguous terms whose possible senses consist of two or more MeSH headings. We then use each ambiguous term and its corresponding MeSH heading to extract MEDLINE citations where the term and only one of the MeSH headings co-occur. The term found in the MEDLINE citation is automatically assigned the UMLS CUI linked to the MeSH heading. Each instance has been assigned a UMLS Concept Unique Identifier (CUI). We compare the characteristics of the MSH WSD data set to the previously existing NLM WSD data set.</p> <p>Results</p> <p>The resulting MSH WSD data set consists of 106 ambiguous abbreviations, 88 ambiguous terms and 9 which are a combination of both, for a total of 203 ambiguous entities. For each ambiguous term/abbreviation, the data set contains a maximum of 100 instances per sense obtained from MEDLINE.</p> <p>We evaluated the reliability of the MSH WSD data set using existing knowledge-based methods and compared their performance to that of the results previously obtained by these algorithms on the pre-existing data set, NLM WSD. We show that the knowledge-based methods achieve different results but keep their relative performance except for the Journal Descriptor Indexing (JDI) method, whose performance is below the other methods.</p> <p>Conclusions</p> <p>The MSH WSD data set allows the evaluation of WSD algorithms in the biomedical domain. Compared to previously existing data sets, MSH WSD contains a larger number of biomedical terms/abbreviations and covers the largest set of UMLS Semantic Types. Furthermore, the MSH WSD data set has been generated automatically reusing already existing annotations and, therefore, can be regenerated from subsequent UMLS versions.</p

    Impact of spouse caregiving on health behaviors and physical and mental health status

    Full text link
    The impact of caring for a spouse with a progressive dementia on caregiver's health behaviors and health status was examined. Data collected from 44 spouse caregivers indicates that: • Providing full-time care interferes with preventive health behaviors (eating nutritiously, exercising) and contributes to high risk behaviors (overeating, alcohol and substance use); • Health behaviors are frequently used as coping strategies; • Caregivers rated their own health as poorer than their spouse's health; and • Disabling (arthritis, cardiac and back problems) and stress-related health problems (migraines, colitis) are a consequence of and interfere with care provision.Peer Reviewedhttp://deepblue.lib.umich.edu/bitstream/2027.42/68204/2/10.1177_153331759400900105.pd

    Production of phi mesons at mid-rapidity in sqrt(s_NN) = 200 GeV Au+Au collisions at RHIC

    Get PDF
    We present the first results of meson production in the K^+K^- decay channel from Au+Au collisions at sqrt(s_NN) = 200 GeV as measured at mid-rapidity by the PHENIX detector at RHIC. Precision resonance centroid and width values are extracted as a function of collision centrality. No significant variation from the PDG accepted values is observed. The transverse mass spectra are fitted with a linear exponential function for which the derived inverse slope parameter is seen to be constant as a function of centrality. These data are also fitted by a hydrodynamic model with the result that the freeze-out temperature and the expansion velocity values are consistent with the values previously derived from fitting single hadron inclusive data. As a function of transverse momentum the collisions scaled peripheral.to.central yield ratio RCP for the is comparable to that of pions rather than that of protons. This result lends support to theoretical models which distinguish between baryons and mesons instead of particle mass for explaining the anomalous proton yield.Comment: 326 authors, 24 pages text, 23 figures, 6 tables, RevTeX 4. To be submitted to Physical Review C as a regular article. Plain text data tables for the points plotted in figures for this and previous PHENIX publications are (or will be) publicly available at http://www.phenix.bnl.gov/papers.htm

    External validation of risk prediction models for incident colorectal cancer using UK Biobank.

    Get PDF
    BACKGROUND: This study aimed to compare and externally validate risk scores developed to predict incident colorectal cancer (CRC) that include variables routinely available or easily obtainable via self-completed questionnaire. METHODS: External validation of fourteen risk models from a previous systematic review in 373 112 men and women within the UK Biobank cohort with 5-year follow-up, no prior history of CRC and data for incidence of CRC through linkage to national cancer registries. RESULTS: There were 1719 (0.46%) cases of incident CRC. The performance of the risk models varied substantially. In men, the QCancer10 model and models by Tao, Driver and Ma all had an area under the receiver operating characteristic curve (AUC) between 0.67 and 0.70. Discrimination was lower in women: the QCancer10, Wells, Tao, Guesmi and Ma models were the best performing with AUCs between 0.63 and 0.66. Assessment of calibration was possible for six models in men and women. All would require country-specific recalibration if estimates of absolute risks were to be given to individuals. CONCLUSIONS: Several risk models based on easily obtainable data have relatively good discrimination in a UK population. Modelling studies are now required to estimate the potential health benefits and cost-effectiveness of implementing stratified risk-based CRC screening

    OpenDMAP: An open source, ontology-driven concept analysis engine, with applications to capturing knowledge regarding protein transport, protein interactions and cell-type-specific gene expression

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Information extraction (IE) efforts are widely acknowledged to be important in harnessing the rapid advance of biomedical knowledge, particularly in areas where important factual information is published in a diverse literature. Here we report on the design, implementation and several evaluations of OpenDMAP, an ontology-driven, integrated concept analysis system. It significantly advances the state of the art in information extraction by leveraging knowledge in ontological resources, integrating diverse text processing applications, and using an expanded pattern language that allows the mixing of syntactic and semantic elements and variable ordering.</p> <p>Results</p> <p>OpenDMAP information extraction systems were produced for extracting protein transport assertions (transport), protein-protein interaction assertions (interaction) and assertions that a gene is expressed in a cell type (expression). Evaluations were performed on each system, resulting in F-scores ranging from .26 – .72 (precision .39 – .85, recall .16 – .85). Additionally, each of these systems was run over all abstracts in MEDLINE, producing a total of 72,460 transport instances, 265,795 interaction instances and 176,153 expression instances. </p> <p>Conclusion</p> <p>OpenDMAP advances the performance standards for extracting protein-protein interaction predications from the full texts of biomedical research articles. Furthermore, this level of performance appears to generalize to other information extraction tasks, including extracting information about predicates of more than two arguments. The output of the information extraction system is always constructed from elements of an ontology, ensuring that the knowledge representation is grounded with respect to a carefully constructed model of reality. The results of these efforts can be used to increase the efficiency of manual curation efforts and to provide additional features in systems that integrate multiple sources for information extraction. The open source OpenDMAP code library is freely available at <url>http://bionlp.sourceforge.net/</url></p

    A study protocol for the evaluation of occupational mutagenic/carcinogenic risks in subjects exposed to antineoplastic drugs: a multicentric project

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Some industrial hygiene studies have assessed occupational exposure to antineoplastic drugs; other epidemiological investigations have detected various toxicological effects in exposure groups labeled with the job title. In no research has the same population been studied both environmentally and epidemiologically. The protocol of the epidemiological study presented here uses an integrated environmental and biological monitoring approach. The aim is to assess in hospital nurses preparing and/or administering therapy to cancer patients the current level of occupational exposure to antineoplastic drugs, DNA and chromosome damage as cancer predictive effects, and the association between the two.</p> <p>Methods/Design</p> <p>About 80 healthy non-smoking female nurses, who job it is to prepare or handle antineoplastic drugs, and a reference group of about 80 healthy non-smoking female nurses not occupationally exposed to chemicals will be examined simultaneously in a cross-sectional study. All the workers will be recruited from five hospitals in northern and central Italy after their informed consent has been obtained.</p> <p>Evaluation of surface contamination and dermal exposure to antineoplastic drugs will be assessed by determining cyclophosphamide on selected surfaces (wipes) and on the exposed nurses' clothes (pads). The concentration of unmetabolized cyclophosphamide as a biomarker of internal dose will be measured in end-shift urine samples from exposed nurses.</p> <p>Biomarkers of effect and susceptibility will be assessed in exposed and unexposed nurses: urinary concentration of 8-hydroxy-2-deoxyguanosine; DNA damage detected using the single-cell microgel electrophoresis (comet) assay in peripheral white blood cells; micronuclei and chromosome aberrations in peripheral blood lymphocytes. Genetic polymorphisms for enzymes involved in metabolic detoxification (i.e. glutathione <it>S</it>-transferases) will also be analysed.</p> <p>Using standardized questionnaires, occupational exposure will be determined in exposed nurses only, whereas potential confounders (medicine consumption, lifestyle habits, diet and other non-occupational exposures) will be assessed in both groups of hospital workers.</p> <p>Statistical analysis will be performed to ascertain the association between occupational exposure to antineoplastic drugs and biomarkers of DNA and chromosome damage, after taking into account the effects of individual genetic susceptibility, and the presence of confounding exposures.</p> <p>Discussion</p> <p>The findings of the study will be useful in updating prevention procedures for handling antineoplastic drugs.</p
    corecore