121 research outputs found

    Search for the standard model Higgs boson at LEP

    Get PDF

    Extraction of pharmacokinetic evidence of drug-drug interactions from the literature

    Get PDF
    Drug-drug interaction (DDI) is a major cause of morbidity and mortality and a subject of intense scientific interest. Biomedical literature mining can aid DDI research by extracting evidence for large numbers of potential interactions from published literature and clinical databases. Though DDI is investigated in domains ranging in scale from intracellular biochemistry to human populations, literature mining has not been used to extract specific types of experimental evidence, which are reported differently for distinct experimental goals. We focus on pharmacokinetic evidence for DDI, essential for identifying causal mechanisms of putative interactions and as input for further pharmacological and pharmacoepidemiology investigations. We used manually curated corpora of PubMed abstracts and annotated sentences to evaluate the efficacy of literature mining on two tasks: first, identifying PubMed abstracts containing pharmacokinetic evidence of DDIs; second, extracting sentences containing such evidence from abstracts. We implemented a text mining pipeline and evaluated it using several linear classifiers and a variety of feature transforms. The most important textual features in the abstract and sentence classification tasks were analyzed. We also investigated the performance benefits of using features derived from PubMed metadata fields, various publicly available named entity recognizers, and pharmacokinetic dictionaries. Several classifiers performed very well in distinguishing relevant and irrelevant abstracts (reaching F10.93, MCC0.74, iAUC0.99) and sentences (F10.76, MCC0.65, iAUC0.83). We found that word bigram features were important for achieving optimal classifier performance and that features derived from Medical Subject Headings (MeSH) terms significantly improved abstract classification. We also found that some drug-related named entity recognition tools and dictionaries led to slight but significant improvements, especially in classification of evidence sentences. Based on our thorough analysis of classifiers and feature transforms and the high classification performance achieved, we demonstrate that literature mining can aid DDI discovery by supporting automatic extraction of specific types of experimental evidence.National Institutes of Health, National Library of Medicine Program, grant 01LM011945-01 "BLR: Evidence-based Drug-Interaction Discovery: In-Vivo, In-Vitro and Clinical," a grant from the Indiana University Collaborative Research Program 2013, "Drug-Drug Interaction Prediction from Large-scale Mining of Literature and Patient Records," as well as a grant from the joint program between the Fundação Luso-Americana para o Desenvolvimento (Portugal) and National Science Foundation (USA), 2012-2014, "Network Mining For Gene Regulation And Biochemical Signaling.

    More Than 1,001 Problems with Protein Domain Databases: Transmembrane Regions, Signal Peptides and the Issue of Sequence Homology

    Get PDF
    Large-scale genome sequencing gained general importance for life science because functional annotation of otherwise experimentally uncharacterized sequences is made possible by the theory of biomolecular sequence homology. Historically, the paradigm of similarity of protein sequences implying common structure, function and ancestry was generalized based on studies of globular domains. Having the same fold imposes strict conditions over the packing in the hydrophobic core requiring similarity of hydrophobic patterns. The implications of sequence similarity among non-globular protein segments have not been studied to the same extent; nevertheless, homology considerations are silently extended for them. This appears especially detrimental in the case of transmembrane helices (TMs) and signal peptides (SPs) where sequence similarity is necessarily a consequence of physical requirements rather than common ancestry. Thus, matching of SPs/TMs creates the illusion of matching hydrophobic cores. Therefore, inclusion of SPs/TMs into domain models can give rise to wrong annotations. More than 1001 domains among the 10,340 models of Pfam release 23 and 18 domains of SMART version 6 (out of 809) contain SP/TM regions. As expected, fragment-mode HMM searches generate promiscuous hits limited to solely the SP/TM part among clearly unrelated proteins. More worryingly, we show explicit examples that the scores of clearly false-positive hits, even in global-mode searches, can be elevated into the significance range just by matching the hydrophobic runs. In the PIR iProClass database v3.74 using conservative criteria, we find that at least between 2.1% and 13.6% of its annotated Pfam hits appear unjustified for a set of validated domain models. Thus, false-positive domain hits enforced by SP/TM regions can lead to dramatic annotation errors where the hit has nothing in common with the problematic domain model except the SP/TM region itself. We suggest a workflow of flagging problematic hits arising from SP/TM-containing models for critical reconsideration by annotation users

    First measurement of the BSB_S meson mass

    Get PDF
    If simplified, every information retrieval problem can be solved when the information need implied by its expression has been identified. We are interested in the criteria used in realising a good information retrieval problem expression. We have listed these criteria through some principles and maxims which first characterized the communication between two persons are applied. We choose to use the gricean maxims because they are the most favoured for this type of situation. Secondly, we have tried to identify some others principles that can be used to realise a good information retrieval problem expression. The principles by Grice can not resolve all forms of error associated with this particular form of communication. In our work, we defined three other principles namely: adhesion principle, reformulation principle, memorization principle. We give some examples of situations where the principles we have formulated are not applicable and the consequences. We present the possible applications of our new model: MIRABEL, which can help in the description of information retrieval problem from. It also compels its user to use essential good expression principle implicitly

    Search for particles with unexpected mass and charge in Z decays

    Get PDF

    Update of electroweak parameters from Z decays

    Get PDF
    corecore