269 research outputs found

    Examples of SAR-centric patent mining using open resources

    Get PDF

    Analysis of in vitro bioactivity data extracted from drug discovery literature and patents: Ranking 1654 human protein targets by assayed compounds and molecular scaffolds

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Since the classic Hopkins and Groom druggable genome review in 2002, there have been a number of publications updating both the hypothetical and successful human drug target statistics. However, listings of research targets that define the area between these two extremes are sparse because of the challenges of collating published information at the necessary scale. We have addressed this by interrogating databases, populated by expert curation, of bioactivity data extracted from patents and journal papers over the last 30 years.</p> <p>Results</p> <p>From a subset of just over 27,000 documents we have extracted a set of compound-to-target relationships for biochemical <it>in vitro </it>binding-type assay data for 1,736 human proteins and 1,654 gene identifiers. These are linked to 1,671,951 compound records derived from 823,179 unique chemical structures. The distribution showed a compounds-per-target average of 964 with a maximum of 42,869 (Factor Xa). The list includes non-targets, failed targets and cross-screening targets. The top-278 most actively pursued targets cover 90% of the compounds. We further investigated target ranking by determining the number of molecular frameworks and scaffolds. These were compared to the compound counts as alternative measures of chemical diversity on a per-target basis.</p> <p>Conclusions</p> <p>The compounds-per-protein listing generated in this work (provided as a supplementary file) represents the major proportion of the human drug target landscape defined by published data. We supplemented the simple ranking by the number of compounds assayed with additional rankings by molecular topology. These showed significant differences and provide complementary assessments of chemical tractability.</p

    The IUPHAR/BPS Guide to PHARMACOLOGY in 2016: towards curated quantitative interactions between 1300 protein targets and 6000 ligands

    Get PDF
    The IUPHAR/BPS Guide to PHARMACOLOGY (GtoPdb, http://www.guidetopharmacology.org) provides expert-curated molecular interactions between successful and potential drugs and their targets in the human genome. Developed by the International Union of Basic and Clinical Pharmacology (IUPHAR) and the British Pharmacological Society (BPS), this resource, and its earlier incarnation as IUPHAR-DB, is described in our 2014 publication. This update incorporates changes over the intervening seven database releases. The unique model of content capture is based on established and new target class subcommittees collaborating with in-house curators. Most information comes from journal articles, but we now also index kinase cross-screening panels. Targets are specified by UniProtKB IDs. Small molecules are defined by PubChem Compound Identifiers (CIDs); ligand capture also includes peptides and clinical antibodies. We have extended the capture of ligands and targets linked via published quantitative binding data (e.g. Ki, IC50 or Kd). The resulting pharmacological relationship network now defines a data-supported druggable genome encompassing 7% of human proteins. The database also provides an expanded substrate for the biennially published compendium, the Concise Guide to PHARMACOLOGY. This article covers content increase, entity analysis, revised curation strategies, new website features and expanded download options

    Penalized likelihood for sparse contingency tables with an application to full-length cDNA libraries

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The joint analysis of several categorical variables is a common task in many areas of biology, and is becoming central to systems biology investigations whose goal is to identify potentially complex interaction among variables belonging to a network. Interactions of arbitrary complexity are traditionally modeled in statistics by log-linear models. It is challenging to extend these to the high dimensional and potentially sparse data arising in computational biology. An important example, which provides the motivation for this article, is the analysis of so-called full-length cDNA libraries of alternatively spliced genes, where we investigate relationships among the presence of various exons in transcript species.</p> <p>Results</p> <p>We develop methods to perform model selection and parameter estimation in log-linear models for the analysis of sparse contingency tables, to study the interaction of two or more factors. Maximum Likelihood estimation of log-linear model coefficients might not be appropriate because of the presence of zeros in the table's cells, and new methods are required. We propose a computationally efficient ℓ<sub>1</sub>-penalization approach extending the Lasso algorithm to this context, and compare it to other procedures in a simulation study. We then illustrate these algorithms on contingency tables arising from full-length cDNA libraries.</p> <p>Conclusion</p> <p>We propose regularization methods that can be used successfully to detect complex interaction patterns among categorical variables in a broad range of biological problems involving categorical variables.</p

    Electrophysiological Properties of Embryonic Stem Cell-Derived Neurons

    Get PDF
    In vitro generation of functional neurons from embryonic stem (ES) cells and induced pluripotent stem cells offers exciting opportunities for dissecting gene function, disease modelling, and therapeutic drug screening. To realize the potential of stem cells in these biomedical applications, a complete understanding of the cell models of interest is required. While rapid advances have been made in developing the technologies for directed induction of defined neuronal subtypes, most published works focus on the molecular characterization of the derived neural cultures. To characterize the functional properties of these neural cultures, we utilized an ES cell model that gave rise to neurons expressing the green fluorescent protein (GFP) and conducted targeted whole-cell electrophysiological recordings from ES cell-derived neurons. Current-clamp recordings revealed that most neurons could fire single overshooting action potentials; in some cases multiple action potentials could be evoked by depolarization, or occurred spontaneously. Voltage-clamp recordings revealed that neurons exhibited neuronal-like currents, including an outward current typical of a delayed rectifier potassium conductance and a fast-activating, fast-inactivating inward current, typical of a sodium conductance. Taken together, these results indicate that ES cell-derived GFP+ neurons in culture display functional neuronal properties even at early stages of differentiation

    The Concise Guide to PHARMACOLOGY 2015/16: G protein-coupled receptors

    Get PDF
    The Concise Guide to PHARMACOLOGY 2015/16 provides concise overviews of the key properties of over 1750 human drug targets with their pharmacology, plus links to an open access knowledgebase of drug targets and their ligands (www.guidetopharmacology.org), which provides more detailed views of target and ligand properties. The full contents can be found at http://onlinelibrary.wiley.com/doi/10.1111/bph.13348/full. G protein-coupled receptors are one of the eight major pharmacological targets into which the Guide is divided, with the others being: ligand-gated ion channels, voltage-gated ion channels, other ion channels, nuclear hormone receptors, catalytic receptors, enzymes and transporters. These are presented with nomenclature guidance and summary information on the best available pharmacological tools, alongside key references and suggestions for further reading. The Concise Guide is published in landscape format in order to facilitate comparison of related targets. It is a condensed version of material contemporary to late 2015, which is presented in greater detail and constantly updated on the website www.guidetopharmacology.org, superseding data presented in the previous Guides to Receptors & Channels and the Concise Guide to PHARMACOLOGY 2013/14. It is produced in conjunction with NC-IUPHAR and provides the official IUPHAR classification and nomenclature for human drug targets, where appropriate. It consolidates information previously curated and displayed separately in IUPHAR-DB and GRAC and provides a permanent, citable, point-in-time record that will survive database updates

    The concise guide to pharmacology 2019/20: Ion channels

    Get PDF
    The Concise Guide to PHARMACOLOGY 2019/20 is the fourth in this series of biennial publications. The Concise Guide provides concise overviews of the key properties of nearly 1800 human drug targets with an emphasis on selective pharmacology (where available), plus links to the open access knowledgebase source of drug targets and their ligands (www.guidetopharmacology.org), which provides more detailed views of target and ligand properties. Although the Concise Guide represents approximately 400 pages, the material presented is substantially reduced compared to information and links presented on the website. It provides a permanent, citable, point‐in‐time record that will survive database updates. The full contents of this section can be found at http://onlinelibrary.wiley.com/doi/10.1111/bph.14749. Ion channels are one of the six major pharmacological targets into which the Guide is divided, with the others being: G protein‐coupled receptors, nuclear hormone receptors, catalytic receptors, enzymes and transporters. These are presented with nomenclature guidance and summary information on the best available pharmacological tools, alongside key references and suggestions for further reading. The landscape format of the Concise Guide is designed to facilitate comparison of related targets from material contemporary to mid‐2019, and supersedes data presented in the 2017/18, 2015/16 and 2013/14 Concise Guides and previous Guides to Receptors and Channels. It is produced in close conjunction with the International Union of Basic and Clinical Pharmacology Committee on Receptor Nomenclature and Drug Classification (NC‐IUPHAR), therefore, providing official IUPHAR classification and nomenclature for human drug targets, where appropriate
    • …
    corecore