Search CORE

University of Dundee Online Publications

Estimation of synthetic accessibility score of drug-like molecules based on molecular complexity and fragment contributions

Author: Ansgar Schuffenhauer
DE Clark
J Gasteiger
JC Baber
K Boda
K Boda
MS Lajiness
P Ertl
P Ertl
P Ertl
P Ertl
P Kündig
Peter Ertl
S Kanemasa
VJ Gillet
W Bremser
XQ Lewell
Y Takaoka
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background A method to estimate ease of synthesis (synthetic accessibility) of drug-like molecules is needed in many areas of the drug discovery process. The development and validation of such a method that is able to characterize molecule synthetic accessibility as a score between 1 (easy to make) and 10 (very difficult to make) is described in this article. Results The method for estimation of the synthetic accessibility score (SAscore) described here is based on a combination of fragment contributions and a complexity penalty. Fragment contributions have been calculated based on the analysis of one million representative molecules from PubChem and therefore one can say that they capture historical synthetic knowledge stored in this database. The molecular complexity score takes into account the presence of non-standard structural features, such as large rings, non-standard ring fusions, stereocomplexity and molecule size. The method has been validated by comparing calculated SAscores with ease of synthesis as estimated by experienced medicinal chemists for a set of 40 molecules. The agreement between calculated and manually estimated synthetic accessibility is very good with <it>r</it>2 = 0.89. Conclusion A novel method to estimate synthetic accessibility of molecules has been developed. This method uses historical synthetic knowledge obtained by analyzing information from millions of already synthesized chemicals and considers also molecule complexity. The method is sufficiently fast and provides results consistent with estimation of ease of synthesis by experienced medicinal chemists. The calculated SAscore may be used to support various drug discovery processes where a large number of molecules needs to be ranked based on their synthetic accessibility, for example when purchasing samples for screening, selecting hits from high-throughput screening for follow-up, or ranking molecules generated by various <it>de novo </it>design approaches.</p

The Novartis Repository

Analysis of in vitro bioactivity data extracted from drug discovery literature and patents: Ranking 1654 human protein targets by assayed compounds and molecular scaffolds

Author: A Monge
A Schuffenhauer
AL Hopkins
AL Hopkins
B Fabio
C Southan
C Southan
C Tyrchan
Christopher Southan
CP Cannon
D Maglott
DS Wishart
E Ryberg
F Lovering
FNB Edfeldt
GV Paolini
GW Bemis
H Chen
H Ye
J Scheiber
JL Jenkins
JP Overington
K Mackie
Kiran Boppana
L Harland
MR Bowlby
PD Leeson
Q Li
R Christensen
S Devidas
S Muresan
S Wetzel
Sarma ARP Jagarlapudi
SARP Jagarlapudi
SJ Campbell
Sorel Muresan
T Joy
T Liu
TH Keller
X Chen
Y Wang
Y Yang
Y Yasuda
YJ Xu
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Since the classic Hopkins and Groom druggable genome review in 2002, there have been a number of publications updating both the hypothetical and successful human drug target statistics. However, listings of research targets that define the area between these two extremes are sparse because of the challenges of collating published information at the necessary scale. We have addressed this by interrogating databases, populated by expert curation, of bioactivity data extracted from patents and journal papers over the last 30 years. Results From a subset of just over 27,000 documents we have extracted a set of compound-to-target relationships for biochemical <it>in vitro </it>binding-type assay data for 1,736 human proteins and 1,654 gene identifiers. These are linked to 1,671,951 compound records derived from 823,179 unique chemical structures. The distribution showed a compounds-per-target average of 964 with a maximum of 42,869 (Factor Xa). The list includes non-targets, failed targets and cross-screening targets. The top-278 most actively pursued targets cover 90% of the compounds. We further investigated target ranking by determining the number of molecular frameworks and scaffolds. These were compared to the compound counts as alternative measures of chemical diversity on a per-target basis. Conclusions The compounds-per-protein listing generated in this work (provided as a supplementary file) represents the major proportion of the human drug target landscape defined by published data. We supplemented the simple ranking by the number of compounds assayed with additional rankings by molecular topology. These showed significant differences and provide complementary assessments of chemical tractability.</p

Association of a de novo 16q copy number variant with a phenotype that overlaps with Lenz microphthalmia and Townes-Brocks syndromes

Author: A Male
A Matthaei
A Piton
Adele S Schneider
B Kallen
BC Ballif
C Miceli-Richard
CA Graham
CM Krauss
D Morrison
D Ng
David Ng
DF Callen
E Ferda Percin
E Hilton
E Hilton
EB Blau
EM Botzenhart
F Fraser
FF Elder
GM Pastores
GM Shaw
GR Bignell
H Dolk
H Knoblauch
J Kohlhase
Jennifer J Johnston
JJ Hoo
JP Hugot
KL Jones
Leslie G Biesecker
M Clementi
N Scheinfeld
NK Ragge
P Bakrania
RJ Gorlin
S Schuffenhauer
SJ Li
T Glaser
Tanya M Bardakjian
VA Voronina
W Borozdin
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background Anophthalmia and microphthalmia are etiologically and clinically heterogeneous. Lenz microphthalmia is a syndromic form that is typically inherited in an X-linked pattern, though the causative gene mutation is unknown. Townes-Brocks syndrome manifests thumb anomalies, imperforate anus, and ear anomalies. We present a 13-year-old boy with a syndromic microphthalmia phenotype and a clinical diagnosis of Lenz microphthalmia syndrome. Case Presentation The patient was subjected to clinical and molecular evaluation, including array CGH analysis. The clinical features included left clinical anophthalmia, right microphthalmia, anteriorly placed anus with fistula, chordee, ventriculoseptal defect, patent ductus arteriosus, posteriorly rotated ears, hypotonia, growth retardation with delayed bone age, and mental retardation. The patient was found to have an approximately 5.6 Mb deletion of 16q11.2q12.1 by microarray based-comparative genomic hybridization, which includes the <it>SALL1 </it>gene, which causes Townes-Brocks syndrome. Conclusions Deletions of 16q11.2q12.2 have been reported in several individuals, although those prior reports did not note microphthalmia or anophthalmia. This region includes <it>SALL1</it>, which causes Townes-Brocks syndrome. In retrospect, this child has a number of features that can be explained by the <it>SALL1 </it>deletion, although it is not clear if the microphthalmia is a rare feature of Townes-Brocks syndrome or caused by other mechanisms. These data suggest that rare copy number changes may be a cause of syndromic microphthalmia allowing a personalized genomic medicine approach to the care of patients with these aberrations.</p

Shaping a screening file for maximal lead discovery efficiency and effectiveness: elimination of molecular redundancy

Author: Andrew S. Bell
Baell J. B.
Bickerton G. R.
Blagg J.
Brenk R.
Chen T.
Coyne A. G.
Crisman T. J.
David Hepworth
Dempster A. P.
Dobson C. M.
Dorr P.
Everett J. R.
Frye S.
Gardiner E. J.
Gillet V. J.
Gregory A. Bakken
Hajduk P.
Hajduk P. J.
Hann M.
Harper G.
Hopkins A. L.
Huggins D.
Huser J.
Huth J. R.
Jacoby E.
Jacquelyn L. Klug-McLeod
Jadhav A.
Janzen W. P.
Jens Loesel
Jeremy Lanfear
Jeremy R. Everett
John Mathias
Kainkaryam R. M.
Lajiness M.
Larsson A.
Leach A. R.
Leeson P. D.
Lipinski C.
Lipinski C. A.
Lipkin M. J.
Macarron R.
Markus Boehm
Mayr L. M.
Milne G. M.
Muchmore S. W.
Mullard A.
Munos B.
Nadin A.
Nilakantan R.
Overington J. P.
Pammoli F.
Patterson D. E.
Pearce B. C.
Pearlman R. S.
Pereira D. A.
Rosalia Gonzales
Scannell J. W.
Schmid E. F.
Schneider G.
Schuffenhauer A.
Sink R.
Stepan A. F.
Sukuru S. C.
Terence P. Wood
Ursa O.
Willett P.
Willett P.
Xi H.
Yeap S. K.
Publication venue: 'American Chemical Society (ACS)'
Publication date: 31/10/2012
Field of study

High Throughput Screening (HTS) is a successful strategy for finding hits and leads that have the opportunity to be converted into drugs. In this paper we highlight novel computational methods used to select compounds to build a new screening file at Pfizer and the analytical methods we used to assess their quality. We also introduce the novel concept of molecular redundancy to help decide on the density of compounds required in any region of chemical space in order to be confident of running successful HTS campaigns

Greenwich Academic Literature Archive

A Mapping of Drug Space from the Viewpoint of Small Molecule Metabolism

Author: A Ciulli
A Clements
A Schuffenhauer
AC Cheng
AE Cleves
AL Hopkins
AM Nicasio
AP Russ
C James
C Zhang
CM Dobson
D Tondi
DB Kell
DC Chan
Deok-Sun Lee
DJ Payne
DS Lee
DS Lee
E Avdic
E Chu
EC Moore
F Ciruela
F Petit
GV Paolini
Henry F. Chambers
HM Faessel
J Drews
J Hert
JA Kramer
James Corey Adams
JJ McGuire
JP Powell
K Mackay
KF Tipton
KI Goh
LF Shyur
Li Basuino
M Johnson
MA Bogoyevitch
MA Yildirim
MB Navarro
Michael J. Keiser
MJ Keiser
MP Costi
MV Dias
NC Meisner
Olaf G. Wiest
P Imming
P Romero
P Shannon
P Willett
Patricia C. Babbitt
Philip E. Bourne
PJ Hajduk
R Caspi
R Martin
RL Kisliuk
S Ekins
S Rochfort
SF Altschul
SM Watkins
W Lewis
W Lewis
WH Gmeiner
Y Cho
Y Yamanishi
Publication venue: Public Library of Science
Publication date: 01/01/2009
Field of study

Small molecule drugs target many core metabolic enzymes in humans and pathogens, often mimicking endogenous ligands. The effects may be therapeutic or toxic, but are frequently unexpected. A large-scale mapping of the intersection between drugs and metabolism is needed to better guide drug discovery. To map the intersection between drugs and metabolism, we have grouped drugs and metabolites by their associated targets and enzymes using ligand-based set signatures created to quantify their degree of similarity in chemical space. The results reveal the chemical space that has been explored for metabolic targets, where successful drugs have been found, and what novel territory remains. To aid other researchers in their drug discovery efforts, we have created an online resource of interactive maps linking drugs to metabolism. These maps predict the “effect space” comprising likely target enzymes for each of the 246 MDDR drug classes in humans. The online resource also provides species-specific interactive drug-metabolism maps for each of the 385 model organisms and pathogens in the BioCyc database collection. Chemical similarity links between drugs and metabolites predict potential toxicity, suggest routes of metabolism, and reveal drug polypharmacology. The metabolic maps enable interactive navigation of the vast biological data on potential metabolic drug targets and the drug chemistry currently available to prosecute those targets. Thus, this work provides a large-scale approach to ligand-based prediction of drug action in small molecule metabolism

CiteSeerX

Public Library of Science (PLOS)

Repository for Publications and Research Data

Structure-based classification and ontology in chemistry

Abstract Background Recent years have seen an explosion in the availability of data in the chemistry domain. With this information explosion, however, retrieving <it>relevant </it>results from the available information, and <it>organising </it>those results, become even harder problems. Computational processing is essential to filter and organise the available resources so as to better facilitate the work of scientists. Ontologies encode expert domain knowledge in a hierarchically organised machine-processable format. One such ontology for the chemical domain is ChEBI. ChEBI provides a classification of chemicals based on their structural features and a role or activity-based classification. An example of a structure-based class is 'pentacyclic compound' (compounds containing five-ring structures), while an example of a role-based class is 'analgesic', since many different chemicals can act as analgesics without sharing structural features. Structure-based classification in chemistry exploits elegant regularities and symmetries in the underlying chemical domain. As yet, there has been neither a systematic analysis of the types of structural classification in use in chemistry nor a comparison to the capabilities of available technologies. Results We analyze the different categories of structural classes in chemistry, presenting a list of patterns for features found in class definitions. We compare these patterns of class definition to tools which allow for automation of hierarchy construction within cheminformatics and within logic-based ontology technology, going into detail in the latter case with respect to the expressive capabilities of the Web Ontology Language and recent extensions for modelling structured objects. Finally we discuss the relationships and interactions between cheminformatics approaches and logic-based approaches. Conclusion Systems that perform intelligent reasoning tasks on chemistry data require a diverse set of underlying computational utilities including algorithmic, statistical and logic-based tools. For the task of automatic structure-based classification of chemical entities, essential to managing the vast swathes of chemical data being brought online, systems which are capable of hybrid reasoning combining several different approaches are crucial. We provide a thorough review of the available tools and methodologies, and identify areas of open research.</p

Oxford University Research Archive

The University of Manchester - Institutional Repository

Novel FOXG1 mutations in Chinese patients with Rett syndrome or Rett-like mental retardation

Author: AM Kerr
B Diebold
B Hagberg
C Bruyn De
C Florian
C Philippe
DB Murphy
E Scala
F Ariani
F Kortüm
F Mari
IS Fetahu
Jiaping Wang
Jiarui Li
JL Neul
KQ McMahon
LE Seltzer
Liping Wei
N Aa Van der
N Bahi-Buisson
N Brunetti-Pierri
Qingping Zhang
S Rolando
S Schuffenhauer
S Xuan
X Zhang
Xiaoying Zhang
Xinhua Bao
Xiru Wu
Y Zhao
Ying Zhao
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

ICF, An Immunodeficiency Syndrome: DNA Methyltransferase 3B Involvement, Chromosome Anomalies, and Gene Dysregulation

Author: Alcobia I
Alexiadis V
Ausio J
Bachman KE
Bai S
Barry F
Blanco-Betancourt CE
Bohnhorst JO
Borst J
Bourc'his D
Brouard S
Brown DC
Burette A
Cadieux B
Cecilia Sanchez
Chen CC
Chunbo Shao
Collins CS
Day RN
Dodge JE
Dromard M
Eden A
Ehrlich M
Ehrlich M
Ehrlich M
Ehrlich M
Ehrlich M
Ehrlich M
Ehrlich M
Ehrlich M
Ehrlich M
Enwright JF
Eris JM
Fasth A
Feng J
Franceschini P
Gama-Sosa MA
Gasser SM
Ge YZ
Geiman TM
Geiman TM
Gemma C
Gill G
Gimelli G
Gisselsson D
Goldmit M
Govin J
Gowher H
Haaf T
Hagleitner M
Hansen RS
Hansen RS
Hansen RS
Hartsfield CL
Hassan KM
Hathcock KS
Hernandez R
Heun P
Horn PJ
Howard PJ
Howlett SK
Hulten M
Husain Z
Irie S
Itoh T
Ivanov VN
Jacquot S
Jeanpierre M
Jeltsch A
Ji W
Jiang YL
John Kehrl
Jolly C
Jolly C
Kang ES
Kareta MS
Kato Y
Ke N
Kim GD
Kim HP
Kloeckener-Gruissem B
Kondo T
Kress C
Kubo M
Kubota T
Lamkanfi M
Li JY
Li T
Liu X
Lucas PC
Luciani JJ
Luciani JJ
Lusser A
Mack KD
Maekawa K
Maraschio P
Maraschio P
Margot JB
Masso-Welch PA
Mathieu O
Melanie Ehrlich
Miled C
Miniou P
Miniou P
Montoliu C
Morrow TA
Nakagawa T
Nakai Y
Narayan A
Narayan A
Naumann U
Netzer C
Niedbala W
Nishiyama R
Okano M
Okano M
Partridge JF
Peters T
Pezzolo A
Pham TN
Piwien Pilipuk G
Qu G
Quan T
Rie Nishiyama
Rork Kuick
Sabbattini P
Samir M. Hanash
Sanford J
Sanford JP
Saras J
Sawyer J
Sawyer JR
Schuetz C
Schuffenhauer S
Seidemann K
Shestakova EA
Shirohzu H
Smeets DFCM
Soares MP
Stacey M
Suetake I
Suetake I
Sumner AT
Takata Y
Takeo Kubota
Takeshima H
Tesz GJ
Tiepolo L
Tsien F
Tsien F
Tuck-Muller CM
Turek-Plewa J
Turleau C
Ueda Y
Ugarkovic D
Uht RM
Vilain A
Weisenberger DJ
Wen J
Wijmenga C
Wong N
Xie S
Xie ZH
Xiong Z
Xu G
Yamada Y
Yamashita K
Yamashita Y
Yamazaki T
Yao Z
Zabel U
Zhao Y
Zhou YW
Publication venue: Informa Healthcare
Publication date
Field of study

The immunodeficiency, centromeric region instability, and facial anomalies syndrome (ICF) is the only disease known to result from a mutated DNA methyltransferase gene, namely, DNMT3B. Characteristic of this recessive disease are decreases in serum immunoglobulins despite the presence of B cells and, in the juxtacentromeric heterochromatin of chromosomes 1 and 16, chromatin decondensation, distinctive rearrangements, and satellite DNA hypomethylation. Although DNMT3B is involved in specific associations with histone deacetylases, HP1, other DNMTs, chromatin remodelling proteins, condensin, and other nuclear proteins, it is probably the partial loss of catalytic activity that is responsible for the disease. In microarray experiments and real-time RT-PCR assays, we observed significant differences in RNA levels from ICF vs. control lymphoblasts for pro- and anti-apoptotic genes (BCL2L10, CASP1, and PTPN13); nitrous oxide, carbon monoxide, NF-κB, and TNFa signalling pathway genes (PRKCH, GUCY1A3, GUCY1B3, MAPK13; HMOX1, and MAP4K4); and transcription control genes (NR2F2 and SMARCA2). This gene dysregulation could contribute to the immunodeficiency and other symptoms of ICF and might result from the limited losses of DNA methylation although ICF-related promoter hypomethylation was not observed for six of the above examined genes. We propose that hypomethylation of satellite 2at1qh and 16qh might provoke this dysregulation gene expression by trans effects from altered sequestration of transcription factors, changes in nuclear architecture, or expression of noncoding RNAs

Closed-cage clusters in the gaseous and condensed phases derived from sonochemically synthesized MoS2 nanoflakes