Search CORE

76 research outputs found

Blind trials of computer-assisted structure elucidation software

Author: AA Stierle
Antony J Williams
Arvin Moser
C Steinbeck
CA Shelley
JB Lambert
Joseph C DiMartino
KA Blinov
KA Blinov
Kirill A Blinov
L Dong
LA Baker
M Tsuda
ME Elyashberg
ME Elyashberg
ME Elyashberg
ME Elyashberg
ME Elyashberg
ME Elyashberg
ME Elyashberg
ME Zain
Mikhail E Elyashberg
PM Joyner
RH Cichewicz
S Pilgrim
W Bremser
WH Houssen
Y Wang
YD Smurnyy
Publication venue: BioMed Central
Publication date: 01/01/2012
Field of study

Abstract Background One of the largest challenges in chemistry today remains that of efficiently mining through vast amounts of data in order to elucidate the chemical structure for an unknown compound. The elucidated candidate compound must be fully consistent with the data and any other competing candidates efficiently eliminated without doubt by using additional data if necessary. It has become increasingly necessary to incorporate an <it>in silico </it>structure generation and verification tool to facilitate this elucidation process. An effective structure elucidation software technology aims to mimic the skills of a human in interpreting the complex nature of spectral data while producing a solution within a reasonable amount of time. This type of software is known as computer-assisted structure elucidation or CASE software. A systematic trial of the ACD/Structure Elucidator CASE software was conducted over an extended period of time by analysing a set of single and double-blind trials submitted by a global audience of scientists. The purpose of the blind trials was to reduce subjective bias. Double-blind trials comprised of data where the candidate compound was unknown to both the submitting scientist and the analyst. The level of expertise of the submitting scientist ranged from novice to expert structure elucidation specialists with experience in pharmaceutical, industrial, government and academic environments. Results Beginning in 2003, and for the following nine years, the algorithms and software technology contained within ACD/Structure Elucidator have been tested against 112 data sets; many of these were unique challenges. Of these challenges 9% were double-blind trials. The results of eighteen of the single-blind trials were investigated in detail and included problems of a diverse nature with many of the specific challenges associated with algorithmic structure elucidation such as deficiency in protons, structure symmetry, a large number of heteroatoms and poor quality spectral data. Conclusion When applied to a complex set of blind trials, ACD/Structure Elucidator was shown to be a very useful tool in advancing the computer's contribution to elucidating a candidate structure from a set of spectral data (NMR and MS) for an unknown. The synergistic interaction between humans and computers can be highly beneficial in terms of less biased approaches to elucidation as well as dramatic improvements in speed and throughput. In those cases where multiple candidate structures exist, ACD/Structure Elucidator is equipped to validate the correct structure and eliminate inconsistent candidates. Full elucidation can generally be performed in less than two hours; this includes the average spectral data processing time and data input.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Computer-assisted methods for molecular structure elucidation: realizing a spectroscopist's dream

Crossref

Springer - Publisher Connector

PubMed Central

Pseudochelin A, a siderophore of Pseudoalteromonas piscicida S2040

Author: Abergel
Abergel
Ali
Allred
Bergeron
Berti
Bluhm
Brandel
Bruce F. Milne
Buysens
Caridad Díaz
Cass
Daniëlle Copmans
De Voss
Dhungana
Elyashberg
Espen Hansen
Eva C. Sonnenschein
Floriane M. Vabre
Gram
Griffiths
Hai Deng
Hamdan
Hoette
Hoette
Homann
Ito
Jeanette Hammer Andersen
Jioji N. Tabudravu
José R. Tormo
Kine Østnes Hanssen
Leonie Pellissier
Li
Li
Lind
Lone Gram
Marc Stierhof
Marcel Jaspars
Martín
Mawji
Mercedes de la Cruz
Miethke
Miller
Neilands
Peter de Witte
Rainer Ebel
Raymond
Sandy
Schieferdecker
Schwyn
Shigemori
Smith
Speitling
Stephan Goralczyk
Venke Kristoffersen
Vinatier
Vinayavekhin
Wuest
Zhang
Zhang
Publication venue: 'Elsevier BV'
Publication date: 01/01/2017
Field of study

A new siderophore containing a 4,5-dihydroimidazole moiety was isolated from Pseudoalteromonas piscicida S2040 together with myxochelins A and B, alteramide A and its cycloaddition product, and bromo- and dibromoalterochromides. The structure of pseudochelin A was established by spectroscopic techniques including 2D NMR and MS/MS fragmentation data. In bioassays selected fractions of the crude extract of S2040 inhibited the opportunistic pathogen Pseudomonas aeruginosa. Pseudochelin A displayed siderophore activity in the chrome azurol S assay at concentrations higher than 50 μM, and showed weak activity against the fungus Aspergillus fumigatus, but did not display antibacterial, anti-inflammatory or anticonvulsant activity

Aberdeen University Research

CLoK

Crossref

University of the South Pacific Electronic Research Repository

Online Research Database In Technology

Fondo Bibliográfico Digital Institucional

NMReDATA, a standard to report the NMR assignment and parameters of organic compounds

Author: Adam C
Argyropoulos D
Butts C
Claridge TDW
Dashti H
Eghbalnia HR
Elyashberg M
Erdelyi M
Farès C
Gil RR
Giraudeau P
Jeannerat D
Kessler P
Kuhn S
Mikhova B
Moriaud F
Nuzillard JM
Pupier M
Pérez M
Robien W
Schlörer NE
Steinbeck C
Trevorrow P
Williams AJ
Wist J
Publication venue: 'Wiley'
Publication date: 01/01/2018
Field of study

The file attached to this record is the author's final peer reviewed version. The Publisher's final version can be found by following the DOI link. Open access articleEven though NMR has found countless applications in the field of small molecule characterization, there is no standard file for the NMR data relevant to structure characterization of small molecules. A file format is introduced to associate the NMR parameters extracted from 1D and 2D spectra of organic compounds to the assigned chemical structure. These NMR parameters, which we shall call NMReDATA, include chemical shift values, signal integrals, intensities, multiplicities, scalar coupling constants, lists of 2D correlations, relaxation times and diffusion rates. The file format is an extension of the existing SDF (Structure Data Format), which is compatible with the commonly used MOL format. The association of an NMReDATA file with the raw and spectral data from which it originates constitutes an NMR record. This format is easily readable by humans and computers and provides a simple and efficient way for disseminating results of structural chemistry investigations, automating the verification of published result, and for assisting the constitution of highly needed open-source structural databases

Crossref

Kölner UniversitätsPublikationsServer

Oxford University Research Archive

De Montfort University Open Research Archive

MPG.PuRe

Hal-Diderot

Explore Bristol Research

Archive ouverte UNIGE

Seven Golden Rules for heuristic filtering of molecular formulas obtained by accurate mass spectrometry

Author: A Makarov
AA Pontet
AJ Dempster
AL Rockwood
AM Richard
AW Jensen
B Seebass
BG Buchanan
C Djerassi
C Steinbeck
C Steinbeck
DA Laws
DL Olson
DL Wheeler
DR Scott
DS Wishart
F Csizmadia
H Budzikiewicz
HE Dayringer
J Braun
J Chen
J Lederberg
JC Lindon
JF Zhang
JJ Irwin
JK Senior
JL Faulon
JM Halket
JR De Laeter
L Sleno
M Badertscher
MD Soffer
ME Elyashberg
MP Balogh
N Huang
O Fiehn
O Fiehn
Oliver Fiehn
P Murray-Rust
QY Wu
RG Dromey
S Heuerding
S Noury
S Omura
SE Stein
SR Heller
SR Heller
T Fink
T Kind
T Morikawa
Tobias Kind
V Wray
W Windig
WD Ihlenfeldt
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

BACKGROUND: Structure elucidation of unknown small molecules by mass spectrometry is a challenge despite advances in instrumentation. The first crucial step is to obtain correct elemental compositions. In order to automatically constrain the thousands of possible candidate structures, rules need to be developed to select the most likely and chemically correct molecular formulas. RESULTS: An algorithm for filtering molecular formulas is derived from seven heuristic rules: (1) restrictions for the number of elements, (2) LEWIS and SENIOR chemical rules, (3) isotopic patterns, (4) hydrogen/carbon ratios, (5) element ratio of nitrogen, oxygen, phosphor, and sulphur versus carbon, (6) element ratio probabilities and (7) presence of trimethylsilylated compounds. Formulas are ranked according to their isotopic patterns and subsequently constrained by presence in public chemical databases. The seven rules were developed on 68,237 existing molecular formulas and were validated in four experiments. First, 432,968 formulas covering five million PubChem database entries were checked for consistency. Only 0.6% of these compounds did not pass all rules. Next, the rules were shown to effectively reducing the complement all eight billion theoretically possible C, H, N, S, O, P-formulas up to 2000 Da to only 623 million most probable elemental compositions. Thirdly 6,000 pharmaceutical, toxic and natural compounds were selected from DrugBank, TSCA and DNP databases. The correct formulas were retrieved as top hit at 80–99% probability when assuming data acquisition with complete resolution of unique compounds and 5% absolute isotope ratio deviation and 3 ppm mass accuracy. Last, some exemplary compounds were analyzed by Fourier transform ion cyclotron resonance mass spectrometry and by gas chromatography-time of flight mass spectrometry. In each case, the correct formula was ranked as top hit when combining the seven rules with database queries. CONCLUSION: The seven rules enable an automatic exclusion of molecular formulas which are either wrong or which contain unlikely high or low number of elements. The correct molecular formula is assigned with a probability of 98% if the formula exists in a compound database. For truly novel compounds that are not present in databases, the correct formula is found in the first three hits with a probability of 65–81%. Corresponding software and supplemental data are available for downloads from the authors' website

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Advances in structure elucidation of small molecules using mass spectrometry

Author: A Alexander
A Broersen
A Castro
A Cuadros-Inostroza
A Droit
A Fredenhagen
A Gordin
A Kameyama
A Kameyama
A Kerber
A Koulman
A Luedemann
A Makarov
A Makarov
A Makarov
A Mitch
A Nordstrom
A Pelander
A Ramos-Fernandez
A Schmidt
A Schreiber
A Serb
A Zhang
A-EF Nassar
AC Lee
AD Hegeman
AD Southam
AG Marshall
AG Marshall
AG Pereira-Medrano
AH Grange
AH Grange
AH Grange
AH Payne
AI Nepomuceno
AJ Alexander
AJ Richard
AJ Williams
AK Vrkic
AL Heaton
AL Piccinelli
AL Rockwood
AL Rockwood
AL Rockwood
AM Jennifer
AN Lane
AV Xianmei Cai
AW Hill
AWT Bristow
B Christensen
B Fan
B Portet
B Wen
BD Nourse
BL Ackermann
BL Milman
BO Keller
BP Koch
BS Mitrevski
BY Renard
C Birkemeyer
C Brunnée
C Hopley
C Pan
C Prakash
C Seger
C Tuniz
C Vafiadi
C Wittmann
C Zhou
CA Marchant
CA Mueller
CA Smith
CE Wujcik
CW Klampfl
D Eric
D Kuehl
D Ryan
D Schwudke
D Sorensen
D Strapoc
DB Robb
DB Robb
DD Stranz
DE Garcia
DF Hochstrasser
DJ Ashline
DJ Weston
DJ Weston
DK Williams Jr
DM Drexler
DM Good
DM Hawkins
DM Horn
DQ Liu
DQ Liu
DR Albaugh
DS Cornett
DS Wishart
DS Wishart
DS Wishart
DW Hill
DW Hill
E Allard
E Dudley
E Gelpí
E Gelpí
E Gorlach
E Hoffmann De
E Pittenauer
E Rijke de
E Rosenberg
E Skoczynska
E Ventola
E Werner
EA Kapp
EA Syrstad
EC Tatsis
ECM Chen
EL Schymanski
EL Schymanski
EL Schymanski
EM Thurman
EM Thurman
EM Thurman
EP Go
ER Wickremsinhe
EW Deutsch
EW Taylor
F Cuyckens
F Cuyckens
F Kuhn
F Matsuda
F Matsuda
F Milletti
F Pont
F Sacher
F Steiner
F Xu
FF Hsu
FF Hsu
FF Hsu
FW McLafferty
FW McLafferty
FW McLafferty
FW McLafferty
G Bouchoux
G Bringmann
G Chen
G Hopfgartner
G Miliauskas
G Schlotterbeck
G Yan
GB Ge
GE Hofmeister
GJ Berkel Van
GJ Dear
GL Gauthier
GL Glish
GS Frysinger
GS Gorman
H Budzikiewicz
H Chen
H Chen
H Choi
H Gallart-Ayala
H Hayen
H Hayen
H Hong
H Horai
H Kaspar
H Lu
H Neuweger
H Oberacher
H Oberacher
H Rodriguez
H Song
H Zhang
H Zhang
H Zhang
H Zhang
HA Clark
HF Sturt
HJ Cooper
HJ Sterling
HK Lim
HK Lim
I Ferrer
I Francois
I Marchi
I Molnár-Perl
IA Kaltashov
ID Wilson
IG Zenkevich
IM Lazar
J Dalluge
J Delaney
J Diana
J Downing
J Draper
J Han
J Hummel
J Hummel
J Meija
J Schiller
J Schmidt
J Segura
J Somuramasami
J Souady
J Zhang
J Zhang
J-L Faulon
JA Falkner
JA Falkner
JB Fenn
JC Bradley
JC Dickens
JC Fjeldsted
JC Hannis
JC Schwartz
JCL Erve
JD Williams
JE Biller
JE Elias
JEP Syka
JG Stroh
JH Futrell
JH Gross
JH Zhu
JH Zhu
JI Haleem
JK Baker
JK Wolken
JL Holmes
JL Little
JL Wolfender
JM Halket
JM Kirk
JM Phalp
JR Wickens
JS Brodbelt
JS Forrester
JS Sinninghe Damsté
JS Splitter
JSB Vlieger de
JT Watson
K Akiyama
K Biemann
K Dettmer
K Dreisewerd
K Guo
K Heberger
K Hobby
K Horvath
K Kandasamy
K Katerina
K Laniewski
K Levsen
K Levsen
K Miyamoto
K Qian
K Schug
K Varmuza
K Yang
KG Lloyd
KP Bateman
KR Jonscher
KW Cheng
KX Wan
L Calcagnile
L Dinan
L Feldberg
L Karsten
L Leclercq
L Li
L Li
L Mondello
L Ramaley
L Sleno
L Sleno
L Yang
L Zhang
LA McDonnell
LC Short
LM Fell
M Adahchour
M Badertscher
M Bedair
M Bogusz
M Brown
M Eggink
M Emmerling
M Fernandes-Whaley
M Gergov
M Gfrerer
M Gu
M Hamacher
M Heinonen
M Holcapek
M Ibanez
M Jalali-Heravi
M Karas
M Karelson
M Kellmann
M Kiffe
M Krauss
M Krummen
M Lehane
M Mann
M Okamoto
M Palit
M Pavlic
M Pulfer
M Scheurell
M Scholz
M Trunzer
M Wind
M Wind
M Yao
M Zhu
MA Eash
ME Elyashberg
ME Hansen
MG Zampolli
ML Bandu
MM Savitski
MM Siegel
MM Yao
MP Balogh
MP Balogh
MP Balogh
MP Washburn
MR Anari
MR Anari
MS Bereman
MS Molchanova
MT Olson
MT Rodgers
MT Sheldon
N Hertkorn
N Huang
N Jaitly
N Ohashi
N Reig
NB Cech
NE Manicke
O Corcoran
O David Sparkman
O Fiehn
O Fiehn
O Pelkonen
OM Saad
OV Krokhin
P Ausloos
P Calza
P Dwivedi
P Fontana
P Giavalisco
P Kiousi
P Lampen
P Marriott
P McCormack
P Mendes
P Murray-Rust
P Schmitt-Kopplin
P Zhu
PA Sutton
PB Lukka
PC Carvalho
PE Adams
PE Sauer
PGA Pedrioli
Q Li
Q Li
Q Xiong
R Almeida
R Baigorri
R Harkewicz
R Hellborg
R Kaliszan
R Knochenmuss
R Kostiainen
R Li
R Mylonas
R Nakabayashi
R Ramanathan
R Samudrala
R Schiewek
R Wu
R Zenobi
RA Scheltema
RA Shellie
RA Zubarev
RA Zubarev
RB Cody
RD Loss
RF Staack
RG Cooks
RG Cooks
RG Dromey
RH Perry
RJ Beynon
RJ Mortishire-Smith
RJ Mortishire-Smith
RK Snider
RM Smith
RM Smith
RP Lattimer
RS Plumb
RS Plumb
RT Kelly
S Bocker
S Bocker
S Borth
S Bourcier
S Buckingham
S Christophoridou
S Dresen
S Dua
S Ekins
S Jarussophon
S Kim
S Kothari
S Ma
S Nojima
S Ojanpera
S Rogers
S Sang
S Su
S Trimpin
S Urayama
S Wolf
SA McLuckey
SC Bell
SC Habicht
SE Ong
SE Scheppele
SE Stein
SE Stein
SE Stein
SE Stein
SF Anabel
SG Roussis
SG Villas-Bôas
SJ Bos
SJ Gaskell
SJ Rochfort
SJ Valentine
SS Ebada
SS Rubakhin
SY Ow
T Alon
T Alon
T Beier
T Chen
T Kind
T Kind
T Kind
T Kind
T Kind
T Lynch
T Reemtsma
T Shinkawa
TA Lydic
TA Ternes
TA Ternes
TG Payne
TJ Kauppila
TM Kertesz
TM Kertesz
TM Schaub
TR Covey
TR Northen
TR Sana
TRI Cataldi
V Exarchou
V Kovácik
V Sanz-Nebot
V Vukics
V Zaikin
VA Petyuk
VI Babushok
VV Mihaleva
W Timm
W Windig
W Zhong
W Zou
WC Byrdwell
WC Byrdwell
WC Yang
WF Smyth
WMA Niessen
WTB Anthony
X Feng
X Han
X Liang
X-J Li
XY Zhu
Y Cai
Y Chen
Y Chen
Y Duan
Y Konishi
Y Lin
Y Liu
Y Park
Y Sawada
Y Shinbo
Y Wang
Y Wang
YA Jeilani
YK Wang
YR Luo
Z Tozuka
Z Yeping
ZP Yao
Publication venue: Springer Vienna
Publication date: 01/01/2010
Field of study

The structural elucidation of small molecules using mass spectrometry plays an important role in modern life sciences and bioanalytical approaches. This review covers different soft and hard ionization techniques and figures of merit for modern mass spectrometers, such as mass resolving power, mass accuracy, isotopic abundance accuracy, accurate mass multiple-stage MS(n) capability, as well as hybrid mass spectrometric and orthogonal chromatographic approaches. The latter part discusses mass spectral data handling strategies, which includes background and noise subtraction, adduct formation and detection, charge state determination, accurate mass measurements, elemental composition determinations, and complex data-dependent setups with ion maps and ion trees. The importance of mass spectral library search algorithms for tandem mass spectra and multiple-stage MS(n) mass spectra as well as mass spectral tree libraries that combine multiple-stage mass spectra are outlined. The successive chapter discusses mass spectral fragmentation pathways, biotransformation reactions and drug metabolism studies, the mass spectral simulation and generation of in silico mass spectra, expert systems for mass spectral interpretation, and the use of computational chemistry to explain gas-phase phenomena. A single chapter discusses data handling for hyphenated approaches including mass spectral deconvolution for clean mass spectra, cheminformatics approaches and structure retention relationships, and retention index predictions for gas and liquid chromatography. The last section reviews the current state of electronic data sharing of mass spectra and discusses the importance of software development for the advancement of structure elucidation of small molecules

CiteSeerX

Crossref

Springer - Publisher Connector

PubMed Central

Structure Revision of Asperjinone Using Computer-Assisted Structure Elucidation Methods

Author: Antony J. Williams
Codina A.
Elyashberg M. E.
Elyashberg M. E.
Elyashberg M. E.
Elyashberg M. E.
Elyashberg M. E.
Kirill Blinov
Liao W.-Y.
Mikhail Elyashberg
Sergey Molodtsov
Smurnyy Y. D.
Steinbeck C.
Williams A. J.
Publication venue: 'American Chemical Society (ACS)'
Publication date
Field of study

Crossref

Fundamentals of Structure Elucidator System

Author: A Tarantola
D Neuhaus
DB Nelson
E Pretsch
EG Paul
J Lederberg
J Zupan
J-T Clerc
JM Seco
KA Blinov
KA Blinov
KA Blinov
KA Blinov
KC Nicolaou
LA Gribov
LA Gribov
LA Gribov
M Elyashberg
M Mitchell
M Reichenbächer
ME Elyashberg
ME Elyashberg
ME Elyashberg
ME Elyashberg
ME Elyashberg
ME Elyashberg
ME Elyashberg
ME Elyashberg
ME Elyashberg
ME Elyashberg
ME Elyashberg
ME Munk
MW Lodewyk
NAB Gray
RT Weavers
S Berger
SG Molodtsov
SI Sasaki
W Bremser
YD Smurnyy
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Metabolite Structure Assignment Using In Silico NMR Techniques

Author: Arthur S. Edison
Elyashberg M.
Kenneth M. Merz
Susanta Das
Publication venue: 'American Chemical Society (ACS)'
Publication date: 01/06/2020
Field of study

A major challenge for metabolomic analysis is to obtain an unambiguous identification of the metabolites detected in a sample. Among metabolomics techniques, NMR spectroscopy is a sophisticated, powerful, and generally applicable spectroscopic tool that can be used to ascertain the correct structure of newly isolated biogenic molecules. However, accurate structure prediction using computational NMR techniques depends on how much of the relevant conformational space of a particular compound is considered. It is intrinsically challenging to calculate NMR chemical shifts using high-level DFT when the conformational space of a metabolite is extensive. In this work, we developed NMR chemical shift calculation protocols using a machine learning model in conjunction with standard DFT methods. The pipeline encompasses the following steps: (1) conformation generation using a force field (FF)-based method, (2) filtering the FF generated conformations using the ASE-ANI machine learning model, (3) clustering of the optimized conformations based on structural similarity to identify chemically unique conformations, (4) DFT structural optimization of the unique conformations, and (5) DFT NMR chemical shift calculation. This protocol can calculate the NMR chemical shifts of a set of molecules using any available combination of DFT theory, solvent model, and NMR-active nuclei, using both user-selected reference compounds and/or linear regression methods. Our protocol reduces the overall computational time by 2 orders of magnitude over methods that optimize the conformations using fully ab initio methods, while still producing good agreement with experimental observations. The complete protocol is designed in such a manner that makes the computation of chemical shifts tractable for a large number of conformationally flexible metabolites

Crossref

Kettering University

Computer-assisted structure elucidation of natural products with limited 2D NMR data: application of the StrucEluc system

Author: Ablordeppey
Bax
Bax
Blinov
Bodenhausen
Bremser
Capon
Christie
Crouch
Crouch
Crouch
Crouch
Elyashberg
Elyashberg
Elyashberg
Elyashberg
Elyashberg
Funatsu
Hadden
Jayasuriya
K�ck
Lindel
Marquez
Martin
Martin
Martin
Martin
Martin
Martin
Martin
Martin
Martin
Martin
Martin
Munk
M�ller
Nuzillard
Peng
Russell
Schlotterbeck
Sharaf
Sharaf
Sharaf
Sharaf
Spitzer
Steinbeck
Steinbeck
Tackie
Will
Yuan
Publication venue: 'Wiley'
Publication date: 01/01/2003
Field of study

Crossref