Search CORE

12 research outputs found

Seven Golden Rules for heuristic filtering of molecular formulas obtained by accurate mass spectrometry

Author: A Makarov
AA Pontet
AJ Dempster
AL Rockwood
AM Richard
AW Jensen
B Seebass
BG Buchanan
C Djerassi
C Steinbeck
C Steinbeck
DA Laws
DL Olson
DL Wheeler
DR Scott
DS Wishart
F Csizmadia
H Budzikiewicz
HE Dayringer
J Braun
J Chen
J Lederberg
JC Lindon
JF Zhang
JJ Irwin
JK Senior
JL Faulon
JM Halket
JR De Laeter
L Sleno
M Badertscher
MD Soffer
ME Elyashberg
MP Balogh
N Huang
O Fiehn
O Fiehn
Oliver Fiehn
P Murray-Rust
QY Wu
RG Dromey
S Heuerding
S Noury
S Omura
SE Stein
SR Heller
SR Heller
T Fink
T Kind
T Morikawa
Tobias Kind
V Wray
W Windig
WD Ihlenfeldt
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

BACKGROUND: Structure elucidation of unknown small molecules by mass spectrometry is a challenge despite advances in instrumentation. The first crucial step is to obtain correct elemental compositions. In order to automatically constrain the thousands of possible candidate structures, rules need to be developed to select the most likely and chemically correct molecular formulas. RESULTS: An algorithm for filtering molecular formulas is derived from seven heuristic rules: (1) restrictions for the number of elements, (2) LEWIS and SENIOR chemical rules, (3) isotopic patterns, (4) hydrogen/carbon ratios, (5) element ratio of nitrogen, oxygen, phosphor, and sulphur versus carbon, (6) element ratio probabilities and (7) presence of trimethylsilylated compounds. Formulas are ranked according to their isotopic patterns and subsequently constrained by presence in public chemical databases. The seven rules were developed on 68,237 existing molecular formulas and were validated in four experiments. First, 432,968 formulas covering five million PubChem database entries were checked for consistency. Only 0.6% of these compounds did not pass all rules. Next, the rules were shown to effectively reducing the complement all eight billion theoretically possible C, H, N, S, O, P-formulas up to 2000 Da to only 623 million most probable elemental compositions. Thirdly 6,000 pharmaceutical, toxic and natural compounds were selected from DrugBank, TSCA and DNP databases. The correct formulas were retrieved as top hit at 80–99% probability when assuming data acquisition with complete resolution of unique compounds and 5% absolute isotope ratio deviation and 3 ppm mass accuracy. Last, some exemplary compounds were analyzed by Fourier transform ion cyclotron resonance mass spectrometry and by gas chromatography-time of flight mass spectrometry. In each case, the correct formula was ranked as top hit when combining the seven rules with database queries. CONCLUSION: The seven rules enable an automatic exclusion of molecular formulas which are either wrong or which contain unlikely high or low number of elements. The correct molecular formula is assigned with a probability of 98% if the formula exists in a compound database. For truly novel compounds that are not present in databases, the correct formula is found in the first three hits with a probability of 65–81%. Corresponding software and supplemental data are available for downloads from the authors' website

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Advances in structure elucidation of small molecules using mass spectrometry

Author: A Alexander
A Broersen
A Castro
A Cuadros-Inostroza
A Droit
A Fredenhagen
A Gordin
A Kameyama
A Kameyama
A Kerber
A Koulman
A Luedemann
A Makarov
A Makarov
A Makarov
A Mitch
A Nordstrom
A Pelander
A Ramos-Fernandez
A Schmidt
A Schreiber
A Serb
A Zhang
A-EF Nassar
AC Lee
AD Hegeman
AD Southam
AG Marshall
AG Marshall
AG Pereira-Medrano
AH Grange
AH Grange
AH Grange
AH Payne
AI Nepomuceno
AJ Alexander
AJ Richard
AJ Williams
AK Vrkic
AL Heaton
AL Piccinelli
AL Rockwood
AL Rockwood
AL Rockwood
AM Jennifer
AN Lane
AV Xianmei Cai
AW Hill
AWT Bristow
B Christensen
B Fan
B Portet
B Wen
BD Nourse
BL Ackermann
BL Milman
BO Keller
BP Koch
BS Mitrevski
BY Renard
C Birkemeyer
C Brunnée
C Hopley
C Pan
C Prakash
C Seger
C Tuniz
C Vafiadi
C Wittmann
C Zhou
CA Marchant
CA Mueller
CA Smith
CE Wujcik
CW Klampfl
D Eric
D Kuehl
D Ryan
D Schwudke
D Sorensen
D Strapoc
DB Robb
DB Robb
DD Stranz
DE Garcia
DF Hochstrasser
DJ Ashline
DJ Weston
DJ Weston
DK Williams Jr
DM Drexler
DM Good
DM Hawkins
DM Horn
DQ Liu
DQ Liu
DR Albaugh
DS Cornett
DS Wishart
DS Wishart
DS Wishart
DW Hill
DW Hill
E Allard
E Dudley
E Gelpí
E Gelpí
E Gorlach
E Hoffmann De
E Pittenauer
E Rijke de
E Rosenberg
E Skoczynska
E Ventola
E Werner
EA Kapp
EA Syrstad
EC Tatsis
ECM Chen
EL Schymanski
EL Schymanski
EL Schymanski
EM Thurman
EM Thurman
EM Thurman
EP Go
ER Wickremsinhe
EW Deutsch
EW Taylor
F Cuyckens
F Cuyckens
F Kuhn
F Matsuda
F Matsuda
F Milletti
F Pont
F Sacher
F Steiner
F Xu
FF Hsu
FF Hsu
FF Hsu
FW McLafferty
FW McLafferty
FW McLafferty
FW McLafferty
G Bouchoux
G Bringmann
G Chen
G Hopfgartner
G Miliauskas
G Schlotterbeck
G Yan
GB Ge
GE Hofmeister
GJ Berkel Van
GJ Dear
GL Gauthier
GL Glish
GS Frysinger
GS Gorman
H Budzikiewicz
H Chen
H Chen
H Choi
H Gallart-Ayala
H Hayen
H Hayen
H Hong
H Horai
H Kaspar
H Lu
H Neuweger
H Oberacher
H Oberacher
H Rodriguez
H Song
H Zhang
H Zhang
H Zhang
H Zhang
HA Clark
HF Sturt
HJ Cooper
HJ Sterling
HK Lim
HK Lim
I Ferrer
I Francois
I Marchi
I Molnár-Perl
IA Kaltashov
ID Wilson
IG Zenkevich
IM Lazar
J Dalluge
J Delaney
J Diana
J Downing
J Draper
J Han
J Hummel
J Hummel
J Meija
J Schiller
J Schmidt
J Segura
J Somuramasami
J Souady
J Zhang
J Zhang
J-L Faulon
JA Falkner
JA Falkner
JB Fenn
JC Bradley
JC Dickens
JC Fjeldsted
JC Hannis
JC Schwartz
JCL Erve
JD Williams
JE Biller
JE Elias
JEP Syka
JG Stroh
JH Futrell
JH Gross
JH Zhu
JH Zhu
JI Haleem
JK Baker
JK Wolken
JL Holmes
JL Little
JL Wolfender
JM Halket
JM Kirk
JM Phalp
JR Wickens
JS Brodbelt
JS Forrester
JS Sinninghe Damsté
JS Splitter
JSB Vlieger de
JT Watson
K Akiyama
K Biemann
K Dettmer
K Dreisewerd
K Guo
K Heberger
K Hobby
K Horvath
K Kandasamy
K Katerina
K Laniewski
K Levsen
K Levsen
K Miyamoto
K Qian
K Schug
K Varmuza
K Yang
KG Lloyd
KP Bateman
KR Jonscher
KW Cheng
KX Wan
L Calcagnile
L Dinan
L Feldberg
L Karsten
L Leclercq
L Li
L Li
L Mondello
L Ramaley
L Sleno
L Sleno
L Yang
L Zhang
LA McDonnell
LC Short
LM Fell
M Adahchour
M Badertscher
M Bedair
M Bogusz
M Brown
M Eggink
M Emmerling
M Fernandes-Whaley
M Gergov
M Gfrerer
M Gu
M Hamacher
M Heinonen
M Holcapek
M Ibanez
M Jalali-Heravi
M Karas
M Karelson
M Kellmann
M Kiffe
M Krauss
M Krummen
M Lehane
M Mann
M Okamoto
M Palit
M Pavlic
M Pulfer
M Scheurell
M Scholz
M Trunzer
M Wind
M Wind
M Yao
M Zhu
MA Eash
ME Elyashberg
ME Hansen
MG Zampolli
ML Bandu
MM Savitski
MM Siegel
MM Yao
MP Balogh
MP Balogh
MP Balogh
MP Washburn
MR Anari
MR Anari
MS Bereman
MS Molchanova
MT Olson
MT Rodgers
MT Sheldon
N Hertkorn
N Huang
N Jaitly
N Ohashi
N Reig
NB Cech
NE Manicke
O Corcoran
O David Sparkman
O Fiehn
O Fiehn
O Pelkonen
OM Saad
OV Krokhin
P Ausloos
P Calza
P Dwivedi
P Fontana
P Giavalisco
P Kiousi
P Lampen
P Marriott
P McCormack
P Mendes
P Murray-Rust
P Schmitt-Kopplin
P Zhu
PA Sutton
PB Lukka
PC Carvalho
PE Adams
PE Sauer
PGA Pedrioli
Q Li
Q Li
Q Xiong
R Almeida
R Baigorri
R Harkewicz
R Hellborg
R Kaliszan
R Knochenmuss
R Kostiainen
R Li
R Mylonas
R Nakabayashi
R Ramanathan
R Samudrala
R Schiewek
R Wu
R Zenobi
RA Scheltema
RA Shellie
RA Zubarev
RA Zubarev
RB Cody
RD Loss
RF Staack
RG Cooks
RG Cooks
RG Dromey
RH Perry
RJ Beynon
RJ Mortishire-Smith
RJ Mortishire-Smith
RK Snider
RM Smith
RM Smith
RP Lattimer
RS Plumb
RS Plumb
RT Kelly
S Bocker
S Bocker
S Borth
S Bourcier
S Buckingham
S Christophoridou
S Dresen
S Dua
S Ekins
S Jarussophon
S Kim
S Kothari
S Ma
S Nojima
S Ojanpera
S Rogers
S Sang
S Su
S Trimpin
S Urayama
S Wolf
SA McLuckey
SC Bell
SC Habicht
SE Ong
SE Scheppele
SE Stein
SE Stein
SE Stein
SE Stein
SF Anabel
SG Roussis
SG Villas-Bôas
SJ Bos
SJ Gaskell
SJ Rochfort
SJ Valentine
SS Ebada
SS Rubakhin
SY Ow
T Alon
T Alon
T Beier
T Chen
T Kind
T Kind
T Kind
T Kind
T Kind
T Lynch
T Reemtsma
T Shinkawa
TA Lydic
TA Ternes
TA Ternes
TG Payne
TJ Kauppila
TM Kertesz
TM Kertesz
TM Schaub
TR Covey
TR Northen
TR Sana
TRI Cataldi
V Exarchou
V Kovácik
V Sanz-Nebot
V Vukics
V Zaikin
VA Petyuk
VI Babushok
VV Mihaleva
W Timm
W Windig
W Zhong
W Zou
WC Byrdwell
WC Byrdwell
WC Yang
WF Smyth
WMA Niessen
WTB Anthony
X Feng
X Han
X Liang
X-J Li
XY Zhu
Y Cai
Y Chen
Y Chen
Y Duan
Y Konishi
Y Lin
Y Liu
Y Park
Y Sawada
Y Shinbo
Y Wang
Y Wang
YA Jeilani
YK Wang
YR Luo
Z Tozuka
Z Yeping
ZP Yao
Publication venue: Springer Vienna
Publication date: 01/01/2010
Field of study

The structural elucidation of small molecules using mass spectrometry plays an important role in modern life sciences and bioanalytical approaches. This review covers different soft and hard ionization techniques and figures of merit for modern mass spectrometers, such as mass resolving power, mass accuracy, isotopic abundance accuracy, accurate mass multiple-stage MS(n) capability, as well as hybrid mass spectrometric and orthogonal chromatographic approaches. The latter part discusses mass spectral data handling strategies, which includes background and noise subtraction, adduct formation and detection, charge state determination, accurate mass measurements, elemental composition determinations, and complex data-dependent setups with ion maps and ion trees. The importance of mass spectral library search algorithms for tandem mass spectra and multiple-stage MS(n) mass spectra as well as mass spectral tree libraries that combine multiple-stage mass spectra are outlined. The successive chapter discusses mass spectral fragmentation pathways, biotransformation reactions and drug metabolism studies, the mass spectral simulation and generation of in silico mass spectra, expert systems for mass spectral interpretation, and the use of computational chemistry to explain gas-phase phenomena. A single chapter discusses data handling for hyphenated approaches including mass spectral deconvolution for clean mass spectra, cheminformatics approaches and structure retention relationships, and retention index predictions for gas and liquid chromatography. The last section reviews the current state of electronic data sharing of mass spectra and discusses the importance of software development for the advancement of structure elucidation of small molecules

CiteSeerX

Crossref

Springer - Publisher Connector

PubMed Central

Conformal Predictors for Compound Activity Prediction

Author: A Gammerman
AN Jain
Chih-Chung Chang
DC Weis
EY Chang
G Shafer
HP Graf
J-L Faulon Jr
K Woodsend
L Bottou
Lars Carlsson
M McCool
T Gärtner
V Monve
V Vovk
Y Wang
Y You
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Towards agile large-scale predictive modelling in drug discovery with flow-based programming design principles

Author: AA Hunter
AV Deursen
B Giardine
C Hansch
C Sloggett
D Blankenberg
D Spinellis
F Pérez
J Alvarsson
J Alvarsson
J Alvarsson
J Goecks
J Köster
J-L Faulon
Jonathan Alvarsson
JP Morrison
L Goodstadt
LG Valerio Jr
MP Mazanetz
O Spjuth
O Spjuth
Ola Spjuth
P Gedeck
PD Tommaso
R-E Fan
S Spycher
Samuel Lampa
SP Sadedin
T Kosar
T White
U Norinder
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Using product kernels to predict protein interactions

Author: A Ben-Hur
A Ben-Hur
AH Tong
AJ Enright
AJ Walhout
B Rost
BA Shoemaker
BA Shoemaker
C Burges
C Leslie
CA Orengo
CS Goh
CZ Cai
D Przybylski
DS Wishart
E Sprinzak
EC Webb
EG Hutchinson
F Pazos
FB Sheinerman
G Rigaut
GP Smith
GR Smith
HM Berman
I Xenarios
J Shawe-Taylor
J Shawe-Taylor
JA Siepen
JC Rain
JL Faulon
JL Faulon
JL Faulon
JR Bock
K Bennet
KM Borgwardt
L Ralaivola
M Deng
M Kanehisa
M Pellegrini
M Rinaldis de
MB Eisen
N Nagamine
P Aloy
P Aloy
P Mahe
P Uetz
P Ye
R Apweiler
R Jansen
R Karlsson
R Overbeek
RB Jones
RD King
RE Steward
S Martin
SJ Swamidass
SM Zaremba
T Dandekar
T Ito
V Vapnik
VA Simossis
W Baumeister
WM Brown
Y Yan
Y Yang
Publication venue: Springer Nature
Publication date: 01/01/2008
Field of study

There is a wide variety of experimental methods for the identification of protein interactions. This variety has in turn spurred the development of numerous different computational approaches for modeling and predicting protein interactions. These methods range from detailed structure-based methods capable of operating on only a single pair of proteins at a time to approximate statistical methods capable of making predictions on multiple proteomes simultaneously. In this chapter, we provide a brief discussion of the relative merits of different experimental and computational methods available for identifying protein interactions. Then we focus on the application of our particular (computational) method using Support Vector Machine product kernels. We describe our method in detail and discuss the application of the method for predicting protein-protein interactions, beta-strand interactions, and protein-chemical interactions

Crossref

The University of Manchester - Institutional Repository

Molecular structures enumeration and virtual screening in the chemical space with RetroPath2.0

Author: AM Virshup
B McKay
CJ Churchwell
CS Henry
D Hoksza
D Thiagarajan
DP Visco Jr
DS Wishart
G Schneider
H Yim
HL Rost
I Ugi
J Mellor
J-L Faulon
JC Gerdeen
JD Orth
JE Peironcely
Jean-Loup Faulon
JG Jeffryes
JL Durant
JM Gally
K Kawai
L Ruddigkeit
M Liu
Mathilde Koch
MJ Yu
MM Jaghoori
N Hadadi
N Hadadi
P Carbonell
P Carbonell
P Kiefer
P Setny
P Willett
Pablo Carbonell
R Deursen van
S Martin
S Moretti
S Szymkuc
SA Rahman
T Feher
T Rodrigues
Thomas Duigou
WA Warr
WM Brown
Y Moriya
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref