Search CORE

153 research outputs found

Is EC class predictable from reaction mechanism?

Author: A Statnikov
AG McDonald
BV Dasarathy
BW Matthews
C Andreini
C Andreini
DARS Latino
DE Almonacid
DE Almonacid
GL Holliday
GL Holliday
GL Holliday
GL Holliday
GL Holliday
GL Holliday
GL Holliday
GL Holliday
IUBMB
J Gorodkin
J Menke
John BO Mitchell
JW Torrance
K Astikainen
KM Borgwardt
L Breiman
L De Ferrari
LD Hughes
M Aizerman
M Kanehisa
M Leber
N Furnham
N Nagano
N Nagano
Neetika Nath
NM O'Boyle
O Sacher
PC Babbitt
PD Dobson
R Lowe
RD Uriarte
SA Rahman
SCH Pegg
SCH Pegg
T Bray
T Bylander
V Egelhofer
VN Vapnik
WS Noble
X Hu
Y Yamanishi
Publication venue: BioMed Central
Publication date: 01/01/2012
Field of study

We thank the Scottish Universities Life Sciences Alliance (SULSA) and the Scottish Overseas Research Student Awards Scheme of the Scottish Funding Council (SFC) for financial support.Background: We investigate the relationships between the EC (Enzyme Commission) class, the associated chemical reaction, and the reaction mechanism by building predictive models using Support Vector Machine (SVM), Random Forest (RF) and k-Nearest Neighbours (kNN). We consider two ways of encoding the reaction mechanism in descriptors, and also three approaches that encode only the overall chemical reaction. Both cross-validation and also an external test set are used. Results: The three descriptor sets encoding overall chemical transformation perform better than the two descriptions of mechanism. SVM and RF models perform comparably well; kNN is less successful. Oxidoreductases and hydrolases are relatively well predicted by all types of descriptor; isomerases are well predicted by overall reaction descriptors but not by mechanistic ones. Conclusions: Our results suggest that pairs of similar enzyme reactions tend to proceed by different mechanisms. Oxidoreductases, hydrolases, and to some extent isomerases and ligases, have clear chemical signatures, making them easier to predict than transferases and lyases. We find evidence that isomerases as a class are notably mechanistically diverse and that their one shared property, of substrate and product being isomers, can arise in various unrelated ways. The performance of the different machine learning algorithms is in line with many cheminformatics applications, with SVM and RF being roughly equally effective. kNN is less successful, given the role that non-local information plays in successful classification. We note also that, despite a lack of clarity in the literature, EC number prediction is not a single problem; the challenge of predicting protein function from available sequence data is quite different from assigning an EC classification from a cheminformatics representation of a reaction.Publisher PDFPeer reviewe

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

University of St. Andrews - Pure

St Andrews Research Repository

Enzyme mechanism prediction: a template matching problem on InterPro signature subspaces

Author: CZ Cai
CZ Cai
E Spyromitros
G Tsoumakas
GL Holliday
GL Holliday
GL Holliday
Hamse Y. Mussa
John B. O. Mitchell
L Ferrari De
L Ferrari De
Luna De Ferrari
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Exploring the Evolution of Novel Enzyme Functions within Structurally Defined Protein Superfamilies

Author: A Andreeva
AE Todd
AL Cuff
Alison L. Cuff
AU Tamuri
BE Engelhardt
BH Dessailly
C Chothia
CA Orengo
Christine A. Orengo
DA Benson
DE Almonacid
DM Schmidt
DS Tawfik
G Caetano-Anolles
GA Reeves
Gemma L. Holliday
GJ Bartlett
GJ Binford
GL Holliday
GL Holliday
GL Holliday
HS Park
I Nobeli
Ian Sillitoe
J Ruan
J Shi
Janet M. Thornton
JP Overington
K Katoh
LH Greene
M Bashton
M Groll
M Xu
ME Glasner
MT Murakami
N Furnham
N Gallastegui
Nicholas Furnham
NJ Mulder
O Khersonsky
PF Gherardini
PJ O'Brien
Roman A. Laskowski
SC Pegg
SD Brown
SF Altschul
W Heinemeyer
WS Valdar
Yanay Ofran
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

In order to understand the evolution of enzyme reactions and to gain an overview of biological catalysis we have combined sequence and structural data to generate phylogenetic trees in an analysis of 276 structurally defined enzyme superfamilies, and used these to study how enzyme functions have evolved. We describe in detail the analysis of two superfamilies to illustrate different paradigms of enzyme evolution. Gathering together data from all the superfamilies supports and develops the observation that they have all evolved to act on a diverse set of substrates, whilst the evolution of new chemistry is much less common. Despite that, by bringing together so much data, we can provide a comprehensive overview of the most common and rare types of changes in function. Our analysis demonstrates on a larger scale than previously studied, that modifications in overall chemistry still occur, with all possible changes at the primary level of the Enzyme Commission (E.C.) classification observed to a greater or lesser extent. The phylogenetic trees map out the evolutionary route taken within a superfamily, as well as all the possible changes within a superfamily. This has been used to generate a matrix of observed exchanges from one enzyme function to another, revealing the scale and nature of enzyme evolution and that some types of exchanges between and within E.C. classes are more prevalent than others. Surprisingly a large proportion (71%) of all known enzyme functions are performed by this relatively small set of 276 superfamilies. This reinforces the hypothesis that relatively few ancient enzymatic domain superfamilies were progenitors for most of the chemistry required for life

Public Library of Science (PLOS)

CiteSeerX

Crossref

LSHTM Research Online

Directory of Open Access Journals

PubMed Central

UCL Discovery

The Francis Crick Institute

EnzyMiner: automatic identification of protein level mutations and their impact on target enzymes from PubMed abstracts

Author: A Bairoch
A Chang
A Fleischmann
AK McCallum
C Cleverdon
CJO Baker
CJO Baker
CT Porter
D Hanisch
D Rebholz-Schuhmann
F Horn
F Sebastiani
G Szarvas
GL Holliday
J Barthelmes
JG Caporaso
JT Chang
K Hult
K Rajaraman
LC Lee
LS Larkey
M Erdogmus
N Gövert
N Nagano
O Zamir
R Caspi
R Witte
R Witte
R Witte
RN Goldberg
SK Dwivedi
Süveyda Yeniterzi
T Karopka
Uğur Sezerman
V Renugopalakrishnan
Y Tsuruoka
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

BACKGROUND: A better understanding of the mechanisms of an enzyme's functionality and stability, as well as knowledge and impact of mutations is crucial for researchers working with enzymes. Though, several of the enzymes' databases are currently available, scientific literature still remains at large for up-to-date source of learning the effects of a mutation on an enzyme. However, going through vast amounts of scientific documents to extract the information on desired mutation has always been a time consuming process. In this paper, therefore, we describe an unique method, termed as EnzyMiner, which automatically identifies the PubMed abstracts that contain information on the impact of a protein level mutation on the stability and/or the activity of a given enzyme. RESULTS: We present an automated system which identifies the abstracts that contain an amino-acid-level mutation and then classifies them according to the mutation's effect on the enzyme. In the case of mutation identification, MuGeX, an automated mutation-gene extraction system has an accuracy of 93.1% with a 91.5 F-measure. For impact analysis, document classification is performed to identify the abstracts that contain a change in enzyme's stability or activity resulting from the mutation. The system was trained on lipases and tested on amylases with an accuracy of 85%. CONCLUSION: EnzyMiner identifies the abstracts that contain a protein mutation for a given enzyme and checks whether the abstract is related to a disease with the help of information extraction and machine learning techniques. For disease related abstracts, the mutation list and direct links to the abstracts are retrieved from the system and displayed on the Web. For those abstracts that are related to non-diseases, in addition to having the mutation list, the abstracts are also categorized into two groups. These two groups determine whether the mutation has an effect on the enzyme's stability or functionality followed by displaying these on the web

Crossref

Springer - Publisher Connector

PubMed Central

Sabanci University Research Database

Cellular Radiosensitivity: How much better do we understand it?

Author: Agrawal A
Ahel I
Ahnesorg P
Ali A
Anderson RM
Awa AA
Azzam E.I.
Bailey SM
Bakkenist CJ
Beamish H
Bedford JS
Bedford JS
Bender MA
Berkovich E
Biedermann KA
Blunt T
Booth C
Bosma GC
Breslin C
Brown GL
Brown JM
Buck D
Carney JP
Cerosaletti K
Cerosaletti K
Chan DW
Chan DW
Chapman JD
Chen P.C.
Clements PM
Cornforth MN
Cornforth MN
Cornforth MN
Curtis SB
Davis ME
de Jager M
Deans B
Dianor II
Ding Q
Dizdaroglu M
Doil C
El-Khamisy SF
El-Khamisy SF
Emmerson PT
Falck J
Falck J
Foray N
Game JC
Gao Y
Gatti RA
Georgakilas AG
Germain M
Godthelp BC
Goffman TE
Goldman M
Goldstein L
Goodarzi AA
Goodarzi AA
Goodhead DT
Grawunder U
Gueven N
Gueven N
Gueven N
Hirano M
Ho KS
Ho KY
Holliday R
Hopfner KP
Houldsworth J
Huen MS
Iliakis G
Iliakis G
Iliakis G
Jackson SP
Jazayeri A
Jeggo PA
Jeggo PA
Jeggo PA
Jeggo PA
Joiner MC
Joiner MC
Kankura T
Kastan MB
Kato TA
Kemp LM
Kemp LM
Khanna KK
Kitagawa R
Kolas NK
Kozlov SV
Krueger SA
Kurz EU
Lavin MF
Lavin MF
Lehmann AR
Li Z
Lindahl T
Lisker R
Little JB
Liu N
Liu Y
Lobrich M
Lockart RZ
Loken MK
Macklis RM
Mailand N
Marples B
Marples B
Martin F. Lavin
Maser RS
Miller RW
Mirzoeva OK
Morgan WF
Morgan WF
Moshous D
Mothersill C
Moynahan ME
Muhlmann-Diaz MC
Muller B
Nakamura J
Neel JV
Nelms BE
Nicolas N
Nikjoo H
O'Driscoll M
O'Driscoll M
Olive PL
Olive PL
Otake M
Ouyang H
Painter RB
Pandita TK
Parsons PA
Paull TT
Paull TT
Paull TT
Penny Jeggo
Peters LJ
Potten CS
Pourquier P
Pourquier P
Purdy JA
Radford IR
Radford JR
Ratnayake RK
Resnick MA
Riballo E
Robins HI
Rothkamm K
Rothkamm K
Rothkamm K
Royal HD
Samson L
Savitsky K
Sax K
Sedgwick RP
Shinohara A
Shreeram S
Singh NP
Spivak G
Stamato TD
Stamato TD
Stewart GS
Stewart GS
Stewart GS
Strzelczyk JJ
Sun Y
Sun Y.
Sutherland JC
Taccioli GE
Takashima H
Tapio S
Taylor AM
Taylor AMR
Teoule R
Terasima T
Thacker J
Thacker J
Thacker J
Thierry-Chef I
Thompson LH
Thompson LH
Uziel T
van den Bosch M
van der Schans GP
Varon R
von Sonntag C
Walker JR
Wall BF
Wallach DF
Waltes R
Ward IM
Ward JF
Ward JF
Weemaes CMR
Whitmore GF
Willetts NS
Williams RS
Wolff S
Wolff S
Wolff S
Wolff S
Woo SY
Wood RD
Wykes SM
Wyman C
Yoshimaru H
Yoshimoto Y
You Z
Zdzienicka MZ
Zdzienicka MZ
Zdzienicka MZ
Zhao W
Zhou T
Ziv Y
Publication venue: 'Informa UK Limited'
Publication date: 03/12/2009
Field of study

Purpose: Ionizing radiation exposure gives rise to a variety of lesions in DNA that result in genetic instability and potentially tumorigenesis or cell death. Radiation extends its effects on DNA by direct interaction or by radiolysis of H2O that generates free radicals or aqueous electrons capable of interacting with and causing indirect damage to DNA. While the various lesions arising in DNA after radiation exposure can contribute to the mutagenising effects of this agent, the potentially most damaging lesion is the DNA double strand break (DSB) that contributes to genome instability and/or cell death. Thus in many cases failure to recognise and/or repair this lesion determines the radiosensitivity status of the cell. DNA repair mechanisms including homologous recombination (HR) and non-homologous end-joining (NHEJ) have evolved to protect cells against DNA DSB. Mutations in proteins that constitute these repair pathways are characterised by radiosensitivity and genome instability. Defects in a number of these proteins also give rise to genetic disorders that feature not only genetic instability but also immunodeficiency, cancer predisposition, neurodegeneration and other pathologies. Conclusions: In the past fifty years our understanding of the cellular response to radiation damage has advanced enormously with insight being gained from a wide range of approaches extending from more basic early studies to the sophisticated approaches used today. In this review we discuss our current understanding of the impact of radiation on the cell and the organism gained from the array of past and present studies and attempt to provide an explanation for what it is that determines the response to radiation

Crossref

Sussex Research Online

Abnormal motor activity during anaesthesia in a dog: a case report

Author: Andreas Lervik
AO Rossetti
AR Mair
B Walder
BS Martin
CH Tan
D Plumb
F Steffen
GL Covey-Crump
Henning A Haga
HJ Naish
HW Jeon
I Constant
I Kathmann
J Logothetis
J McConnell
J Wilson
JL Harrison
K Chandler
K Fukunaga
KM Tobias
LE Smedile
LJ Voss
LJ Voss
LK Cullen
LM Matsubara
LW Hall
Max Becker
MC Veronesi
MH Court
MS Scheller
S Koyama
S Meyer
S Serrano
SC Haskins
SM Mirsattari
T Duke
TA Holliday
V Andreoni
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Seizures or convulsions that occur during anaesthesia in veterinary patients are infrequently reported in the literature. Consequently, the incidence of such events is unknown. Several drugs commonly used in clinical veterinary anaesthesia have been shown to induce epileptiform activity in both human clinical patients and experimental candidates. The present case report describes convulsions in a four-year old male Bernese mountain dog during maintenance of anaesthesia with isoflurane after premedication with acepromazine and methadone followed by co-induction with propofol and ketamine. The dog had no history of previous convulsions. The use of several sedative and anaesthetic drugs makes it difficult to find one single causative pharmaceutical

Crossref

Springer - Publisher Connector

PubMed Central

XMPP for cloud computing in bioinformatics supporting discovery and invocation of asynchronous web services

Author: A Labarga
AR Jones
B Wallner
BioMoby Consortium
C Steinbeck
C Steinbeck
D Smedley
E Jain
E Willighagen
Egon L Willighagen
EW Sayers
GL Holliday
H Stockinger
H Sugawara
Jarl ES Wikberg
Johannes Wagener
L Stein
LM Vaquero
M Hucka
M Lapins
MA Larkin
MD Wilkinson
MWEJ Fiers
N Adams
O Spjuth
Ola Spjuth
P Fisher
P Murray-Rust
PBT Neerincx
R Kottmann
RD Dowell
S Hoon
S Hunter
S Kaarthik
S Kerrien
S Kuhn
S Miyazaki
T Oinn
UniProt Consortium
X Dong
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Background: Life sciences make heavily use of the web for both data provision and analysis. However, the increasing amount of available data and the diversity of analysis tools call for machine accessible interfaces in order to be effective. HTTP-based Web service technologies, like the Simple Object Access Protocol (SOAP) and REpresentational State Transfer (REST) services, are today the most common technologies for this in bioinformatics. However, these methods have severe drawbacks, including lack of discoverability, and the inability for services to send status notifications. Several complementary workarounds have been proposed, but the results are ad-hoc solutions of varying quality that can be difficult to use. Results: We present a novel approach based on the open standard Extensible Messaging and Presence Protocol (XMPP), consisting of an extension (IO Data) to comprise discovery, asynchronous invocation, and definition of data types in the service. That XMPP cloud services are capable of asynchronous communication implies that clients do not have to poll repetitively for status, but the service sends the results back to the client upon completion. Implementations for Bioclipse and Taverna are presented, as are various XMPP cloud services in bio- and cheminformatics. Conclusion: XMPP with its extensions is a powerful protocol for cloud services that demonstrate several advantages over traditional HTTP-based Web services: 1) services are discoverable without the need of an external registry, 2) asynchronous invocation eliminates the need for ad-hoc solutions like polling, and 3) input and output types defined in the service allows for generation of clients on the fly without the need of an external semantics description. The many advantages over existing technologies make XMPP a highly interesting candidate for next generation online services in bioinformatics

Maastricht University Research Portal

Crossref

Springer - Publisher Connector

PubMed Central

Open Access LMU ( Ludwig-Maximilians-Univ. München)

Quantitative global studies of reactomes and metabolomes using a vectorial representation of reactions and chemical compounds

Author: A Andreeva
A Bairoch
A Beloqui
AG Fraser
BA Shoemaker
C Chothia
C Perez-Iratxeta
D Devos
DS Wishart
F Pearl
F Yates
Florencio Pazos
G Yona
GL Holliday
H Satoh
I Letunic
I Nobeli
I Nobeli
I Schomburg
IH Witten
JM Chandonia
Juan C Triviño
L Chen
L Holm
M Kanehisa
M Karthikeyan
M Schena
M Vendruscolo
MJ Gomez
NJ Mulder
NM O'Boyle
P Bork
P Willett
QY Zhang
RA Fisher
RJ Serfling
RL Marsden
SC Schuster
Y Zhang
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Global studies of the protein repertories of organisms are providing important information on the characteristics of the protein space. Many of these studies entail classification of the protein repertory on the basis of structure and/or sequence similarities. The situation is different for metabolism. Because there is no good way of measuring similarities between chemical reactions, there is a barrier to the development of global classifications of "metabolic space" and subsequent studies comparable to those done for protein sequences and structures. Results In this work, we propose a vectorial representation of chemical reactions, which allows them to be compared and classified. In this representation, chemical compounds, reactions and pathways may be represented in the same vectorial space. We show that the representation of chemical compounds reflects their physicochemical properties and can be used for predictive purposes. We use the vectorial representations of reactions to perform a global classification of the reactome of the model organism <it>E. coli</it>. Conclusions We show that this unsupervised clustering results in groups of enzymes more coherent in biological terms than equivalent groupings obtained from the EC hierarchy. This hierarchical clustering produces an optimal set of 21 groups which we analyzed for their biological meaning.</p

Crossref

Springer

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Digital.CSIC

InterPro in 2017-beyond protein family and domain annotations

Author: Attwood TK
Babbitt PC
Bateman A
Bork P
Bridge AJ
Chang HY
Dosztányi Z
El-Gebali S
Finn RD
Fraser M
Gough J
Haft D
Holliday GL
Huang H
Huang X
Letunic I
Lopez R
Lu S
Marchler-Bauer A
Mi H
Mistry J
Mitchell AL
Natale DA
Necci M
Nuka G
Orengo CA
Park Y
Pesseat S
Piovesan D
Potter SC
Rawlings ND
Redaschi N
Richardson L
Rivoire C
Sangrador-Vegas A
Sigrist C
Sillitoe I
Smithers B
Squizzato S
Sutton G
Thanki N
Thomas PD
Tosatto SC
Wu CH
Xenarios I
Yeh LS
Young SY
Publication venue
Publication date: 29/11/2016
Field of study

InterPro (http://www.ebi.ac.uk/interpro/) is a freely available database used to classify protein sequences into families and to predict the presence of important domains and sites. InterProScan is the underlying software that allows both protein and nucleic acid sequences to be searched against InterPro's predictive models, which are provided by its member databases. Here, we report recent developments with InterPro and its associated software, including the addition of two new databases (SFLD and CDD), and the functionality to include residue-level annotation and prediction of intrinsic disorder. These developments enrich the annotations provided by InterPro, increase the overall number of residues annotated and allow more specific functional inferences

UCL Discovery

Catalytic residues in hydrolases: analysis of methods designed for ligand-binding site prediction

Author: A Armon
A Bhinge
A Eichinger
A Gutteridge
A Pingoud
A Shulman-Peleg
A Stark
A Stark
A Stark
AA Bliznyuk
AC Stuart
AC Wallace
AH Elcock
AJ Chalk
ATR Laurie
ATR Laurie
B Huang
B Lee
B Zhang
C Taroni
CA Orengo
CM Seibert
CT Porter
D Pantoja-Uceda
DG Levitt
DJ Vocadlo
DT-H Chang
E Kellenberger
E Youn
FX Gomis-Rüth
G Nimrod
G Pugalenthi
GG Hammes
GJ Bartlett
GJ Kleywegt
GL Holliday
GL Holliday
GP Brady
H Yao
HM Berman
I Botos
Irena Roterman
J An
J An
J Dundas
J Liang
J Teyra
J Weigelt
J-M Chandonia
JA Barker
JM Yon
K Henrick
K Katayanagi
K Kinoshita
K Kinoshita
K Stummeyer
K Zhang
KA Snyder
Katarzyna Prymula
KP Peters
M Bryliński
M Grabowski
M Hendlich
M Jambon
M Jambon
M Kanehisa
M Landau
M Levitt
M Stahl
MA Kurowski
MJ Ondrechen
MP Liang
MR Landon
N Kallenbach
O Gileadi
O Goldenberg
O Lichtarge
O Lichtarge
P Aloy
P Baldi
P Reis
PJ Hajduk
PJ Hajduk
PP Wangikar
R Landgraf
RA Laskowski
RA Laskowski
RA Laskowski
RV Spriggs
S Madabushi
S Vajda
SE Brenner
T Fawcett
T Kortvelyesi
T Pupko
T Tadokoro
T Zhang
TA Binkowski
Tomasz Jadczyk
UniProt Consortium The Universal Protein Resource (UniProt)
V Siksnys
W Kabsch
Y Dou
Y Oda
Y Tsunaka
Y-R Tang
Publication venue: Springer Netherlands
Publication date: 01/01/2010
Field of study

The comparison of eight tools applicable to ligand-binding site prediction is presented. The methods examined cover three types of approaches: the geometrical (CASTp, PASS, Pocket-Finder), the physicochemical (Q-SiteFinder, FOD) and the knowledge-based (ConSurf, SuMo, WebFEATURE). The accuracy of predictions was measured in reference to the catalytic residues documented in the Catalytic Site Atlas. The test was performed on a set comprising selected chains of hydrolases. The results were analysed with regard to size, polarity, secondary structure, accessible solvent area of predicted sites as well as parameters commonly used in machine learning (F-measure, MCC). The relative accuracies of predictions are presented in the ROC space, allowing determination of the optimal methods by means of the ROC convex hull. Additionally the minimum expected cost analysis was performed. Both advantages and disadvantages of the eight methods are presented. Characterization of protein chains in respect to the level of difficulty in the active site prediction is introduced. The main reasons for failures are discussed. Overall, the best performance offers SuMo followed by FOD, while Pocket-Finder is the best method among the geometrical approaches

Crossref

Springer - Publisher Connector

PubMed Central

Jagiellonian Univeristy Repository