Search CORE

eScholarship - University of California

The InterPro protein families and domains database: 20 years on

Author: Bateman A
Blum M
Bork P
Bridge A
Chang H-Y
Chuguransky S
Finn RD
Gough J
Grego T
Haft DH
Kandasaamy S
Letunic I
Marchler-Bauer A
Mi H
Mitchell A
Natale DA
Necci M
Nuka G
Orengo CA
Pandurangan AP
Paysan-Lafosse T
Qureshi M
Raj S
Richardson L
Rivoire C
Salazar GA
Sigrist CJA
Sillitoe I
Thanki N
Thomas PD
Tosatto SCE
Williams L
Wu CH
Publication venue
Publication date: 06/11/2020
Field of study

The InterPro database (https://www.ebi.ac.uk/interpro/) provides an integrative classification of protein sequences into families, and identifies functionally important domains and conserved sites. InterProScan is the underlying software that allows protein and nucleic acid sequences to be searched against InterPro's signatures. Signatures are predictive models which describe protein families, domains or sites, and are provided by multiple databases. InterPro combines signatures representing equivalent families, domains or sites, and provides additional information such as descriptions, literature references and Gene Ontology (GO) terms, to produce a comprehensive resource for protein classification. Founded in 1999, InterPro has become one of the most widely used resources for protein family annotation. Here, we report the status of InterPro (version 81.0) in its 20th year of operation, and its associated software, including updates to database content, the release of a new website and REST API, and performance improvements in InterProScan

UCL Discovery

ComPath: comparative enzyme analysis and annotation in pathway/subsystem contexts

Author: A Andreeva
A Bateman
A Marchler-Bauer
A Osterman
AL Barabási
C Gene Ontology
C The UniProt
CJA Sigrist
CM Zmasek
DA Benson
DH Haft
HM Berman
HW Ma
J Wu
K Choi
Kwangmin Choi
L Pireddu
M Kanehisa
M Kanehisa
M Madera
N Hulo
P Stothard
PC Babbitt
PD Karp
R Caspi
R Overbeek
RA George
S Kim
S Kim
S Kim
S Kim
SCH Pegg
SF Altschul
Sun Kim
V BATAGELJL
VM Markowitz
W Thompson
WR Pearson
Y Ye
Y Zheng
YI Wolf
Publication venue: BioMed Central
Publication date: 01/03/2008
Field of study

Abstract Background Once a new genome is sequenced, one of the important questions is to determine the presence and absence of biological pathways. Analysis of biological pathways in a genome is a complicated task since a number of biological entities are involved in pathways and biological pathways in different organisms are not identical. Computational pathway identification and analysis thus involves a number of computational tools and databases and typically done in comparison with pathways in other organisms. This computational requirement is much beyond the capability of biologists, so information systems for reconstructing, annotating, and analyzing biological pathways are much needed. We introduce a new comparative pathway analysis workbench, ComPath, which integrates various resources and computational tools using an interactive spreadsheet-style web interface for reliable pathway analyses. Results ComPath allows users to compare biological pathways in multiple genomes using a spreadsheet style web interface where various sequence-based analysis can be performed either to compare enzymes (e.g. sequence clustering) and pathways (e.g. pathway hole identification), to search a genome for <it>de novo </it>prediction of enzymes, or to annotate a genome in comparison with reference genomes of choice. To fill in pathway holes or make <it>de novo </it>enzyme predictions, multiple computational methods such as FASTA, Whole-HMM, CSR-HMM (a method of our own introduced in this paper), and PDB-domain search are integrated in ComPath. Our experiments show that FASTA and CSR-HMM search methods generally outperform Whole-HMM and PDB-domain search methods in terms of sensitivity, but FASTA search performs poorly in terms of specificity, detecting more false positive as E-value cutoff increases. Overall, CSR-HMM search method performs best in terms of both sensitivity and specificity. Gene neighborhood and pathway neighborhood (global network) visualization tools can be used to get context information that is complementary to conventional KEGG map representation. Conclusion ComPath is an interactive workbench for pathway reconstruction, annotation, and analysis where experts can perform various sequence, domain, context analysis, using an intuitive and interactive spreadsheet-style interface. </p

RASOnD - A comprehensive resource and search tool for RAS superfamily oncogenes from various species

Author: A Koike
A Kouranov
A Malumbres
A Moon
A Srinivasan
A Wittinghofer
CJ Bult
CJ Tabin
CJA Sigrist
DA Benson
DR Lowy
DS Goodsell
E Sahai
G Oxford
GK Abou-Alfa
GW Reuther
H Prenen
HJ Andreyev
IG Macara
JG Donaldson
JJ Harvey
K Wennerberg
KA Rauen
L Samantha
LD Stein
M Barbacid
M Chen
M Kanehisa
M Kaur
M Malumbres
M Malumbres
M Safran
M Trahey
MA Larkin
N Mitin
P Hupé
PA Konstantinopoulos
PJ Roberts
Punit Kaur
R Levy
RDM Page
S Chiosea
S Kuersten
S Twigger
SA Forbes
SF Altschul
Sujata Sharma
Tej P Singh
Umay Kulsum
UniProt Consortium
Vishwadeep Singh
WH Kirsten
WH Su
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background The Ras superfamily plays an important role in the control of cell signalling and division. Mutations in the Ras genes convert them into active oncogenes. The Ras oncogenes form a major thrust of global cancer research as they are involved in the development and progression of tumors. This has resulted in the exponential growth of data on Ras superfamily across different public databases and in literature. However, no dedicated public resource is currently available for data mining and analysis on this family. The present database was developed to facilitate straightforward accession, retrieval and analysis of information available on Ras oncogenes from one particular site. Description We have developed the RAS Oncogene Database (RASOnD) as a comprehensive knowledgebase that provides integrated and curated information on a single platform for oncogenes of Ras superfamily. RASOnD encompasses exhaustive genomics and proteomics data existing across diverse publicly accessible databases. This resource presently includes overall 199,046 entries from 101 different species. It provides a search tool to generate information about their nucleotide and amino acid sequences, single nucleotide polymorphisms, chromosome positions, orthologies, motifs, structures, related pathways and associated diseases. We have implemented a number of user-friendly search interfaces and sequence analysis tools. At present the user can (i) browse the data (ii) search any field through a simple or advance search interface and (iii) perform a BLAST search and subsequently CLUSTALW multiple sequence alignment by selecting sequences of Ras oncogenes. The Generic gene browser, GBrowse, JMOL for structural visualization and TREEVIEW for phylograms have been integrated for clear perception of retrieved data. External links to related databases have been included in RASOnD. Conclusions This database is a resource and search tool dedicated to Ras oncogenes. It has utility to cancer biologists and cell molecular biologists as it is a ready source for research, identification and elucidation of the role of these oncogenes. The data generated can be used for understanding the relationship between the Ras oncogenes and their association with cancer. The database updated monthly is freely accessible online at <url>http://202.141.47.181/rasond/</url> and <url>http://www.aiims.edu/RAS.html</url>.</p

Public Library of Science (PLOS)

Calbindin-D32k Is Localized to a Subpopulation of Neurons in the Nervous System of the Sea Cucumber Holothuria glaberrima (Echinodermata)

Members of the calbindin subfamily serve as markers of subpopulations of neurons within the vertebrate nervous system. Although markers of these proteins are widely available and used, their application to invertebrate nervous systems has been very limited. In this study we investigated the presence and distribution of members of the calbindin subfamily in the sea cucumber Holothuria glaberrima (Selenka, 1867). Immunohistological experiments with antibodies made against rat calbindin 1, parvalbumin, and calbindin 2, showed that these antibodies labeled cells and fibers within the nervous system of H. glaberrima. Most of the cells and fibers were co-labeled with the neural-specific marker RN1, showing their neural specificity. These were distributed throughout all of the nervous structures, including the connective tissue plexi of the body wall and podia. Bioinformatics analyses of the possible antigen recognized by these markers showed that a calbindin 2-like protein present in the sea urchin Strongylocentrotus purpuratus, corresponded to the calbindin-D32k previously identified in other invertebrates. Western blots with anti-calbindin 1 and anti-parvalbumin showed that these markers recognized an antigen of approximately 32 kDa in homogenates of radial nerve cords of H. glaberrima and Lytechinus variegatus. Furthermore, immunoreactivity with anti-calbindin 1 and anti-parvalbumin was obtained to a fragment of calbindin-D32k of H. glaberrima. Our findings suggest that calbindin-D32k is present in invertebrates and its sequence is more similar to the vertebrate calbindin 2 than to calbindin 1. Thus, characterization of calbindin-D32k in echinoderms provides an important view of the evolution of this protein family and represents a valuable marker to study the nervous system of invertebrates

CiteSeerX

Plasmodium falciparum Hep1 is required to prevent the self aggregation of PfHsp70-3

The majority of mitochondrial proteins are encoded in the nucleus and need to be imported from the cytosol into the mitochondria, and molecular chaperones play a key role in the efficient translocation and proper folding of these proteins in the matrix. One such molecular chaperone is the eukaryotic mitochondrial heat shock protein 70 (Hsp70); however, it is prone to self-aggregation and requires the presence of an essential zinc-finger protein, Hsp70-escort protein 1 (Hep1), to maintain its structure and function. PfHsp70-3, the only Hsp70 predicted to localize in the mitochondria of P. falciparum, may also rely on a Hep1 orthologue to prevent self-aggregation. In this study, we identified a putative Hep1 orthologue in P. falciparum and co-expression of PfHsp70-3 and PfHep1 enhanced the solubility of PfHsp70-3. PfHep1 suppressed the thermally induced aggregation of PfHsp70-3 but not the aggregation of malate dehydrogenase or citrate synthase, thus showing specificity for PfHsp70-3. Zinc ions were indeed essential for maintaining the function of PfHep1, as EDTA chelation abrogated its abilities to suppress the aggregation of PfHsp70-3. Soluble and functional PfHsp70-3, acquired by co-expression with PfHep-1, will facilitate the biochemical characterisation of this particular Hsp70 protein and its evaluation as a drug target for the treatment of malaria

ResearchOnline@ND

Victoria University Eprints Repository

South East Academic Libraries System (SEALS)

AXY3 encodes a α-xylosidase that impacts the structure and accessibility of the hemicellulose xyloglucan in Arabidopsis plant cell walls

Author: C Coutu
C Fanutti
CJA Sigrist
CP Bonin
D Berger
D Weigel
DJ Cosgrove
DM Cavalier
DM Gibeaut
E Zablackis
E Zablackis
G Mouille
GF Vanzin
HV Scheller
J Borevitz
J Puhlmann
J Sampedro
J Sampedro
JD Monroe
JD Tedman-Jones
JP Vincken
K Nishitani
K Tamura
K Vissenberg
M Madson
M Pauly
M Pauly
M Pauly
M Pauly
MA O’Neill
Markus Günl
Markus Pauly
N Obel
NM Steele
O Lerouxel
O Lerouxel
OA Zabotina
Q Zhou
R Guillen
R Kaida
R Louvet
RA O’Neill
RC Smith
RM Perrin
RR Selvendran
SC Fry
SC Fry
T Hayashi
T Hayashi
T Koyama
T Murashige
T Takeda
WD Bauer
WD Reiter
WD Reiter
WR Scheible
WS York
WS York
Y Osato
YF Guan
Publication venue: Springer-Verlag
Publication date: 01/01/2010
Field of study

Xyloglucan is the most abundant hemicellulose in the walls of dicots such as Arabidopsis. It is part of the load-bearing structure of a plant cell and its metabolism is thought to play a major role in cell elongation. However, the molecular mechanism by which xyloglucan carries out this and other functions in planta is not well understood. We performed a forward genetic screen utilizing xyloglucan oligosaccharide mass profiling on chemically mutagenized Arabidopsis seedlings to identify mutants with altered xyloglucan structures termed axy-mutants. One of the identified mutants, axy3.1, contains xyloglucan with a higher proportion of non-fucosylated xyloglucan subunits. Mapping revealed that axy3.1 contains a point mutation in XYLOSIDASE1 (XYL1) known to encode for an apoplastic glycoside hydrolase releasing xylosyl residues from xyloglucan oligosaccharides at the non-reducing end. The data support the hypothesis that AXY3/XYL1 is an essential component of the apoplastic xyloglucan degradation machinery and as a result of the lack of function in the various axy3-alleles leads not only to an altered xyloglucan structure but also a xyloglucan that is less tightly associated with other wall components. However, the plant can cope with the excess xyloglucan relatively well as the mutant does not display any visible growth or morphological phenotypes with the notable exception of shorter siliques and reduced fitness. Taken together, these results demonstrate that plant apoplastic hydrolases have a larger impact on wall polymer structure and function than previously thought

Characterization of a Novel Binding Protein for Fortilin/TCTP — Component of a Defense Mechanism against Viral Infection in Penaeus monodon

The Fortilin (also known as TCTP) in Penaeus monodon (PmFortilin) and Fortilin Binding Protein 1 (FBP1) have recently been shown to interact and to offer protection against the widespread White Spot Syndrome Virus infection. However, the mechanism is yet unknown. We investigated this interaction in detail by a number of in silico and in vitro analyses, including prediction of a binding site between PmFortilin/FBP1 and docking simulations. The basis of the modeling analyses was well-conserved PmFortilin orthologs, containing a Ca2+-binding domain at residues 76–110 representing a section of the helical domain, the translationally controlled tumor protein signature 1 and 2 (TCTP_1, TCTP_2) at residues 45–55 and 123–145, respectively. We found the pairs Cys59 and Cys76 formed a disulfide bond in the C-terminus of FBP1, which is a common structural feature in many exported proteins and the “x–G–K–K” pattern of the amidation site at the end of the C-terminus. This coincided with our previous work, where we found the “x–P–P–x” patterns of an antiviral peptide also to be located in the C-terminus of FBP1. The combined bioinformatics and in vitro results indicate that FBP1 is a transmembrane protein and FBP1 interact with N-terminal region of PmFortilin

CiteSeerX

Whole genome sequence and manual annotation of Clostridium autoethanogenum, an industrially relevant bacterium

Author: Alexander Goesman
Alexander T. Wichlacz
Anne M. Henstra
B Boeckmann
Bart Pander
C Claudel-Renard
Charlie Hodgman
Christopher M. Humphreys
CJA Sigrist
Craig Woods
D Hyatt
David Barrett
E Stackebrandt
EB Fichot
EJ Richardson
F Meyer
Florence J. Annan
H Ogata
H Tae
HN Abubackar
I Schomburg
J Abrini
J Eid
J Marmur
JL Cotter
JL Cotter
JM Bruno-Barcena
Jochen Blom
K Lagesen
KD Pruitt
Klaus Winzer
M Köpke
M Köpke
M Köpke
M Monot
M Pagni
M Scheer
MA Quail
MG Ross
MY Galperin
N Chowdhary
Neil R. Thomas
Nigel P. Minton
O Tirado-Acevedo
P Jones
Pawel Piatek
Peter Rowe
PF Levy
R Mazzoli
R Sims
RD Finn
Ronja Breitkopf
RS Tanner
Rupert Norman
S Koren
S Kurtz
Samantha McLean
Sarah Schatschneider
SD Brown
SF Altschul
SM Utturkar
T Tatusova
The State of Food Insecurity in the World 2008
The UniProt Consortium
Thomas Millat
TJ Treangen
TM Lowe
Y Feng
Y Guo
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

Clostridium autoethanogenum is an acetogenic bacterium capable of producing high value commodity chemicals and biofuels from the C1 gases present in synthesis gas. This common industrial waste gas can act as the sole energy and carbon source for the bacterium that converts the low value gaseous components into cellular building blocks and industrially relevant products via the action of the reductive acetyl-CoA (Wood-Ljungdahl) pathway. Current research efforts are focused on the enhancement and extension of product formation in this organism via synthetic biology approaches. However, crucial to metabolic modelling and directed pathway engineering is a reliable and comprehensively annotated genome sequence

Nottingham ePrints

Nottingham eTheses

Repository@Nottingham

Nottingham Trent Institutional Repository (IRep)

Public Library of Science (PLOS)

Characterization of Profilin Polymorphism in Pollen with a Focus on Multifunctionality

Author: A Bateman
A Di Nardo
A Lambrechts
A Lambrechts
A Limmongkon
AA Fedorov
AD Sharrocks
Antonio J. Castro
B Honoré
B Vemuri
BC Gibbon
BG McLean
BG McLean
BM Jockusch
BM Jockusch
BN Snowman
C Butler-Cole
C Radauer
CE Schutt
CJ Sigrist
CJ Staiger
CJ Staiger
CJ Staiger
CJ Staiger
CJA Sigrist
CR Mehta
D Chalkia
D Polet
D Volkmann
DE Wilke
Dieter Volkmann
DR Kovar
DR Kovar
DR Kovar
E de Castro
E Gasteiger
E Gasteiger
F Baluska
F Chevenet
G Guillen
G Mazzotti
H Larsson
H Levene
H-P Rihs
H-Y Wang
HP Rihs
HY Ren
I Lassing
I Mittermann
J Kyte
JA Asturias
JA Asturias
JA Asturias
JD Thompson
JM McDowell
JM McDowell
Jose C. Jimenez-Lopez
JR Pierce
Juan de D. Alché
K Giehl
K Guruprasad
K Sathish
K Schlüter
K Schlüter
K Schlüter
KD Pruitt
KS Thorn
L Rallo
L Vidali
LM Machesky
M Binder
M Clamp
M Haugwitz
M Vantard
M von Witsch
María I. Rodríguez-García
MJ Deeks
MK Kandasamy
MK Kandasamy
MW Hess
N Blom
N Blom
N Saitou
N Wopfner
NM Mahoney
P Skare
PA Games
PJ Lu
R Aparicio-Fabre
R Blasco
R Karlsson
R Valenta
RB Meagher
RH Sohn
S Fischer
S Henikoff
S Luan
S von Braun
SF Altschul
Sonia Morales
SR Huang
SS Shapiro
TA Hall
TD Pollard
TD Schneider
Vladimir N. Uversky
W Witke
Y-M Jeong
ZS Gao
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

Profilin, a multigene family involved in actin dynamics, is a multiple partners-interacting protein, as regard of the presence of at least of three binding domains encompassing actin, phosphoinositide lipids, and poly-L-proline interacting patches. In addition, pollen profilins are important allergens in several species like Olea europaea L. (Ole e 2), Betula pendula (Bet v 2), Phleum pratense (Phl p 12), Zea mays (Zea m 12) and Corylus avellana (Cor a 2). In spite of the biological and clinical importance of these molecules, variability in pollen profilin sequences has been poorly pointed out up until now. In this work, a relatively high number of pollen profilin sequences have been cloned, with the aim of carrying out an extensive characterization of their polymorphism among 24 olive cultivars and the above mentioned plant species. Our results indicate a high level of variability in the sequences analyzed. Quantitative intra-specific/varietal polymorphism was higher in comparison to inter-specific/cultivars comparisons. Multi-optional posttranslational modifications, e.g. phosphorylation sites, physicochemical properties, and partners-interacting functional residues have been shown to be affected by profilin polymorphism. As a result of this variability, profilins yielded a clear taxonomic separation between the five plant species. Profilin family multifunctionality might be inferred by natural variation through profilin isovariants generated among olive germplasm, as a result of polymorphism. The high variability might result in both differential profilin properties and differences in the regulation of the interaction with natural partners, affecting the mechanisms underlying the transmission of signals throughout signaling pathways in response to different stress environments. Moreover, elucidating the effect of profilin polymorphism in adaptive responses like actin dynamics, and cellular behavior, represents an exciting research goal for the future

CiteSeerX