Search CORE

826 research outputs found

Predicting Secondary Structures, Contact Numbers, and Residue-wise Contact Orders of Native Protein Structure from Amino Acid Sequence by Critical Random Networks

Author: Altschul S. F., Madden, T. L., Sch
Baldi P., Brunak, S., Frasconi, P.
CHANDONIA J-M
Crooks G. E. &amp
Kinjo A. R. &amp
Kinjo A. R. &amp
Kinjo A. R., Horimoto, K. &amp
Lee B. &amp
Li W., Jaroszewski, L. &amp
Nishikawa K. &amp
Pollastri G., Baldi, P., Fariselli
Rost B.
TATENO Y
Publication venue: 'Biophysical Society of Japan'
Publication date: 01/01/2005
Field of study

Prediction of one-dimensional protein structures such as secondary structures and contact numbers is useful for the three-dimensional structure prediction and important for the understanding of sequence-structure relationship. Here we present a new machine-learning method, critical random networks (CRNs), for predicting one-dimensional structures, and apply it, with position-specific scoring matrices, to the prediction of secondary structures (SS), contact numbers (CN), and residue-wise contact orders (RWCO). The present method achieves, on average,

Q_3

accuracy of 77.8% for SS, correlation coefficients of 0.726 and 0.601 for CN and RWCO, respectively. The accuracy of the SS prediction is comparable to other state-of-the-art methods, and that of the CN prediction is a significant improvement over previous methods. We give a detailed formulation of critical random networks-based prediction scheme, and examine the context-dependence of prediction accuracies. In order to study the nonlinear and multi-body effects, we compare the CRNs-based method with a purely linear method based on position-specific scoring matrices. Although not superior to the CRNs-based method, the surprisingly good accuracy achieved by the linear method highlights the difficulty in extracting structural features of higher order from amino acid sequence beyond that provided by the position-specific scoring matrices.Comment: 20 pages, 1 figure, 5 tables; minor revision; accepted for publication in BIOPHYSIC

arXiv.org e-Print Archive

Crossref

Wang-Landau molecular dynamics technique to search for low-energy conformational space of proteins

Author: Akira R. Kinjo
B. A. Berg
J. W. Neidigh
K. Morikami
Ken Nishikawa
P. Kollman
Takashi Mitsui
Takehiro Nagasima
Publication venue: 'American Physical Society (APS)'
Publication date: 17/05/2007
Field of study

Multicanonical molecular dynamics (MD) is a powerful technique for sampling conformations on rugged potential surfaces such as protein. However, it is notoriously difficult to estimate the multicanonical temperature effectively. Wang and Landau developed a convenient method for estimating the density of states based on a multicanonical Monte Carlo method. In their method, the density of states is calculated autonomously during a simulation. In this paper we develop a set of techniques to effectively apply the Wang-Landau method to MD simulations. In the multicanonical MD, the estimation of the derivative of the density of states is critical. In order to estimate it accurately, we devise two original improvements. First, the correction for the density of states is made smooth by using the Gaussian distribution obtained by a short canonical simulation. Second, an approximation is applied to the derivative, which is based on the Gaussian distribution and the multiple weighted histogram technique. A test of this method was performed with small polypeptides, Met-enkephalin and Trp-cage, and it is demonstrated that Wang-Landau MD is consistent with replica exchange MD but can sample much larger conformational space.Comment: 8 pages, 7 figures, accepted for publication in Physical Review

arXiv.org e-Print Archive

Crossref

SPRITE and ASSAM: web servers for side chain 3D-motif searching in protein structures

Author: Artymiuk
Berman
DeLano
E. J. Gardiner
Forst
F l p
Holm
Kinjo
Kleywegt
Laskowski
M. Firdaus-Raih
N. Nadzirin
Nuel
P. J. Artymiuk
P. Willett
POIRRETTE
Porter
Sayle
Spriggs
Stark
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2012
Field of study

Similarities in the 3D patterns of amino acid side chains can provide insights into their function despite the absence of any detectable sequence or fold similarities. Search for protein sites (SPRITE) and amino acid pattern search for substructures and motifs (ASSAM) are graph theoretical programs that can search for 3D amino side chain matches in protein structures, by representing the amino acid side chains as pseudo-atoms. The geometric relationship of the pseudo-atoms to each other as a pattern can be represented as a labeled graph where the pseudo-atoms are the graph's nodes while the edges are the inter-pseudo-atomic distances. Both programs require the input file to be in the PDB format. The objective of using SPRITE is to identify matches of side chains in a query structure to patterns with characterized function. In contrast, a 3D pattern of interest can be searched for existing occurrences in available PDB structures using ASSAM. Both programs are freely accessible without any login requirement. SPRITE is available at http://mfrlab.org/grafss/sprite/while ASSAM can be accessed at http://mfrlab.org/grafss/assam/

CiteSeerX

Crossref

PubMed Central

White Rose Research Online

Unique Interplay between Sugar and Lipid in Determining the Antigenic Potency of Bacterial Antigens for NKT Cells

Author: A Bendelac
A. A Vagin
A. G Leslie
A. P Lawton
Bo Pei
C McCarthy
D Wu
D. G Pellicci
D. M Zajonc
D. M Zajonc
D. M Zajonc
Dirk M. Zajonc
E Tupin
Enrico Girardi
Esther Dawen Yu
G. J Kleywegt
J Mattner
J Rauch
J Wang
J. P Scott-Browne
Jing Wang
K. O Yu
K. S Wun
L. C Wu
M Brigl
M. D Winn
Mitchell Kronenberg
N. A Borg
Norihito Tarumoto
P Emsley
Petr Illarionov
Philippa Marrack
S Sidobre
S. C Lovell
T Kawano
V Sriram
Y Kinjo
Y Kinjo
Y Kinjo
Y Kinjo
Y Li
Yali Li
Yuki Kinjo
Publication venue: Public Library of Science
Publication date: 01/11/2011
Field of study

Structural and biophysical studies reveal the induced-fit mechanism underlying the stringent specificity of invariant natural killer T cells for unique glycolipid antigens from the pathogen Streptococcus pneumoniae

Crossref

Directory of Open Access Journals

PubMed Central

Composite structural motifs of binding sites for delineating biological functions of proteins

Author: A Bairoch
A Fiorillo
A Rausell
A Stark
AC Joerger
AC Wallace
AG Murzin
Akira R. Kinjo
AM Schnoes
AR Kinjo
AR Kinjo
AR Kinjo
B Bollobás
B Dasgupta
B Louie
B Rost
BH Dessailly
C Branden
C Winter
CV Robinson
D Petrey
DJ Schuller
DM Chipman
E Krissinel
E Toyota
FP Davis
FP Davis
GM Santos
H Berman
H Kettenberger
Haruki Nakamura
I Friedberg
J Janin
J Shi
J Westbrook
JI Yeh
K Chen
K Henrick
K Kinoshita
K Kinoshita
K Kinoshita
K Okazaki
K Stenberg
L Xie
M Bashton
M Brylinski
M Kitayner
M Levitt
M Moertl
M Nardini
M Tyagi
M Yang
N Nagano
N Tuncbag
N Tuncbag
N Zhao
ND Gold
O Keskin
O Keskin
OC Redfern
Ozlem Keskin
P Cramer
P Shannon
PD Pawelek
R Koike
R Koike
R Rentzsch
R Sinha
RR Thangudu
S Kadono
SF Altschul
T Amemiya
T Kawabata
T Kawabata
TA Holland
TC Terwilliger
Y Loewenstein
Z Aung
ZX Xia
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2011
Field of study

Most biological processes are described as a series of interactions between proteins and other molecules, and interactions are in turn described in terms of atomic structures. To annotate protein functions as sets of interaction states at atomic resolution, and thereby to better understand the relation between protein interactions and biological functions, we conducted exhaustive all-against-all atomic structure comparisons of all known binding sites for ligands including small molecules, proteins and nucleic acids, and identified recurring elementary motifs. By integrating the elementary motifs associated with each subunit, we defined composite motifs which represent context-dependent combinations of elementary motifs. It is demonstrated that function similarity can be better inferred from composite motif similarity compared to the similarity of protein sequences or of individual binding sites. By integrating the composite motifs associated with each protein function, we define meta-composite motifs each of which is regarded as a time-independent diagrammatic representation of a biological process. It is shown that meta-composite motifs provide richer annotations of biological processes than sequence clusters. The present results serve as a basis for bridging atomic structures to higher-order biological phenomena by classification and integration of binding site structures.Comment: 34 pages, 7 figure

arXiv.org e-Print Archive

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Predicting residue-wise contact orders in proteins by support vector regression

Author: A Bairoch
AG Murzin
AR Kinjo
AR Kinjo
AR Kinjo
AR Kinjo
B Rost
CH Tsai
D Kihara
D Sarda
DT Jones
G Pollastri
G Pollastri
GP Raghava
HM Berman
J Song
J Wang
Jiangning Song
JM Chandonia
Kevin Burrage
KW Plaxco
M Punta
MPS Brown
NP Prabhu
S Ahmad
S Hua
S Hua
V Vapnik
V Vapnik
W Kabsch
W Liu
X Wang
Z Yuan
Z Yuan
Z Yuan
Z Yuan
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

BACKGROUND: The residue-wise contact order (RWCO) describes the sequence separations between the residues of interest and its contacting residues in a protein sequence. It is a new kind of one-dimensional protein structure that represents the extent of long-range contacts and is considered as a generalization of contact order. Together with secondary structure, accessible surface area, the B factor, and contact number, RWCO provides comprehensive and indispensable important information to reconstructing the protein three-dimensional structure from a set of one-dimensional structural properties. Accurately predicting RWCO values could have many important applications in protein three-dimensional structure prediction and protein folding rate prediction, and give deep insights into protein sequence-structure relationships. RESULTS: We developed a novel approach to predict residue-wise contact order values in proteins based on support vector regression (SVR), starting from primary amino acid sequences. We explored seven different sequence encoding schemes to examine their effects on the prediction performance, including local sequence in the form of PSI-BLAST profiles, local sequence plus amino acid composition, local sequence plus molecular weight, local sequence plus secondary structure predicted by PSIPRED, local sequence plus molecular weight and amino acid composition, local sequence plus molecular weight and predicted secondary structure, and local sequence plus molecular weight, amino acid composition and predicted secondary structure. When using local sequences with multiple sequence alignments in the form of PSI-BLAST profiles, we could predict the RWCO distribution with a Pearson correlation coefficient (CC) between the predicted and observed RWCO values of 0.55, and root mean square error (RMSE) of 0.82, based on a well-defined dataset with 680 protein sequences. Moreover, by incorporating global features such as molecular weight and amino acid composition we could further improve the prediction performance with the CC to 0.57 and an RMSE of 0.79. In addition, combining the predicted secondary structure by PSIPRED was found to significantly improve the prediction performance and could yield the best prediction accuracy with a CC of 0.60 and RMSE of 0.78, which provided at least comparable performance compared with the other existing methods. CONCLUSION: The SVR method shows a prediction performance competitive with or at least comparable to the previously developed linear regression-based methods for predicting RWCO values. In contrast to support vector classification (SVC), SVR is very good at estimating the raw value profiles of the samples. The successful application of the SVR approach in this study reinforces the fact that support vector regression is a powerful tool in extracting the protein sequence-structure relationship and in estimating the protein structural profiles from amino acid sequences

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Queensland University of Technology ePrints Archive

University of Queensland eSpace

Application of Advanced Hightemperature Superconductors for Fusion Plasma Experimental Devices

Author: D. Sakata
E. Yatsuka
G." Bansal
J. Morikawa
K. Kinjo
N. Yanagi
T. Mito
Y. "Ogawa
Publication venue
Publication date
Field of study

National Institute for Fusion Science (NIFS-Repository)

The influence of contour fragmentation on recognition memory: An event-related potential study

Author: Boucart
Boucart
Boucart
Brodeur
Brodeur
Doniger
Duarte
Emmanuelle Dionne-Dostie
Foley
Friedman
Hess
J. Bruno Debruille
Kinjo
Lisa Buchy
Louis Renoult
Marie Prévost
Martin Lepage
Mathes
Mathieu B. Brodeur
Otten
Paller
Paller
Pietrowsky
Sanquist
Schendan
Sehatpour
Snodgrass
Snodgrass
Snodgrass
Snodgrass
Stuss
Van Petten
Viggiano
Viggiano
Voss
Voss
Publication venue: 'Elsevier BV'
Publication date: 30/04/2011
Field of study

Crossref

University of East Anglia digital repository

Unveiling exotic magnetic phase diagram of a non-Heisenberg quasicrystal approximant

Author: Fujii Takenori
Inagaki Kazuki
Ishikawa Asuka
Kinjo Katsuki
Labib Farid
Nawa Kazuhiro
Sato Taku J.
Suzuki Shintaro
Tamura Ryuji
Wu Hung-Cheng
Publication venue
Publication date: 22/10/2023
Field of study

A magnetic phase diagram of the non-Heisenberg Tsai-type 1/1 Au-Ga-Tb approximant crystal (AC) has been established across a wide electron-per-atom (e/a) range via magnetization and powder neutron diffraction measurements. The diagram revealed exotic ferromagnetic (FM) and antiferromagnetic (AFM) orders that originate from the unique local spin icosahedron common to icosahedral quasicrystals (iQCs) and ACs; The noncoplanar whirling AFM order is stabilized as the ground state at the e/a of 1.72 or less whereas a noncoplanar whirling FM order was found at the larger e/a of 1.80, with magnetic moments tangential to the Tb icosahedron in both cases. Moreover, the FM/AFM phase selection rule was unveiled in terms of the nearest neighbour (J1) and next nearest neighbour (J2) interactions by numerical calculations on a non-Heisenberg single icosahedron. The present findings will pave the way for understanding the intriguing magnetic orders of not only non-Heisenberg FM/AFM ACs but also non-Heisenberg FM/AFM iQCs, the latter of which are yet to be discovered

arXiv.org e-Print Archive

Amino acid "little Big Bang": Representing amino acid substitution matrices as dot products of Euclidian vectors

Author: A Kidera
A Kinjo
A Kinjo
A Marin
DK Agrafiotis
E Ollivier
F Fogolari
G Golub
J Méndez
J Méndez
Jean-François Gibrat
K Tomii
Karel Zimmermann
M Dayhoff
M Delorme
M Wall
MO Delorme
O Alter
O Bastien
R Durbin
R Swanson
S Altschul
S Gu
S Henikoff
S Kawashima
S Maetschke
V Biou
W Press
W Xu
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Sequence comparisons make use of a one-letter representation for amino acids, the necessary quantitative information being supplied by the substitution matrices. This paper deals with the problem of finding a representation that provides a comprehensive description of amino acid intrinsic properties consistent with the substitution matrices. Results We present a Euclidian vector representation of the amino acids, obtained by the singular value decomposition of the substitution matrices. The substitution matrix entries correspond to the dot product of amino acid vectors. We apply this vector encoding to the study of the relative importance of various amino acid physicochemical properties upon the substitution matrices. We also characterize and compare the PAM and BLOSUM series substitution matrices. Conclusions This vector encoding introduces a Euclidian metric in the amino acid space, consistent with substitution matrices. Such a numerical description of the amino acid is useful when intrinsic properties of amino acids are necessary, for instance, building sequence profiles or finding consensus sequences, using machine learning algorithms such as Support Vector Machine and Neural Networks algorithms.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals