Search CORE

Digitala Vetenskapliga Arkivet - Academic Archive On-line

FAAST: Flow-space Assisted Alignment Search Tool

Author: Fredrik Lysholm
Björn Andersson
Bengt Persson
M Margulies
M Droege
SB Needleman
TF Smith
O Gotoh
DJ Lipman
WR Pearson
SF Altschul
SF Altschul
MO Dayhoff
V Vacic
R Kofler
S Balzer
J Jerlström-Hultqvist
Z Ning
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Publikationer från Linköpings universitet

Aston Publications Explorer

Digitala Vetenskapliga Arkivet - Academic Archive On-line

Effects of HMGN variants on the cellular transcription profile

Author: Ashburner
Berman
Bianchi
Birger
Birger
Bolstad
Bradbury
Bustin
Bustin
Bustin
Bustin
D. Landsman
Ding
Dunker
Fan
Fan
Garner
Gautier
Gentleman
Hock
I. Ovcharenko
Kim
L. Taher
Lee
Li
Lu
M. Bustin
M. Rochman
Paranjape
Postnikov
Postnikov
Rochman
Romero
Romero
S. Cherukuri
Sancho
Shirakawa
T. Kurahashi
Tompa
Tompa
V. N. Uversky
Vacic
Vavouri
Woodcock
Publication venue: Oxford University Press
Publication date: 01/01/2011
Field of study

High mobility group N (HMGN) is a family of intrinsically disordered nuclear proteins that bind to nucleosomes, alters the structure of chromatin and affects transcription. A major unresolved question is the extent of functional specificity, or redundancy, between the various members of the HMGN protein family. Here, we analyze the transcriptional profile of cells in which the expression of various HMGN proteins has been either deleted or doubled. We find that both up- and downregulation of HMGN expression altered the cellular transcription profile. Most, but not all of the changes were variant specific, suggesting limited redundancy in transcriptional regulation. Analysis of point and swap HMGN mutants revealed that the transcriptional specificity is determined by a unique combination of a functional nucleosome-binding domain and C-terminal domain. Doubling the amount of HMGN had a significantly larger effect on the transcription profile than total deletion, suggesting that the intrinsically disordered structure of HMGN proteins plays an important role in their function. The results reveal an HMGN-variant-specific effect on the fidelity of the cellular transcription profile, indicating that functionally the various HMGN subtypes are not fully redundant

USFSP Digital Archive

Scholar Commons - University of South Florida

Prediction of prognostic biomarkers for Interferon-based therapy to Hepatitis C Virus patients: a metaanalysis of the NS5A protein in subtypes 1a, 1b, and 3a

Author: A El-Shamy
A Macdonald
A Wohnsland
B Korber
B Liu
C Kuiken
C Sarrazin
D Wang
E Baralis
ea El-Hefnawi Mahmoud
GR Reyes
Iman A El-Azab
J Cohen
J Felsenstein
J Nousbaum
J Pei
J Song
JM Pawlotsky
K Tamura
M Clamp
M Torres-Puente
M Wistrand
Mahmoud M ElHefnawi
MM El Hefnawi
N Pavio
P Farci
RD Finn
SR Eddy
Suher Zada
TA Hall
U Mihm
V Vacic
V Wagner
WLaP Jiawei
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

PlantPhos: using maximal dependence decomposition to identify plant phosphorylation sites with substrate site specificity

Author: C Burge
Cheng-Tsung Lu
DM Shien
E Huala
F Diella
F Gnad
FF Zhou
GE Crooks
H Steen
HD Huang
HD Huang
J Gao
J Gao
JC Obenauer
JL Heazlewood
JM Stone
KC Chou
LM Iakoucheva
M Schneider
M Steffen
MJ Hubbard
N Blom
N Blom
Neil Arvin Bretaña
P Diolez
PV Hornbeck
R Aebersold
S Luan
SC Huber
SR Eddy
TD Schneider
TY Lee
TY Lee
TY Lee
Tzong-Yi Lee
V Vacic
Y Xue
Y Xue
YH Wong
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Protein phosphorylation catalyzed by kinases plays crucial regulatory roles in intracellular signal transduction. Due to the difficulty in performing high-throughput mass spectrometry-based experiment, there is a desire to predict phosphorylation sites using computational methods. However, previous studies regarding <it>in silico </it>prediction of plant phosphorylation sites lack the consideration of kinase-specific phosphorylation data. Thus, we are motivated to propose a new method that investigates different substrate specificities in plant phosphorylation sites. Results Experimentally verified phosphorylation data were extracted from TAIR9-a protein database containing 3006 phosphorylation data from the plant species <it>Arabidopsis thaliana</it>. In an attempt to investigate the various substrate motifs in plant phosphorylation, maximal dependence decomposition (MDD) is employed to cluster a large set of phosphorylation data into subgroups containing significantly conserved motifs. Profile hidden Markov model (HMM) is then applied to learn a predictive model for each subgroup. Cross-validation evaluation on the MDD-clustered HMMs yields an average accuracy of 82.4% for serine, 78.6% for threonine, and 89.0% for tyrosine models. Moreover, independent test results using <it>Arabidopsis thaliana </it>phosphorylation data from UniProtKB/Swiss-Prot show that the proposed models are able to correctly predict 81.4% phosphoserine, 77.1% phosphothreonine, and 83.7% phosphotyrosine sites. Interestingly, several MDD-clustered subgroups are observed to have similar amino acid conservation with the substrate motifs of well-known kinases from Phospho.ELM-a database containing kinase-specific phosphorylation data from multiple organisms. Conclusions This work presents a novel method for identifying plant phosphorylation sites with various substrate motifs. Based on cross-validation and independent testing, results show that the MDD-clustered models outperform models trained without using MDD. The proposed method has been implemented as a web-based plant phosphorylation prediction tool, PlantPhos <url>http://csb.cse.yzu.edu.tw/PlantPhos/</url>. Additionally, two case studies have been demonstrated to further evaluate the effectiveness of PlantPhos.</p

Public Library of Science (PLOS)

Incorporating Distant Sequence Features and Radial Basis Function Networks to Identify Ubiquitin Conjugation Sites

Author: A Catic
A Hershko
A Zanzoni
AL Chernorudskiy
B Boeckmann
C Chothia
C-J Lin
CN Pang
CT Su
CW Tung
CY Ou
D Xie
DM Shien
DT Jones
GE Crooks
GZ Zhang
HM Berman
Hsin-Yi Hung
J Peng
JL Fauchere
K Bryson
K Ron
L Hicke
LJ McGuffin
M Charton
P Radivojac
R Grantham
S Ahmad
SA Chen
SF Altschul
SF Altschul
Shu-An Chen
T Gilon
TA Tatusova
TD Schneider
TL Bailey
Tzong-Yi Lee
V Vacic
Vladimir Uversky
Y-Y Ou
Yu-Yen Ou
YY Ou
Z Hu
ZR Yang
Publication venue: Public Library of Science
Publication date: 09/03/2011
Field of study

Ubiquitin (Ub) is a small protein that consists of 76 amino acids about 8.5 kDa. In ubiquitin conjugation, the ubiquitin is majorly conjugated on the lysine residue of protein by Ub-ligating (E3) enzymes. Three major enzymes participate in ubiquitin conjugation. They are – E1, E2 and E3 which are responsible for activating, conjugating and ligating ubiquitin, respectively. Ubiquitin conjugation in eukaryotes is an important mechanism of the proteasome-mediated degradation of a protein and regulating the activity of transcription factors. Motivated by the importance of ubiquitin conjugation in biological processes, this investigation develops a method, UbSite, which uses utilizes an efficient radial basis function (RBF) network to identify protein ubiquitin conjugation (ubiquitylation) sites. This work not only investigates the amino acid composition but also the structural characteristics, physicochemical properties, and evolutionary information of amino acids around ubiquitylation (Ub) sites. With reference to the pathway of ubiquitin conjugation, the substrate sites for E3 recognition, which are distant from ubiquitylation sites, are investigated. The measurement of F-score in a large window size (−20∼+20) revealed a statistically significant amino acid composition and position-specific scoring matrix (evolutionary information), which are mainly located distant from Ub sites. The distant information can be used effectively to differentiate Ub sites from non-Ub sites. As determined by five-fold cross-validation, the model that was trained using the combination of amino acid composition and evolutionary information performs best in identifying ubiquitin conjugation sites. The prediction sensitivity, specificity, and accuracy are 65.5%, 74.8%, and 74.5%, respectively. Although the amino acid sequences around the ubiquitin conjugation sites do not contain conserved motifs, the cross-validation result indicates that the integration of distant sequence features of Ub sites can improve predictive performance. Additionally, the independent test demonstrates that the proposed method can outperform other ubiquitylation prediction tools

Supervised multivariate analysis of sequence groups to identify specificity determining residues

Author: A Carro
A del Sol Mesa
AC Culhane
AC Culhane
AR Fersht
CD Livingstone
CL Tucker
D Charif
Desmond G Higgins
DG Higgins
DH Morgan
E Beitz
F Pazos
G Casari
G Zhang
H Yao
HM Wilks
Iain M Wallace
J Thioulouse
JC Gower
JD Thompson
JG Henikoff
KM Mayer
L Yuan
LA Mirny
M Clamp
N Saitou
O Lichtarge
OV Kalinina
OV Kalinina
RC Gentleman
RD Finn
RJ Edwards
S Dolédec
S Henikoff
SJ Hubbard
SS Hannenhalli
TD Schneider
V Vacic
W Pirovano
WR Atchley
X Gu
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Abstract Background Proteins that evolve from a common ancestor can change functionality over time, and it is important to be able identify residues that cause this change. In this paper we show how a supervised multivariate statistical method, Between Group Analysis (BGA), can be used to identify these residues from families of proteins with different substrate specifities using multiple sequence alignments. Results We demonstrate the usefulness of this method on three different test cases. Two of these test cases, the Lactate/Malate dehydrogenase family and Nucleotidyl Cyclases, consist of two functional groups. The other family, Serine Proteases consists of three groups. BGA was used to analyse and visualise these three families using two different encoding schemes for the amino acids. Conclusion This overall combination of methods in this paper is powerful and flexible while being computationally very fast and simple. BGA is especially useful because it can be used to analyse any number of functional classes. In the examples we used in this paper, we have only used 2 or 3 classes for demonstration purposes but any number can be used and visualised.</p

Interpreting the role of de novo protein-coding mutations in neuropsychiatric disease

Author: A Goriely
A Hodgkinson
A Hodgkinson
A Kiezun
A Kong
AC Need
AC Need
AV Dharmadhikari
B Xu
B Xu
BJ O'Roak
BJ O'Roak
BM Neale
Bryan J Mowry
CJ Bell
DG MacArthur
E Vassos
GV Kryukov
H Najmabadi
HV Firth
I Iossifov
JA Tennessen
JA Veltman
Jacob Gratten
JM McClellan
JR Vermeesch
K Wang
LELM Vissers
MJ Bamshad
MR Nelson
MW State
Naomi R Wray
P Awadalla
P Green
P Lichtenstein
P Lichtenstein
Peter M Visscher
PF Sullivan
PF Sullivan
PM Krawitz
R Luo
RE Amir
SJ Sanders
SJ Sanders
SL Girard
T Klassen
V Vacic
X Zhao
Y Kim
Y Li
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/03/2013
Field of study

Pedigree, linkage and association studies are consistent with heritable variation for complex disease due to the segregation of genetic factors in families and in the population. In contrast, de novo mutations make only minor contributions to heritability estimates for complex traits. Nonetheless, some de novo variants are known to be important in disease etiology. The identification of risk-conferring de novo variants will contribute to the discovery of etiologically relevant genes and pathways and may help in genetic counseling. There is considerable interest in the role of such mutations in complex neuropsychiatric disease, largely driven by new genotyping and sequencing technologies. An important role for large de novo copy number variations has been established. Recently, whole-exome sequencing has been used to extend the investigation of de novo variation to point mutations in protein-coding regions. Here, we consider several challenges for the interpretation of such mutations in the context of their role in neuropsychiatric disease

University of Queensland eSpace

Rosetta FlexPepDock ab-initio: Simultaneous Folding, Docking and Refinement of Peptides onto Their Receptors

Author: A Stein
B Kuhlman
B Raveh
Barak Raveh
BR Chapados
C Hetenyi
C Katz
C Wang
CA Rohl
D Frishman
DM Fowler
DT Jones
E Petsalaki
E Petsalaki
G Moncalian
HM Berman
I Antes
I Buch
J Audie
J Guhaniyogi
JG Mandell
JJ Gray
K Abe
K Gehmlich
KL Morrison
L Parthasarathi
Lior Zimmerman
M Belitsky
M Burnier
M Hashemzadeh
M Rubinstein
MY Niv
N London
N London
Nir London
Ora Schueler-Furman
P Molek
P Vanhee
P Vanhee
P Vanhee
P Vlieghe
PA Prasad
PE Wright
R Brenke
R Das
RC Ladner
RL Dunbrack Jr
S Dutta
SA Gai
SS Sidhu
SW Crawley
T Kondo
T Pawson
U Zachariae
V Neduva
V Vacic
Vladimir N. Uversky
Y Li
YJ Im
Z Li
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

Flexible peptides that fold upon binding to another protein molecule mediate a large number of regulatory interactions in the living cell and may provide highly specific recognition modules. We present Rosetta FlexPepDock ab-initio, a protocol for simultaneous docking and de-novo folding of peptides, starting from an approximate specification of the peptide binding site. Using the Rosetta fragments library and a coarse-grained structural representation of the peptide and the receptor, FlexPepDock ab-initio samples efficiently and simultaneously the space of possible peptide backbone conformations and rigid-body orientations over the receptor surface of a given binding site. The subsequent all-atom refinement of the coarse-grained models includes full side-chain modeling of both the receptor and the peptide, resulting in high-resolution models in which key side-chain interactions are recapitulated. The protocol was applied to a benchmark in which peptides were modeled over receptors in either their bound backbone conformations or in their free, unbound form. Near-native peptide conformations were identified in 18/26 of the bound cases and 7/14 of the unbound cases. The protocol performs well on peptides from various classes of secondary structures, including coiled peptides with unusual turns and kinks. The results presented here significantly extend the scope of state-of-the-art methods for high-resolution peptide modeling, which can now be applied to a wide variety of peptide-protein interactions where no prior information about the peptide backbone conformation is available, enabling detailed structure-based studies and manipulation of those interactions

CiteSeerX

Public Library of Science (PLOS)