Search CORE

586 research outputs found

Composition Profiler: a tool for discovery and visualization of amino acid composition differences

Author: Dunker A Keith
Lonardi Stefano
Uversky Vladimir N
Vacic Vladimir
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Abstract Background Composition Profiler is a web-based tool for semi-automatic discovery of enrichment or depletion of amino acids, either individually or grouped by their physico-chemical or structural properties. Results The program takes two samples of amino acids as input: a query sample and a reference sample. The latter provides a suitable background amino acid distribution, and should be chosen according to the nature of the query sample, for example, a standard protein database (e.g. SwissProt, PDB), a representative sample of proteins from the organism under study, or a group of proteins with a contrasting functional annotation. The results of the analysis of amino acid composition differences are summarized in textual and graphical form. Conclusion As an exploratory data mining tool, our software can be used to guide feature selection for protein function or structure predictors. For classes of proteins with significant differences in frequencies of amino acids having particular physico-chemical (e.g. hydrophobicity or charge) or structural (e.g. α helix propensity) properties, Composition Profiler can be used as a rough, light-weight visual classifier.</p

Crossref

USFSP Digital Archive

IUPUIScholarWorks

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

eScholarship - University of California

Scholar Commons - University of South Florida

Disease-Associated Mutations Disrupt Functionally Important Regions of Intrinsic Protein Disorder

Author: Haynes C.
Iakoucheva L. M.
Markwick P. R. L.
Oldfield C. J.
Uversky V. N.
Vacic V.
Zhao X.
Publication venue
Publication date: 01/01/2012
Field of study

The effects of disease mutations on protein structure and function have been extensively investigated, and many predictors of the functional impact of single amino acid substitutions are publicly available. The majority of these predictors are based on protein structure and evolutionary conservation, following the assumption that disease mutations predominantly affect folded and conserved protein regions. However, the prevalence of the intrinsically disordered proteins (IDPs) and regions (IDRs) in the human proteome together with their lack of fixed structure and low sequence conservation raise a question about the impact of disease mutations in IDRs. Here, we investigate annotated missense disease mutations and show that 21.7% of them are located within such intrinsically disordered regions. We further demonstrate that 20% of disease mutations in IDRs cause local disorder-to-order transitions, which represents a 1.7–2.7 fold increase compared to annotated polymorphisms and neutral evolutionary substitutions, respectively. Secondary structure predictions show elevated rates of transition from helices and strands into loops and vice versa in the disease mutations dataset. Disease disorder-to-order mutations also influence predicted molecular recognition features (MoRFs) more often than the control mutations. The repertoire of disorder-to-order transition mutations is limited, with five most frequent mutations (R→W, R→C, E→K, R→H, R→Q) collectively accounting for 44% of all deleterious disorder-to-order transitions. As a proof of concept, we performed accelerated molecular dynamics simulations on a deleterious disorder-to-order transition mutation of tumor protein p63 and, in agreement with our predictions, observed an increased α-helical propensity of the region harboring the mutation. Our findings highlight the importance of mutations in IDRs and refine the traditional structure-centric view of disease mutations. The results of this study offer a new perspective on the role of mutations in disease, with implications for improving predictors of the functional impact of missense mutations

Public Library of Science (PLOS)

USFSP Digital Archive

Cold Spring Harbor Laboratory Institutional Repository

Directory of Open Access Journals

PubMed Central

Scholar Commons - University of South Florida

FigShare

Seq2Logo: a method for construction and visualization of amino acid binding motifs and sequence profiles including sequence weighting, pseudo counts and two-sided representation of amino acid enrichment and depletion

Author: Altschul
Andreatta
Colaert
Crooks
Fujii
Henikoff
Henikoff
Hobohm
Kullback
Martin Christen Frølund Thomsen
Morten Nielsen
Nielsen
Porter
Rammensee
Schneider
Shannon
Vacic
Workman
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2012
Field of study

Seq2Logo is a web-based sequence logo generator. Sequence logos are a graphical representation of the information content stored in a multiple sequence alignment (MSA) and provide a compact and highly intuitive representation of the position-specific amino acid composition of binding motifs, active sites, etc. in biological sequences. Accurate generation of sequence logos is often compromised by sequence redundancy and low number of observations. Moreover, most methods available for sequence logo generation focus on displaying the position-specific enrichment of amino acids, discarding the equally valuable information related to amino acid depletion. Seq2logo aims at resolving these issues allowing the user to include sequence weighting to correct for data redundancy, pseudo counts to correct for low number of observations and different logotype representations each capturing different aspects related to amino acid enrichment and depletion. Besides allowing input in the format of peptides and MSA, Seq2Logo accepts input as Blast sequence profiles, providing easy access for non-expert end-users to characterize and identify functionally conserved/variable amino acids in any given protein of interest. The output from the server is a sequence logo and a PSSM. Seq2Logo is available at http://www.cbs.dtu.dk/biotools/Seq2Logo (14 May 2012, date last accessed)

Crossref

PubMed Central

Online Research Database In Technology

DisProt: the Database of Disordered Proteins

Author: Chen Jake
Cortese Marc S.
Dunker A. Keith
Hamilton Justin A.
LeGall Tanguy
Obradovic Zoran
Sickmeier Megan
Szabo Beata
Tantos Agnes
Tompa Peter
Uversky Vladimir N.
Vacic Vladimir
Publication venue: Oxford University Press
Publication date: 01/12/2006
Field of study

The Database of Protein Disorder (DisProt) links structure and function information for intrinsically disordered proteins (IDPs). Intrinsically disordered proteins do not form a fixed three-dimensional structure under physiological conditions, either in their entireties or in segments or regions. We define IDP as a protein that contains at least one experimentally determined disordered region. Although lacking fixed structure, IDPs and regions carry out important biological functions, being typically involved in regulation, signaling and control. Such functions can involve high-specificity low-affinity interactions, the multiple binding of one protein to many partners and the multiple binding of many proteins to one partner. These three features are all enabled and enhanced by protein intrinsic disorder. One of the major hindrances in the study of IDPs has been the lack of organized information. DisProt was developed to enable IDP research by collecting and organizing knowledge regarding the experimental characterization and the functional associations of IDPs. In addition to being a unique source of biological information, DisProt opens doors for a plethora of bioinformatics studies. DisProt is openly available at

CiteSeerX

Crossref

USFSP Digital Archive

PubMed Central

Scholar Commons - University of South Florida

Anatomy of protein disorder, flexibility and disease-related mutations.

Author: Adzhubei
Al-Numair
Babu
Bandaranayake
Bromberg
Camacho
Capriotti
Cilia
Consortium
de Beer
Dobbins
Dunker
Finn
Fong
Forbes
Fornili
Haling
Hamosh
Holderfield
Hu
Iakoucheva
Kelley
Kircher
Kosciolek
Ling
Lu
Marino
Monastyrskyy
Mosca
Necsulea
Nishi
Pajkos
Pandini
Pires
Punta
R Core Team
Reva
Satoh
Scharner
Schuster-Böckler
Sherry
Shihab
Stefl
Studer
Thevakumaran
Thomas
Uversky
Uversky
Uversky
Vacic
Vacic
Vogelstein
Wang
Ward
Winter
Wolfe
Wright
Yates
Yates
Yue
Publication venue: 'Frontiers Media SA'
Publication date: 01/01/2015
Field of study

Integration of protein structural information with human genetic variation and pathogenic mutations is essential to understand molecular mechanisms associated with the effects of polymorphisms on protein interactions and cellular processes. We investigate occurrences of non-synonymous SNPs in ordered and disordered protein regions by systematic mapping of common variants and disease-related SNPs onto these regions. We show that common variants accumulate in disordered regions; conversely pathogenic variants are significantly depleted in disordered regions. These different occurrences of pathogenic and common SNPs can be attributed to a negative selection on random mutations in structurally highly constrained regions. New approaches in the study of quantitative effects of pathogenic-related mutations should effectively account for all the possible contexts and relative functional constraints in which the sequence variation occurs.This research was supported by the Biotechnology and Biological Sciences Research Council (BB/H018409/1 to FF), the British Heart Foundation (FS/12/41/29724 to AF and FF) and the Leukaemia & Lymphoma Research (to FF). SSC is funded by a Leukaemia & Lymphoma Research Gordon Piller PhD Studentship

Crossref

Directory of Open Access Journals

Frontiers - Publisher Connector

PubMed Central

Queen Mary Research Online

King's Research Portal

Genome-wide mapping of IBD segments in an Ashkenazi PD cohort identifies associated haplotypes

Author: +10 additional authors
Bar-Shira A.
Clark L. N.
Gana-Weisz M.
Guha S.
Gurevich T.
Gusev A.
Lencz T.
Orr-Urtreger A.
Ozelius L. J.
Vacic V.
Publication venue: Donald and Barbara Zucker School of Medicine Academic Works
Publication date: 01/01/2014
Field of study

The recent series of large genome-wide association studies in European and Japanese cohorts established that Parkinson disease (PD) has a substantial genetic component. To further investigate the genetic landscape of PD, we performed a genome-wide scan in the largest to date Ashkenazi Jewish cohort of 1130 Parkinson patients and 2611 pooled controls. Motivated by the reduced disease allele heterogeneity and a high degree of identical-by-descent (IBD) haplotype sharing in this founder population, we conducted a haplotype association study based on mapping of shared IBD segments. We observed significant haplotype association signals at three previously implicated Parkinson loci: LRRK2 (OR = 12.05, P = 1.23 x 10(-56)), MAPT (OR = 0.62, P = 1.78 x 10(-11)) and GBA (multiple distinct haplotypes, OR \u3e 8.28, P = 1.13 x 10(-11) and OR = 2.50, P = 1.22 x 10(-9)). In addition, we identified a novel association signal on chr2q14.3 coming from a rare haplotype (OR = 22.58, P = 1.21 x 10(-10)) and replicated it in a secondary cohort of 306 Ashkenazi PD cases and 2583 controls. Our results highlight the power of our haplotype association method, particularly useful in studies of founder populations, and reaffirm the benefits of studying complex diseases in Ashkenazi Jewish cohorts

Hofstra Northwell Academic Works (Hofstra Northwell School of Medicine)

FAAST: Flow-space Assisted Alignment Search Tool

Author: Bengt Persson
Björn Andersson
DJ Lipman
Fredrik Lysholm
J Jerlström-Hultqvist
M Droege
M Margulies
MO Dayhoff
O Gotoh
R Kofler
S Balzer
SB Needleman
SF Altschul
SF Altschul
TF Smith
V Vacic
WR Pearson
Z Ning
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background High throughput pyrosequencing (454 sequencing) is the major sequencing platform for producing long read high throughput data. While most other sequencing techniques produce reading errors mainly comparable with substitutions, pyrosequencing produce errors mainly comparable with gaps. These errors are less efficiently detected by most conventional alignment programs and may produce inaccurate alignments. Results We suggest a novel algorithm for calculating the optimal local alignment which utilises flowpeak information in order to improve alignment accuracy. Flowpeak information can be retained from a 454 sequencing run through interpretation of the binary SFF-file format. This novel algorithm has been implemented in a program named FAAST (Flow-space Assisted Alignment Search Tool). Conclusions We present and discuss the results of simulations that show that FAAST, through the use of the novel algorithm, can gain several percentage points of accuracy compared to Smith-Waterman-Gotoh alignments, depending on the 454 data quality. Furthermore, through an efficient multi-thread aware implementation, FAAST is able to perform these high quality alignments at high speed. The tool is available at <url>http://www.ifm.liu.se/bioinfo/</url></p

Publikationer från Linköpings universitet

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Digitala Vetenskapliga Arkivet - Academic Archive On-line

FAAST: Flow-space Assisted Alignment Search Tool

Author: Fredrik Lysholm
Björn Andersson
Bengt Persson
M Margulies
M Droege
SB Needleman
TF Smith
O Gotoh
DJ Lipman
WR Pearson
SF Altschul
SF Altschul
MO Dayhoff
V Vacic
R Kofler
S Balzer
J Jerlström-Hultqvist
Z Ning
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Publikationer från Linköpings universitet

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Aston Publications Explorer

Digitala Vetenskapliga Arkivet - Academic Archive On-line

The SH2 Domain Interaction Landscape

Author: Anderson
Blagoev
Blom
Brandt
Ceol
Ceol
Chatr-Aryamontri
Chatr-Aryamontri
Cox
Diella
Dosztányi
Ernst
Flicek
Frank
Gfeller
Gong
Hanke
Hermjakob
Holgado-Madruga
Hornbeck
Hornbeck
Huang
Jones
Kiemer
Li
Licata
Linding
Liu
Liu
Machida
Marengere
Miller
Moran
Nielsen
Olsen
Panni
Pawson
Persico
Santonico
Schulze
Songyang
Su
Vacic
Vastrik
Wenschuh
Wiśniewski
Yaffe
Publication venue: 'Elsevier BV'
Publication date: 01/01/2013
Field of study

Members of the SH2 domain family modulate signal transduction by binding to short peptides containing phosphorylated tyrosines. Each domain displays a distinct preference for the sequence context of the phosphorylated residue. We have developed a high-density peptide chip technology that allows for probing of the affinity of most SH2 domains for a large fraction of the entire complement of tyrosine phosphopeptides in the human proteome. Using this technique, we have experimentally identified thousands of putative SH2-peptide interactions for more than 70 different SH2 domains. By integrating this rich data set with orthogonal context-specific information, we have assembled an SH2-mediated probabilistic interaction network, which we make available as a community resource in the PepspotDB database. A predicted dynamic interaction between the SH2 domains of the tyrosine phosphatase SHP2 and the phosphorylated tyrosine in the extracellular signal-regulated kinase activation loop was validated by experiments in living cells

CiteSeerX

Elsevier - Publisher Connector

Crossref

Directory of Open Access Journals

Copenhagen University Research Information System

PubMed Central

ART

Online Research Database In Technology

MPG.PuRe