Folding by Numbers: Primary Sequence Statistics and Their Use in Studying Protein Folding

Andrew; Anfinsen; Aurora; Bacardit; Bang; Brent Wathen; Broome; Bu; Bédard; Chan; Chiti; Chou; Chou; Cohen; Colloc’h; Cootes; Costantini; Crasto; Daffner; Dasgupta; de Brevern; Dill; Dill; Dill; Doig; Doig; Dong; Dunker; Dunker; Eaton; Edgar; Englander; Ermolenko; Etchebest; Fernández-Recio; Fersht; Fetrow; Fink; Fonseca; Fooks; Galzitskaya; Gruebele; Gu; Gunasekaran; Guruprasad; Hutchinson; Hutchinson; Jiménez; Jones; Kabsch; Kapp; Karplus; Kauzmann; Klingler; Krantz; Kryshtafovych; Levinthal; Levitt; Lifson; Lifson; Liu; Luo; Mahalanobis; Maity; Mandel-Gutfreund; Marqusee; Miyazaki; Murphy; Muñoz; Nakashima; Noguchi; Onuchic; Pal; Penel; Penel; Presta; Richardson; Rigden; Romero; Rose; Rossmann; Sagermann; Santiveri; Schueler-Furman; Schwartz; Schwartz; Serrano; Serrano; Shannon; Shortle; Strait; Suyama; Swanson; Unger; Uversky; Vazquez; Ventura; Viguera; Vincent; von Heijne; Walther; Wang; Wang; Weiss; West; Wetlaufer; Wheelan; White; Wilmot; Wilson; Wolynes; Wouters; Wright; Xiong; Ye; Yon; Yoo; Zhu; Zongchao Jia

Folding by Numbers: Primary Sequence Statistics and Their Use in Studying Protein Folding

Authors: Andrew
Anfinsen
Aurora
Bacardit
Bang
Brent Wathen
Broome
Bu
Bédard
Chan
Chiti
Chou
Chou
Cohen
Colloc’h
Cootes
Costantini
Crasto
Daffner
Dasgupta
de Brevern
Dill
Dill
Dill
Doig
Doig
Dong
Dunker
Dunker
Eaton
Edgar
Englander
Ermolenko
Etchebest
Fernández-Recio
Fersht
Fetrow
Fink
Fonseca
Fooks
Galzitskaya
Gruebele
Gu
Gunasekaran
Guruprasad
Hutchinson
Hutchinson
Jiménez
Jones
Kabsch
Kapp
Karplus
Kauzmann
Klingler
Krantz
Kryshtafovych
Levinthal
Levitt
Lifson
Lifson
Liu
Luo
Mahalanobis
Maity
Mandel-Gutfreund
Marqusee
Miyazaki
Murphy
Muñoz
Nakashima
Noguchi
Onuchic
Pal
Penel
Penel
Presta
Richardson
Rigden
Romero
Rose
Rossmann
Sagermann
Santiveri
Schueler-Furman
Schwartz
Schwartz
Serrano
Serrano
Shannon
Shortle
Strait
Suyama
Swanson
Unger
Uversky
Vazquez
Ventura
Viguera
Vincent
von Heijne
Walther
Wang
Wang
Weiss
West
Wetlaufer
Wheelan
White
Wilmot
Wilson
Wolynes
Wouters
Wright
Xiong
Ye
Yon
Yoo
Zhu
Zongchao Jia
Publication date: 1 April 2009
Publisher: Molecular Diversity Preservation International (MDPI)
Doi

Abstract

The exponential growth over the past several decades in the quantity of both primary sequence data available and the number of protein structures determined has provided a wealth of information describing the relationship between protein primary sequence and tertiary structure. This growing repository of data has served as a prime source for statistical analysis, where underlying relationships between patterns of amino acids and protein structure can be uncovered. Here, we survey the main statistical approaches that have been used for identifying patterns within protein sequences, and discuss sequence pattern research as it relates to both secondary and tertiary protein structure. Limitations to statistical analyses are discussed, and a context for their role within the field of protein folding is given. We conclude by describing a novel statistical study of residue patterning in β-strands, which finds that hydrophobic (i,i+2) pairing in β-strands occurs more often than expected at locations near strand termini. Interpretations involving β-sheet nucleation and growth are discussed

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Directory of Open Access Journals

oai:doaj.org/article:dddafbfb2...

Last time updated on 17/12/2014

Crossref

Last time updated on 01/04/2019