The Plant Proteome Folding Project: Structure and Positive Selection in Plant Protein Families Genome Biology and Evolution Advance Downloaded from

Abstract

ABSTRACT Despite its importance, relatively little is known about the relationship between the structure, function and evolution of proteins, particularly in land plant species. We have developed a database with predicted protein domains for five plant proteomes (http://pfp.bio.nyu.edu), and used both protein structural fold-recognition and de novo Rosettabased protein structure prediction to predict protein structure for Arabidopsis and rice proteins. Based on sequence similarity, we have identified ~15,000 orthologous/paralogous protein family clusters among these species, and used codon-based models to predict positive selection in protein evolution within 175 of these sequence clusters. Our results show that codons that display positive selection appears to be less frequent in helical and strand regions, and are overrepresented in amino acid residues that are associated with a change in protein secondary structure. Like in other organisms, disordered protein regions also appear to have more selected sites. Structural information provides new functional insights into specific plant proteins and allows us to map positively selected amino acid sites onto protein structures and view these sites in a structural and functional context

    Similar works

    Full text

    thumbnail-image

    Available Versions