Search CORE

50 research outputs found

HOW MUCH VALUE IS THERE IN A PRODUCER BRANDED BRED HEIFER PROGRAM?

Author: Gerlt Todd
Parcell Joseph L.
Patterson David J.
Randle Richard
Publication venue
Publication date
Field of study

Agricultural producers are pursuing many methods by which to add value. Typically, some type of change in commodity form is used to add value. However, there exist methods by which added value occurs through intensive management practices, particularly in seedstock production. We investigated the brand premium to a producer-owned quality-based bred heifer program. Results indicated that producers garner in access of a $100/head premium, while potentially reducing future search/advertising costs through building brand loyalty.Livestock Production/Industries,

Research Papers in Economics

Evolutionarily Conserved Substrate Substructures for Automated Annotation of Enzyme Superfamilies

Author: A Aharoni
AE Todd
AG Murzin
Andrej Sali
AS Mildvan
C Kalyanaraman
C Steinbeck
CM Seibert
CS Riesenfeld
CT Porter
D Weininger
DJ Weininger
DM Schmidt
DM Schmidt
GL Holliday
HM Holden
I Friedberg
I Nobeli
I Schomburg
I Shah
J Barthelmes
JA Gerlt
JA Gerlt
JA Gerlt
JA Gerlt
JC Hermann
JC Hermann
JJ Diaz-Mejia
K Tipton
KA Frazer
KN Allen
L Holm
L Song
M Ashburner
M Bashton
M Kotera
MA Marti-Renom
ME Glasner
ME Glasner
MJ Bessman
MJ Keiser
N Nagano
NH Horowitz
NH Horowitz
NM O'Boyle
Patricia C. Babbitt
PC Babbitt
PC Babbitt
PC Babbitt
R Alves
RA Nagatani
Ranyee A. Chiang
Robert B. Russell
S Light
S Schmidt
SC Pegg
SC Rison
SD Copley
TL O'Loughlin
WR Pearson
Publication venue: Public Library of Science
Publication date: 01/08/2008
Field of study

The evolution of enzymes affects how well a species can adapt to new environmental conditions. During enzyme evolution, certain aspects of molecular function are conserved while other aspects can vary. Aspects of function that are more difficult to change or that need to be reused in multiple contexts are often conserved, while those that vary may indicate functions that are more easily changed or that are no longer required. In analogy to the study of conservation patterns in enzyme sequences and structures, we have examined the patterns of conservation and variation in enzyme function by analyzing graph isomorphisms among enzyme substrates of a large number of enzyme superfamilies. This systematic analysis of substrate substructures establishes the conservation patterns that typify individual superfamilies. Specifically, we determined the chemical substructures that are conserved among all known substrates of a superfamily and the substructures that are reacting in these substrates and then examined the relationship between the two. Across the 42 superfamilies that were analyzed, substantial variation was found in how much of the conserved substructure is reacting, suggesting that superfamilies may not be easily grouped into discrete and separable categories. Instead, our results suggest that many superfamilies may need to be treated individually for analyses of evolution, function prediction, and guiding enzyme engineering strategies. Annotating superfamilies with these conserved and reacting substructure patterns provides information that is orthogonal to information provided by studies of conservation in superfamily sequences and structures, thereby improving the precision with which we can predict the functions of enzymes of unknown function and direct studies in enzyme engineering. Because the method is automated, it is suitable for large-scale characterization and comparison of fundamental functional capabilities of both characterized and uncharacterized enzyme superfamilies

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

eScholarship - University of California

Target selection and annotation for the structural genomics of the amidohydrolase and enolase superfamilies

Author: A Andreeva
A Sakai
A Weeks
AE Todd
Andrej Sali
C Nowlan
CH Wu
CM Seibert
D Vitkup
DA Benson
DL Wheeler
EF Pettersen
F Melo
Frank M. Raushel
H Berman
HJ Imker
J Akana
J Gough
J Lee
J. Michael Sauder
JA Gerlt
JA Gerlt
JA Gerlt
JB Bonanno
JB Thoden
JC Hermann
JC Hermann
JC Norvell
JC Venter
JE Vick
JE Vick
Jeffrey B. Bonanno
Jennifer J. Seffernick
JF Rakus
JJ Irwin
John A. Gerlt
L Holm
L Song
L Williams
Libusha Kelly
Margaret E. Glasner
Mark R. Chance
Matthew P. Jacobson
ME Glasner
ME Glasner
ME Glasner
N Eswar
N Nagano
Narayanan Eswar
P Shannon
Patricia C. Babbitt
PC Babbitt
R Marti-Arbona
R Marti-Arbona
R Marti-Arbona
R Sanchez
R Tyagi
Ranyee Chiang
RS Hall
RZ Liao
SC Almo
SC Pegg
SD Brown
SF Altschul
Shoshana D. Brown
SL Schafer
Stephen K. Burley
Steven C. Almo
Subramanyam Swaminathan
TN Porter
TT Nguyen
U Pieper
Ursula Pieper
WS Yew
WS Yew
WS Yew
Xiaojing Zheng
Y Li
Publication venue: Springer Netherlands
Publication date: 01/01/2009
Field of study

To study the substrate specificity of enzymes, we use the amidohydrolase and enolase superfamilies as model systems; members of these superfamilies share a common TIM barrel fold and catalyze a wide range of chemical reactions. Here, we describe a collaboration between the Enzyme Specificity Consortium (ENSPEC) and the New York SGX Research Center for Structural Genomics (NYSGXRC) that aims to maximize the structural coverage of the amidohydrolase and enolase superfamilies. Using sequence- and structure-based protein comparisons, we first selected 535 target proteins from a variety of genomes for high-throughput structure determination by X-ray crystallography; 63 of these targets were not previously annotated as superfamily members. To date, 20 unique amidohydrolase and 41 unique enolase structures have been determined, increasing the fraction of sequences in the two superfamilies that can be modeled based on at least 30% sequence identity from 45% to 73%. We present case studies of proteins related to uronate isomerase (an amidohydrolase superfamily member) and mandelate racemase (an enolase superfamily member), to illustrate how this structure-focused approach can be used to generate hypotheses about sequence–structure–function relationships

Crossref

Springer - Publisher Connector

PubMed Central

eScholarship - University of California

The FGGY carbohydrate kinase family : insights into the evolution of functional specificities

Author: A Osterman
A Vendeville
Adam Godzik
AE Todd
AE Todd
AM Schnoes
Andrei Osterman
B Reva
BE Engelhardt
BG Magor
CA Bonner
CA Orengo
Christos A. Ouzounis
CM Seibert
D Grueninger
D Wu
DA Lee
DA Rodionov
E Di Luccio
G Casari
GE Crooks
HM Berman
I Letunic
Irina Rodionova
JA Capra
JA Capra
JA Gerlt
JH Hurley
JH Hurley
JI Yeh
K Sjolander
K Ye
KB Xavier
LA David
M Ormo
M Pachkov
ME Glasner
MN Price
MV Omelchenko
N Krishnamurthy
Olga Zagnitko
OV Kalinina
P Shannon
R Overbeek
RC Edgar
RC Edgar
RD Finn
RK Aziz
S Cheek
SS Hannenhalli
TA Tatusova
TT Nguyen
W-D Fessner
Y Zhang
Ying Zhang
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/12/2011
Field of study

© The Author(s), 2011. This article is distributed under the terms of the Creative Commons Attribution License. The definitive version was published in PLoS Computational Biology 7 (2011): e1002318, doi:10.1371/journal.pcbi.1002318.Function diversification in large protein families is a major mechanism driving expansion of cellular networks, providing organisms with new metabolic capabilities and thus adding to their evolutionary success. However, our understanding of the evolutionary mechanisms of functional diversity in such families is very limited, which, among many other reasons, is due to the lack of functionally well-characterized sets of proteins. Here, using the FGGY carbohydrate kinase family as an example, we built a confidently annotated reference set (CARS) of proteins by propagating experimentally verified functional assignments to a limited number of homologous proteins that are supported by their genomic and functional contexts. Then, we analyzed, on both the phylogenetic and the molecular levels, the evolution of different functional specificities in this family. The results show that the different functions (substrate specificities) encoded by FGGY kinases have emerged only once in the evolutionary history following an apparently simple divergent evolutionary model. At the same time, on the molecular level, one isofunctional group (L-ribulokinase, AraB) evolved at least two independent solutions that employed distinct specificity-determining residues for the recognition of a same substrate (L-ribulose). Our analysis provides a detailed model of the evolution of the FGGY kinase family. It also shows that only combined molecular and phylogenetic approaches can help reconstruct a full picture of functional diversifications in such diverse families.This study was funded by NIH and DOE grants

Public Library of Science (PLOS)

Crossref

Woods Hole Open Access Server

Directory of Open Access Journals

PubMed Central

eScholarship - University of California

Quantitative sequence-function relationships in proteins based on gene ontology

Author: A Bairoch
A Bairoch
A Bateman
A Bateman
A Conesa
AE Todd
Arthur M Lesk
CA Wilson
CZ Cai
D Devos
D Devos
Daniel J Blankenberg
E Camon
EL Sonnhammer
J Piatigorsky
JA Gerlt
JA Ranea
JC Whisstock
K Fleming
L Holm
LB Koski
LJ Jensen
M Ashburner
M Shadidy
MA Andrade
MD Ganfornina
N Hulo
Naomi Altman
P Bork
R Karp
RA Laskowski
RA Laskowski
RC Edgar
S Jones
S Nakayama
SB Needleman
SE Brenner
SF Altschul
SR Eddy
SS Jeong
T Doerks
TF Smith
TK Attwood
Vineet Sangar
X Lu
Publication venue: BioMed Central
Publication date: 01/08/2007
Field of study

Abstract Background The relationship between divergence of amino-acid sequence and divergence of function among homologous proteins is complex. The assumption that homologs share function – the basis of transfer of annotations in databases – must therefore be regarded with caution. Here, we present a quantitative study of sequence and function divergence, based on the Gene Ontology classification of function. We determined the relationship between sequence divergence and function divergence in 6828 protein families from the PFAM database. Within families there is a broad range of sequence similarity from very closely related proteins – for instance, orthologs in different mammals – to very distantly-related proteins at the limit of reliable recognition of homology. Results We correlated the divergence in sequences determined from pairwise alignments, and the divergence in function determined by path lengths in the Gene Ontology graph, taking into account the fact that many proteins have multiple functions. Our results show that, among homologous proteins, the proportion of divergent functions decreases dramatically above a threshold of sequence similarity at about 50% residue identity. For proteins with more than 50% residue identity, transfer of annotation between homologs will lead to an erroneous attribution with a totally dissimilar function in fewer than 6% of cases. This means that for very similar proteins (about 50 % identical residues) the chance of completely incorrect annotation is low; however, because of the phenomenon of recruitment, it is still non-zero. Conclusion Our results describe general features of the evolution of protein function, and serve as a guide to the reliability of annotation transfer, based on the closeness of the relationship between a new protein and its nearest annotated relative.</p

Crossref

Directory of Open Access Journals

PubMed Central

Searching the protein structure database for ligand-binding site similarities using CPASS v.2

Author: A Herraez
A Mercier Kelly
A Schneider
A Stark
A Stark
Adam Caprez
AE Todd
Ashu Guru
B Rost
C Dessimoz
CD Livingstone
D Petrey
D Watson James
David Swanson
EF Pettersen
G Ausiello
G Lopez
H Hegyi
HM Berman
I Friedberg
I Sfiligoi
JA Gerlt
Jaime L Stark
JD Blake
Jennifer C Copeland
JL Stark
K Park
KA Mercier
L Lo Conte
M Litzkow
MD Shortridge
MJ Zvelebil
ND Gold
R Kolodny
R Pordes
R Powers
R Powers
R Powers
RL Tatusov
Robert Powers
S Henikoff
S Henikoff
SA Cammer
SJ Hubbard
T Triplet
V Sangar
W Zhang
YJ Huang
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background A recent analysis of protein sequences deposited in the NCBI RefSeq database indicates that ~8.5 million protein sequences are encoded in prokaryotic and eukaryotic genomes, where ~30% are explicitly annotated as "hypothetical" or "uncharacterized" protein. Our Comparison of Protein Active-Site Structures (CPASS v.2) database and software compares the sequence and structural characteristics of experimentally determined ligand binding sites to infer a functional relationship in the absence of global sequence or structure similarity. CPASS is an important component of our Functional Annotation Screening Technology by NMR (FAST-NMR) protocol and has been successfully applied to aid the annotation of a number of proteins of unknown function. Findings We report a major upgrade to our CPASS software and database that significantly improves its broad utility. CPASS v.2 is designed with a layered architecture to increase flexibility and portability that also enables job distribution over the Open Science Grid (OSG) to increase speed. Similarly, the CPASS interface was enhanced to provide more user flexibility in submitting a CPASS query. CPASS v.2 now allows for both automatic and manual definition of ligand-binding sites and permits pair-wise, one versus all, one versus list, or list versus list comparisons. Solvent accessible surface area, ligand root-mean square difference, and Cβ distances have been incorporated into the CPASS similarity function to improve the quality of the results. The CPASS database has also been updated. Conclusions CPASS v.2 is more than an order of magnitude faster than the original implementation, and allows for multiple simultaneous job submissions. Similarly, the CPASS database of ligand-defined binding sites has increased in size by ~ 38%, dramatically increasing the likelihood of a positive search result. The modification to the CPASS similarity function is effective in reducing CPASS similarity scores for false positives by ~30%, while leaving true positives unaffected. Importantly, receiver operating characteristics (ROC) curves demonstrate the high correlation between CPASS similarity scores and an accurate functional assignment. As indicated by distribution curves, scores ≥ 30% infer a functional similarity. Software URL: <url>http://cpass.unl.edu</url>.</p

Crossref

DigitalCommons@University of Nebraska

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

A Measure of the Promiscuity of Proteins and Characteristics of Residues in the Vicinity of the Catalytic Site That Regulate Promiscuity

Promiscuity, the basis for the evolution of new functions through ‘tinkering’ of residues in the vicinity of the catalytic site, is yet to be quantitatively defined. We present a computational method Promiscuity Indices Estimator (PROMISE) - based on signatures derived from the spatial and electrostatic properties of the catalytic residues, to estimate the promiscuity (PromIndex) of proteins with known active site residues and 3D structure. PromIndex reflects the number of different active site signatures that have congruent matches in close proximity of its native catalytic site, the quality of the matches and difference in the enzymatic activity. Promiscuity in proteins is observed to follow a lognormal distribution (μ = 0.28, σ = 1.1 reduced chi-square = 3.0E-5). The PROMISE predicted promiscuous functions in any protein can serve as the starting point for directed evolution experiments. PROMISE ranks carboxypeptidase A and ribonuclease A amongst the more promiscuous proteins. We have also investigated the properties of the residues in the vicinity of the catalytic site that regulates its promiscuity. Linear regression establishes a weak correlation (R2∼0.1) between certain properties of the residues (charge, polar, etc) in the neighborhood of the catalytic residues and PromIndex. A stronger relationship states that most proteins with high promiscuity have high percentages of charged and polar residues within a radius of 3 Å of the catalytic site, which is validated using one-tailed hypothesis tests (P-values∼0.05). Since it is known that these characteristics are key factors in catalysis, their relationship with the promiscuity index cross validates the methodology of PROMISE

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

FigShare

Annotation Error in Public Databases: Misannotation of Molecular Function in Enzyme Superfamilies

Due to the rapid release of new data from genome sequencing projects, the majority of protein sequences in public databases have not been experimentally characterized; rather, sequences are annotated using computational analysis. The level of misannotation and the types of misannotation in large public databases are currently unknown and have not been analyzed in depth. We have investigated the misannotation levels for molecular function in four public protein sequence databases (UniProtKB/Swiss-Prot, GenBank NR, UniProtKB/TrEMBL, and KEGG) for a model set of 37 enzyme families for which extensive experimental information is available. The manually curated database Swiss-Prot shows the lowest annotation error levels (close to 0% for most families); the two other protein sequence databases (GenBank NR and TrEMBL) and the protein sequences in the KEGG pathways database exhibit similar and surprisingly high levels of misannotation that average 5%–63% across the six superfamilies studied. For 10 of the 37 families examined, the level of misannotation in one or more of these databases is >80%. Examination of the NR database over time shows that misannotation has increased from 1993 to 2005. The types of misannotation that were found fall into several categories, most associated with “overprediction” of molecular function. These results suggest that misannotation in enzyme superfamilies containing multiple families that catalyze different reactions is a larger problem than has been recognized. Strategies are suggested for addressing some of the systematic problems contributing to these high levels of misannotation

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Functional and informatics analysis enables glycosyltransferase activity prediction

Author: A Roy
A Sánchez-Rodríguez
AE Todd
AE Todd
AM Cartwright
Benjamin G. Davis
BW Matthews
C Peneff
Charlie Fehl
D Dong
DANovel Learmonth
Dianna J. Bowles
DR Friedmann
EK Lim
EK Lim
Eng-Kiat Lim
F Sievers
Gideon J. Davies
GJ Davies
H Kries
H Shao
J Burns
J Flint
J Schmid
J Tomé-Carneiro
JA Gerlt
JD Thompson
Karen V. Lees
KC Harper
KM Backus
L Heide
LL Lairson
LV Modolo
LV Modolo
M Brazier-Hicks
M Kotera
M Udayakumar
M Weis
M Yang
M Yang
M Yang
Matthew G. Davidson
MC McLeod
Min Yang
MS Newton
P Emsley
P Rice
PI Mackenzie
RP Pandey
RS Turner
S Nembri
S Tyagi
S Zhao
SA Osmani
Sascha Venturelli
SC Johnson
Stephen J. Roberts
T Li
T Wang
TF Smith
TM Gloster
TN Kjaer
UM Unligil
V Law
V Lombard
W Offen
Wendy A. Offen
WHB Sauer
WR Pearson
WR Pearson
Y Li
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2018
Field of study

The elucidation and prediction of how changes in a protein result in altered activities and selectivities remain a major challenge in chemistry. Two hurdles have prevented accurate family-wide models: obtaining (i) diverse datasets and (ii) suitable parameter frameworks that encapsulate activities in large sets. Here, we show that a relatively small but broad activity dataset is sufficient to train algorithms for functional prediction over the entire glycosyltransferase superfamily 1 (GT1) of the plant Arabidopsis thaliana. Whereas sequence analysis alone failed for GT1 substrate utilization patterns, our chemical–bioinformatic model, GT-Predict, succeeded by coupling physicochemical features with isozyme-recognition patterns over the family. GT-Predict identified GT1 biocatalysts for novel substrates and enabled functional annotation of uncharacterized GT1s. Finally, analyses of GT-Predict decision pathways revealed structural modulators of substrate recognition, thus providing information on mechanisms. This multifaceted approach to enzyme prediction may guide the streamlined utilization (and design) of biocatalysts and the discovery of other family-wide protein functions

Crossref

Oxford University Research Archive

White Rose Research Online

A study of the quality and value improvements of Show-Me-Select heifers [abstract]

Author: Gerlt Todd
Publication venue: University of Missouri--Columbia. Office of Undergraduate Research
Publication date: 01/01/2004
Field of study

Abstract only availableFaculty Mentor: Dr. Joe Parcell, Agricultural Systems ManagementMissouri's largest source of agriculture revenue, the forage-based beef cattle industry, could become a bigger player in the state's total agriculture revenue and on-farm income with some industry modifications. As a result, the Department of Animal Science and the College of Veterinary Medicine in coordination with the Department of Agriculture Economics decided to develop the Show Me Select Heifer Program. The project assesses of the revenue and cost structures of branded heifer development program to determine its value to producers

University of Missouri: MOspace