578 research outputs found
Complete amino acid sequences of variable regions of two human IgM rheumatoid factors, BOR and KAS of the Wa idiotypic family, reveal restricted use of heavy and light chain variable and joining region gene segments.
Evidence derived from the complete amino acid sequences of the variable regions of both the heavy and light chains of two members (BOR and KAS) of the Wa idiotypic family of human rheumatoid factors suggests that not only are the light chains of these molecules derived from possibly one variable region gene segment, but the heavy chain variable regions are all derived from the VHI subgroup of human V region genes. These molecules exhibit a surprising conservation in the size of D region, and all use the JH4 gene element. This restriction in use of VL, VH, D, and JH suggests all of these elements may play a crucial role in either antigen binding and/or expression of the crossreactive idiotype
ProteinHistorian: Tools for the Comparative Analysis of Eukaryote Protein Origin
The evolutionary history of a protein reflects the functional history of its ancestors. Recent phylogenetic studies identified distinct evolutionary signatures that characterize proteins involved in cancer, Mendelian disease, and different ontogenic stages. Despite the potential to yield insight into the cellular functions and interactions of proteins, such comparative phylogenetic analyses are rarely performed, because they require custom algorithms. We developed ProteinHistorian to make tools for performing analyses of protein origins widely available. Given a list of proteins of interest, ProteinHistorian estimates the phylogenetic age of each protein, quantifies enrichment for proteins of specific ages, and compares variation in protein age with other protein attributes. ProteinHistorian allows flexibility in the definition of protein age by including several algorithms for estimating ages from different databases of evolutionary relationships. We illustrate the use of ProteinHistorian with three example analyses. First, we demonstrate that proteins with high expression in human, compared to chimpanzee and rhesus macaque, are significantly younger than those with human-specific low expression. Next, we show that human proteins with annotated regulatory functions are significantly younger than proteins with catalytic functions. Finally, we compare protein length and age in many eukaryotic species and, as expected from previous studies, find a positive, though often weak, correlation between protein age and length. ProteinHistorian is available through a web server with an intuitive interface and as a set of command line tools; this allows biologists and bioinformaticians alike to integrate these approaches into their analysis pipelines. ProteinHistorian's modular, extensible design facilitates the integration of new datasets and algorithms. The ProteinHistorian web server, source code, and pre-computed ages for 32 eukaryotic genomes are freely available under the GNU public license at http://lighthouse.ucsf.edu/ProteinHistorian/
Mechanisms for the Evolution of a Derived Function in the Ancestral Glucocorticoid Receptor
Understanding the genetic, structural, and biophysical mechanisms that caused protein functions to evolve is a central goal of molecular evolutionary studies. Ancestral sequence reconstruction (ASR) offers an experimental approach to these questions. Here we use ASR to shed light on the earliest functions and evolution of the glucocorticoid receptor (GR), a steroid-activated transcription factor that plays a key role in the regulation of vertebrate physiology. Prior work showed that GR and its paralog, the mineralocorticoid receptor (MR), duplicated from a common ancestor roughly 450 million years ago; the ancestral functions were largely conserved in the MR lineage, but the functions of GRs—reduced sensitivity to all hormones and increased selectivity for glucocorticoids—are derived. Although the mechanisms for the evolution of glucocorticoid specificity have been identified, how reduced sensitivity evolved has not yet been studied. Here we report on the reconstruction of the deepest ancestor in the GR lineage (AncGR1) and demonstrate that GR's reduced sensitivity evolved before the acquisition of restricted hormone specificity, shortly after the GR–MR split. Using site-directed mutagenesis, X-ray crystallography, and computational analyses of protein stability to recapitulate and determine the effects of historical mutations, we show that AncGR1's reduced ligand sensitivity evolved primarily due to three key substitutions. Two large-effect mutations weakened hydrogen bonds and van der Waals interactions within the ancestral protein, reducing its stability. The degenerative effect of these two mutations is extremely strong, but a third permissive substitution, which has no apparent effect on function in the ancestral background and is likely to have occurred first, buffered the effects of the destabilizing mutations. Taken together, our results highlight the potentially creative role of substitutions that partially degrade protein structure and function and reinforce the importance of permissive mutations in protein evolution
Automatic prediction of catalytic residues by modeling residue structural neighborhood
Background: Prediction of catalytic residues is a major step in characterizing the function of enzymes. In its simpler formulation, the problem can be cast into a binary classification task at the residue level, by predicting whether the residue is directly involved in the catalytic process. The task is quite hard also when structural information is available, due to the rather wide range of roles a functional residue can play and to the large imbalance between the number of catalytic and non-catalytic residues.Results: We developed an effective representation of structural information by modeling spherical regions around candidate residues, and extracting statistics on the properties of their content such as physico-chemical properties, atomic density, flexibility, presence of water molecules. We trained an SVM classifier combining our features with sequence-based information and previously developed 3D features, and compared its performance with the most recent state-of-the-art approaches on different benchmark datasets. We further analyzed the discriminant power of the information provided by the presence of heterogens in the residue neighborhood.Conclusions: Our structure-based method achieves consistent improvements on all tested datasets over both sequence-based and structure-based state-of-the-art approaches. Structural neighborhood information is shown to be responsible for such results, and predicting the presence of nearby heterogens seems to be a promising direction for further improvements.Journal ArticleResearch Support, N.I.H. Extramuralinfo:eu-repo/semantics/publishe
Using Multiple Microenvironments to Find Similar Ligand-Binding Sites: Application to Kinase Inhibitor Binding
The recognition of cryptic small-molecular binding sites in protein structures is important for understanding off-target side effects and for recognizing potential new indications for existing drugs. Current methods focus on the geometry and detailed chemical interactions within putative binding pockets, but may not recognize distant similarities where dynamics or modified interactions allow one ligand to bind apparently divergent binding pockets. In this paper, we introduce an algorithm that seeks similar microenvironments within two binding sites, and assesses overall binding site similarity by the presence of multiple shared microenvironments. The method has relatively weak geometric requirements (to allow for conformational change or dynamics in both the ligand and the pocket) and uses multiple biophysical and biochemical measures to characterize the microenvironments (to allow for diverse modes of ligand binding). We term the algorithm PocketFEATURE, since it focuses on pockets using the FEATURE system for characterizing microenvironments. We validate PocketFEATURE first by showing that it can better discriminate sites that bind similar ligands from those that do not, and by showing that we can recognize FAD-binding sites on a proteome scale with Area Under the Curve (AUC) of 92%. We then apply PocketFEATURE to evolutionarily distant kinases, for which the method recognizes several proven distant relationships, and predicts unexpected shared ligand binding. Using experimental data from ChEMBL and Ambit, we show that at high significance level, 40 kinase pairs are predicted to share ligands. Some of these pairs offer new opportunities for inhibiting two proteins in a single pathway
Identifying and Characterizing a Novel Protein Kinase STK35L1 and Deciphering Its Orthologs and Close-Homologs in Vertebrates
The human kinome containing 478 eukaryotic protein kinases has over 100 uncharacterized kinases with unknown substrates and biological functions. The Ser/Thr kinase 35 (STK35, Clik1) is a member of the NKF 4 (New Kinase Family 4) in the kinome with unknown substrates and biological functions. Various high throughput studies indicate that STK35 could be involved in various human diseases such as colorectal cancer and malaria. In this study, we found that the previously published coding sequence of the STK35 gene is incomplete. The newly identified sequence of the STK35 gene codes for a protein of 534 amino acids with a N-terminal elongation of 133 amino acids. It has been designated as STK35L (STK35 long). Since it is the first of further homologous kinases we termed it as STK35L1. The STK35L1 protein (58 kDa on SDS-PAGE), but not STK35 (44 kDa), was found to be expressed in all human cells studied (endothelial cells, HeLa, and HEK cells) and was down-regulated after silencing with specific siRNA. EGFP-STK35L1 was localized in the nucleus and the nucleolus. By combining syntenic and gene structure pattern data and homology searches, two further STK35L1 homologs, STK35L2 (previously known as PDIK1L) and STK35L3, were found. All these protein kinase homologs were conserved throughout the vertebrates. The STK35L3 gene was specifically lost during placental mammalian evolution. Using comparative genomics, we have identified orthologous sets of these three protein kinases genes and their possible ancestor gene in two sea squirt genomes. We found the full-length coding sequence of the STK35 gene and termed it as STK35L1. We identified a new third STK35-like gene, STK35L3, in vertebrates and a possible ancestor gene in sea squirt genome. This study will provide a comprehensive platform to explore the role of STK35L kinases in cell functions and human diseases
Search for rare quark-annihilation decays, B --> Ds(*) Phi
We report on searches for B- --> Ds- Phi and B- --> Ds*- Phi. In the context
of the Standard Model, these decays are expected to be highly suppressed since
they proceed through annihilation of the b and u-bar quarks in the B- meson.
Our results are based on 234 million Upsilon(4S) --> B Bbar decays collected
with the BABAR detector at SLAC. We find no evidence for these decays, and we
set Bayesian 90% confidence level upper limits on the branching fractions BF(B-
--> Ds- Phi) Ds*- Phi)<1.2x10^(-5). These results
are consistent with Standard Model expectations.Comment: 8 pages, 3 postscript figues, submitted to Phys. Rev. D (Rapid
Communications
G-Quadruplex DNA Sequences Are Evolutionarily Conserved and Associated with Distinct Genomic Features in Saccharomyces cerevisiae
G-quadruplex DNA is a four-stranded DNA structure formed by non-Watson-Crick base pairing between stacked sets of four guanines. Many possible functions have been proposed for this structure, but its in vivo role in the cell is still largely unresolved. We carried out a genome-wide survey of the evolutionary conservation of regions with the potential to form G-quadruplex DNA structures (G4 DNA motifs) across seven yeast species. We found that G4 DNA motifs were significantly more conserved than expected by chance, and the nucleotide-level conservation patterns suggested that the motif conservation was the result of the formation of G4 DNA structures. We characterized the association of conserved and non-conserved G4 DNA motifs in Saccharomyces cerevisiae with more than 40 known genome features and gene classes. Our comprehensive, integrated evolutionary and functional analysis confirmed the previously observed associations of G4 DNA motifs with promoter regions and the rDNA, and it identified several previously unrecognized associations of G4 DNA motifs with genomic features, such as mitotic and meiotic double-strand break sites (DSBs). Conserved G4 DNA motifs maintained strong associations with promoters and the rDNA, but not with DSBs. We also performed the first analysis of G4 DNA motifs in the mitochondria, and surprisingly found a tenfold higher concentration of the motifs in the AT-rich yeast mitochondrial DNA than in nuclear DNA. The evolutionary conservation of the G4 DNA motif and its association with specific genome features supports the hypothesis that G4 DNA has in vivo functions that are under evolutionary constraint
Roles of residues in the interface of transient protein-protein complexes before complexation
Transient protein-protein interactions play crucial roles in all facets of cellular physiology. Here, using an analysis on known 3-D structures of transient protein-protein complexes, their corresponding uncomplexed forms and energy calculations we seek to understand the roles of protein-protein interfacial residues in the unbound forms. We show that there are conformationally near invariant and evolutionarily conserved interfacial residues which are rigid and they account for ∼65% of the core interface. Interestingly, some of these residues contribute significantly to the stabilization of the interface structure in the uncomplexed form. Such residues have strong energetic basis to perform dual roles of stabilizing the structure of the uncomplexed form as well as the complex once formed while they maintain their rigid nature throughout. This feature is evolutionarily well conserved at both the structural and sequence levels. We believe this analysis has general bearing in the prediction of interfaces and understanding molecular recognition
- …