17 research outputs found
Prodepth: Predict Residue Depth by Support Vector Regression Approach from Protein Sequences Only
Residue depth (RD) is a solvent exposure measure that complements the information provided by conventional accessible surface area (ASA) and describes to what extent a residue is buried in the protein structure space. Previous studies have established that RD is correlated with several protein properties, such as protein stability, residue conservation and amino acid types. Accurate prediction of RD has many potentially important applications in the field of structural bioinformatics, for example, facilitating the identification of functionally important residues, or residues in the folding nucleus, or enzyme active sites from sequence information. In this work, we introduce an efficient approach that uses support vector regression to quantify the relationship between RD and protein sequence. We systematically investigated eight different sequence encoding schemes including both local and global sequence characteristics and examined their respective prediction performances. For the objective evaluation of our approach, we used 5-fold cross-validation to assess the prediction accuracies and showed that the overall best performance could be achieved with a correlation coefficient (CC) of 0.71 between the observed and predicted RD values and a root mean square error (RMSE) of 1.74, after incorporating the relevant multiple sequence features. The results suggest that residue depth could be reliably predicted solely from protein primary sequences: local sequence environments are the major determinants, while global sequence features could influence the prediction performance marginally. We highlight two examples as a comparison in order to illustrate the applicability of this approach. We also discuss the potential implications of this new structural parameter in the field of protein structure prediction and homology modeling. This method might prove to be a powerful tool for sequence analysis
Measurement of the Bottom-Strange Meson Mixing Phase in the Full CDF Data Set
We report a measurement of the bottom-strange meson mixing phase \beta_s
using the time evolution of B0_s -> J/\psi (->\mu+\mu-) \phi (-> K+ K-) decays
in which the quark-flavor content of the bottom-strange meson is identified at
production. This measurement uses the full data set of proton-antiproton
collisions at sqrt(s)= 1.96 TeV collected by the Collider Detector experiment
at the Fermilab Tevatron, corresponding to 9.6 fb-1 of integrated luminosity.
We report confidence regions in the two-dimensional space of \beta_s and the
B0_s decay-width difference \Delta\Gamma_s, and measure \beta_s in [-\pi/2,
-1.51] U [-0.06, 0.30] U [1.26, \pi/2] at the 68% confidence level, in
agreement with the standard model expectation. Assuming the standard model
value of \beta_s, we also determine \Delta\Gamma_s = 0.068 +- 0.026 (stat) +-
0.009 (syst) ps-1 and the mean B0_s lifetime, \tau_s = 1.528 +- 0.019 (stat) +-
0.009 (syst) ps, which are consistent and competitive with determinations by
other experiments.Comment: 8 pages, 2 figures, Phys. Rev. Lett 109, 171802 (2012
Positional Cloning of “Lisch-like”, a Candidate Modifier of Susceptibility to Type 2 Diabetes in Mice
In 404 Lepob/ob F2 progeny of a C57BL/6J (B6) x DBA/2J (DBA) intercross, we mapped a DBA-related quantitative trait locus (QTL) to distal Chr1 at 169.6 Mb, centered about D1Mit110, for diabetes-related phenotypes that included blood glucose, HbA1c, and pancreatic islet histology. The interval was refined to 1.8 Mb in a series of B6.DBA congenic/subcongenic lines also segregating for Lepob. The phenotypes of B6.DBA congenic mice include reduced β-cell replication rates accompanied by reduced β-cell mass, reduced insulin/glucose ratio in blood, reduced glucose tolerance, and persistent mild hypoinsulinemic hyperglycemia. Nucleotide sequence and expression analysis of 14 genes in this interval identified a predicted gene that we have designated “Lisch-like” (Ll) as the most likely candidate. The gene spans 62.7 kb on Chr1qH2.3, encoding a 10-exon, 646–amino acid polypeptide, homologous to Lsr on Chr7qB1 and to Ildr1 on Chr16qB3. The largest isoform of Ll is predicted to be a transmembrane molecule with an immunoglobulin-like extracellular domain and a serine/threonine-rich intracellular domain that contains a 14-3-3 binding domain. Morpholino knockdown of the zebrafish paralog of Ll resulted in a generalized delay in endodermal development in the gut region and dispersion of insulin-positive cells. Mice segregating for an ENU-induced null allele of Ll have phenotypes comparable to the B.D congenic lines. The human ortholog, C1orf32, is in the middle of a 30-Mb region of Chr1q23-25 that has been repeatedly associated with type 2 diabetes
Coronavirus Gene 7 Counteracts Host Defenses and Modulates Virus Virulence
Transmissible gastroenteritis virus (TGEV) genome contains three accessory genes: 3a, 3b and 7. Gene 7 is only present in members of coronavirus genus a1, and encodes a hydrophobic protein of 78 aa. To study gene 7 function, a recombinant TGEV virus lacking gene 7 was engineered (rTGEV-Δ7). Both the mutant and the parental (rTGEV-wt) viruses showed the same growth and viral RNA accumulation kinetics in tissue cultures. Nevertheless, cells infected with rTGEV-Δ7 virus showed an increased cytopathic effect caused by an enhanced apoptosis mediated by caspase activation. Macromolecular synthesis analysis showed that rTGEV-Δ7 virus infection led to host translational shut-off and increased cellular RNA degradation compared with rTGEV-wt infection. An increase of eukaryotic translation initiation factor 2 (eIF2α) phosphorylation and an enhanced nuclease, most likely RNase L, activity were observed in rTGEV-Δ7 virus infected cells. These results suggested that the removal of gene 7 promoted an intensified dsRNA-activated host antiviral response. In protein 7 a conserved sequence motif that potentially mediates binding to protein phosphatase 1 catalytic subunit (PP1c), a key regulator of the cell antiviral defenses, was identified. We postulated that TGEV protein 7 may counteract host antiviral response by its association with PP1c. In fact, pull-down assays demonstrated the interaction between TGEV protein 7, but not a protein 7 mutant lacking PP1c binding motif, with PP1. Moreover, the interaction between protein 7 and PP1 was required, during the infection, for eIF2α dephosphorylation and inhibition of cell RNA degradation. Inoculation of newborn piglets with rTGEV-Δ7 and rTGEV-wt viruses showed that rTGEV-Δ7 virus presented accelerated growth kinetics and pathology compared with the parental virus. Overall, the results indicated that gene 7 counteracted host cell defenses, and modified TGEV persistence increasing TGEV survival. Therefore, the acquisition of gene 7 by the TGEV genome most likely has provided a selective advantage to the virus
HFSP: High speed homology-driven function annotation of proteins.
Motivation: The rapid drop in sequencing costs has produced many more (predicted) protein sequences than can feasibly be functionally annotated with wet-lab experiments. Thus, many computational methods have been developed for this purpose. Most of these methods employ homology-based inference, approximated via sequence alignments, to transfer functional annotations between proteins. The increase in the number of available sequences, however, has drastically increased the search space, thus significantly slowing down alignment methods. Results: Here we describe homology-derived functional similarity of proteins (HFSP), a novel computational method that uses results of a high-speed alignment algorithm, MMseqs2, to infer functional similarity of proteins on the basis of their alignment length and sequence identity. We show that our method is accurate (85% precision) and fast (more than 40-fold speed increase over stateof-the-art). HFSP can help correct at least a 16% error in legacy curations, even for a resource of as high quality as Swiss-Prot. These findings suggest HFSP as an ideal resource for large-scale functional annotation efforts
Comprehensive in silico mutagenesis highlights functionally important residues in proteins
Motivation: Mutating residues into alanine (alanine scanning) is one of the fastest experimental means of probing hypotheses about protein function. Alanine scans can reveal functional hot spots, i.e. residues that alter function upon mutation. In vitro mutagenesis is cumbersome and costly: probing all residues in a protein is typically as impossible as substituting by all non-native amino acids. In contrast, such exhaustive mutagenesis is feasible in silico