980 research outputs found

    Backbone 1H, 13C, and 15N resonance assignments for lysozyme from bacteriophage lambda

    Get PDF
    Lysozyme from lambda bacteriophage (λ lysozyme) is an 18 kDa globular protein displaying some of the structural features common to all lysozymes; in particular, λ lysozyme consists of two structural domains connected by a helix, and has its catalytic residues located at the interface between these two domains. An interesting feature of λ lysozyme, when compared to the well-characterised hen egg-white lysozyme, is its lack of disulfide bridges; this makes λ lysozyme an interesting system for studies of protein folding. A comparison of the folding properties of λ lysozyme and hen lysozyme will provide important insights into the role that disulfide bonds play in the refolding pathway of the latter protein. Here we report the 1H, 13C and 15N backbone resonance assignments for λ lysozyme by heteronuclear multidimensional NMR spectroscopy. These assignments provide the starting point for detailed investigation of the refolding pathway using pulse-labelling hydrogen/deuterium exchange experiments monitored by NMR

    From Nonspecific DNA–Protein Encounter Complexes to the Prediction of DNA–Protein Interactions

    Get PDF
    ©2009 Gao, Skolnick. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.doi:10.1371/journal.pcbi.1000341DNA–protein interactions are involved in many essential biological activities. Because there is no simple mapping code between DNA base pairs and protein amino acids, the prediction of DNA–protein interactions is a challenging problem. Here, we present a novel computational approach for predicting DNA-binding protein residues and DNA–protein interaction modes without knowing its specific DNA target sequence. Given the structure of a DNA-binding protein, the method first generates an ensemble of complex structures obtained by rigid-body docking with a nonspecific canonical B-DNA. Representative models are subsequently selected through clustering and ranking by their DNA–protein interfacial energy. Analysis of these encounter complex models suggests that the recognition sites for specific DNA binding are usually favorable interaction sites for the nonspecific DNA probe and that nonspecific DNA–protein interaction modes exhibit some similarity to specific DNA–protein binding modes. Although the method requires as input the knowledge that the protein binds DNA, in benchmark tests, it achieves better performance in identifying DNA-binding sites than three previously established methods, which are based on sophisticated machine-learning techniques. We further apply our method to protein structures predicted through modeling and demonstrate that our method performs satisfactorily on protein models whose root-mean-square Ca deviation from native is up to 5 Å from their native structures. This study provides valuable structural insights into how a specific DNA-binding protein interacts with a nonspecific DNA sequence. The similarity between the specific DNA–protein interaction mode and nonspecific interaction modes may reflect an important sampling step in search of its specific DNA targets by a DNA-binding protein

    Prediction of nuclear proteins using SVM and HMM models

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The nucleus, a highly organized organelle, plays important role in cellular homeostasis. The nuclear proteins are crucial for chromosomal maintenance/segregation, gene expression, RNA processing/export, and many other processes. Several methods have been developed for predicting the nuclear proteins in the past. The aim of the present study is to develop a new method for predicting nuclear proteins with higher accuracy.</p> <p>Results</p> <p>All modules were trained and tested on a non-redundant dataset and evaluated using five-fold cross-validation technique. Firstly, Support Vector Machines (SVM) based modules have been developed using amino acid and dipeptide compositions and achieved a Mathews correlation coefficient (MCC) of 0.59 and 0.61 respectively. Secondly, we have developed SVM modules using split amino acid compositions (SAAC) and achieved the maximum MCC of 0.66. Thirdly, a hidden Markov model (HMM) based module/profile was developed for searching exclusively nuclear and non-nuclear domains in a protein. Finally, a hybrid module was developed by combining SVM module and HMM profile and achieved a MCC of 0.87 with an accuracy of 94.61%. This method performs better than the existing methods when evaluated on blind/independent datasets. Our method estimated 31.51%, 21.89%, 26.31%, 25.72% and 24.95% of the proteins as nuclear proteins in <it>Saccharomyces cerevisiae, Caenorhabditis elegans, Drosophila melanogaster</it>, mouse and human proteomes respectively. Based on the above modules, we have developed a web server NpPred for predicting nuclear proteins <url>http://www.imtech.res.in/raghava/nppred/</url>.</p> <p>Conclusion</p> <p>This study describes a highly accurate method for predicting nuclear proteins. SVM module has been developed for the first time using SAAC for predicting nuclear proteins, where amino acid composition of N-terminus and the remaining protein were computed separately. In addition, our study is a first documentation where exclusively nuclear and non-nuclear domains have been identified and used for predicting nuclear proteins. The performance of the method improved further by combining both approaches together.</p

    Physical activity and clustered cardiovascular disease risk factors in young children: a cross-sectional study (the IDEFICS study)

    Get PDF
    &lt;p&gt;Background The relevance of physical activity (PA) for combating cardiovascular disease (CVD) risk in children has been highlighted, but to date there has been no large-scale study analyzing that association in children aged &#8804;9 years of age. This study sought to evaluate the associations between objectively-measured PA and clustered CVD risk factors in a large sample of European children, and to provide evidence for gender-specific recommendations of PA.&lt;/p&gt; &lt;p&gt;Methods Cross-sectional data from a longitudinal study in 16,224 children aged 2 to 9 were collected. Of these, 3,120 (1,016 between 2 to 6 years, 2,104 between 6 to 9 years) had sufficient data for inclusion in the current analyses. Two different age-specific and gender-specific clustered CVD risk scores associated with PA were determined. First, a CVD risk factor (CRF) continuous score was computed using the following variables: systolic blood pressure (SBP), total triglycerides (TG), total cholesterol (TC)/high-density lipoprotein cholesterol (HDL-c) ratio, homeostasis model assessment of insulin resistance (HOMA-IR), and sum of two skinfolds (score CRFs). Secondly, another CVD risk score was obtained for older children containing the score CRFs + the cardiorespiratory fitness variable (termed score CRFs + fit). Data used in the current analysis were derived from the IDEFICS (‘Identification and prevention of Dietary- and lifestyle-induced health EFfects In Children and infantS’) study.&lt;/p&gt; &lt;p&gt;Results In boys &#60;6 years, the odds ratios (OR) for CVD risk were elevated in the least active quintile of PA (OR: 2.58) compared with the most active quintile as well as the second quintile for vigorous PA (OR: 2.91). Compared with the most active quintile, older children in the first, second and third quintiles had OR for CVD risk score CRFs + fit ranging from OR 2.69 to 5.40 in boys, and from OR 2.85 to 7.05 in girls.&lt;/p&gt; &lt;p&gt;Conclusions PA is important to protect against clustering of CVD risk factors in young children, being more consistent in those older than 6 years. Healthcare professionals should recommend around 60 and 85 min/day of moderate-to-vigorous PA, including 20 min/day of vigorous PA.&lt;/p&gt

    Quality of life and cost-effectiveness of interferon-alpha in malignant melanoma: results from randomised trial

    Get PDF
    A definitive conclusion regarding the value of low-dose extended duration adjuvant interferon-alpha therapy in the treatment of malignant melanoma is only possible once data on health-related quality of life (HRQoL) and costs have been considered. This trial randomised 674 patients to interferon alpha-2a (3 megaunits three times per week for 2 years or until recurrence) or placebo. Health-related quality of life (QoL) was to be assessed up to 60 months using the European Organisation for Research and Treatment of Cancer (EORTC) QLQ-C30. Data for the economic analysis, including cost information and the EQ-5D were also collected. Patients in the observation (OBS) group had significantly better mean follow-up quality of on five dimensions of the EORTC QLQ-C30 functional scales: role functioning (P=0.033), emotional functioning (P=0.003), cognitive functioning (P=0.001), social functioning (P=0.003) and global health status (P=0.001). Patients in the OBS group had significantly better mean follow-up symptom scores on seven dimensions of the EORTC QLQ-C30 V1 symptom scales. Economic data showed that costs were £3066 higher in the interferon group and produces an incremental cost per quality-adjusted life year of £41 432 at 5 years. The results show that interferon has significant effects on QoL and symptomatology and is unlikely to be cost-effective in this patient group in the UK

    Evolution favors protein mutational robustness in sufficiently large populations

    Get PDF
    BACKGROUND: An important question is whether evolution favors properties such as mutational robustness or evolvability that do not directly benefit any individual, but can influence the course of future evolution. Functionally similar proteins can differ substantially in their robustness to mutations and capacity to evolve new functions, but it has remained unclear whether any of these differences might be due to evolutionary selection for these properties. RESULTS: Here we use laboratory experiments to demonstrate that evolution favors protein mutational robustness if the evolving population is sufficiently large. We neutrally evolve cytochrome P450 proteins under identical selection pressures and mutation rates in populations of different sizes, and show that proteins from the larger and thus more polymorphic population tend towards higher mutational robustness. Proteins from the larger population also evolve greater stability, a biophysical property that is known to enhance both mutational robustness and evolvability. The excess mutational robustness and stability is well described by existing mathematical theories, and can be quantitatively related to the way that the proteins occupy their neutral network. CONCLUSIONS: Our work is the first experimental demonstration of the general tendency of evolution to favor mutational robustness and protein stability in highly polymorphic populations. We suggest that this phenomenon may contribute to the mutational robustness and evolvability of viruses and bacteria that exist in large populations

    Study of two G-protein coupled receptor variants of human trace amine-associated receptor 5

    Get PDF
    Here we report the study of two bioengineered variants of human trace amine-associated receptor 5 (hTAAR5) that were expressed in stable tetracycline-inducible HEK293S cell lines. A systematic detergent screen showed that fos-choline-14 was the optimal detergent to solubilize and subsequently purify the receptors. Milligram quantities of both hTAAR5 variants were purified to near homogeneity using immunoaffinity chromatography followed by gel filtration. Circular dichroism showed that the purified receptors had helical secondary structures, indicating that they were properly folded. The purified receptors are not only suitable for functional analyses, but also for subsequent crystallization trials. To our knowledge, this is the first mammalian TAAR that has been heterologously expressed and purified. Our study will likely stimulate in the development of therapeutic drug targets for TAAR-associated diseases, as well as fabrication of TAAR-based sensing devices

    Decision tree supported substructure prediction of metabolites from GC-MS profiles

    Get PDF
    Gas chromatography coupled to mass spectrometry (GC-MS) is one of the most widespread routine technologies applied to the large scale screening and discovery of novel metabolic biomarkers. However, currently the majority of mass spectral tags (MSTs) remains unidentified due to the lack of authenticated pure reference substances required for compound identification by GC-MS. Here, we accessed the information on reference compounds stored in the Golm Metabolome Database (GMD) to apply supervised machine learning approaches to the classification and identification of unidentified MSTs without relying on library searches. Non-annotated MSTs with mass spectral and retention index (RI) information together with data of already identified metabolites and reference substances have been archived in the GMD. Structural feature extraction was applied to sub-divide the metabolite space contained in the GMD and to define the prediction target classes. Decision tree (DT)-based prediction of the most frequent substructures based on mass spectral features and RI information is demonstrated to result in highly sensitive and specific detections of sub-structures contained in the compounds. The underlying set of DTs can be inspected by the user and are made available for batch processing via SOAP (Simple Object Access Protocol)-based web services. The GMD mass spectral library with the integrated DTs is freely accessible for non-commercial use at http://gmd.mpimp-golm.mpg.de/. All matching and structure search functionalities are available as SOAP-based web services. A XML + HTTP interface, which follows Representational State Transfer (REST) principles, facilitates read-only access to data base entities

    Assessing protein similarity with Gene Ontology and its use in subnuclear localization prediction

    Get PDF
    BACKGROUND: The accomplishment of the various genome sequencing projects resulted in accumulation of massive amount of gene sequence information. This calls for a large-scale computational method for predicting protein localization from sequence. The protein localization can provide valuable information about its molecular function, as well as the biological pathway in which it participates. The prediction of localization of a protein at subnuclear level is a challenging task. In our previous work we proposed an SVM-based system using protein sequence information for this prediction task. In this work, we assess protein similarity with Gene Ontology (GO) and then improve the performance of the system by adding a module of nearest neighbor classifier using a similarity measure derived from the GO annotation terms for protein sequences. RESULTS: The performance of the new system proposed here was compared with our previous system using a set of proteins resided within 6 localizations collected from the Nuclear Protein Database (NPD). The overall MCC (accuracy) is elevated from 0.284 (50.0%) to 0.519 (66.5%) for single-localization proteins in leave-one-out cross-validation; and from 0.420 (65.2%) to 0.541 (65.2%) for an independent set of multi-localization proteins. The new system is available at . CONCLUSION: The prediction of protein subnuclear localizations can be largely influenced by various definitions of similarity for a pair of proteins based on different similarity measures of GO terms. Using the sum of similarity scores over the matched GO term pairs for two proteins as the similarity definition produced the best predictive outcome. Substantial improvement in predicting protein subnuclear localizations has been achieved by combining Gene Ontology with sequence information

    Structural insights into the production of 3-hydroxypropionic acid by aldehyde dehydrogenase from Azospirillum brasilense

    Get PDF
    3-Hydroxypropionic acid (3-HP) is an important platform chemical to be converted to acrylic acid and acrylamide. Aldehyde dehydrogenase (ALDH), an enzyme that catalyzes the reaction of 3-hydroxypropionaldehyde (3-HPA) to 3-HP, determines 3-HP production rate during the conversion of glycerol to 3-HP. To elucidate molecular mechanism of 3-HP production, we determined the first crystal structure of a 3-HP producing ALDH, alpha-ketoglutarate-semialdehyde dehydrogenase from Azospirillum basilensis (AbKGSADH), in its apo-form and in complex with NAD(+). Although showing an overall structure similar to other ALDHs, the AbKGSADH enzyme had an optimal substrate binding site for accepting 3-HPA as a substrate. Molecular docking simulation of 3-HPA into the AbKGSADH structure revealed that the residues Asn159, Gln160 and Arg163 stabilize the aldehyde-and the hydroxyl-groups of 3-HPA through hydrogen bonds, and several hydrophobic residues, such as Phe156, Val286, Ile288, and Phe450, provide the optimal size and shape for 3-HPA binding. We also compared AbKGSADH with other reported 3-HP producing ALDHs for the crucial amino acid residues for enzyme catalysis and substrate binding, which provides structural implications on how these enzymes utilize 3-HPA as a substrate
    corecore