138 research outputs found
New insights regarding HCV-NS5A structure/function and indication of genotypic differences
<p>Abstract</p> <p>Background</p> <p>HCV is prevalent throughout the world. It is a major cause of chronic liver disease. There is no effective vaccine and the most common therapy, based on Peginterferon, has a success rate of ~50%. The mechanisms underlying viral resistance have not been elucidated but it has been suggested that both host and virus contribute to therapy outcome. Non-structural 5A (NS5A) protein, a critical virus component, is involved in cellular and viral processes.</p> <p>Methods</p> <p>The present study analyzed structural and functional features of 345 sequences of HCV-NS5A genotypes 1 or 3, using <it>in silico </it>tools.</p> <p>Results</p> <p>There was residue type composition and secondary structure differences between the genotypes. In addition, second structural variance were statistical different for each response group in genotype 3. A motif search indicated conserved glycosylation, phosphorylation and myristoylation sites that could be important in structural stabilization and function. Furthermore, a highly conserved integrin ligation site was identified, and could be linked to nuclear forms of NS5A. ProtFun indicated NS5A to have diverse enzymatic and nonenzymatic activities, participating in a great range of cell functions, with statistical difference between genotypes.</p> <p>Conclusion</p> <p>This study presents new insights into the HCV-NS5A. It is the first study that using bioinformatics tools, suggests differences between genotypes and response to therapy that can be related to NS5A protein features. Therefore, it emphasizes the importance of using bioinformatics tools in viral studies. Data acquired herein will aid in clarifying the structure/function of this protein and in the development of antiviral agents.</p
A cluster randomized trial of standard quality improvement versus patient-centered interventions to enhance depression care for African Americans in the primary care setting: study protocol NCT00243425
<p>Abstract</p> <p>Background</p> <p>Several studies document disparities in access to care and quality of care for depression for African Americans. Research suggests that patient attitudes and clinician communication behaviors may contribute to these disparities. Evidence links patient-centered care to improvements in mental health outcomes; therefore, quality improvement interventions that enhance this dimension of care are promising strategies to improve treatment and outcomes of depression among African Americans. This paper describes the design of the BRIDGE (Blacks Receiving Interventions for Depression and Gaining Empowerment) Study. The goal of the study is to compare the effectiveness of two interventions for African-American patients with depression--a standard quality improvement program and a patient-centered quality improvement program. The main hypothesis is that patients in the patient-centered group will have a greater reduction in their depression symptoms, higher rates of depression remission, and greater improvements in mental health functioning at six, twelve, and eighteen months than patients in the standard group. The study also examines patient ratings of care and receipt of guideline-concordant treatment for depression.</p> <p>Methods/Design</p> <p>A total of 36 primary care clinicians and 132 of their African-American patients with major depressive disorder were recruited into a cluster randomized trial. The study uses intent-to-treat analyses to compare the effectiveness of standard quality improvement interventions (academic detailing about depression guidelines for clinicians and disease-oriented care management for their patients) and patient-centered quality improvement interventions (communication skills training to enhance participatory decision-making for clinicians and care management focused on explanatory models, socio-cultural barriers, and treatment preferences for their patients) for improving outcomes over 12 months of follow-up.</p> <p>Discussion</p> <p>The BRIDGE Study includes clinicians and African-American patients in under-resourced community-based practices who have not been well-represented in clinical trials to improve depression care. The patient-centered and culturally targeted approach to depression care is a relatively new one that has not been tested in most previous studies. The study will provide evidence about whether patient-centered accommodations improve quality of care and outcomes to a greater extent than standard quality improvement strategies for African Americans with depression.</p> <p>Trial Registration</p> <p>ClinicalTrials.gov NCT00243425</p
Plasmodium vivax Tryptophan-Rich Antigen PvTRAg33.5 Contains Alpha Helical Structure and Multidomain Architecture
Tryptophan-rich proteins from several malarial parasites have been identified where they play an important role in host-parasite interaction. Structural characterization of these proteins is needed to develop them as therapeutic targets. Here, we describe a novel Plasmodium vivax tryptophan-rich protein named PvTRAg33.5. It is expressed by blood stage(s) of the parasite and its gene contains two exons. The exon 1 encodes for a 23 amino acids long putative signal peptide which is likely to be cleaved off whereas the exon 2 encodes for the mature protein of 252 amino acids. The mature protein contains B-cell epitopes which were recognized by the human immune system during P.vivax infection. The PvTRAg33.5 contains 24 (9.5%) tryptophan residues and six motifs whose patterns were similar among tryptophan-rich proteins. The modeled structure of the PvTRAg33.5 consists of a multidomain architecture which is stabilized by the presence of large number of tryptophan residues. The recombinant PvTRAg33.5 showed predominantly α helical structure and alpha helix to beta sheet transition at pH below 4.5. Protein acquires an irreversible non-native state at temperature more than 50°C at neutral pH. Its secondary and tertiary structures remain stable in the presence of 35% alcohol but these structures are destabilized at higher alcohol concentrations due to the disturbance of hydrophobic interactions between tryptophanyl residues. These structural changes in the protein might occur during its translocation to interact with other proteins at its final destination for biological function such as erythrocyte invasion
Prodepth: Predict Residue Depth by Support Vector Regression Approach from Protein Sequences Only
Residue depth (RD) is a solvent exposure measure that complements the information provided by conventional accessible surface area (ASA) and describes to what extent a residue is buried in the protein structure space. Previous studies have established that RD is correlated with several protein properties, such as protein stability, residue conservation and amino acid types. Accurate prediction of RD has many potentially important applications in the field of structural bioinformatics, for example, facilitating the identification of functionally important residues, or residues in the folding nucleus, or enzyme active sites from sequence information. In this work, we introduce an efficient approach that uses support vector regression to quantify the relationship between RD and protein sequence. We systematically investigated eight different sequence encoding schemes including both local and global sequence characteristics and examined their respective prediction performances. For the objective evaluation of our approach, we used 5-fold cross-validation to assess the prediction accuracies and showed that the overall best performance could be achieved with a correlation coefficient (CC) of 0.71 between the observed and predicted RD values and a root mean square error (RMSE) of 1.74, after incorporating the relevant multiple sequence features. The results suggest that residue depth could be reliably predicted solely from protein primary sequences: local sequence environments are the major determinants, while global sequence features could influence the prediction performance marginally. We highlight two examples as a comparison in order to illustrate the applicability of this approach. We also discuss the potential implications of this new structural parameter in the field of protein structure prediction and homology modeling. This method might prove to be a powerful tool for sequence analysis
Protein structure search and local structure characterization
<p>Abstract</p> <p>Background</p> <p>Structural similarities among proteins can provide valuable insight into their functional mechanisms and relationships. As the number of available three-dimensional (3D) protein structures increases, a greater variety of studies can be conducted with increasing efficiency, among which is the design of protein structural alphabets. Structural alphabets allow us to characterize local structures of proteins and describe the global folding structure of a protein using a one-dimensional (1D) sequence. Thus, 1D sequences can be used to identify structural similarities among proteins using standard sequence alignment tools such as BLAST or FASTA.</p> <p>Results</p> <p>We used self-organizing maps in combination with a minimum spanning tree algorithm to determine the optimum size of a structural alphabet and applied the k-means algorithm to group protein fragnts into clusters. The centroids of these clusters defined the structural alphabet. We also developed a flexible matrix training system to build a substitution matrix (TRISUM-169) for our alphabet. Based on FASTA and using TRISUM-169 as the substitution matrix, we developed the SA-FAST alignment tool. We compared the performance of SA-FAST with that of various search tools in database-scale search tasks and found that SA-FAST was highly competitive in all tests conducted. Further, we evaluated the performance of our structural alphabet in recognizing specific structural domains of EGF and EGF-like proteins. Our method successfully recovered more EGF sub-domains using our structural alphabet than when using other structural alphabets. SA-FAST can be found at <url>http://140.113.166.178/safast/</url>.</p> <p>Conclusion</p> <p>The goal of this project was two-fold. First, we wanted to introduce a modular design pipeline to those who have been working with structural alphabets. Secondly, we wanted to open the door to researchers who have done substantial work in biological sequences but have yet to enter the field of protein structure research. Our experiments showed that by transforming the structural representations from 3D to 1D, several 1D-based tools can be applied to structural analysis, including similarity searches and structural motif finding.</p
Web-based tools can be used reliably to detect patients with major depressive disorder and subsyndromal depressive symptoms
BACKGROUND: Although depression has been regarded as a major public health problem, many individuals with depression still remain undetected or untreated. Despite the potential for Internet-based tools to greatly improve the success rate of screening for depression, their reliability and validity has not been well studied. Therefore the aim of this study was to evaluate the test-retest reliability and criterion validity of a Web-based system, the Internet-based Self-assessment Program for Depression (ISP-D). METHODS: The ISP-D to screen for major depressive disorder (MDD), minor depressive disorder (MinD), and subsyndromal depressive symptoms (SSD) was developed in traditional Chinese. Volunteers, 18 years and older, were recruited via the Internet and then assessed twice on the online ISP-D system to investigate the test-retest reliability of the test. They were subsequently prompted to schedule face-to-face interviews. The interviews were performed by the research psychiatrists using the Mini-International Neuropsychiatric Interview and the diagnoses made according to DSM-IV diagnostic criteria were used for the statistics of criterion validity. Kappa (κ) values were calculated to assess test-retest reliability. RESULTS: A total of 579 volunteer subjects were administered the test. Most of the subjects were young (mean age: 26.2 ± 6.6 years), female (77.7%), single (81.6%), and well educated (61.9% college or higher). The distributions of MDD, MinD, SSD and no depression specified were 30.9%, 7.4%, 15.2%, and 46.5%, respectively. The mean time to complete the ISP-D was 8.89 ± 6.77 min. One hundred and eighty-four of the respondents completed the retest (response rate: 31.8%). Our analysis revealed that the 2-week test-retest reliability for ISP-D was excellent (weighted κ = 0.801). Fifty-five participants completed the face-to-face interview for the validity study. The sensitivity, specificity, positive, and negative predictive values for major depressive disorder were 81.8% and 72.7%, 66.7%, and 85.7% respectively. The overall accuracy was 76.4%. CONCLUSION: The evidence indicates the ISP-D is a reliable and valid online tool for assessing depression. Further studies should test the ISP-D in clinical settings to increase its applications in clinical environments with different populations and in a larger sample size
A discriminative method for protein remote homology detection and fold recognition combining Top-n-grams and latent semantic analysis
<p>Abstract</p> <p>Background</p> <p>Protein remote homology detection and fold recognition are central problems in bioinformatics. Currently, discriminative methods based on support vector machine (SVM) are the most effective and accurate methods for solving these problems. A key step to improve the performance of the SVM-based methods is to find a suitable representation of protein sequences.</p> <p>Results</p> <p>In this paper, a novel building block of proteins called Top-<it>n</it>-grams is presented, which contains the evolutionary information extracted from the protein sequence frequency profiles. The protein sequence frequency profiles are calculated from the multiple sequence alignments outputted by PSI-BLAST and converted into Top-<it>n</it>-grams. The protein sequences are transformed into fixed-dimension feature vectors by the occurrence times of each Top-<it>n</it>-gram. The training vectors are evaluated by SVM to train classifiers which are then used to classify the test protein sequences. We demonstrate that the prediction performance of remote homology detection and fold recognition can be improved by combining Top-<it>n</it>-grams and latent semantic analysis (LSA), which is an efficient feature extraction technique from natural language processing. When tested on superfamily and fold benchmarks, the method combining Top-<it>n</it>-grams and LSA gives significantly better results compared to related methods.</p> <p>Conclusion</p> <p>The method based on Top-<it>n</it>-grams significantly outperforms the methods based on many other building blocks including N-grams, patterns, motifs and binary profiles. Therefore, Top-<it>n</it>-gram is a good building block of the protein sequences and can be widely used in many tasks of the computational biology, such as the sequence alignment, the prediction of domain boundary, the designation of knowledge-based potentials and the prediction of protein binding sites.</p
The Major Antigenic Membrane Protein of “Candidatus Phytoplasma asteris” Selectively Interacts with ATP Synthase and Actin of Leafhopper Vectors
Phytoplasmas, uncultivable phloem-limited phytopathogenic wall-less bacteria, represent a major threat to agriculture worldwide. They are transmitted in a persistent, propagative manner by phloem-sucking Hemipteran insects. Phytoplasma membrane proteins are in direct contact with hosts and are presumably involved in determining vector specificity. Such a role has been proposed for phytoplasma transmembrane proteins encoded by circular extrachromosomal elements, at least one of which is a plasmid. Little is known about the interactions between major phytoplasma antigenic membrane protein (Amp) and insect vector proteins. The aims of our work were to identify vector proteins interacting with Amp and to investigate their role in transmission specificity. In controlled transmission experiments, four Hemipteran species were identified as vectors of “Candidatus Phytoplasma asteris”, the chrysanthemum yellows phytoplasmas (CYP) strain, and three others as non-vectors. Interactions between a labelled (recombinant) CYP Amp and insect proteins were analysed by far Western blots and affinity chromatography. Amp interacted specifically with a few proteins from vector species only. Among Amp-binding vector proteins, actin and both the α and β subunits of ATP synthase were identified by mass spectrometry and Western blots. Immunofluorescence confocal microscopy and Western blots of plasma membrane and mitochondrial fractions confirmed the localisation of ATP synthase, generally known as a mitochondrial protein, in plasma membranes of midgut and salivary gland cells in the vector Euscelidius variegatus. The vector-specific interaction between phytoplasma Amp and insect ATP synthase is demonstrated for the first time, and this work also supports the hypothesis that host actin is involved in the internalization and intracellular motility of phytoplasmas within their vectors. Phytoplasma Amp is hypothesized to play a crucial role in insect transmission specificity
Modular prediction of protein structural classes from sequences of twilight-zone identity with predicting sequences
<p>Abstract</p> <p>Background</p> <p>Knowledge of structural class is used by numerous methods for identification of structural/functional characteristics of proteins and could be used for the detection of remote homologues, particularly for chains that share twilight-zone similarity. In contrast to existing sequence-based structural class predictors, which target four major classes and which are designed for high identity sequences, we predict seven classes from sequences that share twilight-zone identity with the training sequences.</p> <p>Results</p> <p>The proposed MODular Approach to Structural class prediction (MODAS) method is unique as it allows for selection of any subset of the classes. MODAS is also the first to utilize a novel, custom-built feature-based sequence representation that combines evolutionary profiles and predicted secondary structure. The features quantify information relevant to the definition of the classes including conservation of residues and arrangement and number of helix/strand segments. Our comprehensive design considers 8 feature selection methods and 4 classifiers to develop Support Vector Machine-based classifiers that are tailored for each of the seven classes. Tests on 5 twilight-zone and 1 high-similarity benchmark datasets and comparison with over two dozens of modern competing predictors show that MODAS provides the best overall accuracy that ranges between 80% and 96.7% (83.5% for the twilight-zone datasets), depending on the dataset. This translates into 19% and 8% error rate reduction when compared against the best performing competing method on two largest datasets. The proposed predictor provides accurate predictions at 58% accuracy for membrane proteins class, which is not considered by majority of existing methods, in spite that this class accounts for only 2% of the data. Our predictive model is analyzed to demonstrate how and why the input features are associated with the corresponding classes.</p> <p>Conclusions</p> <p>The improved predictions stem from the novel features that express collocation of the secondary structure segments in the protein sequence and that combine evolutionary and secondary structure information. Our work demonstrates that conservation and arrangement of the secondary structure segments predicted along the protein chain can successfully predict structural classes which are defined based on the spatial arrangement of the secondary structures. A web server is available at <url>http://biomine.ece.ualberta.ca/MODAS/</url>.</p
- …