543 research outputs found

    Improving protein secondary structure prediction based on short subsequences with local structure similarity

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>When characterizing the structural topology of proteins, protein secondary structure (PSS) plays an important role in analyzing and modeling protein structures because it represents the local conformation of amino acids into regular structures. Although PSS prediction has been studied for decades, the prediction accuracy reaches a bottleneck at around 80%, and further improvement is very difficult.</p> <p>Results</p> <p>In this paper, we present an improved dictionary-based PSS prediction method called SymPred, and a meta-predictor called SymPsiPred. We adopt the concept behind natural language processing techniques and propose synonymous words to capture local sequence similarities in a group of similar proteins. A synonymous word is an <it>n-</it>gram pattern of amino acids that reflects the sequence variation in a protein’s evolution. We generate a protein-dependent synonymous dictionary from a set of protein sequences for PSS prediction.</p> <p>On a large non-redundant dataset of 8,297 protein chains (<it>DsspNr-25</it>), the average <it>Q</it><sub>3</sub> of SymPred and SymPsiPred are 81.0% and 83.9% respectively. On the two latest independent test sets (<it>EVA Set_1</it> and <it>EVA_Set2</it>), the average <it>Q</it><sub>3</sub> of SymPred is 78.8% and 79.2% respectively. SymPred outperforms other existing methods by 1.4% to 5.4%. We study two factors that may affect the performance of SymPred and find that it is very sensitive to the number of proteins of both known and unknown structures. This finding implies that SymPred and SymPsiPred have the potential to achieve higher accuracy as the number of protein sequences in the NCBInr and PDB databases increases.</p> <p>Conclusions</p> <p>Our experiment results show that local similarities in protein sequences typically exhibit conserved structures, which can be used to improve the accuracy of secondary structure prediction. For the application of synonymous words, we demonstrate an example of a sequence alignment which is generated by the distribution of shared synonymous words of a pair of protein sequences. We can align the two sequences nearly perfectly which are very dissimilar at the sequence level but very similar at the structural level. The SymPred and SymPsiPred prediction servers are available at <url>http://bio-cluster.iis.sinica.edu.tw/SymPred/</url>.</p

    Protein subcellular localization prediction of eukaryotes using a knowledge-based approach

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The study of protein subcellular localization (PSL) is important for elucidating protein functions involved in various cellular processes. However, determining the localization sites of a protein through wet-lab experiments can be time-consuming and labor-intensive. Thus, computational approaches become highly desirable. Most of the PSL prediction systems are established for single-localized proteins. However, a significant number of eukaryotic proteins are known to be localized into multiple subcellular organelles. Many studies have shown that proteins may simultaneously locate or move between different cellular compartments and be involved in different biological processes with different roles.</p> <p>Results</p> <p>In this study, we propose a knowledge based method, called KnowPred<sub>site</sub>, to predict the localization site(s) of both single-localized and multi-localized proteins. Based on the local similarity, we can identify the "related sequences" for prediction. We construct a knowledge base to record the possible sequence variations for protein sequences. When predicting the localization annotation of a query protein, we search against the knowledge base and used a scoring mechanism to determine the predicted sites. We downloaded the dataset from ngLOC, which consisted of ten distinct subcellular organelles from 1923 species, and performed ten-fold cross validation experiments to evaluate KnowPred<sub>site</sub>'s performance. The experiment results show that KnowPred<sub>site </sub>achieves higher prediction accuracy than ngLOC and Blast-hit method. For single-localized proteins, the overall accuracy of KnowPred<sub>site </sub>is 91.7%. For multi-localized proteins, the overall accuracy of KnowPred<sub>site </sub>is 72.1%, which is significantly higher than that of ngLOC by 12.4%. Notably, half of the proteins in the dataset that cannot find any Blast hit sequence above a specified threshold can still be correctly predicted by KnowPred<sub>site</sub>.</p> <p>Conclusion</p> <p>KnowPred<sub>site </sub>demonstrates the power of identifying related sequences in the knowledge base. The experiment results show that even though the sequence similarity is low, the local similarity is effective for prediction. Experiment results show that KnowPred<sub>site </sub>is a highly accurate prediction method for both single- and multi-localized proteins. It is worth-mentioning the prediction process of KnowPred<sub>site </sub>is transparent and biologically interpretable and it shows a set of template sequences to generate the prediction result. The KnowPred<sub>site </sub>prediction server is available at <url>http://bio-cluster.iis.sinica.edu.tw/kbloc/</url>.</p

    Negative flat band magnetism in a spin-orbit coupled correlated kagome magnet

    Full text link
    It has long been speculated that electronic flat band systems can be a fertile ground for hosting novel emergent phenomena including unconventional magnetism and superconductivity. Although flat bands are known to exist in a few systems such as heavy fermion materials and twisted bilayer graphene, their microscopic roles and underlying mechanisms in generating emergent behavior remain elusive. Here we use scanning tunneling microscopy to elucidate the atomically resolved electronic states and their magnetic response in the kagome magnet Co3Sn2S2. We observe a pronounced peak at the Fermi level, which is identified to arise from the kinetically frustrated kagome flat band. Increasing magnetic field up to +-8T, this state exhibits an anomalous magnetization-polarized Zeeman shift, dominated by an orbital moment in opposite to the field direction. Such negative magnetism can be understood as spin-orbit coupling induced quantum phase effects tied to non-trivial flat band systems. We image the flat band peak, resolve the associated negative magnetism, and provide its connection to the Berry curvature field, showing that Co3Sn2S2 is a rare example of kagome magnet where the low energy physics can be dominated by the spin-orbit coupled flat band. Our methodology of probing band-resolved ordering phenomena such as spin-orbit magnetism can also be applied in future experiments to elucidate other exotic phenomena including flat band superconductivity and anomalous quantum transport.Comment: Nature Physics onlin

    Topological photocurrent responses from chiral surface Fermi arcs

    Full text link
    The nonlinear optical responses from topological semimetals are crucial in both understanding the fundamental properties of quantum materials and designing next-generation light-sensors or solar-cells. However, previous work was focusing on the optical effects from bulk states only, disregarding topological surface responses. Here we propose a new (hitherto unknown) surface-only topological photocurrent response from chiral Fermi arcs. Using the ideal topological chiral semimetal RhSi as a representative, we quantitatively compute the topologically robust photocurrents from Fermi arcs on different surfaces. By rigorous crystal symmetry analysis, we demonstrate that Fermi arc photocurrents can be perpendicular to the bulk injection currents regardless of the choice of materials' surface. We then generalize this finding to all cubic chiral space groups and predict material candidates. Our theory reveals a powerful notion where common crystalline-symmetry can be used to induce universal topological responses as well as making it possible to completely disentangle bulk and surface topological responses in many conducting material families

    BIOSMILE web search: a web application for annotating biomedical entities and relations

    Get PDF
    BIOSMILE web search (BWS), a web-based NCBI-PubMed search application, which can analyze articles for selected biomedical verbs and give users relational information, such as subject, object, location, manner, time, etc. After receiving keyword query input, BWS retrieves matching PubMed abstracts and lists them along with snippets by order of relevancy to protein–protein interaction. Users can then select articles for further analysis, and BWS will find and mark up biomedical relations in the text. The analysis results can be viewed in the abstract text or in table form. To date, BWS has been field tested by over 30 biologists and questionnaires have shown that subjects are highly satisfied with its capabilities and usability. BWS is accessible free of charge at http://bioservices.cse.yzu.edu.tw/BWS
    corecore