8,363 research outputs found

    Post-translational modifications and mass spectrometry detection

    Get PDF
    In this review, we provide a comprehensive bibliographic overview of the role of mass spectrometry and the recent technical developments in the detection of post-translational modifications (PTMs). We briefly describe the principles of mass spectrometry for detecting PTMs and the protein and peptide enrichment strategies for PTM analysis, including phosphorylation, acetylation and oxidation. This review presents a bibliographic overview of the scientific achievements and the recent technical development in the detection of PTMs is provided. In order to ascertain the state of the art in mass spectrometry and proteomics methodologies for the study of PTMs, we analyzed all the PTM data introduced in the Universal Protein Resource (UniProt) and the literature published in the last three years. The evolution of curated data in UniProt for proteins annotated as being post-translationally modified is also analyzed. Additionally, we have undertaken a careful analysis of the research articles published in the years 2010 to 2012 reporting the detection of PTMs in biological samples by mass spectrometry. © 2013 Elsevier Inc

    Tachyon search speeds up retrieval of similar sequences by several orders of magnitude

    Get PDF
    Summary: The usage of current sequence search tools becomes increasingly slower as databases of protein sequences continue to grow exponentially. Tachyon, a new algorithm that identifies closely related protein sequences ~200 times faster than standard BLAST, circumvents this limitation with a reduced database and oligopeptide matching heuristic

    Integration of Biological Sources: Exploring the Case of Protein Homology

    Get PDF
    Data integration is a key issue in the domain of bioin- formatics, which deals with huge amounts of heteroge- neous biological data that grows and changes rapidly. This paper serves as an introduction in the field of bioinformatics and the biological concepts it deals with, and an exploration of the integration problems a bioinformatics scientist faces. We examine ProGMap, an integrated protein homology system used by bioin- formatics scientists at Wageningen University, and several use cases related to protein homology. A key issue we identify is the huge manual effort required to unify source databases into a single resource. Un- certain databases are able to contain several possi- ble worlds, and it has been proposed that they can be used to significantly reduce initial integration efforts. We propose several directions for future work where uncertain databases can be applied to bioinformatics, with the goal of furthering the cause of bioinformatics integration

    A Linked Data Approach to Sharing Workflows and Workflow Results

    No full text
    A bioinformatics analysis pipeline is often highly elaborate, due to the inherent complexity of biological systems and the variety and size of datasets. A digital equivalent of the ‘Materials and Methods’ section in wet laboratory publications would be highly beneficial to bioinformatics, for evaluating evidence and examining data across related experiments, while introducing the potential to find associated resources and integrate them as data and services. We present initial steps towards preserving bioinformatics ‘materials and methods’ by exploiting the workflow paradigm for capturing the design of a data analysis pipeline, and RDF to link the workflow, its component services, run-time provenance, and a personalized biological interpretation of the results. An example shows the reproduction of the unique graph of an analysis procedure, its results, provenance, and personal interpretation of a text mining experiment. It links data from Taverna, myExperiment.org, BioCatalogue.org, and ConceptWiki.org. The approach is relatively ‘light-weight’ and unobtrusive to bioinformatics users

    ProDGe: investigating protein-protein interactions at the domain level

    Get PDF
    An important goal of systems biology is the identification and investigation of known and predicted protein-protein interactions to obtain more information about new cellular pathways and processes. Proteins interact via domains, thus it is important to know which domains a protein contains and which domains interact with each other. Here we present the Java^TM^ program ProDGe (Protein Domain Gene), which visualizes existing and suggests novel domain-domain interactions and protein-protein interactions at the domain level. The comprehensive dataset behind ProDGe consists of protein, domain and interaction information for both layers, collected and combined appropriately from UniProt, Pfam, DOMINE and IntAct. Based on known domain interactions, ProDGe suggests novel protein interactions and assigns them to four confidence classes, depending on the reliability of the underlying domain interaction. Furthermore, ProDGe is able to identify potential homologous interaction partners in other species, which is particularly helpful when investigating poorly annotated species. We further evaluated and compared experimentally identified protein interactions from IntAct with domain interactions from DOMINE for six species and noticed that 31.13% of all IntAct protein interactions in all six species can be mapped to the actual interacting domains. ProDGe and a comprehensive documentation are freely available at http://www.cogsys.cs.uni-tuebingen.de/software/ProDGe

    iLIR : a web resource for prediction of Atg8-family interacting proteins

    Get PDF
    Macroautophagy was initially considered to be a nonselective process for bulk breakdown of cytosolic material. However, recent evidence points toward a selective mode of autophagy mediated by the so-called selective autophagy receptors (SARs). SARs act by recognizing and sorting diverse cargo substrates (e.g., proteins, organelles, pathogens) to the autophagic machinery. Known SARs are characterized by a short linear sequence motif (LIR-, LRS-, or AIM-motif) responsible for the interaction between SARs and proteins of the Atg8 family. Interestingly, many LIR-containing proteins (LIRCPs) are also involved in autophagosome formation and maturation and a few of them in regulating signaling pathways. Despite recent research efforts to experimentally identify LIRCPs, only a few dozen of this class of—often unrelated—proteins have been characterized so far using tedious cell biological, biochemical, and crystallographic approaches. The availability of an ever-increasing number of complete eukaryotic genomes provides a grand challenge for characterizing novel LIRCPs throughout the eukaryotes. Along these lines, we developed iLIR, a freely available web resource, which provides in silico tools for assisting the identification of novel LIRCPs. Given an amino acid sequence as input, iLIR searches for instances of short sequences compliant with a refined sensitive regular expression pattern of the extended LIR motif (xLIR-motif) and retrieves characterized protein domains from the SMART database for the query. Additionally, iLIR scores xLIRs against a custom position-specific scoring matrix (PSSM) and identifies potentially disordered subsequences with protein interaction potential overlapping with detected xLIR-motifs. Here we demonstrate that proteins satisfying these criteria make good LIRCP candidates for further experimental verification. Domain architecture is displayed in an informative graphic, and detailed results are also available in tabular form. We anticipate that iLIR will assist with elucidating the full complement of LIRCPs in eukaryotes
    corecore