164 research outputs found

    A mass accuracy sensitive probability based scoring algorithm for database searching of tandem mass spectrometry data

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Liquid chromatography coupled with tandem mass spectrometry (LC-MS/MS) has become one of the most used tools in mass spectrometry based proteomics. Various algorithms have since been developed to automate the process for modern high-throughput LC-MS/MS experiments.</p> <p>Results</p> <p>A probability based statistical scoring model for assessing peptide and protein matches in tandem MS database search was derived. The statistical scores in the model represent the probability that a peptide match is a random occurrence based on the number or the total abundance of matched product ions in the experimental spectrum. The model also calculates probability based scores to assess protein matches. Thus the protein scores in the model reflect the significance of protein matches and can be used to differentiate true from random protein matches.</p> <p>Conclusion</p> <p>The model is sensitive to high mass accuracy and implicitly takes mass accuracy into account during scoring. High mass accuracy will not only reduce false positives, but also improves the scores of true positive matches. The algorithm is incorporated in an automated database search program MassMatrix.</p

    The duck hepatitis virus 5'-UTR possesses HCV-like IRES activity that is independent of eIF4F complex and modulated by downstream coding sequences

    Get PDF
    Duck hepatitis virus (DHV-1) is a worldwide distributed picornavirus that causes acute and fatal disease in young ducklings. Recently, the complete genome of DHV-1 has been determined and comparative sequence analysis has shown that possesses the typical picornavirus organization but exhibits several unique features. For the first time, we provide evidence that the 626-nucleotide-long 5'-UTR of the DHV-1 genome contains an internal ribosome entry site (IRES) element that functions efficiently both in vitro and in mammalian cells. The prediction of the secondary structure of the DHV-1 IRES shows significant similarity to the hepatitis C virus (HCV) IRES. Moreover, similarly to HCV IRES, DHV-1 IRES can direct translation initiation in the absence of a functional eIF4F complex. We also demonstrate that the activity of the DHV-1 IRES is modulated by a viral coding sequence located downstream of the DHV-1 5'-UTR, which enhances DHV-1 IRES activity both in vitro and in vivo. Furthermore, mutational analysis of the predicted pseudo-knot structures at the 3'-end of the putative DHV-1 IRES supported the presence of conserved domains II and III and, as it has been previously described for other picornaviruses, these structures are essential for keeping the normal internal initiation of translation of DHV-1

    Harvest: an open-source tool for the validation and improvement of peptide identification metrics and fragmentation exploration

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Protein identification using mass spectrometry is an important tool in many areas of the life sciences, and in proteomics research in particular. Increasing the number of proteins correctly identified is dependent on the ability to include new knowledge about the mass spectrometry fragmentation process, into computational algorithms designed to separate true matches of peptides to unidentified mass spectra from spurious matches. This discrimination is achieved by computing a function of the various features of the potential match between the observed and theoretical spectra to give a numerical approximation of their similarity. It is these underlying "metrics" that determine the ability of a protein identification package to maximise correct identifications while limiting false discovery rates. There is currently no software available specifically for the simple implementation and analysis of arbitrary novel metrics for peptide matching and for the exploration of fragmentation patterns for a given dataset.</p> <p>Results</p> <p>We present Harvest: an open source software tool for analysing fragmentation patterns and assessing the power of a new piece of information about the MS/MS fragmentation process to more clearly differentiate between correct and random peptide assignments. We demonstrate this functionality using data metrics derived from the properties of individual datasets in a peptide identification context. Using Harvest, we demonstrate how the development of such metrics may improve correct peptide assignment confidence in the context of a high-throughput proteomics experiment and characterise properties of peptide fragmentation.</p> <p>Conclusions</p> <p>Harvest provides a simple framework in C++ for analysing and prototyping metrics for peptide matching, the core of the protein identification problem. It is not a protein identification package and answers a different research question to packages such as Sequest, Mascot, X!Tandem, and other protein identification packages. It does not aim to maximise the number of assigned peptides from a set of unknown spectra, but instead provides a method by which researchers can explore fragmentation properties and assess the power of novel metrics for peptide matching in the context of a given experiment. Metrics developed using Harvest may then become candidates for later integration into protein identification packages.</p

    Identification of alternative splice variants in Aspergillus flavus through comparison of multiple tandem MS search algorithms

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Database searching is the most frequently used approach for automated peptide assignment and protein inference of tandem mass spectra. The results, however, depend on the sequences in target databases and on search algorithms. Recently by using an alternative splicing database, we identified more proteins than with the annotated proteins in <it>Aspergillus flavus</it>. In this study, we aimed at finding a greater number of eligible splice variants based on newly available transcript sequences and the latest genome annotation. The improved database was then used to compare four search algorithms: Mascot, OMSSA, X! Tandem, and InsPecT.</p> <p>Results</p> <p>The updated alternative splicing database predicted 15833 putative protein variants, 61% more than the previous results. There was transcript evidence for 50% of the updated genes compared to the previous 35% coverage. Database searches were conducted using the same set of spectral data, search parameters, and protein database but with different algorithms. The false discovery rates of the peptide-spectrum matches were estimated < 2%. The numbers of the total identified proteins varied from 765 to 867 between algorithms. Whereas 42% (1651/3891) of peptide assignments were unanimous, the comparison showed that 51% (568/1114) of the RefSeq proteins and 15% (11/72) of the putative splice variants were inferred by all algorithms. 12 plausible isoforms were discovered by focusing on the consensus peptides which were detected by at least three different algorithms. The analysis found different conserved domains in two putative isoforms of UDP-galactose 4-epimerase.</p> <p>Conclusions</p> <p>We were able to detect dozens of new peptides using the improved alternative splicing database with the recently updated annotation of the <it>A. flavus </it>genome. Unlike the identifications of the peptides and the RefSeq proteins, large variations existed between the putative splice variants identified by different algorithms. 12 candidates of putative isoforms were reported based on the consensus peptide-spectrum matches. This suggests that applications of multiple search engines effectively reduced the possible false positive results and validated the protein identifications from tandem mass spectra using an alternative splicing database.</p

    The influence of personality and ability on undergraduate teamwork and team performance

    Get PDF
    The ability to work effectively on a team is highly valued by employers, and collaboration among students can lead to intrinsic motivation, increased persistence, and greater transferability of skills. Moreover, innovation often arises from multidisciplinary teamwork. The influence of personality and ability on undergraduate teamwork and performance is not comprehensively understood. An investigation was undertaken to explore correlations between team outcomes, personality measures and ability in an undergraduate population. Team outcomes included various self-, peer- and instructor ratings of skills, performance, and experience. Personality measures and ability involved the Five-Factor Model personality traits and GPA. Personality, GPA, and teamwork survey data, as well as instructor evaluations were collected from upper division team project courses in engineering, business, political science, and industrial design at a large public university. Characteristics of a multidisciplinary student team project were briefly examined. Personality, in terms of extraversion scores, was positively correlated with instructors’ assessment of team performance in terms of oral and written presentation scores, which is consistent with prior research. Other correlations to instructor-, students’ self- and peer-ratings were revealed and merit further study. The findings in this study can be used to understand important influences on successful teamwork, teamwork instruction and intervention and to understand the design of effective curricula in this area moving forward

    Cell-Specific Monitoring of Protein Synthesis In Vivo

    Get PDF
    Analysis of general and specific protein synthesis provides important information, relevant to cellular physiology and function. However, existing methodologies, involving metabolic labelling by incorporation of radioactive amino acids into nascent polypeptides, cannot be applied to monitor protein synthesis in specific cells or tissues, in live specimens. We have developed a novel approach for monitoring protein synthesis in specific cells or tissues, in vivo. Fluorescent reporter proteins such as GFP are expressed in specific cells and tissues of interest or throughout animals using appropriate promoters. Protein synthesis rates are assessed by following fluorescence recovery after partial photobleaching of the fluorophore at targeted sites. We evaluate the method by examining protein synthesis rates in diverse cell types of live, wild type or mRNA translation-defective Caenorhabditis elegans animals. Because it is non-invasive, our approach allows monitoring of protein synthesis in single cells or tissues with intrinsically different protein synthesis rates. Furthermore, it can be readily implemented in other organisms or cell culture systems

    Techniques for accurate protein identification in shotgun proteomic studies of human, mouse, bovine, and chicken lenses

    Get PDF
    Analysis of shotgun proteomics datasets requires techniques to distinguish correct peptide identifications from incorrect identifications, such as linear discriminant functions and target/decoy protein databases. We report an efficient, flexible proteomic analysis workflow pipeline that implements these techniques to control both peptide and protein false discovery rates. We demonstrate its performance by analyzing two-dimensional liquid chromatography separations of lens proteins from human, mouse, bovine, and chicken lenses. We compared the use of International Protein Index databases to UniProt databases and no-enzyme SEQUEST searches to tryptic searches. Sequences present in the International Protein Index databases allowed detection of several novel crystallins. An alternate start codon isoform of βA4 was found in human lens. The minor crystallin γN was detected for the first time in bovine and chicken lenses. Chicken γS was identified and is the first member of the γ-crystallin family observed in avian lenses

    OpenMS – An open-source software framework for mass spectrometry

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Mass spectrometry is an essential analytical technique for high-throughput analysis in proteomics and metabolomics. The development of new separation techniques, precise mass analyzers and experimental protocols is a very active field of research. This leads to more complex experimental setups yielding ever increasing amounts of data. Consequently, analysis of the data is currently often the bottleneck for experimental studies. Although software tools for many data analysis tasks are available today, they are often hard to combine with each other or not flexible enough to allow for rapid prototyping of a new analysis workflow.</p> <p>Results</p> <p>We present OpenMS, a software framework for rapid application development in mass spectrometry. OpenMS has been designed to be portable, easy-to-use and robust while offering a rich functionality ranging from basic data structures to sophisticated algorithms for data analysis. This has already been demonstrated in several studies.</p> <p>Conclusion</p> <p>OpenMS is available under the Lesser GNU Public License (LGPL) from the project website at <url>http://www.openms.de</url>.</p
    corecore