78 research outputs found

    Thermodynamically based DNA strand design

    Get PDF
    We describe a new algorithm for design of strand sets, for use in DNA computations or universal microarrays. Our algorithm can design sets that satisfy any of several thermodynamic and combinatorial constraints, which aim to maximize desired hybridizations between strands and their complements, while minimizing undesired cross-hybridizations. To heuristically search for good strand sets, our algorithm uses a conflict-driven stochastic local search approach, which is known to be effective in solving comparable search problems. The PairFold program of Andronescu et al. [M. Andronescu, Z. C. Zhang and A. Condon (2005) J. Mol. Biol., 345, 987–1001; M. Andronescu, R. Aguirre-Hernandez, A. Condon, and H. Hoos (2003) Nucleic Acids Res., 31, 3416–3422.] is used to calculate the minimum free energy of hybridization between two mismatched strands. We describe new thermodynamic measures of the quality of strand sets. With respect to these measures of quality, our algorithm consistently finds, within reasonable time, sets that are significantly better than previously published sets in the literature

    Identification and Quantification of Proteoforms by Mass Spectrometry

    Get PDF
    A proteoform is a defined form of a protein derived from a given gene with a specific amino acid sequence and localized post-translational modifications. In top-down proteomic analyses, proteoforms are identified and quantified through mass spectrometric analysis of intact proteins. Recent technological developments have enabled comprehensive proteoform analyses in complex samples, and an increasing number of laboratories are adopting top-down proteomic workflows. In this review, we outline some recent advances and discuss current challenges and future directions for the field

    Enhanced protein isoform characterization through long-read proteogenomics

    Get PDF
    [Background] The detection of physiologically relevant protein isoforms encoded by the human genome is critical to biomedicine. Mass spectrometry (MS)-based proteomics is the preeminent method for protein detection, but isoform-resolved proteomic analysis relies on accurate reference databases that match the sample; neither a subset nor a superset database is ideal. Long-read RNA sequencing (e.g., PacBio or Oxford Nanopore) provides full-length transcripts which can be used to predict full-length protein isoforms.[Results] We describe here a long-read proteogenomics approach for integrating sample-matched long-read RNA-seq and MS-based proteomics data to enhance isoform characterization. We introduce a classification scheme for protein isoforms, discover novel protein isoforms, and present the first protein inference algorithm for the direct incorporation of long-read transcriptome data to enable detection of protein isoforms previously intractable to MS-based detection. We have released an open-source Nextflow pipeline that integrates long-read sequencing in a proteomic workflow for isoform-resolved analysis.[Conclusions] Our work suggests that the incorporation of long-read sequencing and proteomic data can facilitate improved characterization of human protein isoform diversity. Our first-generation pipeline provides a strong foundation for future development of long-read proteogenomics and its adoption for both basic and translational research.This work was supported by a National Institutes of Health (NIH) grant R35GM142647 (G.M.S.), NIH grant R35GM126914 (L.M.S.), and Jackson Laboratory (A.D.M.). The codeathon which initiated the project was supported by the NIH STRIDES Initiative at the NIH.Peer reviewe

    High-throughput Single-Molecule Spectroscopy in Free Solution

    No full text
    A high-speed high-throughput single-molecule imaging technique for identifying molecules in free solution based on differences in their fluorescence emission spectra is presented. Unlike previous reports, the entire spectrum, rather than selected wavelengths through optical filters, is recorded. Furthermore, the millisecond data acquisition time means that the molecules do not need to be immobilized or spatially confined. In one example, individual λDNA molecules labeled with YOYO-I, POPO-III, or a combination of the two dyes can be distinguished from one another. In another example, biotinylated 2.1- kb DNA labeled with YOYO-I was reacted with avidinconjugated R-phycoerythrin. The two different reactant molecules and the product molecule can be simultaneously imaged and identified by their spectroscopic characteristics. This technique can therefore be used for screening single molecules for disease markers and for monitoring individual molecular interactions at a rate of thousands of molecules per second

    High-Throughput Single-Molecule Spectroscopy in Free Solution

    No full text
    • …
    corecore