66 research outputs found

    Subnanogram proteomics: impact of LC column selection, MS instrumentation and data analysis strategy on proteome coverage for trace samples

    Get PDF
    One of the greatest challenges for mass spectrometry (MS)-based proteomics is the limited ability to analyze small samples. Here we investigate the relative contributions of liquid chromatography (LC), MS instrumentation and data analysis methods with the aim of improving proteome coverage for sample sizes ranging from 0.5 ng to 50 ng. We show that the LC separations utilizing 30-Ī¼m-i.d. columns increase signal intensity by >3-fold relative to those using 75-Ī¼m-i.d. columns, leading to 32% increase in peptide identifications. The Orbitrap Fusion Lumos MS significantly boosted both sensitivity and sequencing speed relative to earlier generation Orbitraps (e.g., LTQ-Orbitrap), leading to a āˆ¼3-fold increase in peptide identifications and 1.7-fold increase in identified protein groups for 2 ng tryptic digests of the bacterium S. oneidensis. The Match Between Runs algorithm of open-source MaxQuant software further increased proteome coverage by āˆ¼ 95% for 0.5 ng samples and by āˆ¼42% for 2 ng samples. Using the best combination of the above variables, we were able to identify >3,000 proteins from 10 ng tryptic digests from both HeLa and THP-1 mammalian cell lines. We also identified >950 proteins from subnanogram archaeal/bacterial cocultures. The present ultrasensitive LC-MS platform achieves a level of proteome coverage not previously realized for ultra-small sample loadings, and is expected to facilitate the analysis of subnanogram samples, including single mammalian cells

    Efam: an expanded, metaproteome-supported HMM profile database of viral protein families

    Get PDF
    Motivation: Viruses infect, reprogram and kill microbes, leading to profound ecosystem consequences, from elemental cycling in oceans and soils to microbiome-modulated diseases in plants and animals. Although metagenomic datasets are increasingly available, identifying viruses in them is challenging due to poor representation and annotation of viral sequences in databases. Results: Here, we establish efam, an expanded collection of Hidden Markov Model (HMM) profiles that represent viral protein families conservatively identified from the Global Ocean Virome 2.0 dataset. This resulted in 240 311 HMM profiles, each with at least 2 protein sequences, making efam >7-fold larger than the next largest, pan-ecosystem viral HMM profile database. Adjusting the criteria for viral contig confidence from 'conservative' to 'eXtremely Conservative' resulted in 37 841 HMM profiles in our efam-XC database. To assess the value of this resource, we integrated efam-XC into VirSorter viral discovery software to discover viruses from less-studied, ecologically distinct oxygen minimum zone (OMZ) marine habitats. This expanded database led to an increase in viruses recovered from every tested OMZ virome by āˆ¼24% on average (up to āˆ¼42%) and especially improved the recovery of often-missed shorter contigs (<5 kb). Additionally, to help elucidate lesser-known viral protein functions, we annotated the profiles using multiple databases from the DRAM pipeline and virion-associated metaproteomic data, which doubled the number of annotations obtainable by standard, single-database annotation approaches. Together, these marine resources (efam and efam-XC) are provided as searchable, compressed HMM databases that will be updated bi-annually to help maximize viral sequence discovery and study from any ecosystem

    Subnanogram proteomics: impact of LC column selection, MS instrumentation and data analysis strategy on proteome coverage for trace samples

    Get PDF
    One of the greatest challenges for mass spectrometry (MS)-based proteomics is the limited ability to analyze small samples. Here we investigate the relative contributions of liquid chromatography (LC), MS instrumentation and data analysis methods with the aim of improving proteome coverage for sample sizes ranging from 0.5 ng to 50 ng. We show that the LC separations utilizing 30-Ī¼m-i.d. columns increase signal intensity by >3-fold relative to those using 75-Ī¼m-i.d. columns, leading to 32% increase in peptide identifications. The Orbitrap Fusion Lumos MS significantly boosted both sensitivity and sequencing speed relative to earlier generation Orbitraps (e.g., LTQ-Orbitrap), leading to a āˆ¼3-fold increase in peptide identifications and 1.7-fold increase in identified protein groups for 2 ng tryptic digests of the bacterium S. oneidensis. The Match Between Runs algorithm of open-source MaxQuant software further increased proteome coverage by āˆ¼ 95% for 0.5 ng samples and by āˆ¼42% for 2 ng samples. Using the best combination of the above variables, we were able to identify >3,000 proteins from 10 ng tryptic digests from both HeLa and THP-1 mammalian cell lines. We also identified >950 proteins from subnanogram archaeal/bacterial cocultures. The present ultrasensitive LC-MS platform achieves a level of proteome coverage not previously realized for ultra-small sample loadings, and is expected to facilitate the analysis of subnanogram samples, including single mammalian cells

    The human brainome: network analysis identifies \u3ci\u3eHSPA2\u3c/i\u3e as a novel Alzheimerā€™s disease target

    Get PDF
    Our hypothesis is that changes in gene and protein expression are crucial to the development of late-onset Alzheimerā€™s disease. Previously we examined how DNA alleles control downstream expression of RNA transcripts and how those relationships are changed in late-onset Alzheimerā€™s disease. We have now examined how proteins are incorporated into networks in two separate series and evaluated our outputs in two different cell lines. Our pipeline included the following steps: (i) predicting expression quantitative trait loci; (ii) determining differential expression; (iii) analysing networks of transcript and peptide relationships; and (iv) validating effects in two separate cell lines. We performed all our analysis in two separate brain series to validate effects. Our two series included 345 samples in the first set (177 controls, 168 cases; age range 65ā€“105; 58% female; KRONOSII cohort) and 409 samples in the replicate set (153 controls, 141 cases, 115 mild cognitive impairment; age range 66ā€“107; 63% female; RUSH cohort). Our top target is heat shock protein family A member 2 (HSPA2), which was identified as a key driver in our two datasets. HSPA2 was validated in two cell lines, with overexpression driving further elevation of amyloid-B40 and amyloid-B42 levels in APP mutant cells, as well as significant elevation of microtubule associated protein tau and phosphorylated-tau in a modified neuroglioma line. This work further demonstrates that studying changes in gene and protein expression is crucial to understanding late onset disease and further nominates HSPA2 as a specific key regulator of late-onset Alzheimerā€™s disease processes

    In Situ Non-Destructive Temporal Measurements of the Rhizosphere Microbiome ā€˜Hot-Spotsā€™ Using Metaproteomics

    No full text
    Rhizosphere arguably embodies the most diverse microbial ecosystem on the planet, yet it is largely a functional ā€˜black boxā€™ of belowground plant-microbiome interactions. The rhizosphere is the primary site of entry for subsurface injection of fixed carbon (C) into soil with impacts on local to global scale C biogeochemistry and ultimately Earthā€™s climate. While spatial organization of rhizosphere is central to its function, small scale and steep microbial and geochemical gradients within this dynamic region make it easily disrupted by sampling. The significant challenge presented by sampling blocks elucidation of discreet functions, drivers, and interactions within rhizosphere ecosystems. Here, we describe a non-destructive sampling method linked to metaproteomic analysis in order to measure temporal shifts in the microbial composition and function of rhizosphere. A robust, non-destructive method of sampling microbial hotspots within rhizosphere provides an unperturbed window into the elusive functional interactome of this system over time and space

    Nanoproteomics comes of age

    No full text

    Time-of-flight secondary ion mass spectrometry imaging of subcellular lipid heterogeneity: Poisson counting and spatial resolution.

    No full text
    Mass spectrometric imaging is a powerful tool to interrogate biological complexity. One such technique, time-of-flight secondary ion mass spectrometry (TOF-SIMS) imaging, has been successfully utilized for subcellular imaging of cell membrane components. In order for this technique to provide insight into biological processes, it is critical to characterize the figures of merit. Because a SIMS instrument counts individual events, the precision of the measurement is controlled by counting statistics. As the analysis area decreases, the number of molecules available for analysis diminishes. This becomes critical when imaging subcellular features; it limits the information obtainable, resulting in images with only a few counts of interest per pixel. Many features observed in low intensity images are artifacts of counting statistics, making validation of these features crucial to arriving at accurate conclusions. With TOF-SIMS imaging, the experimentally attainable spatial resolution is a function of the molecule of interest, sample matrix, concentration, primary ion, instrument transmission, and spot size of the primary ion beam. A model, based on Poisson statistics, has been developed to validate SIMS imaging data when signal is limited. This model can be used to estimate the effective spatial resolution and limits of detection prior to analysis, making it a powerful tool for tailoring future investigations. In addition, the model allows comparison of pixel-to-pixel intensity and can be used to validate the significance of observed image features. The implications and capabilities of the model are demonstrated by imaging the cell membrane of resting RBL-2H3 mast cells
    • ā€¦
    corecore