30 research outputs found

    Detecting Differential and Correlated Protein Expression in Label-Free Shotgun Proteomics

    No full text
    Recent studies have revealed a relationship between protein abundance and sampling statistics, such as sequence coverage, peptide count, and spectral count, in label-free liquid chromatography−tandem mass spectrometry (LC−MS/MS) shotgun proteomics. The use of sampling statistics offers a promising method of measuring relative protein abundance and detecting differentially expressed or coexpressed proteins. We performed a systematic analysis of various approaches to quantifying differential protein expression in eukaryotic Saccharomyces cerevisiae and prokaryotic Rhodopseudomonas palustris label-free LC−MS/MS data. First, we showed that, among three sampling statistics, the spectral count has the highest technical reproducibility, followed by the less-reproducible peptide count and relatively nonreproducible sequence coverage. Second, we used spectral count statistics to measure differential protein expression in pairwise experiments using five statistical tests:  Fisher's exact test, G-test, AC test, t-test, and LPE test. Given the S. cerevisiae data set with spiked proteins as a benchmark and the false positive rate as a metric, our evaluation suggested that the Fisher's exact test, G-test, and AC test can be used when the number of replications is limited (one or two), whereas the t-test is useful with three or more replicates available. Third, we generalized the G-test to increase the sensitivity of detecting differential protein expression under multiple experimental conditions. Out of 1622 identified R. palustris proteins in the LC−MS/MS experiment, the generalized G-test detected 1119 differentially expressed proteins under six growth conditions. Finally, we studied correlated expression of these 1119 proteins by analyzing pairwise expression correlations and by delineating protein clusters according to expression patterns. Through pairwise expression correlation analysis, we demonstrated that proteins co-located in the same operon were much more strongly coexpressed than those from different operons. Combining cluster analysis with existing protein functional annotations, we identified six protein clusters with known biological significance. In summary, the proposed generalized G-test using spectral count sampling statistics is a viable methodology for robust quantification of relative protein abundance and for sensitive detection of biologically significant differential protein expression under multiple experimental conditions in label-free shotgun proteomics. Keywords: label-free • LC−MS/MS • shotgun proteomics • differential expression • correlated expression • clustering • Saccharomyces cerevisiae • Rhodopseudomonas palustris</i

    Detecting Differential and Correlated Protein Expression in Label-Free Shotgun Proteomics

    No full text
    Recent studies have revealed a relationship between protein abundance and sampling statistics, such as sequence coverage, peptide count, and spectral count, in label-free liquid chromatography−tandem mass spectrometry (LC−MS/MS) shotgun proteomics. The use of sampling statistics offers a promising method of measuring relative protein abundance and detecting differentially expressed or coexpressed proteins. We performed a systematic analysis of various approaches to quantifying differential protein expression in eukaryotic Saccharomyces cerevisiae and prokaryotic Rhodopseudomonas palustris label-free LC−MS/MS data. First, we showed that, among three sampling statistics, the spectral count has the highest technical reproducibility, followed by the less-reproducible peptide count and relatively nonreproducible sequence coverage. Second, we used spectral count statistics to measure differential protein expression in pairwise experiments using five statistical tests:  Fisher's exact test, G-test, AC test, t-test, and LPE test. Given the S. cerevisiae data set with spiked proteins as a benchmark and the false positive rate as a metric, our evaluation suggested that the Fisher's exact test, G-test, and AC test can be used when the number of replications is limited (one or two), whereas the t-test is useful with three or more replicates available. Third, we generalized the G-test to increase the sensitivity of detecting differential protein expression under multiple experimental conditions. Out of 1622 identified R. palustris proteins in the LC−MS/MS experiment, the generalized G-test detected 1119 differentially expressed proteins under six growth conditions. Finally, we studied correlated expression of these 1119 proteins by analyzing pairwise expression correlations and by delineating protein clusters according to expression patterns. Through pairwise expression correlation analysis, we demonstrated that proteins co-located in the same operon were much more strongly coexpressed than those from different operons. Combining cluster analysis with existing protein functional annotations, we identified six protein clusters with known biological significance. In summary, the proposed generalized G-test using spectral count sampling statistics is a viable methodology for robust quantification of relative protein abundance and for sensitive detection of biologically significant differential protein expression under multiple experimental conditions in label-free shotgun proteomics. Keywords: label-free • LC−MS/MS • shotgun proteomics • differential expression • correlated expression • clustering • Saccharomyces cerevisiae • Rhodopseudomonas palustris</i

    Detecting Differential and Correlated Protein Expression in Label-Free Shotgun Proteomics

    No full text
    Recent studies have revealed a relationship between protein abundance and sampling statistics, such as sequence coverage, peptide count, and spectral count, in label-free liquid chromatography−tandem mass spectrometry (LC−MS/MS) shotgun proteomics. The use of sampling statistics offers a promising method of measuring relative protein abundance and detecting differentially expressed or coexpressed proteins. We performed a systematic analysis of various approaches to quantifying differential protein expression in eukaryotic Saccharomyces cerevisiae and prokaryotic Rhodopseudomonas palustris label-free LC−MS/MS data. First, we showed that, among three sampling statistics, the spectral count has the highest technical reproducibility, followed by the less-reproducible peptide count and relatively nonreproducible sequence coverage. Second, we used spectral count statistics to measure differential protein expression in pairwise experiments using five statistical tests:  Fisher's exact test, G-test, AC test, t-test, and LPE test. Given the S. cerevisiae data set with spiked proteins as a benchmark and the false positive rate as a metric, our evaluation suggested that the Fisher's exact test, G-test, and AC test can be used when the number of replications is limited (one or two), whereas the t-test is useful with three or more replicates available. Third, we generalized the G-test to increase the sensitivity of detecting differential protein expression under multiple experimental conditions. Out of 1622 identified R. palustris proteins in the LC−MS/MS experiment, the generalized G-test detected 1119 differentially expressed proteins under six growth conditions. Finally, we studied correlated expression of these 1119 proteins by analyzing pairwise expression correlations and by delineating protein clusters according to expression patterns. Through pairwise expression correlation analysis, we demonstrated that proteins co-located in the same operon were much more strongly coexpressed than those from different operons. Combining cluster analysis with existing protein functional annotations, we identified six protein clusters with known biological significance. In summary, the proposed generalized G-test using spectral count sampling statistics is a viable methodology for robust quantification of relative protein abundance and for sensitive detection of biologically significant differential protein expression under multiple experimental conditions in label-free shotgun proteomics. Keywords: label-free • LC−MS/MS • shotgun proteomics • differential expression • correlated expression • clustering • Saccharomyces cerevisiae • Rhodopseudomonas palustris</i

    Detecting Differential and Correlated Protein Expression in Label-Free Shotgun Proteomics

    No full text
    Recent studies have revealed a relationship between protein abundance and sampling statistics, such as sequence coverage, peptide count, and spectral count, in label-free liquid chromatography−tandem mass spectrometry (LC−MS/MS) shotgun proteomics. The use of sampling statistics offers a promising method of measuring relative protein abundance and detecting differentially expressed or coexpressed proteins. We performed a systematic analysis of various approaches to quantifying differential protein expression in eukaryotic Saccharomyces cerevisiae and prokaryotic Rhodopseudomonas palustris label-free LC−MS/MS data. First, we showed that, among three sampling statistics, the spectral count has the highest technical reproducibility, followed by the less-reproducible peptide count and relatively nonreproducible sequence coverage. Second, we used spectral count statistics to measure differential protein expression in pairwise experiments using five statistical tests:  Fisher's exact test, G-test, AC test, t-test, and LPE test. Given the S. cerevisiae data set with spiked proteins as a benchmark and the false positive rate as a metric, our evaluation suggested that the Fisher's exact test, G-test, and AC test can be used when the number of replications is limited (one or two), whereas the t-test is useful with three or more replicates available. Third, we generalized the G-test to increase the sensitivity of detecting differential protein expression under multiple experimental conditions. Out of 1622 identified R. palustris proteins in the LC−MS/MS experiment, the generalized G-test detected 1119 differentially expressed proteins under six growth conditions. Finally, we studied correlated expression of these 1119 proteins by analyzing pairwise expression correlations and by delineating protein clusters according to expression patterns. Through pairwise expression correlation analysis, we demonstrated that proteins co-located in the same operon were much more strongly coexpressed than those from different operons. Combining cluster analysis with existing protein functional annotations, we identified six protein clusters with known biological significance. In summary, the proposed generalized G-test using spectral count sampling statistics is a viable methodology for robust quantification of relative protein abundance and for sensitive detection of biologically significant differential protein expression under multiple experimental conditions in label-free shotgun proteomics. Keywords: label-free • LC−MS/MS • shotgun proteomics • differential expression • correlated expression • clustering • Saccharomyces cerevisiae • Rhodopseudomonas palustris</i

    Genome-wide DNA methylation analysis reveals loci that distinguish different types of adipose tissue in obese individuals

    No full text
    Background: Epigenetic mechanisms provide an interface between environmental factors and the genome and are known to play a role in complex diseases such as obesity. These mechanisms, including DNA methylation, influence the regulation of development, differentiation and the establishment of cellular identity. Here we employ two approaches to identify differential methylation between two white adipose tissue depots in obese individuals before and after gastric bypass and significant weight loss. We analyse genome-wide DNA methylation data using (a) traditional paired t tests to identify significantly differentially methylated loci (Bonferroni-adjusted P≤1×10−7) and (b) novel combinatorial algorithms to identify loci that differentiate between tissue types. Results: Significant differential methylation was observed for 3239 and 7722 CpG sites, including 784 and 1129 extended regions, between adipose tissue types before and after significant weight loss, respectively. The vast majority of these extended differentially methylated regions (702) were consistent across both time points and enriched for genes with a role in transcriptional regulation and/or development (e.g. homeobox genes). Other differentially methylated loci were only observed at one time point and thus potentially highlight genes important to adipose tissue dysfunction observed in obesity. Strong correlations (r>0.75, P≤0.001) were observed between changes in DNA methylation (subcutaneous adipose vs omentum) and changes in clinical trait, in particular for CpG sites within PITX2 and fasting glucose and four CpG sites within ISL2 and HDL. A single CpG site (cg00838040, ATP2C2) gave strong tissue separation, with validation in independent subcutaneous (n=681) and omental (n=33) adipose samples. Conclusions: This is the first study to report a genome-wide DNA methylome comparison of subcutaneous abdominal and omental adipose before and after weight loss. The combinatorial approach we utilised is a powerful tool for the identification of methylation loci that strongly differentiate between these tissues. This study provides a solid basis for future research focused on the development of adipose tissue and its potential dysfunction in obesity, as well as the role DNA methylation plays in these processes. </div

    Relationship between ErGeNs and counterpart networks in RMA expression data.

    No full text
    <p>Both S-score networks, ErGeN1 and ErGeN2, had counterpart networks in the basal saline and post-ethanol expression data: ErGeN1 significantly overlapped with saline network 1 and ethanol network 1; ErGeN3 significantly overlapped with saline network 4 and ethanol network 2 (<a href="http://www.plosone.org/article/info:doi/10.1371/journal.pone.0033575#pone.0033575.s003" target="_blank">Figure S3</a>). Each point represents a gene that belongs to a given ErGeN's counterpart saline network (blue), ethanol network (red) or both (green). Filled-in points indicate the gene also belongs to the overlapping ErGeN. The X- and Y-axes measure gene connectivity (|Pearson correlation coefficient|≥0.7) within the saline and ethanol expression datasets, respectively.</p

    Acute ethanol transcriptional response profiles.

    No full text
    <p>(<b>A</b>) Strain frequency distributions of gene transcriptional-response classes based on PFC S-score analysis. S-scores >2 indicate a gene was up-regulated by acute ethanol, S-scores <2 indicate down-regulation and S-scores between these thresholds were considered unchanged. (<b>B</b>) S-score strain distributions for three significantly ethanol responsive genes that each represent a different class of ethanol response profile.</p

    Conserved pathways between process of cellular senescence and functional decline.

    No full text
    A group of 10 genes was identified as common to functional decline and senescence. Many of these genes interact in the MAP kinase pathway, as shown by this protein-protein interaction plot from STRING.</p
    corecore