7,504 research outputs found

    SPsimSeq : semi-parametric simulation of bulk and single-cell RNA-sequencing data

    Get PDF
    SPsimSeq is a semi-parametric simulation method to generate bulk and single-cell RNA-sequencing data. It is designed to simulate gene expression data with maximal retention of the characteristics of real data. It is reasonably flexible to accommodate a wide range of experimental scenarios, including different sample sizes, biological signals (differential expression) and confounding batch effects

    Essential guidelines for computational method benchmarking

    Get PDF
    In computational biology and other sciences, researchers are frequently faced with a choice between several computational methods for performing data analyses. Benchmarking studies aim to rigorously compare the performance of different methods using well-characterized benchmark datasets, to determine the strengths of each method or to provide recommendations regarding suitable choices of methods for an analysis. However, benchmarking studies must be carefully designed and implemented to provide accurate, unbiased, and informative results. Here, we summarize key practical guidelines and recommendations for performing high-quality benchmarking analyses, based on our experiences in computational biology.Comment: Minor update

    Evaluating single molecule detection methods for microarrays with high dynamic range for quantitative single cell analysis

    Get PDF
    Single molecule microarrays have been used in quantitative proteomics, in particular, single cell analysis requiring high sensitivity and ultra-low limits of detection. In this paper, several image analysis methods are evaluated for their ability to accurately enumerate single molecules bound to a microarray spot. Crucially, protein abundance in single cells can vary significantly and may span several orders of magnitude. This poses a challenge to single molecule image analysis. In order to quantitatively assess the performance of each method, synthetic image datasets are generated with known ground truth whereby the number of single molecules varies over 5 orders of magnitude with a range of signal to noise ratios. Experiments were performed on synthetic datasets whereby the number of single molecules per spot corresponds to realistic single cell distributions whose ground truth summary statistics are known. The methods of image analysis are assessed in their ability to accurately estimate the distribution parameters. It is shown that super-resolution image analysis methods can significantly improve counting accuracy and better cope with single molecule congestion. The results highlight the challenge posed by quantitative single cell analysis and the implications to performing such analyses using microarray based approaches are discussed
    • …
    corecore