Depth and accuracy of <i>αβ</i> pairings generated by alphabetr, for a range of overall sample sizes, sampling strategies and underlying distributions of clone sizes.
<p>Simulations were performed using <i>in silico</i> data sets of one or five plates using six different sampling strategies (see text) and different degrees of skewness in clonal frequencies, as indicated by the number of clones comprising 50% of the population when ranked by frequency. ‘Threshold’ refers to the stringency of pair association, <i>T</i> (see <a href="http://www.ploscompbiol.org/article/info:doi/10.1371/journal.pcbi.1005313#sec014" target="_blank">Methods</a>). <b>(A)</b> The proportion of the most abundant 50% of clones that were identified. <b>(B)</b> The proportion of the least abundant 50% of clones that were identified. <b>(C)</b> The overall depth was influenced strongly by the tail depth, indicating that data from one plate may be sufficient for recovering the most common clones. <b>(D)</b> The rate at which CDR3<i>α</i> and CDR3<i>β</i> sequences were incorrectly paired (false positive rate, FPR).</p