Search CORE

14 research outputs found

NQS filtering improves fit of probability model to data.

Author: Alexander R. Macalalad (177162)
Bruce W. Birren (147656)
Christian L. Boutwell (177170)
Christine M. Malboeuf (177167)
Doug E. Brackney (177174)
Elizabeth M. Ryan (177168)
Gregory D. Ebel (177185)
Joshua Z. Levin (177182)
Karen A. Power (177172)
Kendra N. Pesko (177178)
Matthew R. Henn (103220)
Michael C. Zody (155402)
Niall J. Lennon (177164)
Patrick Charlebois (177163)
Ruchi M. Newman (177165)
Todd M. Allen (177189)
Publication venue
Publication date
Field of study

(A) Quantile-quantile (q-q) plots under NQS filtering show good fit of the probability model to the observed distribution of errors. Since the probability model is discrete, p values are projected onto a uniform distribution, and the distribution of projected p values is compared with the expected null distribution. See <a href="http://www.ploscompbiol.org/article/info:doi/10.1371/journal.pcbi.1002417#s4" target="_blank">Materials and Methods</a> section for details. (B) In contrast, q-q plots under no filtering show that no filtering skews the calibration of the probability model used by V-Phaser. Q-q plots of models based on subsets of the reads demonstrate that this effect becomes more pronounced with increasing coverage (see <a href="http://www.ploscompbiol.org/article/info:doi/10.1371/journal.pcbi.1002417#pcbi.1002417.s001" target="_blank">Figure S1</a>). Q-q plots are scaled to fit curve, so y = x line is not at a 45 degree angle.</p

FigShare

Phase information increased sensitivity, and base quality scores increased specificity.

Author: Alexander R. Macalalad (177162)
Bruce W. Birren (147656)
Christian L. Boutwell (177170)
Christine M. Malboeuf (177167)
Doug E. Brackney (177174)
Elizabeth M. Ryan (177168)
Gregory D. Ebel (177185)
Joshua Z. Levin (177182)
Karen A. Power (177172)
Kendra N. Pesko (177178)
Matthew R. Henn (103220)
Michael C. Zody (155402)
Niall J. Lennon (177164)
Patrick Charlebois (177163)
Ruchi M. Newman (177165)
Todd M. Allen (177189)
Publication venue
Publication date
Field of study

We compared V-Phaser to alternate versions of V-Phaser with specific components disabled. In the No Phase version, V-Phaser called variants without phase information. In the Uniform Errors version, V-Phaser estimated uniform error rates within homopolymer and nonhomopolymer regions without regard to assigned base qualities. In the No Filtering version, V-Phaser did not filter out low quality bases. (A) Phase information increased sensitivity. The version without phase information attained a sensitivity of 90%, but all other versions of V-Phaser used phase information and attained a sensitivity of 97% or more. We calculated sensitivity as the percentage of known variants correctly identified. Data are from WNV mixed population control dataset. (B) Individual base quality scores increased specificity. Among loci with mismatches, the Uniform Errors version had only 91% specificity, but all other versions incorporated base quality scores in their probability model and attained 97% specificity or more. We calculated specificity as the percentage of loci in the control sample correctly identified as having no variants among loci that had at least one candidate variant. Data are from infectious clone (HIV NL4-3) control dataset.</p

FigShare

Error rates were not uniformly distributed.

Author: Alexander R. Macalalad (177162)
Bruce W. Birren (147656)
Christian L. Boutwell (177170)
Christine M. Malboeuf (177167)
Doug E. Brackney (177174)
Elizabeth M. Ryan (177168)
Gregory D. Ebel (177185)
Joshua Z. Levin (177182)
Karen A. Power (177172)
Kendra N. Pesko (177178)
Matthew R. Henn (103220)
Michael C. Zody (155402)
Niall J. Lennon (177164)
Patrick Charlebois (177163)
Ruchi M. Newman (177165)
Todd M. Allen (177189)
Publication venue
Publication date
Field of study

Error rates varied by (A) read position, (B) base transition, and (C) base quality score. We counted as errors any mismatches to the consensus assembly for each of the two runs in the control read set under the assumption that the NL-43 infectious clone had no diversity. We defined the read position relative to the beginning or end of the read, whichever was closer. We defined a base transition as a dinucleotide representing the transition from the preceding base to the current base, and we scored a transition as an error if the current base was a mismatch. Base quality scores came from the sequencing process.</p

FigShare

Phase increased sensitivity to detect variants.

Author: Alexander R. Macalalad (177162)
Bruce W. Birren (147656)
Christian L. Boutwell (177170)
Christine M. Malboeuf (177167)
Doug E. Brackney (177174)
Elizabeth M. Ryan (177168)
Gregory D. Ebel (177185)
Joshua Z. Levin (177182)
Karen A. Power (177172)
Kendra N. Pesko (177178)
Matthew R. Henn (103220)
Michael C. Zody (155402)
Niall J. Lennon (177164)
Patrick Charlebois (177163)
Ruchi M. Newman (177165)
Todd M. Allen (177189)
Publication venue
Publication date
Field of study

Phase increased sensitivity to detect variants, as seen over a range of error rates at coverages of 100-fold, 250-fold, and 500-fold. The phased variant detection threshold frequency (VDTF) is the lowest frequency of reads with variants at two specific loci that V-Phaser can distinguish from error among reads that span both loci. The unphased VDTF is the lowest frequency of one variant that V-Phaser can distinguish from error among reads that cover that locus. 100-fold phased sequence coverage achieves comparable detection thresholds as 500-fold unphased. We use Equation 7 to calculate the phased and unphased VDTFs. (See the <a href="http://www.ploscompbiol.org/article/info:doi/10.1371/journal.pcbi.1002417#s4" target="_blank">Materials and Methods</a> section for Equation 7 and its derivation.)</p

FigShare

Phase information increased sensitivity to detect minor variants.

Author: Alexander R. Macalalad (177162)
Bruce W. Birren (147656)
Christian L. Boutwell (177170)
Christine M. Malboeuf (177167)
Doug E. Brackney (177174)
Elizabeth M. Ryan (177168)
Gregory D. Ebel (177185)
Joshua Z. Levin (177182)
Karen A. Power (177172)
Kendra N. Pesko (177178)
Matthew R. Henn (103220)
Michael C. Zody (155402)
Niall J. Lennon (177164)
Patrick Charlebois (177163)
Ruchi M. Newman (177165)
Todd M. Allen (177189)
Publication venue
Publication date
Field of study

Phase information increased sensitivity to detect low frequency variants, as shown by these histograms of variants under 2.5%. All versions of V-Phaser detected 100% of the variants above 2.5% frequency, so these variants are not shown here. All versions of V-Phaser with phase information (A), (C), and (D) detected most variants below 1% in frequency, but the No Phase version (B) missed many variants below 1% and some variants as high as 2.5%. Data are from control WNV mixed population.</p

FigShare

Comparison of V-Phaser to other viral variant callers.

Author: Alexander R. Macalalad (177162)
Bruce W. Birren (147656)
Christian L. Boutwell (177170)
Christine M. Malboeuf (177167)
Doug E. Brackney (177174)
Elizabeth M. Ryan (177168)
Gregory D. Ebel (177185)
Joshua Z. Levin (177182)
Karen A. Power (177172)
Kendra N. Pesko (177178)
Matthew R. Henn (103220)
Michael C. Zody (155402)
Niall J. Lennon (177164)
Patrick Charlebois (177163)
Ruchi M. Newman (177165)
Todd M. Allen (177189)
Publication venue
Publication date
Field of study

Sensitivities and specificities reported across residues interrogated by all programs. Sensitivity is measured as the fraction of the known variants found by each program in the WNV mixed population control data set. Specificity is the fraction of sites not containing known variants that were called as invariant in the HIV NL4-3 control data set; values reported in parentheses include inserted and deleted bases (see <a href="http://www.ploscompbiol.org/article/info:doi/10.1371/journal.pcbi.1002417#s4" target="_blank">Materials and Methods</a>).</p

FigShare

Comparison of sequence variant quantification by 454 deep sequencing and by PCR cloning/sequencing.

Author: Aaron M. Berlin (178709)
Adrianne D. Gladden (178756)
Alexander R. Macalalad (177162)
Allyson K. Bloom (178731)
Andrew Berical (178718)
Bruce D. Walker (64224)
Bruce W. Birren (147656)
Carmen Zedlack (178776)
Chanson J. Brumme (141876)
Christian Brander (64220)
Christian L. Boutwell (177170)
Christine M. Malboeuf (177167)
Christoph Hess (178783)
Damien Tully (178742)
Elizabeth M. Ryan (177168)
Eric Rosenberg (178814)
Florencia Pereyra (178819)
Heiko Jessen (51152)
Hendrik Streeck (91376)
Huldrych F. Günthard (173082)
Jake P. Tinsley (178800)
Jenna Rychert (178796)
Joshua Z. Levin (177182)
Karen A. Power (177172)
Karen L. Axten (178752)
Ken H. Mayer (178808)
Laura Battis (178759)
Lisa M. Green (178715)
Marcus Altfeld (10455)
Matthew R. Henn (103220)
Michael C. Zody (155402)
Michael Kemper (178762)
Monica Casali (178725)
Niall J. Lennon (177164)
Olivier Gasser (178779)
Patrick Charlebois (177163)
Qiandong Zeng (178764)
Rachel L. Erlich (178712)
Ruchi Newman (178748)
Sante Gnerre (89816)
Sarah K. Young (178826)
Sharvari Gujja (178772)
Suzane Bazner (178793)
Terrance P. Shea (178767)
Tim Dudek (178737)
Todd M. Allen (177189)
Yaoyu Wang (178722)
Zabrina L. Brumme (178788)
Publication venue
Publication date
Field of study

Orthogonal regression of variant frequency estimates obtained by 454 and clonal sequence data across the highly variable 1544 nucleotide region spanning Vif to Tat in subject 9213 (slope = 1.01; 95% CI, 0.73 to 1.40).</p

FigShare

Viral escape from acute and chronic phase CD8+ T cell responses.

Author: Aaron M. Berlin (178709)
Adrianne D. Gladden (178756)
Alexander R. Macalalad (177162)
Allyson K. Bloom (178731)
Andrew Berical (178718)
Bruce D. Walker (64224)
Bruce W. Birren (147656)
Carmen Zedlack (178776)
Chanson J. Brumme (141876)
Christian Brander (64220)
Christian L. Boutwell (177170)
Christine M. Malboeuf (177167)
Christoph Hess (178783)
Damien Tully (178742)
Elizabeth M. Ryan (177168)
Eric Rosenberg (178814)
Florencia Pereyra (178819)
Heiko Jessen (51152)
Hendrik Streeck (91376)
Huldrych F. Günthard (173082)
Jake P. Tinsley (178800)
Jenna Rychert (178796)
Joshua Z. Levin (177182)
Karen A. Power (177172)
Karen L. Axten (178752)
Ken H. Mayer (178808)
Laura Battis (178759)
Lisa M. Green (178715)
Marcus Altfeld (10455)
Matthew R. Henn (103220)
Michael C. Zody (155402)
Michael Kemper (178762)
Monica Casali (178725)
Niall J. Lennon (177164)
Olivier Gasser (178779)
Patrick Charlebois (177163)
Qiandong Zeng (178764)
Rachel L. Erlich (178712)
Ruchi Newman (178748)
Sante Gnerre (89816)
Sarah K. Young (178826)
Sharvari Gujja (178772)
Suzane Bazner (178793)
Terrance P. Shea (178767)
Tim Dudek (178737)
Todd M. Allen (177189)
Yaoyu Wang (178722)
Zabrina L. Brumme (178788)
Publication venue
Publication date
Field of study

Stacked heat-maps illustrate variant codon frequencies over time for each residue of the CD8 epitopes targeted by subject 9213. Shown are epitopes targeted during the acute (Day 59) and chronic (Day 476) phases of HIV-1 infection. The baseline sequence is shown at the top of each epitope, with non-HIV-1B consensus residues highlighted in blue. The magnitude of each response is shown in SFC per million PBMC.</p

FigShare

Rapidly expanding sequence diversity during HIV-1 infection.

Author: Aaron M. Berlin (178709)
Adrianne D. Gladden (178756)
Alexander R. Macalalad (177162)
Allyson K. Bloom (178731)
Andrew Berical (178718)
Bruce D. Walker (64224)
Bruce W. Birren (147656)
Carmen Zedlack (178776)
Chanson J. Brumme (141876)
Christian Brander (64220)
Christian L. Boutwell (177170)
Christine M. Malboeuf (177167)
Christoph Hess (178783)
Damien Tully (178742)
Elizabeth M. Ryan (177168)
Eric Rosenberg (178814)
Florencia Pereyra (178819)
Heiko Jessen (51152)
Hendrik Streeck (91376)
Huldrych F. Günthard (173082)
Jake P. Tinsley (178800)
Jenna Rychert (178796)
Joshua Z. Levin (177182)
Karen A. Power (177172)
Karen L. Axten (178752)
Ken H. Mayer (178808)
Laura Battis (178759)
Lisa M. Green (178715)
Marcus Altfeld (10455)
Matthew R. Henn (103220)
Michael C. Zody (155402)
Michael Kemper (178762)
Monica Casali (178725)
Niall J. Lennon (177164)
Olivier Gasser (178779)
Patrick Charlebois (177163)
Qiandong Zeng (178764)
Rachel L. Erlich (178712)
Ruchi Newman (178748)
Sante Gnerre (89816)
Sarah K. Young (178826)
Sharvari Gujja (178772)
Suzane Bazner (178793)
Terrance P. Shea (178767)
Tim Dudek (178737)
Todd M. Allen (177189)
Yaoyu Wang (178722)
Zabrina L. Brumme (178788)
Publication venue
Publication date
Field of study

Heat maps illustrate sites exhibiting amino acid sequence diversity at days 0, 3, 59, 165, 476 and 1543 post-presentation. Plotted is the percentage of amino acid diversity at each position with respect to the dominant baseline (day 0) amino acid residue. All 3174 amino acids of HIV-1 are represented, with the first amino acid of Gag located in the top left corner of the grid and the last amino acid of Nef located in the bottom right corner. Completely conserved residues are dark blue, low-level variant residues (<10% divergent from baseline) are light blue, moderately variable residues (10–50%) in orange, and highly variant residues (>50%) in red. (A) 0 days p.p., (B) 3 days p.p., (C) 59 days p.p., (D) 165 days p.p., (E) 476 days p.p., (F) 1543 days p.p..</p

FigShare

Limited evolution in the HIV-1 proteome prior to establishment of viral set point.

Author: Aaron M. Berlin (178709)
Adrianne D. Gladden (178756)
Alexander R. Macalalad (177162)
Allyson K. Bloom (178731)
Andrew Berical (178718)
Bruce D. Walker (64224)
Bruce W. Birren (147656)
Carmen Zedlack (178776)
Chanson J. Brumme (141876)
Christian Brander (64220)
Christian L. Boutwell (177170)
Christine M. Malboeuf (177167)
Christoph Hess (178783)
Damien Tully (178742)
Elizabeth M. Ryan (177168)
Eric Rosenberg (178814)
Florencia Pereyra (178819)
Heiko Jessen (51152)
Hendrik Streeck (91376)
Huldrych F. Günthard (173082)
Jake P. Tinsley (178800)
Jenna Rychert (178796)
Joshua Z. Levin (177182)
Karen A. Power (177172)
Karen L. Axten (178752)
Ken H. Mayer (178808)
Laura Battis (178759)
Lisa M. Green (178715)
Marcus Altfeld (10455)
Matthew R. Henn (103220)
Michael C. Zody (155402)
Michael Kemper (178762)
Monica Casali (178725)
Niall J. Lennon (177164)
Olivier Gasser (178779)
Patrick Charlebois (177163)
Qiandong Zeng (178764)
Rachel L. Erlich (178712)
Ruchi Newman (178748)
Sante Gnerre (89816)
Sarah K. Young (178826)
Sharvari Gujja (178772)
Suzane Bazner (178793)
Terrance P. Shea (178767)
Tim Dudek (178737)
Todd M. Allen (177189)
Yaoyu Wang (178722)
Zabrina L. Brumme (178788)
Publication venue
Publication date
Field of study

Sequence diversity is plotted for all evolving codons in each HIV-1 protein as the percent of sequences with an amino acid residue different from the dominant baseline residue. Colored lines denote individual evolving amino acid residues within each protein. The time of infection prior to the establishment of viral set point (day 165) is highlighted in grey.</p

FigShare