Search CORE

242 research outputs found

W-curve Z-Axis view of complete HIV-1 Genome showing all points (A), points near origin with radius <.5 (B) and further out, with radius >.5 (C).

Author: Douglas J. Cork (364364)
Jerome H. Kim (132312)
Merlin L. Robb (132305)
Sodsai Tovanabutra (364366)
Steven Lembark (364365)
Publication venue
Publication date
Field of study

The last group of points (C) is used to re-align the curves after a gap. CG dinucleotide deficiencies are seen in B and C.</p

The Francis Crick Institute

The W-curve for “CG” showing both Cartesian & Cylindrical notation for the points.

Author: Douglas J. Cork (364364)
Jerome H. Kim (132312)
Merlin L. Robb (132305)
Sodsai Tovanabutra (364366)
Steven Lembark (364365)
Publication venue
Publication date
Field of study

The curve is shown in blue, layout lines indicating the X-Y locations and line for half-distance rule between the points for C & G are shown in red.</p

The Francis Crick Institute

Autoregression in the W-curve after a gap inserted into sequences.

Author: Douglas J. Cork (364364)
Jerome H. Kim (132312)
Merlin L. Robb (132305)
Sodsai Tovanabutra (364366)
Steven Lembark (364365)
Publication venue
Publication date
Field of study

The gap (green, right panel) offsets the matching portion of the right curve by seven bases. After three bases for the curves to converge (shown in blue) the curves converge again. Comparing bases 1204 and 1211 (gap + convergence window) will show the curves aligning.</p

The Francis Crick Institute

W-curves of a whole HIV-1 genome (A) and embedded pol gene sequence (B, C).

Author: Douglas J. Cork (364364)
Jerome H. Kim (132312)
Merlin L. Robb (132305)
Sodsai Tovanabutra (364366)
Steven Lembark (364365)
Publication venue
Publication date
Field of study

A W-curve of an entire genome of HIV-1 01TH.OUR6091 (Accession number AY358040 is shown in panel A. Panel B shows a “zoomed in” projection of pol gene HIV-1 99TH.OUR1991 (Accession number AY358039) with respect to base pair position in the whole genome. Panel C shows the same pol sequence extracted from the fasta file and renumbered with respect to base pair position. Sequences can be input into one of two graphical packages for the W-curve existing on the internet (10, 11).</p

The Francis Crick Institute

Difference measure for W-curves stored in Cylindrical notation.

Author: Douglas J. Cork (364364)
Jerome H. Kim (132312)
Merlin L. Robb (132305)
Sodsai Tovanabutra (364366)
Steven Lembark (364365)
Publication venue
Publication date
Field of study

Subtracting the projection of the smaller radius onto the larger one smooths out small differences after the curves have largely converged. For small angles (shown) the projection is subtracted and produces a small difference. As the angle increases the projection becomes small; points in opposite quadrants have obtuse angles with a negative cosine, adding the projection onto the larger radius.</p

The Francis Crick Institute

W-curve nucleotide coordinate positions (left) and W-curve projection of each nucleotide (right).

Author: Douglas J. Cork (364364)
Jerome H. Kim (132312)
Merlin L. Robb (132305)
Sodsai Tovanabutra (364366)
Steven Lembark (364365)
Publication venue
Publication date
Field of study

The W-curve is generated using a square centered at the origin with corners on the axes (left). Each moving point moves halfway from a starting point P to P' halfway to the corner for the next base in sequence (right). The numbers iterate within the square as follows: P'(T) = [(Px+1)/2, (Py)/2]; P'(A) = [(Px)/2, (Py+1)/2]; P'(G) = [(Px–1)/2], (Py)/2]; P'(C) = [(Px)/2, (Py–1)/2]</p

The Francis Crick Institute

Benchmark results for sequence comparison using the W-curve.

Author: Douglas J. Cork (364364)
Jerome H. Kim (132312)
Merlin L. Robb (132305)
Sodsai Tovanabutra (364366)
Steven Lembark (364365)
Publication venue
Publication date
Field of study

For each group, the number of sequences determines how many comparisons must be made, average number of bases in each sample can be used to estimate the processing rate in terms of the sequence lengths (vs. count of sequences). These times were taken from processing the upper-triangular distance matrices processed by neighbor and drawgram to produce the phenograms from the Mother/Infant study data <a href="http://www.plosone.org/article/info:doi/10.1371/journal.pone.0010829#pone.0010829-httpwwwbioinformaticsorgwcurve1" target="_blank">[11]</a>. Variations in rates are largely due to timesharing overhead, which reflects the likely environment of any clinical application. The data was measured using Perl-5.10.1 on linux-2.6.31 on an Asus M3N-HT (i.e., commodity desktop) motherboard with 4GB RAM and AMD Phenom 3.0 GHz processor.</p

The Francis Crick Institute

Building a fragment library.

Author: Douglas J. Cork (364364)
Jerome H. Kim (132312)
Merlin L. Robb (132305)
Sodsai Tovanabutra (364366)
Steven Lembark (364365)
Publication venue
Publication date
Field of study

Panels A, B, C, D describe the process of extracting curve fragments. Autoregression allows re-use of the curve fragments for comparisons between curves. Starting with the W-curve for a genome or gene (A), regions corresponding to the sequences of interest are found (B) and the remainder of the curve dropped (C), leaving a set of smaller curves (D). The set of curve fragments can be used to search for a list of regions in or score only part of a gene. For example, scoring only the conserved regions of gp120 may prove more effective for generating phenograms than using the entire gp120 sequence or env gene.</p

The Francis Crick Institute

Clinical Chemistry reference ranges (median and 95th-percentile) derived from HIV-seronegative adults in Kericho, Kenya.

Author: Christian T. Bautista (248661)
Deborah L. Birx (376470)
Douglas N. Shaffer (376469)
Frederick K. Sawe (376468)
Mark S. de Souza (147332)
Merlin L. Robb (132305)
Nelson L. Michael (132309)
Paul T. Scott (112131)
Rukia S. Kibaya (376467)
Warren B. Sateren (375572)
Publication venue
Publication date
Field of study

a: P value of <0.05 using the Wilcoxon rank test.</p

The Francis Crick Institute

Hematology reference ranges from Kericho, Kenya compared to other sources in Africa and the United States of America.

Author: Christian T. Bautista (248661)
Deborah L. Birx (376470)
Douglas N. Shaffer (376469)
Frederick K. Sawe (376468)
Mark S. de Souza (147332)
Merlin L. Robb (132305)
Nelson L. Michael (132309)
Paul T. Scott (112131)
Rukia S. Kibaya (376467)
Warren B. Sateren (375572)
Publication venue
Publication date
Field of study

a: NA-Not Available.</p

The Francis Crick Institute