Search CORE

24 research outputs found

Network nodes in the model and corresponding Boolean update rules.

Author: Daniel MacLean (132098)
David J. Studholme (57620)
Publication venue
Publication date
Field of study

Network nodes in the model and corresponding Boolean update rules.</p

FigShare

Network of the hrp regulon of P. syringae.

Author: Daniel MacLean (132098)
David J. Studholme (57620)
Publication venue
Publication date
Field of study

Nodes (Blue circles) represent the proteins in the network and edges (black lines) represent regulatory interactions, arrow headed edges represent a positive regulatory interaction and T-headed edges represent a negative regulatory interaction.</p

FigShare

Attractor trees of the model for the 128 different start states.

Author: Daniel MacLean (132098)
David J. Studholme (57620)
Publication venue
Publication date
Field of study

We ran the model in synchronous mode starting from each of the 128 possible combinations of states. Each circle represents a possible state of the model and the edge indicates the state to which the model evolves on the next iteration. The tree with the terminal node labelled ‘ON’ has an attractor with the same state as the steady state of runs with the model i.e GacSGacA = True; RpoN = True; HrpV = False; HrpG = True; HrpRS = True; HrpL = True; HrpA = True; The tree with terminal node labelled ‘OFF’ has an attractor in which all states are false.</p

FigShare

States of the model at step 10 in runs with simulated knock-outs of individual genes.

Author: Daniel MacLean (132098)
David J. Studholme (57620)
Publication venue
Publication date
Field of study

We ran the model in synchronous mode for 10 steps from the initial state and simulated a knock-out of a single gene, recording the model's state at step 10. Each column represents the results from a single run with a single knocked out gene, indicated above the column, each row represents a gene. Blue colour indicates that the model showed the gene was in the ‘True’ state at step 10, no colour indicates the model showed ‘False’ for the protein at step 10.</p

FigShare

Improvement of SNP/indel calling accuracy by Coval-Refine in targeted alignment.

Author: Daniel MacLean (132098)
Kentaro Yoshida (165749)
Liliana Cano (432612)
Ryohei Terauchi (165768)
Satoshi Natsume (432603)
Shunichi Kosugi (432611)
Sophien Kamoun (6132)
Publication venue
Publication date
Field of study

The whole chromosomes (All chr), chromosome 10 (Chr10), a 1 Mb fragment of chromosome 10 (Chr10-1M: positions 1000001 to 2000000 of Chr10) from the simulated rice genome were aligned with 75-bp paired-end reads sequenced from the whole rice genome using BWA. The alignments were filtered (+, bars in dark- and middle-red and in dark- and middle-blue) or not filtered (–, bars in light red and in light blue) with Coval-Refine in the basic mode. Two different filtering conditions of Coval-Refine for mismatch reads were applied; one is the default option for removing reads with three or more mismatches (middle-red and middle-blue bars), the other removing the second paired-end mate read when the first mate is filtered and removing a read pair that contained more than two total mismatches (dark red and dark blue bars). The mean coverage of read depth before and after (indicated with parentheses) the Coval-Refine treatment is indicated under the reference chromosome name. Homozygous SNPs and indels were called as in <a href="http://www.plosone.org/article/info:doi/10.1371/journal.pone.0075402#pone-0075402-g001" target="_blank">Figure 1</a>. TPR and FPR for the called SNPs are shown with red and blue bars, respectively.</p

FigShare

Coval: Improving Alignment Quality and Variant Calling Accuracy for Next-Generation Sequencing Data

Author: Daniel MacLean (132098)
Kentaro Yoshida (165749)
Liliana Cano (432612)
Ryohei Terauchi (165768)
Satoshi Natsume (432603)
Shunichi Kosugi (432611)
Sophien Kamoun (6132)
Publication venue
Publication date: 08/10/2013
Field of study

<div>Accurate identification of DNA polymorphisms using next-generation sequencing technology is challenging because of a high rate of sequencing error and incorrect mapping of reads to reference genomes. Currently available short read aligners and DNA variant callers suffer from these problems. We developed the Coval software to improve the quality of short read alignments. Coval is designed to minimize the incidence of spurious alignment of short reads, by filtering mismatched reads that remained in alignments after local realignment and error correction of mismatched reads. The error correction is executed based on the base quality and allele frequency at the non-reference positions for an individual or pooled sample. We demonstrated the utility of Coval by applying it to simulated genomes and experimentally obtained short-read data of rice, nematode, and mouse. Moreover, we found an unexpectedly large number of incorrectly mapped reads in ‘targeted’ alignments, where the whole genome sequencing reads had been aligned to a local genomic segment, and showed that Coval effectively eliminated such spurious alignments. We conclude that Coval significantly improves the quality of short-read sequence alignments, thereby increasing the calling accuracy of currently available tools for SNP and indel identification. Coval is available at <a href="http://sourceforge.net/projects/coval105/" target="_blank">http://sourceforge.net/projects/coval105/</a>.</div

Directory of Open Access Journals

PubMed Central

FigShare

Number of mismatches in aligned reads.

Author: Daniel MacLean (132098)
Kentaro Yoshida (165749)
Liliana Cano (432612)
Ryohei Terauchi (165768)
Satoshi Natsume (432603)
Shunichi Kosugi (432611)
Sophien Kamoun (6132)
Publication venue
Publication date
Field of study

aThe percentage of reads with mismatches, out of the total number of aligned reads for each species or simulated reads. Aligned reads are paired-end reads of 100 bp for nematode, 76 bp for mouse, and 75 bp for the others. Artificial reads reflecting the error tendency of the rice reads were generated with a dwgsim. The total error rates (%) are indicated in the last line.</p

FigShare

Calling accuracy of SNPs from alignment data containing multiple samples.

Author: Daniel MacLean (132098)
Kentaro Yoshida (165749)
Liliana Cano (432612)
Ryohei Terauchi (165768)
Satoshi Natsume (432603)
Shunichi Kosugi (432611)
Sophien Kamoun (6132)
Publication venue
Publication date
Field of study

The experimentally obtained rice reads (60, 30, and 15 millions) were mixed with the simulated 75 bp paired-end reads (60, 90, and 105 millions) generated by dwgsim with the rice simulated genome as template, respectively, yielding 120 millions of reads. The read mixtures were aligned to the rice simulated genome, resulting in alignments with average read depth of 24×, and each read set (sample) in the read mixtures was discriminated from the other read set using the RG tag. The SNPs were called using Coval-Call with a maximum of 80 reads covering the called positions, a minimum allele frequency at the called position of 0.2 (for 50% homozygous sample), 0.1 (for 50% heterozygous and 25% homozygous samples), or 0.05 (for 25% heterozygous and 12.5% homozygous samples), a minimum of three reads (for 50% homozygous sample) or two reads (for the others) supporting the called allele.aPercentage of the experimentally obtained rice read sample in the read mixture.bHeterozygosity of the experimentally obtained rice read sample (Homo: 0% heterozygosity, Hetero: 50% heterozygosity).</p

FigShare

Improvement by Coval-Refine of SNP/indel calling accuracy of variant calling tools for mouse alignment data.

Author: Daniel MacLean (132098)
Kentaro Yoshida (165749)
Liliana Cano (432612)
Ryohei Terauchi (165768)
Satoshi Natsume (432603)
Shunichi Kosugi (432611)
Sophien Kamoun (6132)
Publication venue
Publication date
Field of study

(A) SNP calling accuracy with or without Coval-Refine. (B) Indel calling accuracy with or without Coval-Refine. A simulated mouse genome was aligned with real mouse read data using BWA. The alignments were filtered (+, striped bars) or not filtered (–, plain bars) with Coval-Refine. Homozygous SNPs and indels were called with the indicated variant callers under the same conditions as in <a href="http://www.plosone.org/article/info:doi/10.1371/journal.pone.0075402#pone-0075402-g001" target="_blank">Figure 1</a>.</p

FigShare

Improvement of SNP/indel calling accuracies of various DNA variant callers by Coval-Refine.

Author: Daniel MacLean (132098)
Kentaro Yoshida (165749)
Liliana Cano (432612)
Ryohei Terauchi (165768)
Satoshi Natsume (432603)
Shunichi Kosugi (432611)
Sophien Kamoun (6132)
Publication venue
Publication date
Field of study

(A) SNP calling accuracy with or without Coval-Refine. (B) Indel calling accuracy with or without Coval-Refine. The simulated rice genome was aligned with reads of the real rice genome (experimental reads) using BWA. Alignment data were filtered (+, red striped and blue striped bars) or not filtered (–, light red and light blue bars) with the Coval-Refine component (Coval-Refine, error correction mode), and homozygous SNPs and indels were called using the indicated variant callers. The SNPs and indels extracted by all the callers were further filtered under the same conditions, as described in the text. True positive rate (TPR, the number of successfully called SNPs or indels divided with the number of SNPs or indels introduced into the simulated genome, followed by multiplying with 100) is shown with light red and red striped bars, and false positive rate (FPR, the number of wrongly called SNPs or indels divided with the number of the totally called SNPs or indels, followed by multiplying with 100) with light blue and blue striped bars. The GATK pileline was carried out with (GATK BQSR) or without (GATK) the base quality score recalibration. A variant quality score recalibration in the GATK pipeline was omitted because of its unsuitability for our data. Instead it was replaced by simple filtering: a minimum allele frequency of 0.8 and a minimum allelic read depth of 2 (see Materials and Methods for details).</p

FigShare