Search CORE

26 research outputs found

Additional file 1: Table S1. of Chloroplast genomes: diversity, evolution, and applications in genetic engineering

Author: Choun-Sea Lin (173375)
Henry Daniell (18263)
Ming Yu (78996)
Wan-Jung Chang (3586448)
Publication venue
Publication date
Field of study

The chloroplast genes which are absent in specific species, their knock out phenotypes and transfer to nuclear genomes. (DOCX 23 kb

FigShare

Heterozygous variations, including heterozygous SNPs and hemizygous insertions/deletions/inversions, detected during assembly of diploid genome.

Author: Choun-Sea Lin (173375)
Chuan-Kang Ting (3353495)
Jian-Wei Chen (452345)
Ming-Tsai Chan (3353498)
Sheng-Yu Chuang (3353501)
Yao-Ting Huang (2436382)
Publication venue
Publication date
Field of study

Heterozygous variations, including heterozygous SNPs and hemizygous insertions/deletions/inversions, detected during assembly of diploid genome.</p

FigShare

A Genetic Algorithm for Diploid Genome Reconstruction Using Paired-End Sequencing

Author: Choun-Sea Lin (173375)
Chuan-Kang Ting (3353495)
Jian-Wei Chen (452345)
Ming-Tsai Chan (3353498)
Sheng-Yu Chuang (3353501)
Yao-Ting Huang (2436382)
Publication venue
Publication date: 01/01/2016
Field of study

<div>The genome of many species in the biosphere is a diploid consisting of paternal and maternal haplotypes. The differences between these two haplotypes range from single nucleotide polymorphisms (SNPs) to large-scale structural variations (SVs). Existing genome assemblers for next-generation sequencing platforms attempt to reconstruct one consensus sequence, which is a mosaic of two parental haplotypes. Reconstructing paternal and maternal haplotypes is an important task in linkage analysis and association studies. This study designs and implemented HapSVAssembler on the basis of Genetic Algorithm (GA) and paired-end sequencing. The proposed method builds a consensus sequence, identifies various types of heterozygous variants, and reconstructs the paternal and maternal haplotypes by solving an optimization problem with a GA algorithm. Experimental results indicate that the HapSVAssembler has high accuracy and contiguity under various sequencing coverage, error rates, and insert sizes. The program is tested on pilot sequencing of a highly heterozygous genome, and 12,781 heterozygous SNPs and 602 hemizygous SVs are identified. We observe that, although the number of SVs is much less than that of SNPs, the genomic regions occupied by SVs are much larger, implying the heterozygosity computed using SNPs or k-mer spectrum may be under-estimated.</div

Directory of Open Access Journals

PubMed Central

FigShare

Assembly accuracy and contiguity for different sequencing coverage and error rates.

Author: Choun-Sea Lin (173375)
Chuan-Kang Ting (3353495)
Jian-Wei Chen (452345)
Ming-Tsai Chan (3353498)
Sheng-Yu Chuang (3353501)
Yao-Ting Huang (2436382)
Publication venue
Publication date
Field of study

(a) The accuracy higher than 90% can be obtained with low error rate simulations even in low coverage; (b) The comparison of N10/N50 for different sequencing coverage.</p

FigShare

Identification of insertions or deletions.

Author: Choun-Sea Lin (173375)
Chuan-Kang Ting (3353495)
Jian-Wei Chen (452345)
Ming-Tsai Chan (3353498)
Sheng-Yu Chuang (3353501)
Yao-Ting Huang (2436382)
Publication venue
Publication date
Field of study

A discordant read rj is mapped on the reference with two mapping locis, and . The spanning region of rj is from to . And the potential breakpoint pair of SVi is initialized from to .</p

FigShare

Illustration of breakpoint reads across SV boundaries.

Author: Choun-Sea Lin (173375)
Chuan-Kang Ting (3353495)
Jian-Wei Chen (452345)
Ming-Tsai Chan (3353498)
Sheng-Yu Chuang (3353501)
Yao-Ting Huang (2436382)
Publication venue
Publication date
Field of study

(a) A breakpoint Read rj whose right end matches perfectly first 4 nucleotides whether the remainder bases are mismatched with the reference. The guessing breakpoint can be inferred at the 4th base of the right end on rj; (b) The actual breakpoints of SV can be determined by breakpoint reads.</p

FigShare

The accuracy for different genome size and read length.

Author: Choun-Sea Lin (173375)
Chuan-Kang Ting (3353495)
Jian-Wei Chen (452345)
Ming-Tsai Chan (3353498)
Sheng-Yu Chuang (3353501)
Yao-Ting Huang (2436382)
Publication venue
Publication date
Field of study

The paternal and maternal genomes differes in 1% SNPs. The mean insert size is 250bp with 25bp standard deviation, the sequencing coverage is 20X, and the sequencing error rate is 1%. (a) The accuracy for different genome sizes; (b) The accuracy for different read lengths.</p

FigShare

Flowchart of hybrid de novo assembly approach.

Author: Choun-Sea Lin (173375)
Chuan-Kang Ting (3353495)
Jian-Wei Chen (452345)
Ming-Tsai Chan (3353498)
Sheng-Yu Chuang (3353501)
Yao-Ting Huang (2436382)
Publication venue
Publication date
Field of study

The flowchart of the de novo assembly using hybrid approach with.</p

FigShare

Illustration of converting paired-reads to SNP matrix and SV matrix.

Author: Choun-Sea Lin (173375)
Chuan-Kang Ting (3353495)
Jian-Wei Chen (452345)
Ming-Tsai Chan (3353498)
Sheng-Yu Chuang (3353501)
Yao-Ting Huang (2436382)
Publication venue
Publication date
Field of study

(a) Paired-end read r1 and r2 both contain SNPs but r3 does not, therefore, r1 and r2 can be successfully converted to read fragment f1 and f2 respectively. SNP s2 is covered by r2, and the allele at s2 can be obtained by the 4-th nucleotide on r2; (b) Single-end mapped read r1 and r2 whose unmapped ends are overlapping with sv1 (e.g., a deletion), both of and can be assigned by 1.</p

FigShare

Illustration of extended Haplotype blocks via heterozygous SVs.

Author: Choun-Sea Lin (173375)
Chuan-Kang Ting (3353495)
Jian-Wei Chen (452345)
Ming-Tsai Chan (3353498)
Sheng-Yu Chuang (3353501)
Yao-Ting Huang (2436382)
Publication venue
Publication date
Field of study

One end is represented by a solid arrow and two ends from the same read are connected by a dotted line. There is a heterozygous SV1 between SNP10 and SNP11. (a) Without considering SVs, the entire haplotype will be broken into three haplotype blocks; (b) In our approach, Block2 and Block3 in (a) are merged by bridging read x, y in Block2 and bridging read z in Block3 that indicate heterozygous SV1.</p

FigShare