7 research outputs found

    Iterative Compression of End-to-End ASR Model using AutoML

    Full text link
    Increasing demand for on-device Automatic Speech Recognition (ASR) systems has resulted in renewed interests in developing automatic model compression techniques. Past research have shown that AutoML-based Low Rank Factorization (LRF) technique, when applied to an end-to-end Encoder-Attention-Decoder style ASR model, can achieve a speedup of up to 3.7x, outperforming laborious manual rank-selection approaches. However, we show that current AutoML-based search techniques only work up to a certain compression level, beyond which they fail to produce compressed models with acceptable word error rates (WER). In this work, we propose an iterative AutoML-based LRF approach that achieves over 5x compression without degrading the WER, thereby advancing the state-of-the-art in ASR compression

    The complete chloroplast genome of Paeonia lactiflora Pall. (Paeoniaceae)

    No full text
    The complete chloroplast genome sequence of Paeonia lactiflora was obtained by de novo assembly using next-generation sequencing. The circular molecule of the genome constituted of four parts, with 152,731 bp in the complete chloroplast genome, including a large single-copy (LSC) region of 84,402 bp, a small single-copy (SSC) region of 16,969 bp, and two inverted repeat (IRa and IRb) regions of 25,680 bp each. The genome annotation predicted a total of 111 genes, including 77 protein-coding genes, 30 tRNA genes, and 4 rRNA genes. Phylogenetic analysis showed the close taxonomical relationship to P. veitchii

    The complete chloroplast genome of Paeonia cv. Hwang-Moran (Paeoniaceae)

    No full text
    The complete chloroplast genome of Paeonia cv. Hwang-Moran (PHM), a yellow flowering tree peony, was de novo assembled and characterized from high-throughput next-generation sequencing data. The total length of the circular PHM chloroplast genome was 152,519 bp, including a large single-copy (LSC) region of 84,214 bp, a small single-copy (SSC) region of 17,026 bp, and a pair of inverted repeat regions (IRs) of 25,640 bp. The entire chloroplast genome contained 111 genes, including 77 protein-coding genes, 30 tRNAs, and four rRNAs. A phylogenetic tree constructed using the PHM and related chloroplast genome sequences revealed its close taxonomic relationship with P. ludlowii within the genus Paeonia

    The complete chloroplast genome of Paeonia suffruticosa Andrew (Paeoniaceae)

    No full text
    The complete chloroplast genome sequence of Paeonia suffruticosa was determined by next-generation sequencing. The total length of the plastid genome of P. suffruticosa is 152,811 bp, including a large single-copy region (LSC) of 84,466 bp, a small single-copy region (SSC) of 17,051 bp, and a pair of identical inverted repeat regions (IRs) of 25,647 bp. A total of 111 genes was annotated, resulting in 77 protein-coding genes, 30 tRNA genes, and 4 rRNA genes. The phylogenetic analysis of P. suffruticosa with related chloroplast genome sequences in this study provided the close taxonomical relationship to Paeonia delavayi and Paeonia ludlowii in the genus Paeonia

    The complete chloroplast genome of Ziziphus jujuba cv. Bokjo (Rhamnaceae)

    No full text
    We have sequenced the Ziziphus jujuba cv. Bokjo chloroplast genome by de novo assembly using next-generation sequencing. The complete circular chloroplast genome consisted of 161,714 bp and contained four parts: a large single-copy (LSC) region of 89,323 bp, a small single-copy (SSC) region of 19,361 bp, and two inverted repeat regions (IRa and IRb) of 26,515 bp each. The genome annotation predicted a total of 110 genes, including 76 protein-coding genes, 30 tRNA genes, and four rRNA genes. Phylogenetic analysis demonstrated the close taxonomic relationship between Z. jujuba cv. Bokjo and two other members of the Ziziphus genus, Z. spina-christi and Z. mauritiana. We found 135 polymorphic loci, 63 single nucleotide polymorphism (SNP) and 72 insertion–deletion (InDel), from the comparison of Z. jujuba cultivar Bokjo and Z. jujuba reference (NC_030299). The polymorphic loci could be used for the differentiation of Z. jujuba genetic resources and for breeding in the future

    Development of Genome-Wide SSR Markers from Angelica gigas Nakai Using Next Generation Sequencing

    No full text
    Angelica gigas Nakai is an important medicinal herb, widely utilized in Asian countries especially in Korea, Japan, and China. Although it is a vital medicinal herb, the lack of sequencing data and efficient molecular markers has limited the application of a genetic approach for horticultural improvements. Simple sequence repeats (SSRs) are universally accepted molecular markers for population structure study. In this study, we found over 130,000 SSRs, ranging from di- to deca-nucleotide motifs, using the genome sequence of Manchu variety (MV) of A. gigas, derived from next generation sequencing (NGS). From the putative SSR regions identified, a total of 16,496 primer sets were successfully designed. Among them, we selected 848 SSR markers that showed polymorphism from in silico analysis and contained tri- to hexa-nucleotide motifs. We tested 36 SSR primer sets for polymorphism in 16 A. gigas accessions. The average polymorphism information content (PIC) was 0.69; the average observed heterozygosity (HO) values, and the expected heterozygosity (HE) values were 0.53 and 0.73, respectively. These newly developed SSR markers would be useful tools for molecular genetics, genotype identification, genetic mapping, molecular breeding, and studying species relationships of the Angelica genus
    corecore