46 research outputs found

    De novo assembly and characterization of transcriptome using Illumina paired-end sequencing and identification of CesA gene in ramie (Boehmeria nivea L. Gaud)

    Get PDF
    BACKGROUND: Ramie fiber, extracted from vegetative organ stem bast, is one of the most important natural fibers. Understanding the molecular mechanisms of the vegetative growth of the ramie and the formation and development of bast fiber is essential for improving the yield and quality of the ramie fiber. However, only 418 expressed tag sequences (ESTs) of ramie deposited in public databases are far from sufficient to understand the molecular mechanisms. Thus, high-throughput transcriptome sequencing is essential to generate enormous ramie transcript sequences for the purpose of gene discovery, especially genes such as the cellulose synthase (CesA) gene. RESULTS: Using Illumina paired-end sequencing, about 53 million sequencing reads were generated. De novo assembly yielded 43,990 unigenes with an average length of 824 bp. By sequence similarity searching for known proteins, a total of 34,192 (77.7%) genes were annotated for their function. Out of these annotated unigenes, 16,050 and 13,042 unigenes were assigned to gene ontology and clusters of orthologous group, respectively. Searching against the Kyoto Encyclopedia of Genes and Genomes Pathway database (KEGG) indicated that 19,846 unigenes were mapped to 126 KEGG pathways, and 565 genes were assigned to http://starch and sucrose metabolic pathway which was related with cellulose biosynthesis. Additionally, 51 CesA genes involved in cellulose biosynthesis were identified. Analysis of tissue-specific expression pattern of the 51 CesA genes revealed that there were 36 genes with a relatively high expression levels in the stem bark, which suggests that they are most likely responsible for the biosynthesis of bast fiber. CONCLUSION: To the best of our knowledge, this study is the first to characterize the ramie transcriptome and the substantial amount of transcripts obtained will accelerate the understanding of the ramie vegetative growth and development mechanism. Moreover, discovery of the 36 CesA genes with relatively high expression levels in the stem bark will present an opportunity to understand the ramie bast fiber formation and development mechanisms

    Metadata Caching in Presto: Towards Fast Data Processing

    Full text link
    Presto is an open-source distributed SQL query engine for OLAP, aiming for "SQL on everything". Since open-sourced in 2013, Presto has been consistently gaining popularity in large-scale data analytics and attracting adoption from a wide range of enterprises. From the development and operation of Presto, we witnessed a significant amount of CPU consumption on parsing column-oriented data files in Presto worker nodes. This blocks some companies, including Meta, from increasing analytical data volumes. In this paper, we present a metadata caching layer, built on top of the Alluxio SDK cache and incorporated in each Presto worker node, to cache the intermediate results in file parsing. The metadata cache provides two caching methods: caching the decompressed metadata bytes from raw data files and caching the deserialized metadata objects. Our evaluation of the TPC-DS benchmark on Presto demonstrates that when the cache is warm, the first method can reduce the query's CPU consumption by 10%-20%, whereas the second method can minimize the CPU usage by 20%-40%.Comment: 5 pages, 8 figure

    Identification of an NAC Transcription Factor Family by Deep Transcriptome Sequencing in Onion (Allium cepa L.).

    No full text
    Although onion has been used extensively in the past for cytogenetic studies, molecular analysis has been lacking because the availability of genetic resources is limited. NAM, ATAF, and CUC (NAC) transcription factors (TFs) are plant-specific proteins, and they play key roles in plant growth, development, and stress tolerance. However, none of the onion NAC (CepNAC) genes had been identified thus far. In this study, the transcriptome of onion leaves was analyzed by Illumina paired-end sequencing. Approximately 102.9 million clean sequence reads were produced and used for de novo assembly, which generated 117,189 non-redundant transcripts. Of these transcripts, 39,472 were annotated for their function. In order to mine the CepNAC TFs, CepNAC genes were searched from the transcripts assembled, resulting in the identification of all 39 CepNAC genes. These 39 CepNAC proteins were subjected to phylogenetic analysis together with 47 NAC proteins of known function that were previously identified in other species. The results showed that they can be divided into five groups (NAC-I-V). Interestingly, the NAC-IV and -V groups were found to be likely related to the processes of secondary wall synthesis and stress response, respectively. The transcriptome analysis generated a substantial amount of transcripts, which will aid immensely in identifying important genes and accelerating our understanding of onion growth and development. Moreover, the discovery of 39 CepNAC TFs and the identification of the sequence conservation between them and NAC proteins published will provide a basis for further characterization and validation of their functions in the future

    Investigation on the Influence of Flow Passage Structure on the Performance of Bionic Pumps

    No full text
    The flapping hydrofoil bionic pump drives the hydrofoil to make simple harmonic motion and completes one-way water pumping in the flow passage. As a new pump device that can realize ultra-low head water delivery, the flapping hydrofoil device can effectively enrich the drainage methods of plain rivers and improve water delivery efficiency, and the passage structure is the key factor of ultra-low head devices. In this paper, the two-dimensional flow passage models are established, and the flapping of the airfoil is realized by using the dynamic grid technology. Based on the continuity equation, k-ε turbulence model, and Reynolds time-averaged equation, the flapping hydrofoil device is simulated by transient calculation. The hydraulic performance characteristics of various passages with different widths, such as square passages, micro-arc passages, and convergent–divergent passages, are calculated and simulated. The results show that, under the fixed motion parameters, the narrower the passage width, the higher the outlet velocity, lift, and efficiency of the device, the lower the flow rate. The contraction–expansion pipe can effectively improve the efficiency and flow rate of the device, and, before the wake is stable, the longer the contraction section the better the lifting effect. However, the micro-arc pipeline will affect the formation of a double-row anti-Karman vortex street, resulting in greater energy loss and in its hydraulic performance being inferior to that of the square passage

    Genetic structure and relationships of an associated population in ramie (Boehmeria nivea L. Gaud) evaluated by SSR markers

    No full text
    Ramie (Boehmeria nivea L. Gaud) is one of the most important natural fibre crops. For enhanced crop development, it is necessary to understand its population structure and genetic relationships. In this study, we assessed the genetic diversity and population structure of 134 ramie accessions (with three plants per accession) from 12 regions by using 36 simple sequence repeat markers. The 36 microsatellite primers revealed 149 alleles in 134 ramie populations, with an average of 4.14 alleles per locus. The structure analysis divided the 134 ramie accessions into three groups (I, II and III), and into further six subgroups (a, b, c, d, e and f). In Subgroup b, 13 accessions were from Guizhou Province, 9 accessions were from Sichuan Province and the remaining 20 accessions were from Chongqing (4), Hunan (8), Guangxi (4), Jiangxi (2), Yunan (1) and Taiwan (1). In Subgroup d, 22 accessions were from Guizhou Province and the remaining 17 accessions were from Chongqing (6), Sichuan (5) and Yunnan (6). It can be inferred that the genetic background of these ramie accessions did not always correlate with their geographical regions. Similar results were found in Subgroups a and f. The pair-wise genetic similarity coefficients between the 134 accessions ranged from 0.390 to 0.939, which suggested that there was abundant genetic diversity in the ramie accessions. These markers have provided important information about the genetic structure of ramie, which can contribute to future breeding and improvement programmes for these resources

    NAND Gate Computational Model based on the DNA Origami Template

    No full text
    The essence of NAND gate is the superposition of AND gate and NOT gate. The AND gate operation is performed first, and then the NOT gate is performed. It is the basis of the DNA computer. In order to realize the computing of NAND gate, a NAND gate computational model is established based on the DNA origami template. The inputs of the logic value are completed by the Hybridization Chain Reaction (HCR) on the DNA origami template. The input strands first react with the AND gate region and then react with the NOT gate region. The result of the reaction is shown by dynamically separation of the gold nanoparticles on the DNA origami template. The simulation of the model through Visual DSD shows that the system has the advantages of high feasibility

    Gene ontology classifications of assembled transcripts; the red, blue, and green histogram bar represented three main categories: biological process, molecular function, and cellular component.

    No full text
    <p>Gene ontology classifications of assembled transcripts; the red, blue, and green histogram bar represented three main categories: biological process, molecular function, and cellular component.</p
    corecore