33 research outputs found

    GBParsy: A GenBank flatfile parser library with high speed

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>GenBank flatfile (GBF) format is one of the most popular sequence file formats because of its detailed sequence features and ease of readability. To use the data in the file by a computer, a parsing process is required and is performed according to a given grammar for the sequence and the description in a GBF. Currently, several parser libraries for the GBF have been developed. However, with the accumulation of DNA sequence information from eukaryotic chromosomes, parsing a eukaryotic genome sequence with these libraries inevitably takes a long time, due to the large GBF file and its correspondingly large genomic nucleotide sequence and related feature information. Thus, there is significant need to develop a parsing program with high speed and efficient use of system memory.</p> <p>Results</p> <p>We developed a library, GBParsy, which was C language-based and parses GBF files. The parsing speed was maximized by using content-specified functions in place of regular expressions that are flexible but slow. In addition, we optimized an algorithm related to memory usage so that it also increased parsing performance and efficiency of memory usage. GBParsy is at least 5 - 100× faster than current parsers in benchmark tests.</p> <p>Conclusion</p> <p>GBParsy is estimated to extract annotated information from almost 100 Mb of a GenBank flatfile for chromosomal sequence information within a second. Thus, it should be used for a variety of applications such as on-time visualization of a genome at a web site.</p

    Quadruple 9-mer-based protein binding microarray with DsRed fusion protein

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The interaction between a transcription factor and DNA motif (<it>cis</it>-acting element) is an important regulatory step in gene regulation. Comprehensive genome-wide methods have been developed to characterize protein-DNA interactions. Recently, the universal protein binding microarray (PBM) was introduced to determine if a DNA motif interacts with proteins in a genome-wide manner.</p> <p>Results</p> <p>We facilitated the PBM technology using a DsRed fluorescent protein and a concatenated sequence of oligonucleotides. The PBM was designed in such a way that target probes were synthesized as quadruples of all possible 9-mer combinations, permitting unequivocal interpretation of the <it>cis</it>-acting elements. The complimentary DNA strands of the features were synthesized with a primer and DNA polymerase on microarray slides. Proteins were labeled via N-terminal fusion with DsRed fluorescent protein, which circumvents the need for a multi-step incubation. The PBM presented herein confirmed the well-known DNA binding sequences of Cbf1 and CBF1/DREB1B, and it was also applied to elucidate the unidentified <it>cis</it>-acting element of the OsNAC6 rice transcription factor.</p> <p>Conclusion</p> <p>Our method demonstrated PBM can be conveniently performed by adopting: (1) quadruple 9-mers may increase protein-DNA binding interactions in the microarray, and (2) a one-step incubation shortens the wash and hybridization steps. This technology will facilitate greater understanding of genome-wide interactions between proteins and DNA.</p

    Deep and comparative analysis of the mycelium and appressorium transcriptomes of Magnaporthe grisea using MPSS, RL-SAGE, and oligoarray methods

    Get PDF
    BACKGROUND: Rice blast, caused by the fungal pathogen Magnaporthe grisea, is a devastating disease causing tremendous yield loss in rice production. The public availability of the complete genome sequence of M. grisea provides ample opportunities to understand the molecular mechanism of its pathogenesis on rice plants at the transcriptome level. To identify all the expressed genes encoded in the fungal genome, we have analyzed the mycelium and appressorium transcriptomes using massively parallel signature sequencing (MPSS), robust-long serial analysis of gene expression (RL-SAGE) and oligoarray methods. RESULTS: The MPSS analyses identified 12,531 and 12,927 distinct significant tags from mycelia and appressoria, respectively, while the RL-SAGE analysis identified 16,580 distinct significant tags from the mycelial library. When matching these 12,531 mycelial and 12,927 appressorial significant tags to the annotated CDS, 500 bp upstream and 500 bp downstream of CDS, 6,735 unique genes in mycelia and 7,686 unique genes in appressoria were identified. A total of 7,135 mycelium-specific and 7,531 appressorium-specific significant MPSS tags were identified, which correspond to 2,088 and 1,784 annotated genes, respectively, when matching to the same set of reference sequences. Nearly 85% of the significant MPSS tags from mycelia and appressoria and 65% of the significant tags from the RL-SAGE mycelium library matched to the M. grisea genome. MPSS and RL-SAGE methods supported the expression of more than 9,000 genes, representing over 80% of the predicted genes in M. grisea. About 40% of the MPSS tags and 55% of the RL-SAGE tags represent novel transcripts since they had no matches in the existing M. grisea EST collections. Over 19% of the annotated genes were found to produce both sense and antisense tags in the protein-coding region. The oligoarray analysis identified the expression of 3,793 mycelium-specific and 4,652 appressorium-specific genes. A total of 2,430 mycelial genes and 1,886 appressorial genes were identified by both MPSS and oligoarray. CONCLUSION: The comprehensive and deep transcriptome analysis by MPSS and RL-SAGE methods identified many novel sense and antisense transcripts in the M. grisea genome at two important growth stages. The differentially expressed transcripts that were identified, especially those specifically expressed in appressoria, represent a genomic resource useful for gaining a better understanding of the molecular basis of M. grisea pathogenicity. Further analysis of the novel antisense transcripts will provide new insights into the regulation and function of these genes in fungal growth, development and pathogenesis in the host plants

    A pepper MSRB2 gene confer drought tolerance in rice through the protection of chloroplast-targeted genes

    Get PDF
    Background: The perturbation of the steady state of reactive oxygen species (ROS) due to biotic and abiotic stresses in a plant could lead to protein denaturation through the modification of amino acid residues, including the oxidation of methionine residues. Methionine sulfoxide reductases (MSRs) catalyze the reduction of methionine sulfoxide back to the methionine residue. To assess the role of this enzyme, we generated transgenic rice using a pepper CaMSRB2 gene under the control of the rice Rab21 (responsive to ABA protein 21) promoter with/without a selection marker, the bar gene. Results: A drought resistance test on transgenic plants showed that CaMSRB2 confers drought tolerance to rice, as evidenced by less oxidative stress symptoms and a strengthened PSII quantum yield under stress conditions, and increased survival rate and chlorophyll index after the re-watering. The results from immunoblotting using a methionine sulfoxide antibody and nano-LC-MS/MS spectrometry suggest that porphobilinogen deaminase (PBGD), which is involved in chlorophyll synthesis, is a putative target of CaMSRB2. The oxidized methionine content of PBGD expressed in E. coli increased in the presence of H2O2, and the Met-95 and Met-227 residues of PBGD were reduced by CaMSRB2 in the presence of dithiothreitol (DTT). An expression profiling analysis of the overexpression lines also suggested that photosystems are less severely affected by drought stress. Conclusions: Our results indicate that CaMSRB2 might play an important functional role in chloroplasts for conferring drought stress tolerance in rice.OAIID:oai:osos.snu.ac.kr:snu2014-01/102/0000005113/8SEQ:8PERF_CD:SNU2014-01EVAL_ITEM_CD:102USER_ID:0000005113ADJUST_YN:NEMP_ID:A077085DEPT_CD:517CITE_RATE:3.73FILENAME:msrb-김연기.pdfDEPT_NM:식물생산과학부EMAIL:[email protected]_YN:YCONFIRM:

    Complete Genome Sequence of Burkholderia glumae BGR1

    No full text
    corecore