68 research outputs found
The Rice Annotation Project Database (RAP-DB): hub for Oryza sativa ssp. japonica genome information
With the completion of the rice genome sequencing, a standardized annotation is necessary so that the information from the genome sequence can be fully utilized in understanding the biology of rice and other cereal crops. An annotation jamboree was held in Japan with the aim of annotating and manually curating all the genes in the rice genome. Here we present the Rice Annotation Project Database (RAP-DB), which has been developed to provide access to the annotation data. The RAP-DB has two different types of annotation viewers, BLAST and BLAT search, and other useful features. By connecting the annotations to other rice genomics data, such as full-length cDNAs and Tos17 mutant lines, the RAP-DB serves as a hub for rice genomics. All of the resources can be accessed through
KAIKObase: An integrated silkworm genome database and data mining tool
<p>Abstract</p> <p>Background</p> <p>The silkworm, <it>Bombyx mori</it>, is one of the most economically important insects in many developing countries owing to its large-scale cultivation for silk production. With the development of genomic and biotechnological tools, <it>B. mori </it>has also become an important bioreactor for production of various recombinant proteins of biomedical interest. In 2004, two genome sequencing projects for <it>B. mori </it>were reported independently by Chinese and Japanese teams; however, the datasets were insufficient for building long genomic scaffolds which are essential for unambiguous annotation of the genome. Now, both the datasets have been merged and assembled through a joint collaboration between the two groups.</p> <p>Description</p> <p>Integration of the two data sets of silkworm whole-genome-shotgun sequencing by the Japanese and Chinese groups together with newly obtained fosmid- and BAC-end sequences produced the best continuity (~3.7 Mb in N50 scaffold size) among the sequenced insect genomes and provided a high degree of nucleotide coverage (88%) of all 28 chromosomes. In addition, a physical map of BAC contigs constructed by fingerprinting BAC clones and a SNP linkage map constructed using BAC-end sequences were available. In parallel, proteomic data from two-dimensional polyacrylamide gel electrophoresis in various tissues and developmental stages were compiled into a silkworm proteome database. Finally, a <it>Bombyx </it>trap database was constructed for documenting insertion positions and expression data of transposon insertion lines.</p> <p>Conclusion</p> <p>For efficient usage of genome information for functional studies, genomic sequences, physical and genetic map information and EST data were compiled into KAIKObase, an integrated silkworm genome database which consists of 4 map viewers, a gene viewer, and sequence, keyword and position search systems to display results and data at the level of nucleotide sequence, gene, scaffold and chromosome. Integration of the silkworm proteome database and the <it>Bombyx </it>trap database with KAIKObase led to a high-grade, user-friendly, and comprehensive silkworm genome database which is now available from URL: <url>http://sgp.dna.affrc.go.jp/KAIKObase/</url>.</p
Rice Annotation Database (RAD): a contig-oriented database for map-based rice genomics
A contig-oriented database for annotation of the rice genome has been constructed to facilitate map-based rice genomics. The Rice Annotation Database has the following functional features: (i) extensive effort of manual annotations of P1-derived artificial chromosome/bacterial artificial chromosome clones can be merged at chromosome and contig-level; (ii) concise visualization of the annotation information such as the predicted genes, results of various prediction programs (RiceHMM, Genscan, Genscan+, Fgenesh, GeneMark, etc.), homology to expressed sequence tag, full-length cDNA and protein; (iii) user-friendly clone / gene query system; (iv) download functions for nucleotide, amino acid and coding sequences; (v) analysis of various features of the genome (GC-content, average value, etc.); and (vi) genome-wide homology search (BLAST) of contig- and chromosome-level genome sequence to allow comparative analysis with the genome sequence of other organisms. As of October 2004, the database contains a total of 215 Mb sequence with relevant annotation results including 30 000 manually curated genes. The database can provide the latest information on manual annotation as well as a comprehensive structural analysis of various features of the rice genome. The database can be accessed at http://rad.dna.affrc.go.jp/
A genome-wide gain-of-function analysis of rice genes using the FOX-hunting system
Funding Information: Acknowledgements This work was supported by a grant from the Ministry of Agriculture, Forestry and Fisheries of Japan (Green Technology Project EF-1004). We are grateful to Dr. Takuji Sasaki for his encouragement throughout the project and his excellent advice on the improvement of this manuscript, and to Dr. Shoshi Kikuchi for providing useful information on rice FL-cDNAs. We thank Professors Kokichi Hinata, Atsushi Hirai, Hiroshi Kamada and Masashi Ugaki for their encouragement, critical comments and helpful suggestions, and Drs. Hisato Okuizumi and Hiroyuki Kawahigashi for their administrative support throughout the project. We also thank Mayumi Akagawa, Hiroko Abe, Keiko Mori, Etsuko Sugai, Yumiko Nakane, Ken-ichi Watanabe, Mayumi Takeya, and Kana Miyata for their technical assistance; the members of the Technical Support Section of the National Institute of Agrobiological Sciences for their help in the care of the FOX-rice plants; Haruko Onodera and Kazuko Ono for their technical assistance and advice on rice transformation; Inplanta Innovations Inc. for their technical help on the construction of theThe latest report has estimated the number of rice genes to be ∼32 000. To elucidate the functions of a large population of rice genes and to search efficiently for agriculturally useful genes, we have been taking advantage of the Full-length cDNA Over-eXpresser (FOX) gene-hunting system. This system is very useful for analyzing various gain-of-function phenotypes from large populations of transgenic plants overexpressing cDNAs of interest and others with unknown or important functions. We collected the plasmid DNAs of 13 980 independent full-length cDNA (FL-cDNA) clones to produce a FOX library by placing individual cDNAs under the control of the maize Ubiquitin-1 promoter. The FOX library was transformed into rice by Agrobacterium-mediated high-speed transformation. So far, we have generated approximately 12 000 FOX-rice lines. Genomic PCR analysis indicated that the average number of FL-cDNAs introduced into individual lines was 1.04. Sequencing analysis of the PCR fragments carrying FL-cDNAs from 8615 FOX-rice lines identified FL-cDNAs in 8225 lines, and a database search classified the cDNAs into 5462 independent ones. Approximately 16.6% of FOX-rice lines examined showed altered growth or morphological characteristics. Three super-dwarf mutants overexpressed a novel gibberellin 2-oxidase gene, confirming the importance of this system. We also show here the other morphological alterations caused by individual FL-cDNA expression. These dominant phenotypes should be valuable indicators for gene discovery and functional analysis.publishersversionPeer reviewe
Rice TOGO Browser: A Platform to Retrieve Integrated Information on Rice Functional and Applied Genomics
The Rice TOGO Browser is an online public resource designed to facilitate integration and visualization of mapping data of bacterial artificial chromosome (BAC)/P1-derived artificial chromosome (PAC) clones, genes, restriction fragment length polymorphism (RFLP)/simple sequence repeat (SSR) markers and phenotype data represented as quantitative trait loci (QTLs) onto the genome sequence, and to provide a platform for more efficient utilization of genome information from the point of view of applied genomics as well as functional genomics. Three search options, namely keyword search, region search and trait search, generate various types of data in a user-friendly interface with three distinct viewers, a chromosome viewer, an integrated map viewer and a sequence viewer, thereby providing the opportunity to view the position of genes and/or QTLs at the chromosomal level and to retrieve any sequence information in a user-defined genome region. Furthermore, the gene list, marker list and genome sequence in a specified region delineated by RFLP/SSR markers and any sequences designed as primers can be viewed and downloaded to support forward genetics approaches. An additional feature of this database is the graphical viewer for BLAST search to reveal information not only for regions with significant sequence similarity but also for regions adjacent to those with similarity but with no hits between sequences. An easy to use and intuitive user interface can help a wide range of users in retrieving integrated mapping information including agronomically important traits on the rice genome sequence. The database can be accessed at http://agri-trait.dna.affrc.go.jp/
Curated genome annotation of Oryza sativa ssp. japonica and comparative genome analysis with Arabidopsis thaliana
We present here the annotation of the complete genome of rice Oryza sativa L. ssp. japonica cultivar Nipponbare. All functional annotations for proteins and non-protein-coding RNA (npRNA) candidates were manually curated. Functions were identified or inferred in 19,969 (70%) of the proteins, and 131 possible npRNAs (including 58 antisense transcripts) were found. Almost 5000 annotated protein-coding genes were found to be disrupted in insertional mutant lines, which will accelerate future experimental validation of the annotations. The rice loci were determined by using cDNA sequences obtained from rice and other representative cereals. Our conservative estimate based on these loci and an extrapolation suggested that the gene number of rice is ~32,000, which is smaller than previous estimates. We conducted comparative analyses between rice and Arabidopsis thaliana and found that both genomes possessed several lineage-specific genes, which might account for the observed differences between these species, while they had similar sets of predicted functional domains among the protein sequences. A system to control translational efficiency seems to be conserved across large evolutionary distances. Moreover, the evolutionary process of protein-coding genes was examined. Our results suggest that natural selection may have played a role for duplicated genes in both species, so that duplication was suppressed or favored in a manner that depended on the function of a gene
An Integrated High-density Linkage Map of Soybean with RFLP, SSR, STS, and AFLP Markers Using A Single F2 Population
Soybean [Glycine max (L.) Merrill] is the most important leguminous crop in the world due to its high contents of high-quality protein and oil for human and animal consumption as well as for industrial uses. An accurate and saturated genetic linkage map of soybean is an essential tool for studies on modern soybean genomics. In order to update the linkage map of a F2 population derived from a cross between Misuzudaizu and Moshidou Gong 503 and to make it more informative and useful to the soybean genome research community, a total of 318 AFLP, 121 SSR, 108 RFLP, and 126 STS markers were newly developed and integrated into the framework of the previously described linkage map. The updated genetic map is composed of 509 RFLP, 318 SSR, 318 AFLP, 97 AFLP-derived STS, 29 BAC-end or EST-derived STS, 1 RAPD, and five morphological markers, covering a map distance of 3080 cM (Kosambi function) in 20 linkage groups (LGs). To our knowledge, this is presently the densest linkage map developed from a single F2 population in soybean. The average intermarker distance was reduced to 2.41 from 5.78 cM in the earlier version of the linkage map. Most SSR and RFLP markers were relatively evenly distributed among different LGs in contrast to the moderately clustered AFLP markers. The number of gaps of more than 25 cM was reduced to 6 from 19 in the earlier version of the linkage map. The coverage of the linkage map was extended since 17 markers were mapped beyond the distal ends of the previous linkage map. In particular, 17 markers were tagged in a 5.7 cM interval between CE47M5a and Satt100 on LG C2, where several important QTLs were clustered. This newly updated soybean linkage map will enable to streamline positional cloning of agronomically important trait locus genes, and promote the development of physical maps, genome sequencing, and other genomic research activities
Gene Organization in Rice Revealed by Full-Length cDNA Mapping and Gene Expression Analysis through Microarray
Rice (Oryza sativa L.) is a model organism for the functional genomics of monocotyledonous plants since the genome size is considerably smaller than those of other monocotyledonous plants. Although highly accurate genome sequences of indica and japonica rice are available, additional resources such as full-length complementary DNA (FL-cDNA) sequences are also indispensable for comprehensive analyses of gene structure and function. We cross-referenced 28.5K individual loci in the rice genome defined by mapping of 578K FL-cDNA clones with the 56K loci predicted in the TIGR genome assembly. Based on the annotation status and the presence of corresponding cDNA clones, genes were classified into 23K annotated expressed (AE) genes, 33K annotated non-expressed (ANE) genes, and 5.5K non-annotated expressed (NAE) genes. We developed a 60mer oligo-array for analysis of gene expression from each locus. Analysis of gene structures and expression levels revealed that the general features of gene structure and expression of NAE and ANE genes were considerably different from those of AE genes. The results also suggested that the cloning efficiency of rice FL-cDNA is associated with the transcription activity of the corresponding genetic locus, although other factors may also have an effect. Comparison of the coverage of FL-cDNA among gene families suggested that FL-cDNA from genes encoding rice- or eukaryote-specific domains, and those involved in regulatory functions were difficult to produce in bacterial cells. Collectively, these results indicate that rice genes can be divided into distinct groups based on transcription activity and gene structure, and that the coverage bias of FL-cDNA clones exists due to the incompatibility of certain eukaryotic genes in bacteria
- …