32 research outputs found

    Whole Genome Characterization of a Few EMS-Induced Mutants of Upland Rice Variety Nagina 22 Reveals a Staggeringly High Frequency of SNPs Which Show High Phenotypic Plasticity Towards the Wild-Type

    Get PDF
    The Indian initiative, in creating mutant resources for the functional genomics in rice, has been instrumental in the development of 87,000 ethylmethanesulfonate (EMS)-induced mutants, of which 7,000 are in advanced generations. The mutants have been created in the background of Nagina 22, a popular drought- and heat-tolerant upland cultivar. As it is a pregreen revolution cultivar, as many as 573 dwarf mutants identified from this resource could be useful as an alternate source of dwarfing. A total of 541 mutants, including the macromutants and the trait-specific ones, obtained after appropriate screening, are being maintained in the mutant garden. Here, we report on the detailed characterizations of the 541 mutants based on the distinctness, uniformity, and stability (DUS) descriptors at two different locations. About 90% of the mutants were found to be similar to the wild type (WT) with high similarity index (>0.6) at both the locations. All 541 mutants were characterized for chlorophyll and epicuticular wax contents, while a subset of 84 mutants were characterized for their ionomes, namely, phosphorous, silicon, and chloride contents. Genotyping of these mutants with 54 genomewide simple sequence repeat (SSR) markers revealed 93% of the mutants to be either completely identical to WT or nearly identical with just one polymorphic locus. Whole genome resequencing (WGS) of four mutants, which have minimal differences in the SSR fingerprint pattern and DUS characters from the WT, revealed a staggeringly high number of single nucleotide polymorphisms (SNPs) on an average (16,453 per mutant) in the genic sequences. Of these, nearly 50% of the SNPs led to non-synonymous codons, while 30% resulted in synonymous codons. The number of insertions and deletions (InDels) varied from 898 to 2,595, with more than 80% of them being 1–2 bp long. Such a high number of SNPs could pose a serious challenge in identifying gene(s) governing the mutant phenotype by next generation sequencing-based mapping approaches such as Mutmap. From the WGS data of the WT and the mutants, we developed a genic resource of the WT with a novel analysis pipeline. The entire information about this resource along with the panicle architecture of the 493 mutants is made available in a mutant database EMSgardeN22 (http://14.139.229.201/EMSgardeN22)

    Large expert-curated database for benchmarking document similarity detection in biomedical literature search

    Get PDF
    Document recommendation systems for locating relevant literature have mostly relied on methods developed a decade ago. This is largely due to the lack of a large offline gold-standard benchmark of relevant documents that cover a variety of research fields such that newly developed literature search techniques can be compared, improved and translated into practice. To overcome this bottleneck, we have established the RElevant LIterature SearcH consortium consisting of more than 1500 scientists from 84 countries, who have collectively annotated the relevance of over 180 000 PubMed-listed articles with regard to their respective seed (input) article/s. The majority of annotations were contributed by highly experienced, original authors of the seed articles. The collected data cover 76% of all unique PubMed Medical Subject Headings descriptors. No systematic biases were observed across different experience levels, research fields or time spent on annotations. More importantly, annotations of the same document pairs contributed by different scientists were highly concordant. We further show that the three representative baseline methods used to generate recommended articles for evaluation (Okapi Best Matching 25, Term Frequency-Inverse Document Frequency and PubMed Related Articles) had similar overall performances. Additionally, we found that these methods each tend to produce distinct collections of recommended articles, suggesting that a hybrid method may be required to completely capture all relevant articles. The established database server located at https://relishdb.ict.griffith.edu.au is freely available for the downloading of annotation data and the blind testing of new methods. We expect that this benchmark will be useful for stimulating the development of new powerful techniques for title and title/abstract-based search engines for relevant articles in biomedical research.Peer reviewe

    Shifting the limits in wheat research and breeding using a fully annotated reference genome

    Get PDF
    Introduction: Wheat (Triticum aestivum L.) is the most widely cultivated crop on Earth, contributing about a fifth of the total calories consumed by humans. Consequently, wheat yields and production affect the global economy, and failed harvests can lead to social unrest. Breeders continuously strive to develop improved varieties by fine-tuning genetically complex yield and end-use quality parameters while maintaining stable yields and adapting the crop to regionally specific biotic and abiotic stresses. Rationale: Breeding efforts are limited by insufficient knowledge and understanding of wheat biology and the molecular basis of central agronomic traits. To meet the demands of human population growth, there is an urgent need for wheat research and breeding to accelerate genetic gain as well as to increase and protect wheat yield and quality traits. In other plant and animal species, access to a fully annotated and ordered genome sequence, including regulatory sequences and genome-diversity information, has promoted the development of systematic and more time-efficient approaches for the selection and understanding of important traits. Wheat has lagged behind, primarily owing to the challenges of assembling a genome that is more than five times as large as the human genome, polyploid, and complex, containing more than 85% repetitive DNA. To provide a foundation for improvement through molecular breeding, in 2005, the International Wheat Genome Sequencing Consortium set out to deliver a high-quality annotated reference genome sequence of bread wheat. Results: An annotated reference sequence representing the hexaploid bread wheat genome in the form of 21 chromosome-like sequence assemblies has now been delivered, giving access to 107,891 high-confidence genes, including their genomic context of regulatory sequences. This assembly enabled the discovery of tissue- and developmental stage–related gene coexpression networks using a transcriptome atlas representing all stages of wheat development. The dynamics of change in complex gene families involved in environmental adaptation and end-use quality were revealed at subgenome resolution and contextualized to known agronomic single-gene or quantitative trait loci. Aspects of the future value of the annotated assembly for molecular breeding and research were exemplarily illustrated by resolving the genetic basis of a quantitative trait locus conferring resistance to abiotic stress and insect damage as well as by serving as the basis for genome editing of the flowering-time trait. Conclusion: This annotated reference sequence of wheat is a resource that can now drive disruptive innovation in wheat improvement, as this community resource establishes the foundation for accelerating wheat research and application through improved understanding of wheat biology and genomics-assisted breeding. Importantly, the bioinformatics capacity developed for model-organism genomes will facilitate a better understanding of the wheat genome as a result of the high-quality chromosome-based genome assembly. By necessity, breeders work with the genome at the whole chromosome level, as each new cross involves the modification of genome-wide gene networks that control the expression of complex traits such as yield. With the annotated and ordered reference genome sequence in place, researchers and breeders can now easily access sequence-level information to precisely define the necessary changes in the genomes for breeding programs. This will be realized through the implementation of new DNA marker platforms and targeted breeding technologies, including genome editing

    Not Available

    No full text
    Not AvailableOur aim was to screen new hyper-variable mango SSRs (MSSRs) designed from Amrapali genome sequences to find polymorphic ones between Amrapall and Sensation parental genotypes. Out of 253 MSSRs screened against Amrapaliand Sensation genotypes, 212 were polymorphic, of which 8 MSSRs showed aa, bb or bb, aa allelic patterns. MSSRs showing polymorphism in terms of homozygous for 'a' allele in Amrapali and homozygous 'b' allele for Sensation and vice versa were selected for assignment of parentage of hand-pollinated (Amrapali x Sensation) hybrids and open-pollinated Amrapali. MSSR markers were mapped back to the Amrapali genome version 3.0 NCBI GenBank database. MSSRs (M109,P125, M15, and M73) assigned parentage to more than 88.0% of hand-pollinated hybrids. Pl25 confirmed parentage of 93.61% progenies followed by Ml09 (90.43%), Ml5 (8936%), and M73 (88.29%). Randomly three MSSRs,viz. Ml09,P125, and M73,were used for assignment of parentage of OP seedlings of Amrapal i.Out of 22 OP seedlings, only 2 seedlings showed the identical hetero­ zygous allelic pattern and their hybrid origin was confirmed.It is concluded that MSSRs (P125, M109,M15, and M73) from the Amrapaligenome have immense value in MAS in mango and can be utilised for early stage selection of true hybrids.Not Availabl

    Analysis of Magnaporthe oryzae Genome Reveals a Fungal Effector, Which Is Able to Induce Resistance Response in Transgenic Rice Line Containing Resistance Gene, Pi54

    Get PDF
    Rice blast caused by Magnaporthe oryzae is one of the most important diseases of rice. Pi54, a rice gene that imparts resistance to M. oryzae isolates prevalent in India, was already cloned but its avirulent counterpart in the pathogen was not known.. After decoding the whole genome of an avirulent isolate of M. oryzae, we predicted 11440 protein coding genes and then identified four candidate effector proteins which are exclusively expressed in the infectious structure, appresoria. In silico protein modeling followed by interaction analysis between Pi54 protein model and selected four candidate effector proteins models revealed that Mo-01947_9 protein model encoded by a gene located at chromosome 4 of M. oryzae, interacted best at the Leucine Rich Repeat domain of Pi54 protein model. Yeast-two-hybrid analysis showed that Mo-01947_9 protein physically interacts with Pi54 protein. Nicotiana benthamiana leaf infiltration assay confirmed induction of hypersensitive response in the presence of Pi54 gene in a heterologous system. Genetic complementation test also proved that Mo-01947_9 protein induces avirulence response in the pathogen in the presence of Pi54 gene. Here, we report identification and cloning of a new fungal effector gene which interacts with resistance gene Pi54 gene in rice

    TinoTranscriptDB: A Database of Transcripts and Microsatellite Markers of Tinospora cordifolia, an Important Medicinal Plant

    No full text
    Tinospora cordifolia, commonly known as “Giloe” in India, is a shrub belonging to the family Menispermaceae. It is an important medicinal plant known for its antipyretic, anti-inflammatory, antispasmodic, and antidiabetic properties and is used in the treatment of jaundice, gout, and rheumatism. Despite its economic importance, the limited information related to its genomic resources prohibits its judicious exploitation through molecular breeding or biotechnological approaches. In this study, we generated a meta-transcriptome assembly of 43,090 non-redundant transcripts by merging the RNASeq data obtained from Roche 454 GS-FLX, and Illumina platforms, and report the first transcriptome-based database for simple sequence repeats and transcription factors (“TinoTranscriptDB” (Tinospora cordifolia Transcriptome Database)). We annotated 26,716 (62%) of the total transcripts successfully from National Center for Biotechnology Information non-redundant protein (NCBI-NR), gene ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG), Swiss-Prot, and Pfam databases. This database contains information of 2620 perfect simple sequence repeats (P-SSRs) with a relative abundance of 340.12 (loci/Mb), and relative density of 6309.29 (bp/Mb). Excluding mono-nucleotides, the most abundant SSR motifs were tri-nucleotides (54.31%), followed by di-nucleotides (37.51%), tetra-nucleotides (4.54%), penta-nucleotides (3.16%) and hexa-nucleotides (0.45%). Additionally, we also identified 4,311 transcription factors (TFs) and categorized them into 55 sub-families. This database is expected to fill the gap in genomic resource availability in T. cordifolia and thus accelerate molecular breeding and related functional and other applied studies aimed towards genetic improvements of T. cordifolia and related species

    De novo transcriptome assembly and identification of brassinosteroid biosynthetic pathway in Safflower

    No full text
    Safflower (Carthamus tinctorius L.) is known for its oil quality and ability to grow in drought conditions. The threat to global food security posed by challenges of climate change has stressed the use of functional genomics in current plant breeding approaches. Plant steroid hormones called brassinosteroids (BRs) are at the nexus of regulating plant growth and development, and plant stress responses. Enhanced BR levels or signalling have increased crop yields by up to 40% while also conferring broad range stress tolerance. Currently there is no information on the BR biosynthesis and signalling pathways in safflower. A de novo transcriptomic analysis of untreated and 24-epibrassinolide (EBR)-treated safflower leaves was conducted using the Illumina sequencing platform. Approximately 5 GB clean data were generated from untreated and EBR-treated samples that assembled into 34,456 and 36,997 transcripts (combined 50,630), and 30,180 and 32,333 CoDing sequences (CDS) (combined 43,637), respectively. More than 71% of the CDS were annotated with majority of hits against Cynara cardunculus var. scolymus, a thistle in the safflower family. A total of 74 KEGG pathways were identified in safflower. Six genes, including DWF4 that codes for a rate-limiting enzyme in BR biosynthesis, were mapped to the BR biosynthesis pathway using the KEGG mapper
    corecore