290 research outputs found
Hybrid assembly with long and short reads improves discovery of gene family expansions
BACKGROUND: Long-read and short-read sequencing technologies offer competing advantages for eukaryotic genome sequencing projects. Combinations of both may be appropriate for surveys of within-species genomic variation. METHODS: We developed a hybrid assembly pipeline called "Alpaca" that can operate on 20X long-read coverage plus about 50X short-insert and 50X long-insert short-read coverage. To preclude collapse of tandem repeats, Alpaca relies on base-call-corrected long reads for contig formation. RESULTS: Compared to two other assembly protocols, Alpaca demonstrated the most reference agreement and repeat capture on the rice genome. On three accessions of the model legume Medicago truncatula, Alpaca generated the most agreement to a conspecific reference and predicted tandemly repeated genes absent from the other assemblies. CONCLUSION: Our results suggest Alpaca is a useful tool for investigating structural and copy number variation within de novo assemblies of sampled populations
USDA Plant Genome Research Program
The U.S. Congress appropriated funds in 1991 for the USDA Plant Genome Research Program, four years after its initial conception in 1987. The goal of the USDA Plant Genome Research Program is to improve plants (agronomic, horticultural, and forest tree species) by locating marker DNA or genes on chromosomes, determining gene structure, and transferring genes to improve plant performance with accompanying reduced environmental impact to meet marketplace needs and niches. The Plant Genome Research Program is one program with two parts: National Research Initiative and Plant Genome Database (PGD). The PGD is now a real and functioning information and data resource for agricultural and other plant science genome researchers, and it is in the public domain. Additional progress is given according to major plant groups. The PGD is a suite of several information products produced at the National Agricultural Library (NAL) in collaboration with the Agricultural Research Service and Forest Service species coordinators
Open access resources for genome-wide association mapping in rice.
Increasing food production is essential to meet the demands of a growing human population, with its rising income levels and nutritional expectations. To address the demand, plant breeders seek new sources of genetic variation to enhance the productivity, sustainability and resilience of crop varieties. Here we launch a high-resolution, open-access research platform to facilitate genome-wide association mapping in rice, a staple food crop. The platform provides an immortal collection of diverse germplasm, a high-density single-nucleotide polymorphism data set tailored for gene discovery, well-documented analytical strategies, and a suite of bioinformatics resources to facilitate biological interpretation. Using grain length, we demonstrate the power and resolution of our new high-density rice array, the accompanying genotypic data set, and an expanded diversity panel for detecting major and minor effect QTLs and subpopulation-specific alleles, with immediate implications for rice improvement.Article number:10532
Natural history of Arabidopsis thaliana and oomycete symbioses
Molecular ecology of plant–microbe interactions has immediate significance for filling a gap in knowledge between the laboratory discipline of molecular biology and the largely theoretical discipline of evolutionary ecology. Somewhere in between lies conservation biology, aimed at protection of habitats and the diversity of species housed within them. A seemingly insignificant wildflower called Arabidopsis thaliana has an important contribution to make in this endeavour. It has already transformed botanical research with deepening understanding of molecular processes within the species and across the Plant Kingdom; and has begun to revolutionize plant breeding by providing an invaluable catalogue of gene sequences that can be used to design the most precise molecular markers attainable for marker-assisted selection of valued traits. This review describes how A. thaliana and two of its natural biotrophic parasites could be seminal as a model for exploring the biogeography and molecular ecology of plant–microbe interactions, and specifically, for testing hypotheses proposed from the geographic mosaic theory of co-evolution
Genetic dissection of haploid male fertility in maize (Zea mays L.)
Haploid genome doubling is a key limiting step of haploid breeding in maize. Spontaneous restoration of haploid male fertility (HMF) provides a more promising method than the artificial doubling process. To reveal the genetic basis of HMF, haploids were obtained from the offspring of 285 F2:3 families, derived from the cross Zheng58 × K22. The F2:3 families were used as the female donor and Yu high inducer No. 1 (YHI‐1) as the male inducer line. The rates of HMF from each family line were evaluated at two field sites over two planting seasons. HMF displayed incomplete dominance. Transgressive segregation of haploids from F2:3 families was observed relative to haploids derived from the two parents of the mapping population. A total of nine quantitative trait loci (QTL) were detected, which were distributed on chromosomes 1, 3, 4, 7 and 8. Three major QTL, qHMF3b, qHMF7a and qHMF7b were detected in both locations, respectively. These QTL could be useful to predict the ability of spontaneous haploid genome doubling, and to accelerate the haploid breeding process by introgression or aggregation of those QTL
Mobilizing Crop Biodiversity
Over the past 70 years, the world has witnessed extraordinary growth in crop productivity, 1 enabled by a suite of technological advances, including higher yielding crop varieties, improved farm management, synthetic agrochemicals, and agricultural mechanization. While this “Green Revolution” intensified crop production, and is credited with reducing famine and malnutrition, its benefits were accompanied by several undesirable collateral effects (Pingali, 2012). These include a narrowing of agricultural biodiversity, stemming from increased monoculture and greater reliance on a smaller number of crops and
crop varieties for the majority of our calories. This reduction in diversity has created vulnerabilities to pest and disease epidemics, climate variation, and ultimately to human health (Harlan, 1972). The value of crop diversity has long been recognized (Vavilov, 1992). A global system of genebanks (e.g.www.genebanks.org/genebanks/) was established in the 1970s to preserve the abundant genetic variation found in traditional “landrace” varieties of crops and in crop wild relatives (Harlan, 1972). While preserving crop variation is a critical first step, the time has come to make use of this variation to breed more resilient crops. The DivSeek International Network (https://divseekintl.org/) is a scientific, not-for profit organization that aims to accelerate such effort
Fine mapping of qSTV11KAS, a major QTL for rice stripe disease resistance
Rice stripe disease, caused by rice stripe virus (RSV), is one of the most serious diseases in temperate rice-growing areas. In the present study, we performed quantitative trait locus (QTL) analysis for RSV resistance using 98 backcross inbred lines derived from the cross between the highly resistant variety, Kasalath, and the highly susceptible variety, Nipponbare. Under artificial inoculation in the greenhouse, two QTLs for RSV resistance, designated qSTV7 and qSTV11KAS, were detected on chromosomes 7 and 11 respectively, whereas only one QTL was detected in the same location of chromosome 11 under natural inoculation in the field. The stability of qSTV11KAS was validated using 39 established chromosome segment substitution lines. Fine mapping of qSTV11KAS was carried out using 372 BC3F2:3 recombinants and 399 BC3F3:4 lines selected from 7,018 BC3F2 plants of the cross SL-234/Koshihikari. The qSTV11KAS was localized to a 39.2 kb region containing seven annotated genes. The most likely candidate gene, LOC_Os11g30910, is predicted to encode a sulfotransferase domain-containing protein. The predicted protein encoded by the Kasalath allele differs from Nipponbare by a single amino acid substitution and the deletion of two amino acids within the sulfotransferase domain. Marker-resistance association analysis revealed that the markers L104-155 bp and R48-194 bp were highly correlated with RSV resistance in the 148 landrace varieties. These results provide a basis for the cloning of qSTV11KAS, and the markers may be used for molecular breeding of RSV resistant rice varieties
Characterization of the major fragance gene from an aromatic japonica rice and analysis of its diversity in Asian cultivated rice
In Asian cultivated rice (Oryza sativa L.), aroma is one of the most valuable traits in grain quality and 2-ACP is the main volatile compound contributing to the characteristic popcorn-like odour of aromatic rices. Although the major locus for grain fragrance (frg gene) has been described recently in Basmati rice, this gene has not been characterised in true japonica varieties and molecular information available on the genetic diversity and evolutionary origin of this gene among the different varieties is still limited. Here we report on characterisation of the frg gene in the Azucena variety, one of the few aromatic japonica cultivars. We used a RIL population from a cross between Azucena and IR64, a non-aromatic indica, the reference genomic sequence of Nipponbare (japonica) and 93–11 (indica) as well as an Azucena BAC library, to identify the major fragance gene in Azucena. We thus identified a betaine aldehyde dehydrogenase gene, badh2, as the candidate locus responsible for aroma, which presented exactly the same mutation as that identified in Basmati and Jasmine-like rices. Comparative genomic analyses showed very high sequence conservation between Azucena and Nipponbare BADH2, and a MITE was identified in the promotor region of the BADH2 allele in 93–11. The badh2 mutation and MITE were surveyed in a representative rice collection, including traditional aromatic and non-aromatic rice varieties, and strongly suggested a monophylogenetic origin of this badh2 mutation in Asian cultivated rices. Altogether these new data are discussed here in the light of current hypotheses on the origin of rice genetic diversity
Diversity analysis of 80,000 wheat accessions reveals consequences and opportunities of selection footprints
Undomesticated wild species, crop wild relatives, and landraces represent sources of variation for wheat improvement to address challenges from climate change and the growing human population. Here, we study 56,342 domesticated hexaploid, 18,946 domesticated tetraploid and 3,903 crop wild relatives in a massive-scale genotyping and diversity analysis. Using DArTseqTM technology, we identify more than 300,000 high-quality SNPs and SilicoDArT markers and align them to three reference maps: the IWGSC RefSeq v1.0 genome assembly, the durum wheat genome assembly (cv. Svevo), and the DArT genetic map. On average, 72% of the markers are uniquely placed on these maps and 50% are linked to genes. The analysis reveals landraces with unexplored diversity and genetic footprints defined by regions under selection. This provides fertile ground to develop wheat varieties of the future by exploring specific gene or chromosome regions and identifying germplasm conserving allelic diversity missing in current breeding programs
Genome-Wide Distribution and Organization of Microsatellites in Plants: An Insight into Marker Development in Brachypodium
Plant genomes are complex and contain large amounts of repetitive DNA including microsatellites that are distributed across entire genomes. Whole genome sequences of several monocot and dicot plants that are available in the public domain provide an opportunity to study the origin, distribution and evolution of microsatellites, and also facilitate the development of new molecular markers. In the present investigation, a genome-wide analysis of microsatellite distribution in monocots (Brachypodium, sorghum and rice) and dicots (Arabidopsis, Medicago and Populus) was performed. A total of 797,863 simple sequence repeats (SSRs) were identified in the whole genome sequences of six plant species. Characterization of these SSRs revealed that mono-nucleotide repeats were the most abundant repeats, and that the frequency of repeats decreased with increase in motif length both in monocots and dicots. However, the frequency of SSRs was higher in dicots than in monocots both for nuclear and chloroplast genomes. Interestingly, GC-rich repeats were the dominant repeats only in monocots, with the majority of them being present in the coding region. These coding GC-rich repeats were found to be involved in different biological processes, predominantly binding activities. In addition, a set of 22,879 SSR markers that were validated by e-PCR were developed and mapped on different chromosomes in Brachypodium for the first time, with a frequency of 101 SSR markers per Mb. Experimental validation of 55 markers showed successful amplification of 80% SSR markers in 16 Brachypodium accessions. An online database ‘BraMi’ (Brachypodium microsatellite markers) of these genome-wide SSR markers was developed and made available in the public domain. The observed differential patterns of SSR marker distribution would be useful for studying microsatellite evolution in a monocot–dicot system. SSR markers developed in this study would be helpful for genomic studies in Brachypodium and related grass species, especially for the map based cloning of the candidate gene(s)
- …