9 research outputs found

    ANALYSIS OF CLASSIFICATION ALGORITHMS ON DIFFERENT DATASETS

    Get PDF
    Purpose. Data mining is the forthcoming research area to solve different problems and classification is one of main problem in the field of data mining. In this paper, we use two classification algorithms J48 and Sequential Minimal Optimization alias SMO of the Weka interface. Methodology. It can be used for testing several datasets. The performance of J48 and Sequential Minimal Optimization has been analyzed to choose the better algorithm based on the conditions of the datasets. The datasets have been chosen from UCI Machine Learning Repository. Findings. Algorithm J48 is based on C4.5 decision-based learning and algorithm Sequential Minimal Optimization uses the Support Vector Machine approach for classification of datasets. When comparing the performance of both algorithms we found Sequential Minimal Optimization is better algorithm in most of the cases. Originality. This is the first implemented research work up to my knowledge, data sets classification problem handled in data mining using SMO with Weka interface

    Genomic selection for target traits in the Australian lentil breeding program

    Get PDF
    Genomic selection (GS) uses associations between markers and phenotypes to predict the breeding values of individuals. It can be applied early in the breeding cycle to reduce the cross-to-cross generation interval and thereby increase genetic gain per unit of time. The development of cost-effective, high-throughput genotyping platforms has revolutionized plant breeding programs by enabling the implementation of GS at the scale required to achieve impact. As a result, GS is becoming routine in plant breeding, even in minor crops such as pulses. Here we examined 2,081 breeding lines from Agriculture Victoria’s national lentil breeding program for a range of target traits including grain yield, ascochyta blight resistance, botrytis grey mould resistance, salinity and boron stress tolerance, 100-grain weight, seed size index and protein content. A broad range of narrow-sense heritabilities was observed across these traits (0.24-0.66). Genomic prediction models were developed based on 64,781 genome-wide SNPs using Bayesian methodology and genomic estimated breeding values (GEBVs) were calculated. Forward cross-validation was applied to examine the prediction accuracy of GS for these targeted traits. The accuracy of GEBVs was consistently higher (0.34-0.83) than BLUP estimated breeding values (EBVs) (0.22-0.54), indicating a higher expected rate of genetic gain with GS. GS-led parental selection using early generation breeding materials also resulted in higher genetic gain compared to BLUP-based selection performed using later generation breeding lines. Our results show that implementing GS in lentil breeding will fast track the development of high-yielding cultivars with increased resistance to biotic and abiotic stresses, as well as improved seed quality traits

    ANALYSIS OF CLASSIFICATION ALGORITHMS ON DIFFERENT DATASETS

    Get PDF
    Purpose. Data mining is the forthcoming research area to solve different problems and classification is one of main problem in the field of data mining. In this paper, we use two classification algorithms J48 and Sequential Minimal Optimization alias SMO of the Weka interface. Methodology. It can be used for testing several datasets. The performance of J48 and Sequential Minimal Optimization has been analyzed to choose the better algorithm based on the conditions of the datasets. The datasets have been chosen from UCI Machine Learning Repository. Findings. Algorithm J48 is based on C4.5 decision-based learning and algorithm Sequential Minimal Optimization uses the Support Vector Machine approach for classification of datasets. When comparing the performance of both algorithms we found Sequential Minimal Optimization is better algorithm in most of the cases. Originality. This is the first implemented research work up to my knowledge, data sets classification problem handled in data mining using SMO with Weka interface

    Physiology Based Approaches for Breeding of Next-Generation Food Legumes

    No full text
    Plant breeders and agricultural scientists of the 21st century are challenged to increase the yield potentials of crops to feed the growing world population. Climate change, the resultant stresses and increasing nutrient deficiencies are factors that are to be considered in designing modern plant breeding pipelines. Underutilized food legumes have the potential to address these issues and ensure food security in developing nations of the world. Food legumes in the past have drawn limited research funding and technological attention when compared to cereal crops. Physiological breeding strategies that were proven to be successful in cereals are to be adapted to legume crop improvement to realize their potential. The gap between breeders and physiologists should be narrowed by collaborative approaches to understand complex traits in legumes. This review discusses the potential of physiology based approaches in food legume breeding and how they impact yield gains and abiotic stress tolerance in these crops. The influence of roots and root system architectures in food legumes’ breeding is also discussed. Molecular breeding to map the relevant physiological traits and the potentials of gene editing those traits are detailed. It is imperative to unlock the potentials of these underutilized crops to attain sustainable environmental and nutritional food security

    Construction of high-density linkage maps for mapping quantitative trait loci for multiple traits in field pea (Pisum sativum L.)

    No full text
    Abstract Background The objective of this research was to map quantitative trait loci (QTLs) of multiple traits of breeding importance in pea (Pisum sativum L.). Three recombinant inbred line (RIL) populations, PR-02 (Orb x CDC Striker), PR-07 (Carerra x CDC Striker) and PR-15 (1–2347-144 x CDC Meadow) were phenotyped for agronomic and seed quality traits under field conditions over multiple environments in Saskatchewan, Canada. The mapping populations were genotyped using genotyping-by-sequencing (GBS) method for simultaneous single nucleotide polymorphism (SNP) discovery and construction of high-density linkage maps. Results After filtering for read depth, segregation distortion, and missing values, 2234, 3389 and 3541 single nucleotide polymorphism (SNP) markers identified by GBS in PR-02, PR-07 and PR-15, respectively, were used for construction of genetic linkage maps. Genetic linkage groups were assigned by anchoring to SNP markers previously positioned on these linkage maps. PR-02, PR-07 and PR-15 genetic maps represented 527, 675 and 609 non-redundant loci, and cover map distances of 951.9, 1008.8 and 914.2 cM, respectively. Based on phenotyping of the three mapping populations in multiple environments, 375 QTLs were identified for important traits including days to flowering, days to maturity, lodging resistance, Mycosphaerella blight resistance, seed weight, grain yield, acid and neutral detergent fiber concentration, seed starch concentration, seed shape, seed dimpling, and concentration of seed iron, selenium and zinc. Of all the QTLs identified, the most significant in terms of explained percentage of maximum phenotypic variance (PVmax) and occurrence in multiple environments were the QTLs for days to flowering (PVmax = 47.9%), plant height (PVmax = 65.1%), lodging resistance (PVmax = 35.3%), grain yield (PVmax = 54.2%), seed iron concentration (PVmax = 27.4%), and seed zinc concentration (PVmax = 43.2%). Conclusion We have identified highly significant and reproducible QTLs for several agronomic and seed quality traits of breeding importance in pea. The QTLs identified will be the basis for fine mapping candidate genes, while some of the markers linked to the highly significant QTLs are useful for immediate breeding applications

    MeioCapture: an efficient method for staging and isolation of meiocytes in the prophase I sub-stages of meiosis in wheat

    No full text
    Background Molecular analysis of meiosis has been hindered by difficulties in isolating high purity subpopulations of sporogenous cells representing the succeeding stages of meiosis. Isolation of purified male meiocytes from defined meiotic stages is crucial in discovering meiosis specific genes and associated regulatory networks. Results We describe an optimized method termed MeioCapture for simultaneous isolation of uncontaminated male meiocytes from wheat (Triticum spp.), specifically from the pre-meiotic G2 and the five sub-stages of meiotic prophase I. The MeioCapture protocol builds on the traditional anther squash technique and the capillary collection method, and involves extrusion of intact sporogenous archesporial columns (SACs) containing meiocytes. This improved method exploits the natural meiotic synchrony between anthers of the same floret, the correlation between the length of anthers and meiotic stage, and the occurrence of meiocytes in intact SACs largely free of somatic cells. The main advantage of MeioCapture, compared to previous methods, is that it allows simultaneous collection of meiocytes from different sub-stages of prophase I at a very high level of purity, through correlation of stages with anther sizes. A detailed description is provided for all steps, including the collection of tissue, isolation and size sorting of anthers, extrusion of intact SACs, and staging of meiocytes. Precautions for individual steps throughout the procedure are also provided to facilitate efficient isolation of pure meiocytes. The proof-of-concept was successfully established in wheat, and a light microscopic atlas of meiosis, encompassing all stages from pre-meiosis to telophase II, was developed. Conclusion The MeioCapture method provides an essential technique to study the molecular basis of chromosome pairing and exchange of genetic information in wheat, leading to strategies for manipulating meiotic recombination frequencies. The method also provides a foundation for similar studies in other crop species

    Gene-based SNP discovery and genetic mapping in pea

    Get PDF
    International audienceGene-based SNPs were identified and mapped in pea using five recombinant inbred line populations segregating for traits of agronomic importance. Pea (Pisum sativum L.) is one of the world's oldest domesticated crops and has been a model system in plant biology and genetics since the work of Gregor Mendel. Pea is the second most widely grown pulse crop in the world following common bean. The importance of pea as a food crop is growing due to its combination of moderate protein concentration, slowly digestible starch, high dietary fiber concentration, and its richness in micronutrients; however, pea has lagged behind other major crops in harnessing recent advances in molecular biology, genomics and bioinformatics, partly due to its large genome size with a large proportion of repetitive sequence, and to the relatively limited investment in research in this crop globally. The objective of this research was the development of a genome-wide transcriptome-based pea single-nucleotide polymorphism (SNP) marker platform using next-generation sequencing technology. A total of 1,536 polymorphic SNP loci selected from over 20,000 non-redundant SNPs identified using deep transcriptome sequencing of eight diverse Pisum accessions were used for genotyping in five RIL populations using an Illumina GoldenGate assay. The first high-density pea SNP map defining all seven linkage groups was generated by integrating with previously published anchor markers. Syntenic relationships of this map with the model legume Medicago truncatula and lentil (Lens culinaris Medik.) maps were established. The genic SNP map establishes a foundation for future molecular breeding efforts by enabling both the identification and tracking of introgression of genomic regions harbouring QTLs related to agronomic and seed quality traits
    corecore