Search CORE

5 research outputs found

DataSheet_1_Embracing limited and imperfect training datasets: opportunities and challenges in plant disease recognition using deep learning.zip

Author: Alvaro Fuentes (10974617)
Dong Sun Park (4114609)
Hyongsuk Kim (7609646)
Jucheng Yang (11227173)
Mingle Xu (13773109)
Sook Yoon (3739201)
Taehyun Kim (3997091)
Yao Meng (594490)
Publication venue
Publication date: 22/09/2023
Field of study

Recent advancements in deep learning have brought significant improvements to plant disease recognition. However, achieving satisfactory performance often requires high-quality training datasets, which are challenging and expensive to collect. Consequently, the practical application of current deep learning–based methods in real-world scenarios is hindered by the scarcity of high-quality datasets. In this paper, we argue that embracing poor datasets is viable and aims to explicitly define the challenges associated with using these datasets. To delve into this topic, we analyze the characteristics of high-quality datasets, namely, large-scale images and desired annotation, and contrast them with the limited and imperfect nature of poor datasets. Challenges arise when the training datasets deviate from these characteristics. To provide a comprehensive understanding, we propose a novel and informative taxonomy that categorizes these challenges. Furthermore, we offer a brief overview of existing studies and approaches that address these challenges. We point out that our paper sheds light on the importance of embracing poor datasets, enhances the understanding of the associated challenges, and contributes to the ambitious objective of deploying deep learning in real-world applications. To facilitate the progress, we finally describe several outstanding questions and point out potential future directions. Although our primary focus is on plant disease recognition, we emphasize that the principles of embracing and analyzing poor datasets are applicable to a wider range of domains, including agriculture. Our project is public available at https://github.com/xml94/EmbracingLimitedImperfectTrainingDatasets.</p

FigShare

Additional file 3: Table S3. of Whole genome scan reveals the genetic signature of African Ankole cattle breed and potential for higher quality beef

Author: Hak-Kyo Lee (540982)
Heebal Kim (9991)
Jaemin Kim (386282)
Kelsey Caetano-Anolles (3443945)
Mengistie Taye (3739204)
Okeyo Mwai (3739198)
Olivier Hanotte (89422)
Seoae Cho (75685)
Sook Yoon (3739201)
Stephen Kemp (373419)
Sung Oh (3739207)
Tadelle Dessie (3546146)
Wonseok Lee (3705475)
Publication venue
Publication date
Field of study

Summary of genes common for both XP-EHH and XP-CLR test statistics. (XLS 45 kb

FigShare

Additional file 4: Table S4. of Whole genome scan reveals the genetic signature of African Ankole cattle breed and potential for higher quality beef

Author: Hak-Kyo Lee (540982)
Heebal Kim (9991)
Jaemin Kim (386282)
Kelsey Caetano-Anolles (3443945)
Mengistie Taye (3739204)
Okeyo Mwai (3739198)
Olivier Hanotte (89422)
Seoae Cho (75685)
Sook Yoon (3739201)
Stephen Kemp (373419)
Sung Oh (3739207)
Tadelle Dessie (3546146)
Wonseok Lee (3705475)
Publication venue
Publication date
Field of study

Gene Ontology Biological Process terms obtained from DAVID gene ontology analysis using all XP-EHH and XP-CLR gene lists. (XLS 47 kb

FigShare

Additional file 5: Figure S1. of Whole genome scan reveals the genetic signature of African Ankole cattle breed and potential for higher quality beef

Author: Hak-Kyo Lee (540982)
Heebal Kim (9991)
Jaemin Kim (386282)
Kelsey Caetano-Anolles (3443945)
Mengistie Taye (3739204)
Okeyo Mwai (3739198)
Olivier Hanotte (89422)
Seoae Cho (75685)
Sook Yoon (3739201)
Stephen Kemp (373419)
Sung Oh (3739207)
Tadelle Dessie (3546146)
Wonseok Lee (3705475)
Publication venue
Publication date
Field of study

Tajima’s D and FST plot of positively selected gene regions in Sanga and indicus cattle populations. The Tajima’s D plot for each gene region (upper plot for each gene) is the Tajima’s D value within 50 kb window plotted for both populations. The smaller (negative) Tajima’s D value in Sanga population shows that the gene region considered is under positive selection. The FST plot (lower plot for each gene) is the FST values within 50 kb windows separated by 5 kb steps. (DOCX 300 kb

FigShare

Additional file 6: Table S5. of Whole genome scan reveals the genetic signature of African Ankole cattle breed and potential for higher quality beef

Author: Hak-Kyo Lee (540982)
Heebal Kim (9991)
Jaemin Kim (386282)
Kelsey Caetano-Anolles (3443945)
Mengistie Taye (3739204)
Okeyo Mwai (3739198)
Olivier Hanotte (89422)
Seoae Cho (75685)
Sook Yoon (3739201)
Stephen Kemp (373419)
Sung Oh (3739207)
Tadelle Dessie (3546146)
Wonseok Lee (3705475)
Publication venue
Publication date
Field of study

Names and descriptions of major candidate meat quality related genes. All gene names and descriptions are based on RefSeq. (DOCX 24 kb

FigShare