Vitamins are nutrients that are essential to human health, and deficiencies have been shown to cause severe diseases. In this study, a computational approach was used to identify vitamin deficiency diseases and plant-based foods with vitamin content. Data from the United States Department of Agriculture Standard Reference (SR27), National Library of Medicine\u27s Medical Subject Headings and MEDLINE, and Wikipedia were combined to identify vitamin deficiency diseases and vitamin content of plant-based foods. A total of 41,584 vitamin-disease associations were identified from MEDLINE-indexed articles as well as from entries in Wikipedia. The SR27 identified 1912 foods that contained at least one vitamin, with an average of 1276 foods per vitamin. Vitamin B12 and D contained the fewest number of foods (n=135 and 70, respectively). The results of this study establish the foundation for developing a process to link vitamin deficiency diseases to vitamin-rich foods