Additional file 3: Fig. S2. The random forest model was constructed for the genus taxonomic level (a). Comparison of model performance of random forests with different numbers of species, with the largest ROC values obtained for the 20 species selected (b). The AUC (Area Under Curve) is defined as the area under the ROC curve. Typically, it has a value between 1.0 and 0.5. For AUC > 0.5, the closer the AUC is to 1, the better the classification prediction is