NA is supported by a fellowship of the King Saud University (Riyadh, Saudi Arabia). The authors thank the work of the management team of the ALICE High Performance Computing Facility at the University of Leicester. JDR is supported by the BBSRC grant BB/P504737/1. Data AvailabiliTy Statement The datasets generated for this study can be found in the GenBank (accession numbers SAMN12840193–SAMN12840250).Peer reviewedPublisher PD