The Alignment consists of the masked 42,626 16SrDNA sequences sampled from Genbank Gammaproteobacteria as well as all known endosymbionts from lice including the newly sequenced taxa for this study: KX146199-KX146216. The tree files consists of 6 phylogenetic trees taken from different samples of the 42,626 taxa. The reduced trees are consensus trees of sampled sequence from a clustering algorithm, selected to represent sequence diversity. Numbers on the names of these datasets indicate the stringency of clustering. For example Ribo.70.con.tre indicates the sequences were selected from clusters were each sequence in the cluster had to be at least 70% or more similar to each other. Ribo.80.con.tre means the tree was built from sequences clustered with 80% similarity or more. Finally Ribo.100.con.tre indicates 100% of the sequences were used in this tree