7 research outputs found
Trinity_assemblies
Transcriptome assemblies for 21 eupulmonate species. The transcriptomes were assembled using the program Trinity
Manual curation: 500 gene alignments
The alignments for 500 single copy, orthologous, nuclear genes across 21 representatives of the eupulmonates. Orthology was assessed through manual curation and gene tree assessment. Each alignment contains a mask, 'x' denotes regions that were masked out (i.e. remove from further analyses). The alignments contain dummy sequences for missing taxa
Camaenidae alignment
The concatenated alignment of the 2,648 exons which were sequenced from representatives of the family Camaenidae using exon capture. This alignment was used to produce the camaenidae phylogeny presented in the paper
Camaenidae_exon_capture_probe_set
This file contains the probes for the Camaenidae exon capture design. These probes target exons from 490 orthologous genes. The probes were designed for use with the Mycroarray Mybaits custom kit which consists of 120 bp RNA probes
'Agalma equivalent' alignments
The alignments representing a subset of the output of Agalma, run on 21 eupulmonate transcriptomes.
This subset is the 635 orthologous clusters identified by the automated pipeline Agalma, which correspond to the 500 nuclear single copy, orthologous genes identified by manual curation. The alignments contain dummy sequences for missing taxa
'Agalma best' alignments
The alignments representing a subset of the output of Agalma, run on 21 eupulmonate transcriptomes. This subset is the 546 orthologous clusters identified by Agalma, where each orthologous cluster was the only one produced from the respective homolog cluster and had sequences for at least 18 taxa. The alignments contain dummy sequences for missing taxa
Manual_curation_500_genes_seperated_into_exons
This file contains the alignments for the 500 manually curated genes seperated out into alignments per exon based on the exon boundaries from the Lottia gigantea genome. These alignments contain the regions which are masked out in the gene alignments but the mask is not presented