118 research outputs found

    Phenotype data

    No full text
    This file contains tapeworm parasite infection scoring data for Tribolium castaneum

    Chromosome-level genomic information.

    No full text
    Schistosomiasis is one of the world’s most devastating parasitic diseases, afflicting 251 million people globally. The Neotropical snail Biomphalaria glabrata is an important intermediate host of the human blood fluke Schistosoma mansoni and a predominant model for schistosomiasis research. To fully exploit this model snail for biomedical research, here we report a haplotype-like, chromosome-level assembled and annotated genome of the homozygous iM line of B. glabrata that we developed at the University of New Mexico. Using multiple sequencing platforms, including Illumina, PacBio, and Omni-C sequencing, 18 sequence contact matrices representing 18 haploid chromosomes (2n = 36) were generated (337x genome coverage), and 96.5% of the scaffold sequences were anchored to the 18 chromosomes. Protein-coding genes (n = 34,559), non-coding RNAs (n = 2,406), and repetitive elements (42.52% of the genome) were predicted for the whole genome, and detailed annotations for individual chromosomes were also provided. Using this genomic resource, we have investigated the genomic structure and organization of the Toll-like receptor (TLR) and fibrinogen-domain containing protein (FReD) genes, the two important immune-related gene families. Notably, TLR-like genes are scattered on 13 chromosomes. In contrast, almost all (39 of 40) fibrinogen-related genes (FREPs) (immunoglobulin superfamily (IgSF) + fibrinogen (FBG)) are clustered within a 5-million nucleotide region on chromosome 13, yielding insight into mechanisms involved in the diversification of FREPs. This is the first genome of schistosomiasis vector snails that has been assembled at the chromosome level, annotated, and analyzed. It serves as a valuable resource for a deeper understanding of the biology of vector snails, especially Biomphalaria snails.</div

    Phylogeny and domain structure of FReDs in the <i>B</i>. <i>glabrata</i> genome.

    No full text
    Left panel: A ML tree was constructed using full length aa sequences of 80 FReDs. ModelFinder selected WAG+F+R7 as the best-fit model for tree inference (Bayesian Information Criterion). A test of 1,000 bootstrap replicates was performed and nodes with bootstrap support of 60 or higher are marked with different colorss. Right panel: structure of FReD gene products. EGF: epidermal growth factor; IgSF: immunoglobulin superfamily; FBG: fibrinogen domain.</p

    Functional annotation for predicted gene models in <i>B</i>. <i>glabrata</i>.

    No full text
    Four major bioinformatics databases including Uniports database, NCBI-non-redundant nucleotide (NT) database, NCBI non-redundant protein (NR) database, and InterProScan integrated conserved database were applied. Percentages were calculated based on a total of 34,559 gene models. (TIFF)</p

    Distribution of <i>TLR</i> genes on <i>B</i>. <i>glabrata</i> chromosomes.

    No full text
    The number in parenthesis is the number of TLR genes identified on the chromosome. Only chromosomes possessing TLR are shown.</p

    Fig 2 -

    No full text
    Synteny plot analyses show the comparison among the 18 chromosomes (A) and between the 18 chromosome-level assemblies and the 18 LGs (B). Blue and pink rectangles show matches in the same and reverse directions, respectively. The chromosome- and scaffold-level assemblies for the iM line of B. glabrata were compared at the nucleotide level using Minimap2.</p

    Phylogenetic relationship of 39 <i>FREPs</i> and their locations on chromosome 13.

    No full text
    The ML tree was constructed with 1,000 bootstrap replicates. ModelFinder selected JTT+F+R4 as the best-fit model for tree inference (Bayesian Information Criterion). A test of 1,000 bootstrap replicates was performed and nodes with bootstrap support of 60 or higher are indicated by different colors FREP genes are grouped into 3 large clusters, in which their sequences are indicated by purple, green, and orange lines, respectively.</p

    Density and distribution of protein-coding genes, repetitive elements, and non-coding RNAs on the 18 chromosomes.

    No full text
    Density and distribution of protein-coding genes, repetitive elements, and non-coding RNAs on the 18 chromosomes.</p
    corecore