111 research outputs found

    The antibody loci of the domestic goat (Capra hircus)

    Get PDF
    The domestic goat (Capra hircus) is an important ruminant species both as a source of antibody-based reagents for research and biomedical applications and as an economically important animal for agriculture, particularly for developing nations that maintain most of the global goat population. Characterization of the loci encoding the goat immune repertoire would be highly beneficial for both vaccine and immune reagent development. However, in goat and other species whose reference genomes were generated using short-read sequencing technologies, the immune loci are poorly assembled as a result of their repetitive nature. Our recent construction of a long-read goat genome assembly (ARS1) has facilitated characterization of all three antibody loci with high confidence and comparative analysis to cattle. We observed broad similarity of goat and cattle antibody-encoding loci but with notable differences that likely influence formation of the functional antibody repertoire. The goat heavy-chain locus is restricted to only four functional and nearly identical IGHV genes, in contrast to the ten observed in cattle. Repertoire analysis indicates that light-chain usage is more balanced in goats, with greater representation of kappa light chains (~ 20-30%) compared to that in cattle (~ 5%). The present study represents the first characterization of the goat antibody loci and will help inform future investigations of their antibody responses to disease and vaccination

    The evolution of the natural killer complex; a comparison between mammals using new high-quality genome assemblies and targeted annotation.

    Get PDF
    Natural killer (NK) cells are a diverse population of lymphocytes with a range of biological roles including essential immune functions. NK cell diversity is in part created by the differential expression of cell surface receptors which modulate activation and function, including multiple subfamilies of C-type lectin receptors encoded within the NK complex (NKC). Little is known about the gene content of the NKC beyond rodent and primate lineages, other than it appears to be extremely variable between mammalian groups. We compared the NKC structure between mammalian species using new high-quality draft genome assemblies for cattle and goat; re-annotated sheep, pig, and horse genome assemblies; and the published human, rat, and mouse lemur NKC. The major NKC genes are largely in the equivalent positions in all eight species, with significant independent expansions and deletions between species, allowing us to propose a model for NKC evolution during mammalian radiation. The ruminant species, cattle and goats, have independently evolved a second KLRC locus flanked by KLRA and KLRJ, and a novel KLRH-like gene has acquired an activating tail. This novel gene has duplicated several times within cattle, while other activating receptor genes have been selectively disrupted. Targeted genome enrichment in cattle identified varying levels of allelic polymorphism between the NKC genes concentrated in the predicted extracellular ligand-binding domains. This novel recombination and allelic polymorphism is consistent with NKC evolution under balancing selection, suggesting that this diversity influences individual immune responses and may impact on differential outcomes of pathogen infection and vaccination

    Insertion sequence content reflects genome plasticity in strains of the root nodule actinobacterium Frankia

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Genome analysis of three <it>Frankia sp. </it>strains has revealed a high number of transposable elements in two of the strains. Twelve out of the 20 major families of bacterial Insertion Sequence (IS) elements are represented in the 148 annotated transposases of <it>Frankia </it>strain HFPCcI3 (CcI3) comprising 3% of its total coding sequences (CDS). EAN1pec (EAN) has 183 transposase ORFs from 13 IS families comprising 2.2% of its CDS. Strain ACN14a (ACN) differs significantly from the other strains with only 33 transposase ORFs (0.5% of the total CDS) from 9 IS families.</p> <p>Results</p> <p>Insertion sequences in the <it>Frankia </it>genomes were analyzed using BLAST searches, PHYML phylogenies and the IRF (Inverted Repeat Finder) algorithms. To identify putative or decaying IS elements, a PSI-TBLASTN search was performed on all three genomes, identifying 36%, 39% and 12% additional putative transposase ORFs than originally annotated in strains CcI3, EAN and ACN, respectively. The distribution of transposase ORFs in each strain was then analysed using a sliding window, revealing significant clustering of elements in regions of the EAN and CcI3 genomes. Lastly the three genomes were aligned with the MAUVE multiple genome alignment tool, revealing several Large Chromosome Rearrangement (LCR) events; many of which correlate to transposase clusters.</p> <p>Conclusion</p> <p>Analysis of transposase ORFs in <it>Frankia </it>sp. revealed low inter-strain diversity of transposases, suggesting that the majority of transposase proliferation occurred without recent horizontal transfer of novel mobile elements from outside the genus. Exceptions to this include representatives from the IS3 family in strain EAN and seven IS4 transposases in all three strains that have a lower G+C content, suggesting recent horizontal transfer. The clustering of transposase ORFs near LCRs revealed a tendency for IS elements to be associated with regions of chromosome instability in the three strains. The results of this study suggest that IS elements may help drive chromosome differences in different <it>Frankia </it>sp. strains as they have adapted to a variety of hosts and environments.</p

    RAPTR-SV: A Hybrid Method for the Detection of Structural Variants

    Get PDF
    Identification of Structural Variants (SV) in sequence data results in a large number of false positive calls using existing software, which overburdens subsequent validation. Simulations using RAPTR-SV and other, similar algorithms for SV detection revealed that RAPTR-SV had superior sensitivity and precision, as it recovered 66.4% of simulated tandem duplications with a precision of 99.2%. When compared to calls made by Delly and LUMPY on available datasets from the 1000 genomes project, RAPTR-SV showed superior sensitivity for tandem duplications, as it identified two-fold more duplications than Delly, while making approximately 85% fewer duplication predictions. RAPTR-SV is written in Java and uses new features in the collections framework in the latest release of the Java version 8 language specifications. A compiled version of the software, instructions for usage and test results files are available on the GitHub repository page: https://github.com/njdbickhart/RAPTR-SV

    Characterization of the domestic goat γδ T cell receptor gene loci and gene usage

    Get PDF
    Goats and cattle diverged 30 million years ago but retain similarities in immune system genes. Here, the caprine T cell receptor (TCR) gene loci and transcription of its genes were examined and compared to cattle. We annotated the TCR loci using an improved genome assembly (ARS1) of a highly homozygous San Clemente goat. This assembly has already proven useful for describing other immune system genes including antibody and leucocyte receptors. Both the TCRγ (TRG) and TCRδ (TRD) loci were similarly organized in goats as in cattle and the gene sequences were highly conserved. However, the number of genes varied slightly as a result of duplications and differences occurred in mutations resulting in pseudogenes. WC1+ γδ T cells in cattle have been shown to use TCRγ genes from only one of the six available cassettes. The structure of that Cγ gene product is unique and may be necessary to interact with WC1 for signal transduction following antigen ligation. Using RT-PCR and PacBio sequencing, we observed the same restriction for goat WC1+ γδ T cells. In contrast, caprine WC1+ and WC1− γδ T cell populations had a diverse TCRδ gene usage although the propensity for particular gene usage differed between the two cell populations. Noncanonical recombination signal sequences (RSS) largely correlated with restricted expression of TCRγ and δ genes. Finally, caprine γδ T cells were found to incorporate multiple TRD diversity gene sequences in a single transcript, an unusual feature among mammals but also previously observed in cattle

    Genome characteristics of facultatively symbiotic Frankia sp. strains reflect host range and host plant biogeography

    Get PDF
    Soil bacteria that also form mutualistic symbioses in plants encounter two major levels of selection. One occurs during adaptation to and survival in soil, and the other occurs in concert with host plant speciation and adaptation. Actinobacteria from the genus Frankia are facultative symbionts that form N2-fixing root nodules on diverse and globally distributed angiosperms in the “actinorhizal” symbioses. Three closely related clades of Frankia sp. strains are recognized; members of each clade infect a subset of plants from among eight angiosperm families. We sequenced the genomes from three strains; their sizes varied from 5.43 Mbp for a narrow host range strain (Frankia sp. strain HFPCcI3) to 7.50 Mbp for a medium host range strain (Frankia alni strain ACN14a) to 9.04 Mbp for a broad host range strain (Frankia sp. strain EAN1pec.) This size divergence is the largest yet reported for such closely related soil bacteria (97.8%–98.9% identity of 16S rRNA genes). The extent of gene deletion, duplication, and acquisition is in concert with the biogeographic history of the symbioses and host plant speciation. Host plant isolation favored genome contraction, whereas host plant diversification favored genome expansion. The results support the idea that major genome expansions as well as reductions can occur in facultative symbiotic soil bacteria as they respond to new environments in the context of their symbioses

    An improved ovine reference genome assembly to facilitate in depth functional annotation of the sheep genome

    Get PDF
    BACKGROUND: The domestic sheep (Ovis aries) is an important agricultural species raised for meat, wool, and milk across the world. A high-quality reference genome for this species enhances the ability to discover genetic mechanisms influencing biological traits. Furthermore, a high-quality reference genome allows for precise functional annotation of gene regulatory elements. The rapid advances in genome assembly algorithms and emergence of sequencing technologies with increasingly long reads provide the opportunity for an improved de novo assembly of the sheep reference genome. FINDINGS: Short-read Illumina (55× coverage), long-read Pacific Biosciences (75× coverage), and Hi-C data from this ewe retrieved from public databases were combined with an additional 50× coverage of Oxford Nanopore data and assembled with canu v1.9. The assembled contigs were scaffolded using Hi-C data with Salsa v2.2, gaps filled with PBsuitev15.8.24, and polished with Nanopolish v0.12.5. After duplicate contig removal with PurgeDups v1.0.1, chromosomes were oriented and polished with 2 rounds of a pipeline that consisted of freebayes v1.3.1 to call variants, Merfin to validate them, and BCFtools to generate the consensus fasta. The ARS-UI_Ramb_v2.0 assembly is 2.63 Gb in length and has improved continuity (contig NG50 of 43.18 Mb), with a 19- and 38-fold decrease in the number of scaffolds compared with Oar_rambouillet_v1.0 and Oar_v4.0. ARS-UI_Ramb_v2.0 has greater per-base accuracy and fewer insertions and deletions identified from mapped RNA sequence than previous assemblies. CONCLUSIONS: The ARS-UI_Ramb_v2.0 assembly is a substantial improvement in contiguity that will optimize the functional annotation of the sheep genome and facilitate improved mapping accuracy of genetic variant and expression data for traits in sheep

    Effect of consuming endophyte-infected fescue seed on transcript abundance in the mammary gland of lactating and dry cows, as assessed by RNA sequencing

    Get PDF
    Ergot alkaloids in endophyte-infected grasses inhibit prolactin secretion and reduce milk production in lactating cows. However, we previously showed that prepartum consumption of infected seed throughout the dry period did not inhibit subsequent milk production and prior exposure to bromocriptine (ergot peptide) actually increased production in the next lactation. To identify changes in the transcriptome and molecular pathways mediating the mammary gland's response to ergot alkaloids in the diet, RNA sequencing (RNA-seq) was performed on mammary tissues obtained from 22 multiparous Holstein cows exposed to 1 of 3 treatments. Starting at 90 ± 4 d prepartum, cows were fed endophyte-free fescue seed (control; CON), endophyte-free fescue seed plus 3×/wk subcutaneous injections of bromocriptine (BROMO; 0.1 mg/kg of BW), or endophyte-infected fescue seed (INF) as 10% of the diet. Cows were dried off 60 ± 2 d prepartum. Mammary biopsies from 4 (BROMO, INF) or 5 (CON) cows/treatment at each of the 3 phases were obtained: 7 d before dry off during the initial lactation (L1), mid-dry period (D), and 10 d postpartum (L2). Although tissue from the same cow was preferentially used at 3 phases (L1, D, L2), tissue from additional cows were used to as necessary to provide RNA of sufficient quality. Individual samples were used to generate individual RNA-seq libraries. Normalized reads of the RNA-seq data were organized into technical and biological replicates before processing with the RSEM software package. Each lactation phase was processed separately and genes that differed between any of 3 treatments were identified. A large proportion of genes differentially expressed in at least 1 treatment (n = 866) were found to be similarly expressed in BROMO and INF treatments, but differentially expressed from CON (n = 575, total for 3 phases). Of genes differentially expressed compared with CON, 104 genes were common to the L1 and L2 phases. Consistent with the production findings, networks most affected by treatments in L1 and L2 included lipid metabolism, small molecule biochemistry, and molecular transport, whereas networks related more to developmental and cellular functions and maintenance were evident during D phase. Similar patterns of expression in BROMO and INF during late and early lactation suggest involvement of similar cell signaling pathways or mechanisms of action for BROMO and INF and the importance of prolactin messaging pathways

    Structural variant-based pangenome construction has low sensitivity to variability of haplotype-resolved bovine assemblies

    Get PDF
    Advantages of pangenomes over linear reference assemblies for genome research have recently been established. However, potential effects of sequence platform and assembly approach, or of combining assemblies created by different approaches, on pangenome construction have not been investigated. Here we generate haplotype-resolved assemblies from the offspring of three bovine trios representing increasing levels of heterozygosity that each demonstrate a substantial improvement in contiguity, completeness, and accuracy over the current Bos taurus reference genome. Diploid coverage as low as 20x for HiFi or 60x for ONT is sufficient to produce two haplotype-resolved assemblies meeting standards set by the Vertebrate Genomes Project. Structural variant-based pangenomes created from the haplotype-resolved assemblies demonstrate significant consensus regardless of sequence platform, assembler algorithm, or coverage. Inspecting pangenome topologies identifies 90 thousand structural variants including 931 overlapping with coding sequences; this approach reveals variants affecting QRICH2, PRDM9, HSPA1A, TAS2R46, and GC that have potential to affect phenotype

    Structural variant-based pangenome construction has low sensitivity to variability of haplotype-resolved bovine assemblies

    Full text link
    Advantages of pangenomes over linear reference assemblies for genome research have recently been established. However, potential effects of sequence platform and assembly approach, or of combining assemblies created by different approaches, on pangenome construction have not been investigated. Here we generate haplotype-resolved assemblies from the offspring of three bovine trios representing increasing levels of heterozygosity that each demonstrate a substantial improvement in contiguity, completeness, and accuracy over the current Bos taurus reference genome. Diploid coverage as low as 20x for HiFi or 60x for ONT is sufficient to produce two haplotype-resolved assemblies meeting standards set by the Vertebrate Genomes Project. Structural variant-based pangenomes created from the haplotype-resolved assemblies demonstrate significant consensus regardless of sequence platform, assembler algorithm, or coverage. Inspecting pangenome topologies identifies 90 thousand structural variants including 931 overlapping with coding sequences; this approach reveals variants affecting QRICH2, PRDM9, HSPA1A, TAS2R46, and GC that have potential to affect phenotype
    corecore