17 research outputs found

    An Improved Canine Genome and a Comprehensive Catalogue of Coding Genes and Non-Coding Transcripts

    Get PDF
    The domestic dog, Canis familiaris, is a well-established model system for mapping trait and disease loci. While the original draft sequence was of good quality, gaps were abundant particularly in promoter regions of the genome, negatively impacting the annotation and study of candidate genes. Here, we present an improved genome build, canFam3.1, which includes 85 MB of novel sequence and now covers 99.8% of the euchromatic portion of the genome. We also present multiple RNA-Sequencing data sets from 10 different canine tissues to catalog ∼175,000 expressed loci. While about 90% of the coding genes previously annotated by EnsEMBL have measurable expression in at least one sample, the number of transcript isoforms detected by our data expands the EnsEMBL annotations by a factor of four. Syntenic comparison with the human genome revealed an additional ∼3,000 loci that are characterized as protein coding in human and were also expressed in the dog, suggesting that those were previously not annotated in the EnsEMBL canine gene set. In addition to ∼20,700 high-confidence protein coding loci, we found ∼4,600 antisense transcripts overlapping exons of protein coding genes, ∼7,200 intergenic multi-exon transcripts without coding potential, likely candidates for long intergenic non-coding RNAs (lincRNAs) and ∼11,000 transcripts were reported by two different library construction methods but did not fit any of the above categories. Of the lincRNAs, about 6,000 have no annotated orthologs in human or mouse. Functional analysis of two novel transcripts with shRNA in a mouse kidney cell line altered cell morphology and motility. All in all, we provide a much-improved annotation of the canine genome and suggest regulatory functions for several of the novel non-coding transcripts

    Population genomic sequencing of Coccidioides fungi reveals recent hybridization and transposon control

    No full text
    We have sequenced the genomes of 18 isolates of the closely related human pathogenic fungi Coccidioides immitis and Coccidioides posadasii to more clearly elucidate population genomic structure, bringing the total number of sequenced genomes for each species to 10. Our data confirm earlier microsatellite-based findings that these species are genetically differentiated, but our population genomics approach reveals that hybridization and genetic introgression have recently occurred between the two species. The directionality of introgression is primarily from C. posadasii to C. immitis, and we find more than 800 genes exhibiting strong evidence of introgression in one or more sequenced isolates. We performed PCR-based sequencing of one region exhibiting introgression in 40 C. immitis isolates to confirm and better define the extent of gene flow between the species. We find more coding sequence than expected by chance in the introgressed regions, suggesting that natural selection may play a role in the observed genetic exchange. We find notable heterogeneity in repetitive sequence composition among the sequenced genomes and present the first detailed genome-wide profile of a repeat-induced point mutation (RIP) process distinctly different from what has been observed in Neurospora. We identify promiscuous HLA-I and HLA-II epitopes in both proteomes and discuss the possible implications of introgression and population genomic data for public health and vaccine candidate prioritization. This study highlights the importance of population genomic data for detecting subtle but potentially important phenomena such as introgression

    Distance trees of expression profiles.

    No full text
    <p>We constructed neighbor-joining trees based on the correlation between expression values (FPKM>1.0) between samples, with 1 minus Spearman's rho defining the distance. Colors denote library construction methods (poly-A: blue, DSN: red). We divided transcribed loci into (a) protein coding genes with RNA-Seq support, either annotated by EnsEMBL in dog or EnsEMBL in the human orthologous regions. Replicates cluster together, so do the library constructions methods poly-A and DSN, as well as related tissues, such as heart and muscle; (b) antisense transcripts, that overlap at least one exon of a protein coding gene, as defined in (a). With the exception of testis, poly-A and DSN separate the samples, with both the poly-A and DSN sub-trees maintaining closer relationships between the related tissues heart and muscle; (c) spliced intergenic loci, excluding sequences that have coding potential. Similar to protein coding genes, the poly-A and DSN group by tissue first, with the exception of kidney DSN; and (d) intergenic and uncharacterized single-exon transcript loci. In this set, DSN and poly-A are, similar to antisense loci, the most dominant factor when grouping samples.</p

    Gene expression profiling in pachyonychia congenita skin

    No full text
    BACKGROUND: Pachyonychia congenita (PC) is a skin disorder resulting from mutations in keratin (K) proteins including K6a, K6b, K16, and K17. One of the major symptoms is painful plantar keratoderma. The pathogenic sequelae resulting from the keratin mutations remain unclear. OBJECTIVE: To better understand PC pathogenesis. METHODS: RNA profiling was performed on biopsies taken from PC-involved and uninvolved plantar skin of seven genotyped PC patients (two K6a, one K6b, three K16, and one K17) as well as from control volunteers. Protein profiling was generated from tape-stripping samples. RESULTS: A comparison of PC-involved skin biopsies to adjacent uninvolved plantar skin identified 112 differentially-expressed mRNAs common to patient groups harboring K6 (i.e., both K6a and K6b) and K16 mutations. Among these mRNAs, 25 encode structural proteins including keratins, small proline-rich and late cornified envelope proteins, 20 are related to metabolism and 16 encode proteases, peptidases, and their inhibitors including kallikrein-related peptidases (KLKs), and serine protease inhibitors (SERPINs). mRNAs were also identified to be differentially expressed only in K6 (81) or K16 (141) patient samples. Furthermore, 13 mRNAs were identified that may be involved in pain including nociception and neuropathy. Protein profiling, comparing three K6a plantar tape-stripping samples to non-PC controls, showed changes in the PC corneocytes similar, but not identical, to the mRNA analysis. CONCLUSION: Many differentially-expressed genes identified in PC-involved skin encode components critical for skin barrier homeostasis including keratinocyte proliferation, differentiation, cornification, and desquamation. The profiling data provide a foundation for unraveling the pathogenesis of PC and identifying targets for developing effective PC therapeutics
    corecore