63 research outputs found

    Nucleotide Polymorphisms in the Canine Noggin Gene and Their Distribution Among Dog (Canis lupus familiaris) Breeds

    Get PDF
    Noggin (NOG) is an important regulator for the signaling of bone morphogenetic proteins. In this study, we sequenced the complete coding sequence of the canine NOG gene and characterized the nucleotide polymorphisms. The sequence length varied from 717 to 729 bp, depending on the number of a 6-bp tandem repeat unit (GGCGCG), an insertion that has not been observed in other mammalian NOG genes investigated to date. It results in extensions of (Gly–Ala)3–5 in the putative NOG protein. To survey the distribution of these tandem repeat polymorphisms, we analyzed 126 individuals in seven dog breeds. We identified only three alleles: (GGCGCG)3, (GGCGCG)4, and (GGCGCG)5. Although the allele frequencies were remarkably different among the breeds, the three alleles were present in all seven of the breeds and did not show any deviation from Hardy–Weinberg equilibrium

    Analysis of Microsatellite Variation in Drosophila melanogaster with Population-Scale Genome Sequencing

    Get PDF
    Genome sequencing technologies promise to revolutionize our understanding of genetics, evolution, and disease by making it feasible to survey a broad spectrum of sequence variation on a population scale. However, this potential can only be realized to the extent that methods for extracting and interpreting distinct forms of variation can be established. The error profiles and read length limitations of early versions of next-generation sequencing technologies rendered them ineffective for some sequence variant types, particularly microsatellites and other tandem repeats, and fostered the general misconception that such variants are inherently inaccessible to these platforms. At the same time, tandem repeats have emerged as important sources of functional variation. Tandem repeats are often located in and around genes, and frequent mutations in their lengths exert quantitative effects on gene function and phenotype, rapidly degrading linkage disequilibrium between markers and traits. Sensitive identification of these variants in large-scale next-gen sequencing efforts will enable more comprehensive association studies capable of revealing previously invisible associations. We present a population-scale analysis of microsatellite repeats using whole-genome data from 158 inbred isolates from the Drosophila Genetics Reference Panel, a collection of over 200 extensively phenotypically characterized isolates from a single natural population, to uncover processes underlying repeat mutation and to enable associations with behavioral, morphological, and life-history traits. Analysis of repeat variation from next-generation sequence data will also enhance studies of genome stability and neurodegenerative diseases

    Bovine proteins containing poly-glutamine repeats are often polymorphic and enriched for components of transcriptional regulatory complexes

    Get PDF
    peer-reviewedBackground: About forty human diseases are caused by repeat instability mutations. A distinct subset of these diseases is the result of extreme expansions of polymorphic trinucleotide repeats; typically CAG repeats encoding poly-glutamine (poly-Q) tracts in proteins. Polymorphic repeat length variation is also apparent in human poly-Q encoding genes from normal individuals. As these coding sequence repeats are subject to selection in mammals, it has been suggested that normal variations in some of these typically highly conserved genes are implicated in morphological differences between species and phenotypic variations within species. At present, poly-Q encoding genes in non-human mammalian species are poorly documented, as are their functions and propensities for polymorphic variation. Results: The current investigation identified 178 bovine poly-Q encoding genes (Q ≥ 5) and within this group, 26 genes with orthologs in both human and mouse that did not contain poly-Q repeats. The bovine poly-Q encoding genes typically had ubiquitous expression patterns although there was bias towards expression in epithelia, brain and testes. They were also characterised by unusually large sizes. Analysis of gene ontology terms revealed that the encoded proteins were strongly enriched for functions associated with transcriptional regulation and many contributed to physical interaction networks in the nucleus where they presumably act cooperatively in transcriptional regulatory complexes. In addition, the coding sequence CAG repeats in some bovine genes impacted mRNA splicing thereby generating unusual transcriptional diversity, which in at least one instance was tissue-specific. The poly-Q encoding genes were prioritised using multiple criteria for their likelihood of being polymorphic and then the highest ranking group was experimentally tested for polymorphic variation within a cattle diversity panel. Extensive and meiotically stable variation was identified. Conclusions: Transcriptional diversity can potentially be generated in poly-Q encoding genes by the impact of CAG repeat tracts on mRNA alternative splicing. This effect, combined with the physical interactions of the encoded proteins in large transcriptional regulatory complexes suggests that polymorphic variations of proteins in these complexes have strong potential to affect phenotype.Dairy Australia (through the Innovative Dairy Cooperative Research Center

    Unraveling the mysteries of dog evolution

    Get PDF
    The increased battery of molecular markers, derived from comparative genomics, is aiding our understanding of the genetics of domestication. The recent BMC Biology article pertaining to the evolution of small size in dogs is an example of how such methods can be used to study the origin and diversification of the domestic dog. We are still challenged, however, to appreciate the genetic mechanisms responsible for the phenotypic diversity seen in 'our best friend'

    Diversification and Molecular Evolution of ATOH8, a Gene Encoding a bHLH Transcription Factor

    Get PDF
    ATOH8 is a bHLH domain transcription factor implicated in the development of the nervous system, kidney, pancreas, retina and muscle. In the present study, we collected sequence of ATOH8 orthologues from 18 vertebrate species and 24 invertebrate species. The reconstruction of ATOH8 phylogeny and sequence analysis showed that this gene underwent notable divergences during evolution. For those vertebrate species investigated, we analyzed the gene structure and regulatory elements of ATOH8. We found that the bHLH domain of vertebrate ATOH8 was highly conserved. Mammals retained some specific amino acids in contrast to the non-mammalian orthologues. Mammals also developed another potential isoform, verified by a human expressed sequence tag (EST). Comparative genomic analyses of the regulatory elements revealed a replacement of the ancestral TATA box by CpG-islands in the eutherian mammals and an evolutionary tendency for TATA box reduction in vertebrates in general. We furthermore identified the region of the effective promoter of human ATOH8 which could drive the expression of EGFP reporter in the chicken embryo. In the opossum, both the coding region and regulatory elements of ATOH8 have some special features, such as the unique extended C-terminus encoded by the third exon and absence of both CpG islands and TATA elements in the regulatory region. Our gene mapping data showed that in human, ATOH8 was hosted in one chromosome which is a fusion product of two orthologous chromosomes in non-human primates. This unique chromosomal environment of human ATOH8 probably subjects its expression to the regulation at chromosomal level. We deduce that the great interspecific differences found in both ATOH8 gene sequence and its regulatory elements might be significant for the fine regulation of its spatiotemporal expression and roles of ATOH8, thus orchestrating its function in different tissues and organisms

    Variation within the Huntington's Disease Gene Influences Normal Brain Structure

    Get PDF
    Genetics of the variability of normal and diseased brain structure largely remains to be elucidated. Expansions of certain trinucleotide repeats cause neurodegenerative disorders of which Huntington's disease constitutes the most common example. Here, we test the hypothesis that variation within the IT15 gene on chromosome 4, whose expansion causes Huntington's disease, influences normal human brain structure. In 278 normal subjects, we determined CAG repeat length within the IT15 gene on chromosome 4 and analyzed high-resolution T1-weighted magnetic resonance images by the use of voxel-based morphometry. We found an increase of GM with increasing long CAG repeat and its interaction with age within the pallidum, which is involved in Huntington's disease. Our study demonstrates that a certain trinucleotide repeat influences normal brain structure in humans. This result may have important implications for the understanding of both the healthy and diseased brain

    The IGF1 small dog haplotype is derived from Middle Eastern grey wolves

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>A selective sweep containing the insulin-like growth factor 1 (<it>IGF1</it>) gene is associated with size variation in domestic dogs. Intron 2 of <it>IGF1 </it>contains a SINE element and single nucleotide polymorphism (SNP) found in all small dog breeds that is almost entirely absent from large breeds. In this study, we surveyed a large sample of grey wolf populations to better understand the ancestral pattern of variation at <it>IGF1 </it>with a particular focus on the distribution of the small dog haplotype and its relationship to the origin of the dog.</p> <p>Results</p> <p>We present DNA sequence data that confirms the absence of the derived small SNP allele in the intron 2 region of <it>IGF1 </it>in a large sample of grey wolves and further establishes the absence of a small dog associated SINE element in all wild canids and most large dog breeds. Grey wolf haplotypes from the Middle East have higher nucleotide diversity suggesting an origin there. Additionally, PCA and phylogenetic analyses suggests a closer kinship of the small domestic dog <it>IGF1 </it>haplotype with those from Middle Eastern grey wolves.</p> <p>Conclusions</p> <p>The absence of both the SINE element and SNP allele in grey wolves suggests that the mutation for small body size post-dates the domestication of dogs. However, because all small dogs possess these diagnostic mutations, the mutations likely arose early in the history of domestic dogs. Our results show that the small dog haplotype is closely related to those in Middle Eastern wolves and is consistent with an ancient origin of the small dog haplotype there. Thus, in concordance with past archeological studies, our molecular analysis is consistent with the early evolution of small size in dogs from the Middle East.</p> <p>See associated opinion by Driscoll and Macdonald: <url>http://jbiol.com/content/9/2/10</url></p

    An analysis of single amino acid repeats as use case for application specific background models

    Get PDF
    Background Sequence analysis aims to identify biologically relevant signals against a backdrop of functionally meaningless variation. Increasingly, it is recognized that the quality of the background model directly affects the performance of analyses. State-of-the-art approaches rely on classical sequence models that are adapted to the studied dataset. Although performing well in the analysis of globular protein domains, these models break down in regions of stronger compositional bias or low complexity. While these regions are typically filtered, there is increasing anecdotal evidence of functional roles. This motivates an exploration of more complex sequence models and application-specific approaches for the investigation of biased regions. Results Traditional Markov-chains and application-specific regression models are compared using the example of predicting runs of single amino acids, a particularly simple class of biased regions. Cross-fold validation experiments reveal that the alternative regression models capture the multi-variate trends well, despite their low dimensionality and in contrast even to higher-order Markov-predictors. We show how the significance of unusual observations can be computed for such empirical models. The power of a dedicated model in the detection of biologically interesting signals is then demonstrated in an analysis identifying the unexpected enrichment of contiguous leucine-repeats in signal-peptides. Considering different reference sets, we show how the question examined actually defines what constitutes the 'background'. Results can thus be highly sensitive to the choice of appropriate model training sets. Conversely, the choice of reference data determines the questions that can be investigated in an analysis. Conclusions Using a specific case of studying biased regions as an example, we have demonstrated that the construction of application-specific background models is both necessary and feasible in a challenging sequence analysis situation
    corecore