73 research outputs found

    Compositional biases and polyalanine runs in humans

    Get PDF

    Genome-Wide Linkage in a Highly Consanguineous Pedigree Reveals Two Novel Loci on Chromosome 7 for Non-Syndromic Familial Premature Ovarian Failure

    Get PDF
    BACKGROUND: The human condition known as Premature Ovarian Failure (POF) is characterized by loss of ovarian function before the age of 40. A majority of POF cases are sporadic, but 10-15% are familial, suggesting a genetic origin of the disease. Although several causal mutations have been identified, the etiology of POF is still unknown for about 90% of the patients.¦METHODOLOGY/PRINCIPAL FINDINGS: We report a genome-wide linkage and homozygosity analysis in one large consanguineous Middle-Eastern POF-affected family presenting an autosomal recessive pattern of inheritance. We identified two regions with a LOD(max) of 3.26 on chromosome 7p21.1-15.3 and 7q21.3-22.2, which are supported as candidate regions by homozygosity mapping. Sequencing of the coding exons and known regulatory sequences of three candidate genes (DLX5, DLX6 and DSS1) included within the largest region did not reveal any causal mutations.¦CONCLUSIONS/SIGNIFICANCE: We detect two novel POF-associated loci on human chromosome 7, opening the way to the identification of new genes involved in the control of ovarian development and function

    A homozygous FANCM mutation underlies a familial case of non-syndromic primary ovarian insufficiency

    Get PDF
    Primary Ovarian Insufficiency (P01) affects 1% of women under forty. Exome sequencing of two Finnish sisters with non-syndromic P01 revealed a homozygous mutation in FANCM, leading to a truncated protein (p.GIn1701*). FANCM is a DNA-damage response gene whose heterozygous mutations predispose to breast cancer. Compared to the mother's cells, the patients' lymphocytes displayed higher levels of basal and mitomycin C (MMC)-induced chromosomal abnormalities. Their lymphoblasts were hypersensitive to MMC and MMC-induced monoubiquitination of FANCD2 was impaired. Genetic complementation of patient's cells with wild-type FANCM improved their resistance to MMC re-establishing FANCD2 monoubiquitination. FANCM was more strongly expressed in human fetal germ cells than in somatic cells. FANCM protein was preferentially expressed along the chromosomes in pachytene cells, which undergo meiotic recombination. This mutation may provoke meiotic defects leading to a depleted follicular stock, as in Fancrril- mice. Our findings document the first Mendelian phenotype due to a biallelic FANCM mutation

    An analysis of single amino acid repeats as use case for application specific background models

    Get PDF
    Background Sequence analysis aims to identify biologically relevant signals against a backdrop of functionally meaningless variation. Increasingly, it is recognized that the quality of the background model directly affects the performance of analyses. State-of-the-art approaches rely on classical sequence models that are adapted to the studied dataset. Although performing well in the analysis of globular protein domains, these models break down in regions of stronger compositional bias or low complexity. While these regions are typically filtered, there is increasing anecdotal evidence of functional roles. This motivates an exploration of more complex sequence models and application-specific approaches for the investigation of biased regions. Results Traditional Markov-chains and application-specific regression models are compared using the example of predicting runs of single amino acids, a particularly simple class of biased regions. Cross-fold validation experiments reveal that the alternative regression models capture the multi-variate trends well, despite their low dimensionality and in contrast even to higher-order Markov-predictors. We show how the significance of unusual observations can be computed for such empirical models. The power of a dedicated model in the detection of biologically interesting signals is then demonstrated in an analysis identifying the unexpected enrichment of contiguous leucine-repeats in signal-peptides. Considering different reference sets, we show how the question examined actually defines what constitutes the 'background'. Results can thus be highly sensitive to the choice of appropriate model training sets. Conversely, the choice of reference data determines the questions that can be investigated in an analysis. Conclusions Using a specific case of studying biased regions as an example, we have demonstrated that the construction of application-specific background models is both necessary and feasible in a challenging sequence analysis situation

    Large Scale Comparative Codon-Pair Context Analysis Unveils General Rules that Fine-Tune Evolution of mRNA Primary Structure

    Get PDF
    BACKGROUND: Codon usage and codon-pair context are important gene primary structure features that influence mRNA decoding fidelity. In order to identify general rules that shape codon-pair context and minimize mRNA decoding error, we have carried out a large scale comparative codon-pair context analysis of 119 fully sequenced genomes. METHODOLOGIES/PRINCIPAL FINDINGS: We have developed mathematical and software tools for large scale comparative codon-pair context analysis. These methodologies unveiled general and species specific codon-pair context rules that govern evolution of mRNAs in the 3 domains of life. We show that evolution of bacterial and archeal mRNA primary structure is mainly dependent on constraints imposed by the translational machinery, while in eukaryotes DNA methylation and tri-nucleotide repeats impose strong biases on codon-pair context. CONCLUSIONS: The data highlight fundamental differences between prokaryotic and eukaryotic mRNA decoding rules, which are partially independent of codon usage

    C14ORF39/SIX6OS1 is a constituent of the synaptonemal complex and is essential for mouse fertility

    Get PDF
    Meiotic recombination generates crossovers between homologous chromosomes that are essential for genome haploidization. The synaptonemal complex is a ‘zipper’-like protein assembly that synapses homologue pairs together and provides the structural framework for processing recombination sites into crossovers. Humans show individual differences in the number of crossovers generated across the genome. Recently, an anonymous gene variant in C14ORF39/SIX6OS1 was identified that influences the recombination rate in humans. Here we show that C14ORF39/SIX6OS1 encodes a component of the central element of the synaptonemal complex. Yeast two-hybrid analysis reveals that SIX6OS1 interacts with the well-established protein synaptonemal complex central element 1 (SYCE1). Mice lacking SIX6OS1 are defective in chromosome synapsis at meiotic prophase I, which provokes an arrest at the pachytene-like stage and results in infertility. In accordance with its role as a modifier of the human recombination rate, SIX6OS1 is essential for the appropriate processing of intermediate recombination nodules before crossover formation.This work was supported by BFU_2014-59307-R, MEIONet and JCyLe (CSI052U16). LGH and NFM are supported by European Social Fund/JCyLe grants (EDU/1083/2013 and EDU/310/2015). ORD is a Sir Henry Dale Fellow jointly funded by the Wellcome Trust and Royal Society (Grant Number 104158/Z/14/Z). RB is funded by DFG (grant Be1168/8-1). AT and ID were supported by DFG grants TO421/8-2 and TO421/6-1, respectively.Peer reviewe

    Bovine proteins containing poly-glutamine repeats are often polymorphic and enriched for components of transcriptional regulatory complexes

    Get PDF
    peer-reviewedBackground: About forty human diseases are caused by repeat instability mutations. A distinct subset of these diseases is the result of extreme expansions of polymorphic trinucleotide repeats; typically CAG repeats encoding poly-glutamine (poly-Q) tracts in proteins. Polymorphic repeat length variation is also apparent in human poly-Q encoding genes from normal individuals. As these coding sequence repeats are subject to selection in mammals, it has been suggested that normal variations in some of these typically highly conserved genes are implicated in morphological differences between species and phenotypic variations within species. At present, poly-Q encoding genes in non-human mammalian species are poorly documented, as are their functions and propensities for polymorphic variation. Results: The current investigation identified 178 bovine poly-Q encoding genes (Q ≥ 5) and within this group, 26 genes with orthologs in both human and mouse that did not contain poly-Q repeats. The bovine poly-Q encoding genes typically had ubiquitous expression patterns although there was bias towards expression in epithelia, brain and testes. They were also characterised by unusually large sizes. Analysis of gene ontology terms revealed that the encoded proteins were strongly enriched for functions associated with transcriptional regulation and many contributed to physical interaction networks in the nucleus where they presumably act cooperatively in transcriptional regulatory complexes. In addition, the coding sequence CAG repeats in some bovine genes impacted mRNA splicing thereby generating unusual transcriptional diversity, which in at least one instance was tissue-specific. The poly-Q encoding genes were prioritised using multiple criteria for their likelihood of being polymorphic and then the highest ranking group was experimentally tested for polymorphic variation within a cattle diversity panel. Extensive and meiotically stable variation was identified. Conclusions: Transcriptional diversity can potentially be generated in poly-Q encoding genes by the impact of CAG repeat tracts on mRNA alternative splicing. This effect, combined with the physical interactions of the encoded proteins in large transcriptional regulatory complexes suggests that polymorphic variations of proteins in these complexes have strong potential to affect phenotype.Dairy Australia (through the Innovative Dairy Cooperative Research Center

    Frequent Missense and Insertion/Deletion Polymorphisms in the Ovine Shadoo Gene Parallel Species-Specific Variation in PrP

    Get PDF
    BACKGROUND: The cellular prion protein PrP(C) is encoded by the Prnp gene. This protein is expressed in the central nervous system (CNS) and serves as a precursor to the misfolded PrP(Sc) isoform in prion diseases. The prototype prion disease is scrapie in sheep, and whereas Prnp exhibits common missense polymorphisms for V136A, R154H and Q171R in ovine populations, genetic variation in mouse Prnp is limited. Recently the CNS glycoprotein Shadoo (Sho) has been shown to resemble PrP(C) both in a central hydrophobic domain and in activity in a toxicity assay performed in cerebellar neurons. Sho protein levels are reduced in prion infections in rodents. Prompted by these properties of the Sho protein we investigated the extent of natural variation in SPRN. PRINCIPAL FINDINGS: Paralleling the case for ovine versus human and murine PRNP, we failed to detect significant coding polymorphisms that alter the mature Sho protein in a sample of neurologically normal humans, or in diverse strains of mice. However, ovine SPRN exhibited 4 missense mutations and expansion/contraction in a series of 5 tandem Ala/Gly-containing repeats R1-R5 encoding Sho's hydrophobic domain. A Val71Ala polymorphism and polymorphic expansion of wt 67(Ala)(3)Gly70 to 67(Ala)(5)Gly72 reached frequencies of 20%, with other alleles including Delta67-70 and a 67(Ala)(6)Gly73 expansion. Sheep V71, A71, Delta67-70 and 67(Ala)(6)Gly73 SPRN alleles encoded proteins with similar stability and posttranslational processing in transfected neuroblastoma cells. SIGNIFICANCE: Frequent coding polymorphisms are a hallmark of the sheep PRNP gene and our data indicate a similar situation applies to ovine SPRN. Whether a common selection pressure balances diversity at both loci remains to be established

    Mouse Ribosomal RNA Genes Contain Multiple Differentially Regulated Variants

    Get PDF
    Previous cytogenetic studies suggest that various rDNA chromosomal loci are not equally active in different cell types. Consistent with this variability, rDNA polymorphism is well documented in human and mouse. However, attempts to identify molecularly rDNA variant types, which are regulated individually (i.e., independent of other rDNA variants) and tissue-specifically, have not been successful. We report here the molecular cloning and characterization of seven mouse rDNA variants (v-rDNA). The identification of these v-rDNAs was based on restriction fragment length polymorphisms (RFLPs), which are conserved among individuals and mouse strains. The total copy number of the identified variants is less than 100 and the copy number of each individual variant ranges from 4 to 15. Sequence analysis of the cloned v-rDNA identified variant-specific single nucleotide polymorphisms (SNPs) in the transcribed region. These SNPs were used to develop a set of variant-specific PCR assays, which permitted analysis of the v-rDNAs' expression profiles in various tissues. These profiles show that three v-rDNAs are expressed in all tissues (constitutively active), two are expressed in some tissues (selectively active), and two are not expressed (silent). These expression profiles were observed in six individuals from three mouse strains, suggesting the pattern is not randomly determined. Thus, the mouse rDNA array likely consists of genetically distinct variants, and some are regulated tissue-specifically. Our results provide the first molecular evidence for cell-type-specific regulation of a subset of rDNA

    SUMOylation of the Forkhead Transcription Factor FOXL2 Promotes Its Stabilization/Activation through Transient Recruitment to PML Bodies

    Get PDF
    International audienceBACKGROUND: FOXL2 is a transcription factor essential for ovarian development and maintenance. It is mutated in the genetic condition called Blepharophimosis Ptosis Epicantus inversus Syndrome (BPES) and in cases of isolated premature ovarian failure. We and others have previously shown that FOXL2 undergoes several post-translational modifications. METHODS AND PRINCIPAL FINDINGS: Here, using cells in culture, we show that interference with FOXL2 SUMOylation leads to a robust inhibition of its transactivation ability, which correlates with a decreased stability. Interestingly, FOXL2 SUMOylation promotes its transient recruitment to subnuclear structures that we demonstrate to be PML (Promyelocytic Leukemia) Nuclear Bodies. Since PML bodies are known to be sites where post-translational modifications of nuclear factors take place, we used tandem mass spectrometry to identify new post-translational modifications of FOXL2. Specifically, we detected four phosphorylated, one sulfated and three acetylated sites. CONCLUSIONS: By analogy with other transcription factors, we propose that PML Nuclear Bodies might transiently recruit FOXL2 to the vicinity of locally concentrated enzymes that could be involved in the post-translational maturation of FOXL2. FOXL2 acetylation, sulfation, phosphorylation as well as other modifications yet to be discovered might alter the transactivation capacity of FOXL2 and/or its stability, thus modulating its global intracellular activity
    corecore