12 research outputs found

    Polyglutamine Repeats Are Associated to Specific Sequence Biases That Are Conserved among Eukaryotes

    Get PDF
    Nine human neurodegenerative diseases, including Huntington's disease and several spinocerebellar ataxia, are associated to the aggregation of proteins comprising an extended tract of consecutive glutamine residues (polyQs) once it exceeds a certain length threshold. This event is believed to be the consequence of the expansion of polyCAG codons during the replication process. This is in apparent contradiction with the fact that many polyQs-containing proteins remain soluble and are encoded by invariant genes in a number of eukaryotes. The latter suggests that polyQs expansion and/or aggregation might be counter-selected through a genetic and/or protein context. To identify this context, we designed a software that scrutinize entire proteomes in search for imperfect polyQs. The nature of residues flanking the polyQs and that of residues other than Gln within polyQs (insertions) were assessed. We discovered strong amino acid residue biases robustly associated to polyQs in the 15 eukaryotic proteomes we examined, with an over-representation of Pro, Leu and His and an under-representation of Asp, Cys and Gly amino acid residues. These biases are conserved amongst unrelated proteins and are independent of specific functional classes. Our findings suggest that specific residues have been co-selected with polyQs during evolution. We discuss the possible selective pressures responsible of the observed biases

    Precision medicine in cancer: Challenges and recommendations from an EU-funded cervical cancer biobanking study

    Get PDF
    Background:Cervical cancer (CC) remains a leading cause of gynaecological cancer-related mortality worldwide. CC pathogenesis is triggered when human papillomavirus (HPV) inserts into the genome, resulting in tumour suppressor gene inactivation and oncogene activation. Collecting tumour and blood samples is critical for identifying these genetic alterations.Methods:BIO-RAIDs is the first prospective molecular profiling clinical study to include a substantial biobanking effort that used uniform high-quality standards and control of samples. In this European Union (EU)-funded study, we identified the challenges that were impeding the effective implementation of such a systematic and comprehensive biobanking effort.Results:The challenges included a lack of uniform international legal and ethical standards, complexities in clinical and molecular data management, and difficulties in determining the best technical platforms and data analysis techniques. Some difficulties were encountered by all investigators, while others affected only certain institutions, regions, or countries.Conclusions:The results of the BIO-RAIDs programme highlight the need to facilitate and standardise regulatory procedures, and we feel that there is also a need for international working groups that make recommendations to regulatory bodies, governmental funding agencies, and academic institutions to achieve a proficient biobanking programme throughout EU countries. This represents the first step in precision medicine

    Sequence biases associated to polyQs in the human proteome.

    No full text
    <p>(A) Sequence biases within polyQs (insertions). The relative abundances of each residue type within polyQs are represented. (B–G) Sequences biases at the flanks of polyQs. Each point represents the relative abundance of Pro (B), His (C), Leu (D), Asp (E), Cys (F) and Gly (G) at each of the 30 positions within the Nt (black circles) and Ct (gray circles) flanks. The relative abundances within the polyQ insertions are also indicated (stars). The solid lines are the best fit to an exponential function. The dotted lines indicate the threshold for residues over- (residue twice as frequent as in the proteome) or under- (residue twice less frequent than in the proteome) representation.</p

    Sequence biases associated to polyQs are conserved throughout eukaryotic proteomes.

    No full text
    <p>The relative abundances of Pro (A), His (B), Leu (C), Asp (D), Cys (E) and Gly (F) within polyQs insertions in the 15 eukaryotic proteomes we analyzed are represented. The dotted lines indicate the threshold for residues over- (residue twice as frequent as in the proteome) or under- (residue twice less frequent than in the proteome) representation.</p

    Schematic of a polyQ.

    No full text
    <p>A polyQ contains a minimum of five consecutive Q residues. The maximum proportion of residues other than Q (insertions) is 25% and each insertion cannot be over 5-residues long. The N- and C-terminal flanks of the polyQ are labeled Nt and Ct flanks, respectively. The numbering scheme for the residues within the flanks is shown.</p

    The polyQ zone.

    No full text
    <p>PolyQs insertions and flanks share identical residue biases: Pro, Leu and His are over-represented within these zones while Asp, Cys and Gly are under-represented.</p

    Size distribution of the polyQs in the human proteome.

    No full text
    <p>Black, imperfect polyQs; dark gray, pure polyQs; light gray, pure polyCAGs.</p

    A survey of additional biases within polyQs.

    No full text
    <p>(<b>A</b>) <b>Codon biases within polyQ insertions</b>. The relative abundances of the different Pro, His and Leu codons within polyQs insertions in human, chimpanzee, orangutan, mouse and dog are represented. The dotted lines indicate the threshold for codons over- (codons twice as frequent as in the rest of the open reading frames) or under- (codons twice less frequent than in the rest of the open reading frames) representation. (<b>B</b>) <b>Sequence biases within human polyQs are not specific to transcription factors</b>. The relative abundance of each residue within polyQs is represented for all human polyQs (black), for human polyQs tagged by the GO term “regulation of transcription DNA-dependent” (dark gray) and human polyQs that are not tagged by the GO term “regulation of transcription DNA-dependent” (light gray). The dotted lines indicate the threshold for residues over- (residue twice as frequent as in the proteome) or under- (residue twice less frequent than in the proteome) representation.</p

    PolyQs associated to human diseases are significantly longer than those that are not.

    No full text
    <p>(A) Length of the imperfect polyQs. (B) Length of the longest pure polyQ within each polyQ zone. (C) Length of the longest pure polyCAG within each polyQ zone. ***, p<0.001; **, p<0.01; *, p<0.05 (Mann-Whitney and Kolmogorov-Smirnov tests).</p

    Eukaryotic polyQ-containing proteins are not orthologous.

    No full text
    <p>The number of polyQ-containing proteins that are orthologous to human polyQ-containing proteins in different organisms are highlighted in black. Those that are not are in gray.</p
    corecore