Search CORE

22,414 research outputs found

Investigation of sequence features of hinge-bending regions in proteins with domain movements using kernel logistic regression

Author: Cawley Gavin
Hayward Steven
Veevers Ruth
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 09/04/2020
Field of study

Background: Hinge-bending movements in proteins comprising two or more domains form a large class of functional movements. Hinge-bending regions demarcate protein domains and collectively control the domain movement. Consequently, the ability to recognise sequence features of hinge-bending regions and to be able to predict them from sequence alone would benefit various areas of protein research. For example, an understanding of how the sequence features of these regions relate to dynamic properties in multi-domain proteins would aid in the rational design of linkers in therapeutic fusion proteins. Results: The DynDom database of protein domain movements comprises sequences annotated to indicate whether the amino acid residue is located within a hinge-bending region or within an intradomain region. Using statistical methods and Kernel Logistic Regression (KLR) models, this data was used to determine sequence features that favour or disfavour hinge-bending regions. This is a difficult classification problem as the number of negative cases (intradomain residues) is much larger than the number of positive cases (hinge residues). The statistical methods and the KLR models both show that cysteine has the lowest propensity for hinge-bending regions and proline has the highest, even though it is the most rigid amino acid. As hinge-bending regions have been previously shown to occur frequently at the terminal regions of the secondary structures, the propensity for proline at these regions is likely due to its tendency to break secondary structures. The KLR models also indicate that isoleucine may act as a domain-capping residue. We have found that a quadratic KLR model outperforms a linear KLR model and that improvement in performance occurs up to very long window lengths (eighty residues) indicating long-range correlations. Conclusion: In contrast to the only other approach that focused solely on interdomain hinge-bending regions, the method provides a modest and statistically significant improvement over a random classifier. An explanation of the KLR results is that in the prediction of hinge-bending regions a long-range correlation is at play between a small number amino acids that either favour or disfavour hinge-bending regions. The resulting sequence-based prediction tool, HingeSeek, is available to run through a webserver at hingeseek.cmp.uea.ac.uk

University of East Anglia digital repository

Squares and difference sets in finite fields

Author: Bachoc C.
Matolcsi Máté
Ruzsa Z. Imre
Publication venue: State University of West Georgia, Charles University, DIMATIA
Publication date: 01/01/2013
Field of study

For infinitely many primes p = 4k+1 we give a slightly improved upper bound for the maximal cardinality of a set B ⊂ Z p such that the difference set B−B contains only quadratic residues. Namely, instead of the ”trivial” bound |B| ≤ √p we prove |B √p | ≤ − 1, under suitable conditions on p. The new bound is valid for approximately three quarters of the primes p = 4k + 1

Repository of the Academy's Library