14 research outputs found

    A structure filter for the Eukaryotic Linear Motif Resource

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Many proteins are highly modular, being assembled from globular domains and segments of natively disordered polypeptides. Linear motifs, short sequence modules functioning independently of protein tertiary structure, are most abundant in natively disordered polypeptides but are also found in accessible parts of globular domains, such as exposed loops. The prediction of novel occurrences of known linear motifs attempts the difficult task of distinguishing functional matches from stochastically occurring non-functional matches. Although functionality can only be confirmed experimentally, confidence in a putative motif is increased if a motif exhibits attributes associated with functional instances such as occurrence in the correct taxonomic range, cellular compartment, conservation in homologues and accessibility to interacting partners. Several tools now use these attributes to classify putative motifs based on confidence of functionality.</p> <p>Results</p> <p>Current methods assessing motif accessibility do not consider much of the information available, either predicting accessibility from primary sequence or regarding any motif occurring in a globular region as low confidence. We present a method considering accessibility and secondary structural context derived from experimentally solved protein structures to rectify this situation. Putatively functional motif occurrences are mapped onto a representative domain, given that a high quality reference SCOP domain structure is available for the protein itself or a close relative. Candidate motifs can then be scored for solvent-accessibility and secondary structure context. The scores are calibrated on a benchmark set of experimentally verified motif instances compared with a set of random matches. A combined score yields 3-fold enrichment for functional motifs assigned to high confidence classifications and 2.5-fold enrichment for random motifs assigned to low confidence classifications. The structure filter is implemented as a pipeline with both a graphical interface via the ELM resource <url>http://elm.eu.org/</url> and through a Web Service protocol.</p> <p>Conclusion</p> <p>New occurrences of known linear motifs require experimental validation as the bioinformatics tools currently have limited reliability. The ELM structure filter will aid users assessing candidate motifs presenting in globular structural regions. Most importantly, it will help users to decide whether to expend their valuable time and resources on experimental testing of interesting motif candidates.</p

    Optimal allocation of policy deductibles for exchangeable risks

    No full text
    © 2016 Elsevier B.V. Let X1,…,Xn be a set of n continuous and non-negative random variables, with log-concave joint density function f, faced by a person who seeks for an optimal deductible coverage for these n risks. Let d=(d1,…dn) and d∗=(d1∗,…dn∗) be two vectors of deductibles such that d∗ is majorized by d. It is shown that ∑i=1n(Xi∧di∗) is larger than ∑i=1n(Xi∧di) in stochastic dominance, provided f is exchangeable. As a result, the vector (∑i=1ndi,0,…,0) is an optimal allocation that maximizes the expected utility of the policyholder's wealth. It is proven that the same result remains to hold in some situations if we drop the assumption that f is log-concave.status: publishe

    Community proteogenomics highlights strain-variant protein expression within activated sludge performing enhanced biological phosphorus removal

    No full text
    Enhanced biological phosphorus removal (EBPR) selects for polyphosphate accumulating microorganisms to achieve phosphate removal from wastewater. We used high-resolution community proteomics to identify key metabolic pathways in ‘Candidatus Accumulibacter phosphatis’ (A. phosphatis)-mediated EBPR and to evaluate the contributions of co-existing strains within the dominant population. Overall, 702 proteins from the A. phosphatis population were identified. Results highlight the importance of denitrification, fatty acid cycling and the glyoxylate bypass in EBPR. Strong similarity in protein profiles under anaerobic and aerobic conditions was uncovered (only 3% of A. phosphatis-associated proteins exhibited statistically significant abundance differences). By comprehensive genome-wide alignment of 13?930 orthologous proteins, we uncovered substantial differences in protein abundance for enzyme variants involved in both core-metabolism and EBPR-specific pathways among the A. phosphatis population. These findings suggest an essential role for genetic diversity in maintaining the stable performance of EBPR systems and, hence, demonstrate the power of integrated cultivation-independent genomics and proteomics for the analysis of complex biotechnological systems
    corecore