34 research outputs found

    A structure filter for the Eukaryotic Linear Motif Resource

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Many proteins are highly modular, being assembled from globular domains and segments of natively disordered polypeptides. Linear motifs, short sequence modules functioning independently of protein tertiary structure, are most abundant in natively disordered polypeptides but are also found in accessible parts of globular domains, such as exposed loops. The prediction of novel occurrences of known linear motifs attempts the difficult task of distinguishing functional matches from stochastically occurring non-functional matches. Although functionality can only be confirmed experimentally, confidence in a putative motif is increased if a motif exhibits attributes associated with functional instances such as occurrence in the correct taxonomic range, cellular compartment, conservation in homologues and accessibility to interacting partners. Several tools now use these attributes to classify putative motifs based on confidence of functionality.</p> <p>Results</p> <p>Current methods assessing motif accessibility do not consider much of the information available, either predicting accessibility from primary sequence or regarding any motif occurring in a globular region as low confidence. We present a method considering accessibility and secondary structural context derived from experimentally solved protein structures to rectify this situation. Putatively functional motif occurrences are mapped onto a representative domain, given that a high quality reference SCOP domain structure is available for the protein itself or a close relative. Candidate motifs can then be scored for solvent-accessibility and secondary structure context. The scores are calibrated on a benchmark set of experimentally verified motif instances compared with a set of random matches. A combined score yields 3-fold enrichment for functional motifs assigned to high confidence classifications and 2.5-fold enrichment for random motifs assigned to low confidence classifications. The structure filter is implemented as a pipeline with both a graphical interface via the ELM resource <url>http://elm.eu.org/</url> and through a Web Service protocol.</p> <p>Conclusion</p> <p>New occurrences of known linear motifs require experimental validation as the bioinformatics tools currently have limited reliability. The ELM structure filter will aid users assessing candidate motifs presenting in globular structural regions. Most importantly, it will help users to decide whether to expend their valuable time and resources on experimental testing of interesting motif candidates.</p

    Predicting the Risk of Rheumatoid Arthritis and Its Age of Onset through Modelling Genetic Risk Variants with Smoking

    Get PDF
    The improved characterisation of risk factors for rheumatoid arthritis (RA) suggests they could be combined to identify individuals at increased disease risks in whom preventive strategies may be evaluated. We aimed to develop an RA prediction model capable of generating clinically relevant predictive data and to determine if it better predicted younger onset RA (YORA). Our novel modelling approach combined odds ratios for 15 four-digit/10 two-digit HLA-DRB1 alleles, 31 single nucleotide polymorphisms (SNPs) and ever-smoking status in males to determine risk using computer simulation and confidence interval based risk categorisation. Only males were evaluated in our models incorporating smoking as ever-smoking is a significant risk factor for RA in men but not women. We developed multiple models to evaluate each risk factor's impact on prediction. Each model's ability to discriminate anti-citrullinated protein antibody (ACPA)-positive RA from controls was evaluated in two cohorts: Wellcome Trust Case Control Consortium (WTCCC: 1,516 cases; 1,647 controls); UK RA Genetics Group Consortium (UKRAGG: 2,623 cases; 1,500 controls). HLA and smoking provided strongest prediction with good discrimination evidenced by an HLA-smoking model area under the curve (AUC) value of 0.813 in both WTCCC and UKRAGG. SNPs provided minimal prediction (AUC 0.660 WTCCC/0.617 UKRAGG). Whilst high individual risks were identified, with some cases having estimated lifetime risks of 86%, only a minority overall had substantially increased odds for RA. High risks from the HLA model were associated with YORA (P<0.0001); ever-smoking associated with older onset disease. This latter finding suggests smoking's impact on RA risk manifests later in life. Our modelling demonstrates that combining risk factors provides clinically informative RA prediction; additionally HLA and smoking status can be used to predict the risk of younger and older onset RA, respectively

    Genomewide Association Scan of Suicidal Thoughts and Behaviour in Major Depression

    Get PDF
    Background Suicidal behaviour can be conceptualised as a continuum from suicidal ideation, to suicidal attempts to completed suicide. In this study we identify genes contributing to suicidal behaviour in the depression study RADIANT. Methodology/Principal Findings A quantitative suicidality score was composed of two items from the SCAN interview. In addition, the 251 depression cases with a history of serious suicide attempts were classified to form a discrete trait. The quantitative trait was correlated with younger onset of depression and number of episodes of depression, but not with gender. A genome-wide association study of 2,023 depression cases was performed to identify genes that may contribute to suicidal behaviour. Two Munich depression studies were used as replication cohorts to test the most strongly associated SNPs. No SNP was associated at genome-wide significance level. For the quantitative trait, evidence of association was detected at GFRA1, a receptor for the neurotrophin GDRA (p = 2e-06). For the discrete trait of suicide attempt, SNPs in KIAA1244 and RGS18 attained p-values of <5e-6. None of these SNPs showed evidence for replication in the additional cohorts tested. Candidate gene analysis provided some support for a polymorphism in NTRK2, which was previously associated with suicidality. Conclusions/Significance This study provides a genome-wide assessment of possible genetic contribution to suicidal behaviour in depression but indicates a genetic architecture of multiple genes with small effects. Large cohorts will be required to dissect this further

    REGENT:a risk assessment and classification algorithm for genetic and environmental factors

    No full text
    The identification of environmental and genetic factors that contribute to disease risk requires appropriate statistical methods and software that can integrate different sources of risk, provide statistical assessment of combined risk factors, and facilitate interpretation of this risk. We have developed an R package, REGENT, to calculate risks conferred by genetic factors and multilevel environmental factors. This is performed at a population level, with the option to also analyse individual-level data. REGENT incorporates variability in risk factors to calculate confidence intervals for risk estimates and to classify the population into different categories of risk based on significant differences from the baseline average member of the population. REGENT is an R package available from CRAN: http://cran.r-project.org/web/packages/REGENT. It will be of value to genetic researchers exploring the utility of the variants detected for their disorder, and to clinical researchers interested in genetic risk studies
    corecore