699 research outputs found

    Proteotoxic stress reprograms the chromatin landscape of SUMO modification

    Get PDF

    Quantification of the variation in percentage identity for protein sequence alignments

    Get PDF
    BACKGROUND: Percentage Identity (PID) is frequently quoted in discussion of sequence alignments since it appears simple and easy to understand. However, although there are several different ways to calculate percentage identity and each may yield a different result for the same alignment, the method of calculation is rarely reported. Accordingly, quantification of the variation in PID caused by the different calculations would help in interpreting PID values in the literature. In this study, the variation in PID was quantified systematically on a reference set of 1028 alignments generated by comparison of the protein three-dimensional structures. Since the alignment algorithm may also affect the range of PID, this study also considered the effect of algorithm, and the combination of algorithm and PID method. RESULTS: The maximum variation in PID due to the calculation method was 11.5% while the effect of alignment algorithm on PID was up to 14.6% across three popular alignment methods. The combined effect of alignment algorithm and PID calculation gave a variation of up to 22% on the test data, with an average of 5.3% ± 2.8% for sequence pairs with < 30% identity. In order to see which PID method was most highly correlated with structural similarity, four different PID calculations were compared to similarity scores (Sc) from the comparison of the corresponding protein three-dimensional structures. The highest correlation coefficient for a PID calculation was 0.80. In contrast, the more sophisticated Z-score calculated by reference to randomized sequences gave a correlation coefficient of 0.84. CONCLUSION: Although it is well known amongst expert sequence analysts that PID is a poor score for discriminating between protein sequences, the apparent simplicity of the percentage identity score encourages its widespread use in establishing cutoffs for structural similarity. This paper illustrates that not only is PID a poor measure of sequence similarity when compared to the Z-score, but that there is also a large uncertainty in reported PID values. Since better alternatives to PID exist to quantify sequence similarity, these should be quoted where possible in preference to PID. The findings presented here should prove helpful to those new to sequence analysis, and in warning those who seek to interpret the value of a PID reported in the literature

    Cleaved end-face quality of microstructured polymer optical fibres

    Get PDF
    The cutting of a microstructured polymer optical fibre to form an optical end-face is studied. The effect of the temperature and speed of the cutting blade on the end-face is qualitatively assessed and it is found that for fibres at temperatures in the range 70–90 C, a blade at a similar temperature moving at a speed of less than 0.5 mm/s produces a good quality end-face. The nature of the damage caused by the cutting process was examined and found to vary with fibre temperature, blade quality and cut depth. Thermo-mechanical analysis showed that the drawn material was significantly more visco-elastic than the annealed raw material in the 70-90 C temperature range. The behaviour of the surface damage with cut depth was found to be consistent with the behaviour of a visco-elastic material

    GOtcha: a new method for prediction of protein function assessed by the annotation of seven genomes

    Get PDF
    BACKGROUND: The function of a novel gene product is typically predicted by transitive assignment of annotation from similar sequences. We describe a novel method, GOtcha, for predicting gene product function by annotation with Gene Ontology (GO) terms. GOtcha predicts GO term associations with term-specific probability (P-score) measures of confidence. Term-specific probabilities are a novel feature of GOtcha and allow the identification of conflicts or uncertainty in annotation. RESULTS: The GOtcha method was applied to the recently sequenced genome for Plasmodium falciparum and six other genomes. GOtcha was compared quantitatively for retrieval of assigned GO terms against direct transitive assignment from the highest scoring annotated BLAST search hit (TOPBLAST). GOtcha exploits information deep into the 'twilight zone' of similarity search matches, making use of much information that is otherwise discarded by more simplistic approaches. At a P-score cutoff of 50%, GOtcha provided 60% better recovery of annotation terms and 20% higher selectivity than annotation with TOPBLAST at an E-value cutoff of 10(-4). CONCLUSIONS: The GOtcha method is a useful tool for genome annotators. It has identified both errors and omissions in the original Plasmodium falciparum annotation and is being adopted by many other genome sequencing projects

    A Numerical and Experimental Study of Adhesively-Bonded Polyethylene Pipelines

    Get PDF
    Adhesive bonding of polyethylene gas pipelines is receiving increasing attention as a replacement for traditional electrofusion welding due to its potential to produce rapid and low-cost joints with structural integrity and pressure tight sealing. In this paper a mode-dependent cohesive zone model for the simulation of adhesively bonded medium density polyethylene (MDPE) pipeline joints is directly determined by following three consecutive steps. Firstly, the bulk stress–strain response of the MDPE adherend was obtained via tensile testing to provide a multi-linear numerical approximation to simulate the plastic deformation of the material. Secondly, the mechanical responses of double cantilever beam and end-notched flexure test specimens were utilised for the direct extraction of the energy release rate and cohesive strength of the adhesive in failure mode I and II. Finally, these material properties were used as inputs to develop a finite element model using a cohesive zone model with triangular shape traction separation law. The developed model was successfully validated against experimental tensile lap-shear test results and was able to accurately predict the strength of adhesively-bonded MPDE pipeline joints with a maximum variation of <3%

    Establishing Pine Monocultures and Mixed Pine-Hardwood Stands on Reclaimed Surface Mined Land in Eastern Kentucky: Implications for Forest Resilience in a Changing Climate

    Get PDF
    Surface mining and mine reclamation practices have caused significant forest loss and forest fragmentation in Appalachia. Shortleaf pine (Pinus echinata) is threatened by a variety of stresses, including diseases, pests, poor management, altered fire regimes, and climate change, and the species is the subject of a widescale restoration effort. Surface mines may present opportunity for shortleaf pine restoration; however, the survival and growth of shortleaf pine on these harsh sites has not been critically evaluated. This paper presents first-year survival and growth of native shortleaf pine planted on a reclaimed surface mine, compared to non-native loblolly pine (Pinus taeda), which has been highly successful in previous mined land reclamation plantings. Pine monoculture plots are also compared to pine-hardwood polyculture plots to evaluate effects of planting mix on tree growth and survival, as well as soil health. Initial survival of shortleaf pine is low (42%), but height growth is similar to that of loblolly pine. No differences in survival or growth were observed between monoculture and polyculture treatments. Additional surveys in coming years will address longer-term growth and survival patterns of these species, as well as changes to relevant soil health endpoints, such as soil carbon

    AlmostSignificant:simplifying quality control of high-throughput sequencing data

    Get PDF
    Motivation: The current generation of DNA sequencing technologies produce a large amount of data quickly. All of these data need to pass some form of quality control (QC) processing and checking before they can be used for any analysis. The large number of samples that are run through Illumina sequencing machines makes the process of QC an onerous and time-consuming task that requires multiple pieces of information from several sources. Results: AlmostSignificant is an open-source platform for aggregating multiple sources of quality metrics as well as run and sample meta-data associated with DNA sequencing runs from Illumina sequencing machines. AlmostSignificant is a graphical platform to streamline the QC of DNA sequencing data, to store these data for future reference together with extra meta-data associated with the sequencing runs not typically retained. This simplifies the challenge of monitoring the volume of data produced by Illumina sequencers. AlmostSignificant has been used to track the quality of over 80 sequencing runs covering over 2500 samples produced over the last three years. Availability and Implementation: The code and documentation for AlmostSignificant is freely available at https://github.com/bartongroup/AlmostSignificant. Contacts: [email protected] or [email protected] Supplementary information: Supplementary data are available at Bioinformatics online
    corecore