14 research outputs found

    MSACompro: protein multiple sequence alignment using predicted secondary structure, solvent accessibility, and residue-residue contacts

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Multiple Sequence Alignment (MSA) is a basic tool for bioinformatics research and analysis. It has been used essentially in almost all bioinformatics tasks such as protein structure modeling, gene and protein function prediction, DNA motif recognition, and phylogenetic analysis. Therefore, improving the accuracy of multiple sequence alignment is important for advancing many bioinformatics fields.</p> <p>Results</p> <p>We designed and developed a new method, MSACompro, to synergistically incorporate predicted secondary structure, relative solvent accessibility, and residue-residue contact information into the currently most accurate posterior probability-based MSA methods to improve the accuracy of multiple sequence alignments. The method is different from the multiple sequence alignment methods (e.g. 3D-Coffee) that use the tertiary structure information of some sequences since the structural information of our method is fully predicted from sequences. To the best of our knowledge, applying predicted relative solvent accessibility and contact map to multiple sequence alignment is novel. The rigorous benchmarking of our method to the standard benchmarks (i.e. BAliBASE, SABmark and OXBENCH) clearly demonstrated that incorporating predicted protein structural information improves the multiple sequence alignment accuracy over the leading multiple protein sequence alignment tools without using this information, such as MSAProbs, ProbCons, Probalign, T-coffee, MAFFT and MUSCLE. And the performance of the method is comparable to the state-of-the-art method PROMALS of using structural features and additional homologous sequences by slightly lower scores.</p> <p>Conclusion</p> <p>MSACompro is an efficient and reliable multiple protein sequence alignment tool that can effectively incorporate predicted protein structural information into multiple sequence alignment. The software is available at <url>http://sysbio.rnet.missouri.edu/multicom_toolbox/</url>.</p

    Morphogenesis of the T4 tail and tail fibers

    Get PDF
    Remarkable progress has been made during the past ten years in elucidating the structure of the bacteriophage T4 tail by a combination of three-dimensional image reconstruction from electron micrographs and X-ray crystallography of the components. Partial and complete structures of nine out of twenty tail structural proteins have been determined by X-ray crystallography and have been fitted into the 3D-reconstituted structure of the "extended" tail. The 3D structure of the "contracted" tail was also determined and interpreted in terms of component proteins. Given the pseudo-atomic tail structures both before and after contraction, it is now possible to understand the gross conformational change of the baseplate in terms of the change in the relative positions of the subunit proteins. These studies have explained how the conformational change of the baseplate and contraction of the tail are related to the tail's host cell recognition and membrane penetration function. On the other hand, the baseplate assembly process has been recently reexamined in detail in a precise system involving recombinant proteins (unlike the earlier studies with phage mutants). These experiments showed that the sequential association of the subunits of the baseplate wedge is based on the induced-fit upon association of each subunit. It was also found that, upon association of gp53 (gene product 53), the penultimate subunit of the wedge, six of the wedge intermediates spontaneously associate to form a baseplate-like structure in the absence of the central hub. Structure determination of the rest of the subunits and intermediate complexes and the assembly of the hub still require further study
    corecore