1,370 research outputs found

    Who Watches the Watchmen? An Appraisal of Benchmarks for Multiple Sequence Alignment

    Get PDF
    Multiple sequence alignment (MSA) is a fundamental and ubiquitous technique in bioinformatics used to infer related residues among biological sequences. Thus alignment accuracy is crucial to a vast range of analyses, often in ways difficult to assess in those analyses. To compare the performance of different aligners and help detect systematic errors in alignments, a number of benchmarking strategies have been pursued. Here we present an overview of the main strategies--based on simulation, consistency, protein structure, and phylogeny--and discuss their different advantages and associated risks. We outline a set of desirable characteristics for effective benchmarking, and evaluate each strategy in light of them. We conclude that there is currently no universally applicable means of benchmarking MSA, and that developers and users of alignment tools should base their choice of benchmark depending on the context of application--with a keen awareness of the assumptions underlying each benchmarking strategy.Comment: Revie

    T-Coffee: a web server for the multiple sequence alignment of protein and RNA sequences using structural information and homology extension

    Get PDF
    This article introduces a new interface for T-Coffee, a consistency-based multiple sequence alignment program. This interface provides an easy and intuitive access to the most popular functionality of the package. These include the default T-Coffee mode for protein and nucleic acid sequences, the M-Coffee mode that allows combining the output of any other aligners, and template-based modes of T-Coffee that deliver high accuracy alignments while using structural or homology derived templates. These three available template modes are Expresso for the alignment of protein with a known 3D-Structure, R-Coffee to align RNA sequences with conserved secondary structures and PSI-Coffee to accurately align distantly related sequences using homology extension. The new server benefits from recent improvements of the T-Coffee algorithm and can align up to 150 sequences as long as 10 000 residues and is available from both http://www.tcoffee.org and its main mirror http://tcoffee.crg.cat

    Lateral Dose Distribution of Proton Beams at Energies from 4 to 11 MeV

    Full text link

    A study of the ground state of acceptors in silicon from thermal transport experiments

    Get PDF
    Thermal conductivity measurements of silicon crystals doped with Bor In have shown the presence of several phonon scattering processes. The resonant effect observed below 1 K is ascribed to the existence of a distribution of splittings N(δ) of the Γ8 ground state of the acceptor, which could be related to the presence of oxygen and carbon impurities. In two cases, the maximum of N(δ) occurs for δ max near 6 GHz, in agreement with previous ultrasonic studies

    Characteristics of TCR repertoire associated with successful immune checkpoint therapy responses

    Get PDF
    Immunotherapies have revolutionized cancer treatment. In particular, immune checkpoint therapy (ICT) leads to durable responses in some patients with some cancers. However, the majority of treated patients do not respond. Understanding immune mechanisms that underlie responsiveness to ICT will help identify predictive biomarkers of response and develop treatments to convert non-responding patients to responding ones. ICT primarily acts at the level of adaptive immunity. The specificity of adaptive immune cells, such as T and B cells, is determined by antigen-specific receptors. T cell repertoires can be comprehensively profiled by high-throughput sequencing at the bulk and single-cell level. T cell receptor (TCR) sequencing allows for sensitive tracking of dynamic changes in antigen-specific T cells at the clonal level, giving unprecedented insight into the mechanisms by which ICT alters T cell responses. Here, we review how the repertoire influences response to ICT and conversely how ICT affects repertoire diversity. We will also explore how changes to the repertoire in different anatomical locations can better correlate and perhaps predict treatment outcome. We discuss the advantages and limitations of current metrics used to characterize and represent TCR repertoire diversity. Discovery of predictive biomarkers could lie in novel analysis approaches, such as network analysis of amino acids similarities between TCR sequences. Single-cell sequencing is a breakthrough technology that can link phenotype with specificity, identifying T cell clones that are crucial for successful ICT. The field of immuno-sequencing is rapidly developing and cross-disciplinary efforts are required to maximize the analysis, application, and validation of sequencing data. Unravelling the dynamic behavior of the TCR repertoire during ICT will be highly valuable for tracking and understanding anti-tumor immunity, biomarker discovery, and ultimately for the development of novel strategies to improve patient outcomes

    Sequence alignment, mutual information, and dissimilarity measures for constructing phylogenies

    Get PDF
    Existing sequence alignment algorithms use heuristic scoring schemes which cannot be used as objective distance metrics. Therefore one relies on measures like the p- or log-det distances, or makes explicit, and often simplistic, assumptions about sequence evolution. Information theory provides an alternative, in the form of mutual information (MI) which is, in principle, an objective and model independent similarity measure. MI can be estimated by concatenating and zipping sequences, yielding thereby the "normalized compression distance". So far this has produced promising results, but with uncontrolled errors. We describe a simple approach to get robust estimates of MI from global pairwise alignments. Using standard alignment algorithms, this gives for animal mitochondrial DNA estimates that are strikingly close to estimates obtained from the alignment free methods mentioned above. Our main result uses algorithmic (Kolmogorov) information theory, but we show that similar results can also be obtained from Shannon theory. Due to the fact that it is not additive, normalized compression distance is not an optimal metric for phylogenetics, but we propose a simple modification that overcomes the issue of additivity. We test several versions of our MI based distance measures on a large number of randomly chosen quartets and demonstrate that they all perform better than traditional measures like the Kimura or log-det (resp. paralinear) distances. Even a simplified version based on single letter Shannon entropies, which can be easily incorporated in existing software packages, gave superior results throughout the entire animal kingdom. But we see the main virtue of our approach in a more general way. For example, it can also help to judge the relative merits of different alignment algorithms, by estimating the significance of specific alignments.Comment: 19 pages + 16 pages of supplementary materia
    corecore