1,370 research outputs found
Who Watches the Watchmen? An Appraisal of Benchmarks for Multiple Sequence Alignment
Multiple sequence alignment (MSA) is a fundamental and ubiquitous technique
in bioinformatics used to infer related residues among biological sequences.
Thus alignment accuracy is crucial to a vast range of analyses, often in ways
difficult to assess in those analyses. To compare the performance of different
aligners and help detect systematic errors in alignments, a number of
benchmarking strategies have been pursued. Here we present an overview of the
main strategies--based on simulation, consistency, protein structure, and
phylogeny--and discuss their different advantages and associated risks. We
outline a set of desirable characteristics for effective benchmarking, and
evaluate each strategy in light of them. We conclude that there is currently no
universally applicable means of benchmarking MSA, and that developers and users
of alignment tools should base their choice of benchmark depending on the
context of application--with a keen awareness of the assumptions underlying
each benchmarking strategy.Comment: Revie
T-Coffee: a web server for the multiple sequence alignment of protein and RNA sequences using structural information and homology extension
This article introduces a new interface for T-Coffee, a consistency-based multiple sequence alignment program. This interface provides an easy and intuitive access to the most popular functionality of the package. These include the default T-Coffee mode for protein and nucleic acid sequences, the M-Coffee mode that allows combining the output of any other aligners, and template-based modes of T-Coffee that deliver high accuracy alignments while using structural or homology derived templates. These three available template modes are Expresso for the alignment of protein with a known 3D-Structure, R-Coffee to align RNA sequences with conserved secondary structures and PSI-Coffee to accurately align distantly related sequences using homology extension. The new server benefits from recent improvements of the T-Coffee algorithm and can align up to 150 sequences as long as 10 000 residues and is available from both http://www.tcoffee.org and its main mirror http://tcoffee.crg.cat
A study of the ground state of acceptors in silicon from thermal transport experiments
Thermal conductivity measurements of silicon crystals doped with Bor In have shown the presence of several phonon scattering processes. The resonant effect observed below 1 K is ascribed to the existence of a distribution of splittings N(δ) of the Γ8 ground state of the acceptor, which could be related to the presence of oxygen and carbon impurities. In two cases, the maximum of N(δ) occurs for δ max near 6 GHz, in agreement with previous ultrasonic studies
Characteristics of TCR repertoire associated with successful immune checkpoint therapy responses
Immunotherapies have revolutionized cancer treatment. In particular, immune checkpoint therapy (ICT) leads to durable responses in some patients with some cancers. However, the majority of treated patients do not respond. Understanding immune mechanisms that underlie responsiveness to ICT will help identify predictive biomarkers of response and develop treatments to convert non-responding patients to responding ones. ICT primarily acts at the level of adaptive immunity. The specificity of adaptive immune cells, such as T and B cells, is determined by antigen-specific receptors. T cell repertoires can be comprehensively profiled by high-throughput sequencing at the bulk and single-cell level. T cell receptor (TCR) sequencing allows for sensitive tracking of dynamic changes in antigen-specific T cells at the clonal level, giving unprecedented insight into the mechanisms by which ICT alters T cell responses. Here, we review how the repertoire influences response to ICT and conversely how ICT affects repertoire diversity. We will also explore how changes to the repertoire in different anatomical locations can better correlate and perhaps predict treatment outcome. We discuss the advantages and limitations of current metrics used to characterize and represent TCR repertoire diversity. Discovery of predictive biomarkers could lie in novel analysis approaches, such as network analysis of amino acids similarities between TCR sequences. Single-cell sequencing is a breakthrough technology that can link phenotype with specificity, identifying T cell clones that are crucial for successful ICT. The field of immuno-sequencing is rapidly developing and cross-disciplinary efforts are required to maximize the analysis, application, and validation of sequencing data. Unravelling the dynamic behavior of the TCR repertoire during ICT will be highly valuable for tracking and understanding anti-tumor immunity, biomarker discovery, and ultimately for the development of novel strategies to improve patient outcomes
Sequence alignment, mutual information, and dissimilarity measures for constructing phylogenies
Existing sequence alignment algorithms use heuristic scoring schemes which
cannot be used as objective distance metrics. Therefore one relies on measures
like the p- or log-det distances, or makes explicit, and often simplistic,
assumptions about sequence evolution. Information theory provides an
alternative, in the form of mutual information (MI) which is, in principle, an
objective and model independent similarity measure. MI can be estimated by
concatenating and zipping sequences, yielding thereby the "normalized
compression distance". So far this has produced promising results, but with
uncontrolled errors. We describe a simple approach to get robust estimates of
MI from global pairwise alignments. Using standard alignment algorithms, this
gives for animal mitochondrial DNA estimates that are strikingly close to
estimates obtained from the alignment free methods mentioned above. Our main
result uses algorithmic (Kolmogorov) information theory, but we show that
similar results can also be obtained from Shannon theory. Due to the fact that
it is not additive, normalized compression distance is not an optimal metric
for phylogenetics, but we propose a simple modification that overcomes the
issue of additivity. We test several versions of our MI based distance measures
on a large number of randomly chosen quartets and demonstrate that they all
perform better than traditional measures like the Kimura or log-det (resp.
paralinear) distances. Even a simplified version based on single letter Shannon
entropies, which can be easily incorporated in existing software packages, gave
superior results throughout the entire animal kingdom. But we see the main
virtue of our approach in a more general way. For example, it can also help to
judge the relative merits of different alignment algorithms, by estimating the
significance of specific alignments.Comment: 19 pages + 16 pages of supplementary materia
- …