116 research outputs found

    Cinteny: flexible analysis and visualization of synteny and genome rearrangements in multiple organisms

    Get PDF
    BACKGROUND: Identifying syntenic regions, i.e., blocks of genes or other markers with evolutionary conserved order, and quantifying evolutionary relatedness between genomes in terms of chromosomal rearrangements is one of the central goals in comparative genomics. However, the analysis of synteny and the resulting assessment of genome rearrangements are sensitive to the choice of a number of arbitrary parameters that affect the detection of synteny blocks. In particular, the choice of a set of markers and the effect of different aggregation strategies, which enable coarse graining of synteny blocks and exclusion of micro-rearrangements, need to be assessed. Therefore, existing tools and resources that facilitate identification, visualization and analysis of synteny need to be further improved to provide a flexible platform for such analysis, especially in the context of multiple genomes. RESULTS: We present a new tool, Cinteny, for fast identification and analysis of synteny with different sets of markers and various levels of coarse graining of syntenic blocks. Using Hannenhalli-Pevzner approach and its extensions, Cinteny also enables interactive determination of evolutionary relationships between genomes in terms of the number of rearrangements (the reversal distance). In particular, Cinteny provides: i) integration of synteny browsing with assessment of evolutionary distances for multiple genomes; ii) flexibility to adjust the parameters and re-compute the results on-the-fly; iii) ability to work with user provided data, such as orthologous genes, sequence tags or other conserved markers. In addition, Cinteny provides many annotated mammalian, invertebrate and fungal genomes that are pre-loaded and available for analysis at . CONCLUSION: Cinteny allows one to automatically compare multiple genomes and perform sensitivity analysis for synteny block detection and for the subsequent computation of reversal distances. Cinteny can also be used to interactively browse syntenic blocks conserved in multiple genomes, to facilitate genome annotation and validation of assemblies for newly sequenced genomes, and to construct and assess phylogenomic trees

    The CanOE Strategy: Integrating Genomic and Metabolic Contexts across Multiple Prokaryote Genomes to Find Candidate Genes for Orphan Enzymes

    Get PDF
    Of all biochemically characterized metabolic reactions formalized by the IUBMB, over one out of four have yet to be associated with a nucleic or protein sequence, i.e. are sequence-orphan enzymatic activities. Few bioinformatics annotation tools are able to propose candidate genes for such activities by exploiting context-dependent rather than sequence-dependent data, and none are readily accessible and propose result integration across multiple genomes. Here, we present CanOE (Candidate genes for Orphan Enzymes), a four-step bioinformatics strategy that proposes ranked candidate genes for sequence-orphan enzymatic activities (or orphan enzymes for short). The first step locates “genomic metabolons”, i.e. groups of co-localized genes coding proteins catalyzing reactions linked by shared metabolites, in one genome at a time. These metabolons can be particularly helpful for aiding bioanalysts to visualize relevant metabolic data. In the second step, they are used to generate candidate associations between un-annotated genes and gene-less reactions. The third step integrates these gene-reaction associations over several genomes using gene families, and summarizes the strength of family-reaction associations by several scores. In the final step, these scores are used to rank members of gene families which are proposed for metabolic reactions. These associations are of particular interest when the metabolic reaction is a sequence-orphan enzymatic activity. Our strategy found over 60,000 genomic metabolons in more than 1,000 prokaryote organisms from the MicroScope platform, generating candidate genes for many metabolic reactions, of which more than 70 distinct orphan reactions. A computational validation of the approach is discussed. Finally, we present a case study on the anaerobic allantoin degradation pathway in Escherichia coli K-12

    Tracking Antigen-Specific T-Cells during Clinical Tolerance Induction in Humans

    Get PDF
    Allergen immunotherapy presents an opportunity to define mechanisms of induction of clinical tolerance in humans. Significant progress has been made in our understanding of changes in T cell responses during immunotherapy, but existing work has largely been based on functional T cell assays. HLA-peptide-tetrameric complexes allow the tracking of antigen-specific T-cell populations based on the presence of specific T-cell receptors and when combined with functional assays allow a closer assessment of the potential roles of T-cell anergy and clonotype evolution. We sought to develop tools to facilitate tracking of antigen-specific T-cell populations during wasp-venom immunotherapy in people with wasp-venom allergy. We first defined dominant immunogenic regions within Ves v 5, a constituent of wasp venom that is known to represent a target antigen for T-cells. We next identified HLA-DRB1*1501 restricted epitopes and used HLA class II tetrameric complexes alongside cytokine responses to Ves v 5 to track T-cell responses during immunotherapy. In contrast to previous reports, we show that there was a significant initial induction of IL-4 producing antigen-specific T-cells within the first 3–5 weeks of immunotherapy which was followed by reduction of circulating effector antigen-specific T-cells despite escalation of wasp-venom dosage. However, there was sustained induction of IL-10-producing and FOXP3 positive antigen-specific T cells. We observed that these IL-10 producing cells could share a common precursor with IL-4-producing T cells specific for the same epitope. Clinical tolerance induction in humans is associated with dynamic changes in frequencies of antigen-specific T-cells, with a marked loss of IL-4-producing T-cells and the acquisition of IL-10-producing and FOXP3-positive antigen-specific CD4+ T-cells that can derive from a common shared precursor to pre-treatment effector T-cells. The development of new approaches to track antigen specific T-cell responses during immunotherapy can provide novel insights into mechanisms of tolerance induction in humans and identify new potential treatment targets

    MultiRTA: A simple yet reliable method for predicting peptide binding affinities for multiple class II MHC allotypes

    Get PDF
    abstract: Background The binding of peptide fragments of antigens to class II MHC is a crucial step in initiating a helper T cell immune response. The identification of such peptide epitopes has potential applications in vaccine design and in better understanding autoimmune diseases and allergies. However, comprehensive experimental determination of peptide-MHC binding affinities is infeasible due to MHC diversity and the large number of possible peptide sequences. Computational methods trained on the limited experimental binding data can address this challenge. We present the MultiRTA method, an extension of our previous single-type RTA prediction method, which allows the prediction of peptide binding affinities for multiple MHC allotypes not used to train the model. Thus predictions can be made for many MHC allotypes for which experimental binding data is unavailable. Results We fit MultiRTA models for both HLA-DR and HLA-DP using large experimental binding data sets. The performance in predicting binding affinities for novel MHC allotypes, not in the training set, was tested in two different ways. First, we performed leave-one-allele-out cross-validation, in which predictions are made for one allotype using a model fit to binding data for the remaining MHC allotypes. Comparison of the HLA-DR results with those of two other prediction methods applied to the same data sets showed that MultiRTA achieved performance comparable to NetMHCIIpan and better than the earlier TEPITOPE method. We also directly tested model transferability by making leave-one-allele-out predictions for additional experimentally characterized sets of overlapping peptide epitopes binding to multiple MHC allotypes. In addition, we determined the applicability of prediction methods like MultiRTA to other MHC allotypes by examining the degree of MHC variation accounted for in the training set. An examination of predictions for the promiscuous binding CLIP peptide revealed variations in binding affinity among alleles as well as potentially distinct binding registers for HLA-DR and HLA-DP. Finally, we analyzed the optimal MultiRTA parameters to discover the most important peptide residues for promiscuous and allele-specific binding to HLA-DR and HLA-DP allotypes. Conclusions The MultiRTA method yields competitive performance but with a significantly simpler and physically interpretable model compared with previous prediction methods. A MultiRTA prediction webserver is available at http://bordnerlab.org/MultiRTA.The electronic version of this article is the complete one and can be found online at: http://bmcbioinformatics.biomedcentral.com/articles/10.1186/1471-2105-11-48

    Crystal Structure of an Integron Gene Cassette-Associated Protein from Vibrio cholerae Identifies a Cationic Drug-Binding Module

    Get PDF
    Background The direct isolation of integron gene cassettes from cultivated and environmental microbial sources allows an assessment of the impact of the integron/gene cassette system on the emergence of new phenotypes, such as drug resistance or virulence. A structural approach is being exploited to investigate the modularity and function of novel integron gene cassettes. Methodology/Principal Findings We report the 1.8 A crystal structure of Cass2, an integron-associated protein derived from an environmental V. cholerae. The structure defines a monomeric beta-barrel protein with a fold related to the effector-binding portion of AraC/XylS transcription activators. The closest homologs of Cass2 are multi-drug binding proteins, such as BmrR. Consistent with this, a binding pocket made up of hydrophobic residues and a single glutamate side chain is evident in Cass2, occupied in the crystal form by polyethylene glycol. Fluorescence assays demonstrate that Cass2 is capable of binding cationic drug compounds with submicromolar affinity. The Cass2 module possesses a protein interaction surface proximal to its drug-binding cavity with features homologous to those seen in multi-domain transcriptional regulators. Conclusions/Significance Genetic analysis identifies Cass2 to be representative of a larger family of independent effector-binding proteins associated with lateral gene transfer within Vibrio and closely-related species. We propose that the Cass2 family not only has capacity to form functional transcription regulator complexes, but represents possible evolutionary precursors to multi-domain regulators associated with cationic drug compounds.National Health and Medical Research Council (Australia) (NHMRC grant 488502)National Institutes of Health (U.S.) (Grant GM62414-0 )Ontario. Ministry of Revenue (Challenge Fund

    Accurate Protein Structure Annotation through Competitive Diffusion of Enzymatic Functions over a Network of Local Evolutionary Similarities

    Get PDF
    High-throughput Structural Genomics yields many new protein structures without known molecular function. This study aims to uncover these missing annotations by globally comparing select functional residues across the structural proteome. First, Evolutionary Trace Annotation, or ETA, identifies which proteins have local evolutionary and structural features in common; next, these proteins are linked together into a proteomic network of ETA similarities; then, starting from proteins with known functions, competing functional labels diffuse link-by-link over the entire network. Every node is thus assigned a likelihood z-score for every function, and the most significant one at each node wins and defines its annotation. In high-throughput controls, this competitive diffusion process recovered enzyme activity annotations with 99% and 97% accuracy at half-coverage for the third and fourth Enzyme Commission (EC) levels, respectively. This corresponds to false positive rates 4-fold lower than nearest-neighbor and 5-fold lower than sequence-based annotations. In practice, experimental validation of the predicted carboxylesterase activity in a protein from Staphylococcus aureus illustrated the effectiveness of this approach in the context of an increasingly drug-resistant microbe. This study further links molecular function to a small number of evolutionarily important residues recognizable by Evolutionary Tracing and it points to the specificity and sensitivity of functional annotation by competitive global network diffusion. A web server is at http://mammoth.bcm.tmc.edu/networks

    Biomedical Discovery Acceleration, with Applications to Craniofacial Development

    Get PDF
    The profusion of high-throughput instruments and the explosion of new results in the scientific literature, particularly in molecular biomedicine, is both a blessing and a curse to the bench researcher. Even knowledgeable and experienced scientists can benefit from computational tools that help navigate this vast and rapidly evolving terrain. In this paper, we describe a novel computational approach to this challenge, a knowledge-based system that combines reading, reasoning, and reporting methods to facilitate analysis of experimental data. Reading methods extract information from external resources, either by parsing structured data or using biomedical language processing to extract information from unstructured data, and track knowledge provenance. Reasoning methods enrich the knowledge that results from reading by, for example, noting two genes that are annotated to the same ontology term or database entry. Reasoning is also used to combine all sources into a knowledge network that represents the integration of all sorts of relationships between a pair of genes, and to calculate a combined reliability score. Reporting methods combine the knowledge network with a congruent network constructed from experimental data and visualize the combined network in a tool that facilitates the knowledge-based analysis of that data. An implementation of this approach, called the Hanalyzer, is demonstrated on a large-scale gene expression array dataset relevant to craniofacial development. The use of the tool was critical in the creation of hypotheses regarding the roles of four genes never previously characterized as involved in craniofacial development; each of these hypotheses was validated by further experimental work

    Lawson criterion for ignition exceeded in an inertial fusion experiment

    Get PDF
    For more than half a century, researchers around the world have been engaged in attempts to achieve fusion ignition as a proof of principle of various fusion concepts. Following the Lawson criterion, an ignited plasma is one where the fusion heating power is high enough to overcome all the physical processes that cool the fusion plasma, creating a positive thermodynamic feedback loop with rapidly increasing temperature. In inertially confined fusion, ignition is a state where the fusion plasma can begin "burn propagation" into surrounding cold fuel, enabling the possibility of high energy gain. While "scientific breakeven" (i.e., unity target gain) has not yet been achieved (here target gain is 0.72, 1.37 MJ of fusion for 1.92 MJ of laser energy), this Letter reports the first controlled fusion experiment, using laser indirect drive, on the National Ignition Facility to produce capsule gain (here 5.8) and reach ignition by nine different formulations of the Lawson criterion

    Synthetic lethal therapies for cancer: what's next after PARP inhibitors?

    Get PDF
    The genetic concept of synthetic lethality has now been validated clinically through the demonstrated efficacy of poly(ADP-ribose) polymerase (PARP) inhibitors for the treatment of cancers in individuals with germline loss-of-function mutations in either BRCA1 or BRCA2. Three different PARP inhibitors have now been approved for the treatment of patients with BRCA-mutant ovarian cancer and one for those with BRCA-mutant breast cancer; these agents have also shown promising results in patients with BRCA-mutant prostate cancer. Here, we describe a number of other synthetic lethal interactions that have been discovered in cancer. We discuss some of the underlying principles that might increase the likelihood of clinical efficacy and how new computational and experimental approaches are now facilitating the discovery and validation of synthetic lethal interactions. Finally, we make suggestions on possible future directions and challenges facing researchers in this field
    corecore