397 research outputs found

    Concordance of copy number abnormality detection using SNP arrays and Multiplex Ligation-dependent Probe Amplification (MLPA) in acute lymphoblastic leukaemia

    Get PDF
    In acute lymphoblastic leukaemia, MLPA has been used in research studies to identify clinically relevant copy number abnormality (CNA) profiles. However, in diagnostic settings other techniques are often employed. We assess whether equivalent CNA profiles are called using SNP arrays, ensuring platform independence. We demonstrate concordance between SNP6.0 and MLPA CNA calling on 143 leukaemia samples from two UK trials; comparing 1,287 calls within eight genes and a region. The techniques are 99% concordant using manually augmented calling, and 98% concordant using an automated pipeline. We classify these discordant calls and examine reasons for discordance. In nine cases the circular binary segmentation (CBS) algorithm failed to detect focal abnormalities or those flanking gaps in IKZF1 probe coverage. Eight cases were discordant due to probe design differences, with focal abnormalities detectable using one technique not observable by the other. Risk classification using manually augmented array calling resulted in four out of 143 patients being assigned to a different CNA risk group and eight patients using the automated pipeline. We conclude that MLPA defined CNA profiles can be accurately mirrored by SNP6.0 or similar array platforms. Automated calling using the CBS algorithm proved successful, except for IKZF1 which should be manually inspected

    Dynamic clonal progression in xenografts of acute lymphoblastic leukemia with intrachromosomal amplification of chromosome 21

    Get PDF
    Intrachromosomal amplification of chromosome 21 is a heterogeneous chromosomal rearrangement occurring in 2% of childhood precursor B-cell acute lymphoblastic leukemia. There are no cell lines with iAMP21 and these abnormalities are too complex to faithfully engineer in animal models. As a resource for future functional and pre-clinical studies, we have created xenografts from intrachromosomal amplification of chromosome 21 leukemia patient blasts and characterised them by in-vivo and ex-vivo luminescent imaging, FLOW immunophenotyping, and histological and ultrastructural analysis of bone marrow and the central nervous system. Investigation of up to three generations of xenografts revealed phenotypic evolution, branching genomic architecture and, compared with other B-cell acute lymphoblastic leukemia genetic subtypes, greater clonal diversity of leukemia initiating cells. In support of intrachromosomal amplification of chromosome 21 as a primary genetic abnormality, it was always retained through generations of xenografts, although we also observed the first example of structural evolution of this rearrangement. Clonal segregation in xenografts revealed convergent evolution of different secondary genomic abnormalities implicating several known tumour suppressor genes and a region, containing the B-cell adaptor, PIK3AP1, and nuclear receptor co-repressor, LCOR, in the progression of B-ALL. Tracking of mutations in patients and derived xenografts provided evidence for co-operation between abnormalities activating the RAS pathway in B-ALL and for their aggressive clonal expansion in the xeno-environment. Bi-allelic loss of the CDKN2A/B locus was recurrently maintained or emergent in xenografts and also strongly selected as RNA sequencing demonstrated a complete absence of reads for genes associated with the deletions

    IKZF1 Deletions with COBL Breakpoints Are Not Driven by RAG-Mediated Recombination Events in Acute Lymphoblastic Leukemia

    Get PDF
    IKZF1 deletion (ΔIKZF1) is an important predictor of relapse in both childhood and adult B-cell precursor acute lymphoblastic leukemia (B-ALL). Previously, we revealed that COBL is a hotspot for breakpoints in leukemia and could promote IKZF1 deletions. Through an international collaboration, we provide a detailed genetic and clinical picture of B-ALL with COBL rearrangements (COBL-r). Patients with B-ALL and IKZF1 deletion (n = 133) were included. IKZF1 ∆1-8 were associated with large alterations within chromosome 7: monosomy 7 (18%), isochromosome 7q (10%), 7p loss (19%), and interstitial deletions (53%). The latter included COBL-r, which were found in 12% of the IKZF1 ∆1-8 cohort. Patients with COBL-r are mostly classified as intermediate cytogenetic risk and frequently harbor ETV6, PAX5, CDKN2A/B deletions. Overall, 56% of breakpoints were located within COBL intron 5. Cryptic recombination signal sequence motifs were broadly distributed within the sequence of COBL, and no enrichment for the breakpoint cluster region was found. In summary, a diverse spectrum of alterations characterizes ΔIKZF1 and they also include deletion breakpoints within COBL. We confirmed that COBL is a hotspot associated with ΔIKZF1, but these rearrangements are not driven by RAG-mediated recombination

    Composite structural motifs of binding sites for delineating biological functions of proteins

    Get PDF
    Most biological processes are described as a series of interactions between proteins and other molecules, and interactions are in turn described in terms of atomic structures. To annotate protein functions as sets of interaction states at atomic resolution, and thereby to better understand the relation between protein interactions and biological functions, we conducted exhaustive all-against-all atomic structure comparisons of all known binding sites for ligands including small molecules, proteins and nucleic acids, and identified recurring elementary motifs. By integrating the elementary motifs associated with each subunit, we defined composite motifs which represent context-dependent combinations of elementary motifs. It is demonstrated that function similarity can be better inferred from composite motif similarity compared to the similarity of protein sequences or of individual binding sites. By integrating the composite motifs associated with each protein function, we define meta-composite motifs each of which is regarded as a time-independent diagrammatic representation of a biological process. It is shown that meta-composite motifs provide richer annotations of biological processes than sequence clusters. The present results serve as a basis for bridging atomic structures to higher-order biological phenomena by classification and integration of binding site structures.Comment: 34 pages, 7 figure

    Crowdsourcing genomic analyses of ash and ash dieback – power to the people

    Get PDF
    Ash dieback is a devastating fungal disease of ash trees that has swept across Europe and recently reached the UK. This emergent pathogen has received little study in the past and its effect threatens to overwhelm the ash population. In response to this we have produced some initial genomics datasets and taken the unusual step of releasing them to the scientific community for analysis without first performing our own. In this manner we hope to ‘crowdsource’ analyses and bring the expertise of the community to bear on this problem as quickly as possible. Our data has been released through our website at oadb.tsl.ac.uk and a public GitHub repository

    FLORA: a novel method to predict protein function from structure in diverse superfamilies

    Get PDF
    Predicting protein function from structure remains an active area of interest, particularly for the structural genomics initiatives where a substantial number of structures are initially solved with little or no functional characterisation. Although global structure comparison methods can be used to transfer functional annotations, the relationship between fold and function is complex, particularly in functionally diverse superfamilies that have evolved through different secondary structure embellishments to a common structural core. The majority of prediction algorithms employ local templates built on known or predicted functional residues. Here, we present a novel method (FLORA) that automatically generates structural motifs associated with different functional sub-families (FSGs) within functionally diverse domain superfamilies. Templates are created purely on the basis of their specificity for a given FSG, and the method makes no prior prediction of functional sites, nor assumes specific physico-chemical properties of residues. FLORA is able to accurately discriminate between homologous domains with different functions and substantially outperforms (a 2–3 fold increase in coverage at low error rates) popular structure comparison methods and a leading function prediction method. We benchmark FLORA on a large data set of enzyme superfamilies from all three major protein classes (α, β, αβ) and demonstrate the functional relevance of the motifs it identifies. We also provide novel predictions of enzymatic activity for a large number of structures solved by the Protein Structure Initiative. Overall, we show that FLORA is able to effectively detect functionally similar protein domain structures by purely using patterns of structural conservation of all residues

    Minimal methylation classifier (MIMIC): A novel method for derivation and rapid diagnostic detection of disease-associated DNA methylation signatures

    Get PDF
    Rapid and reliable detection of disease-associated DNA methylation patterns has major potential to advance molecular diagnostics and underpin research investigations. We describe the development and validation of minimal methylation classifier (MIMIC), combining CpG signature design from genome-wide datasets, multiplex-PCR and detection by single-base extension and MALDI-TOF mass spectrometry, in a novel method to assess multi-locus DNA methylation profiles within routine clinically-applicable assays. We illustrate the application of MIMIC to successfully identify the methylation-dependent diagnostic molecular subgroups of medulloblastoma (the most common malignant childhood brain tumour), using scant/low-quality samples remaining from the most recently completed pan-European medulloblastoma clinical trial, refractory to analysis by conventional genome-wide DNA methylation analysis. Using this approach, we identify critical DNA methylation patterns from previously inaccessible cohorts, and reveal novel survival differences between the medulloblastoma disease subgroups with significant potential for clinical exploitation

    Combinatorial Clustering of Residue Position Subsets Predicts Inhibitor Affinity across the Human Kinome

    Get PDF
    The protein kinases are a large family of enzymes that play fundamental roles in propagating signals within the cell. Because of the high degree of binding site similarity shared among protein kinases, designing drug compounds with high specificity among the kinases has proven difficult. However, computational approaches to comparing the 3-dimensional geometry and physicochemical properties of key binding site residue positions have been shown to be informative of inhibitor selectivity. The Combinatorial Clustering Of Residue Position Subsets (CCORPS) method, introduced here, provides a semi-supervised learning approach for identifying structural features that are correlated with a given set of annotation labels. Here, CCORPS is applied to the problem of identifying structural features of the kinase ATP binding site that are informative of inhibitor binding. CCORPS is demonstrated to make perfect or near-perfect predictions for the binding affinity profile of 8 of the 38 kinase inhibitors studied, while only having overall poor predictive ability for 1 of the 38 compounds. Additionally, CCORPS is shown to identify shared structural features across phylogenetically diverse groups of kinases that are correlated with binding affinity for particular inhibitors; such instances of structural similarity among phylogenetically diverse kinases are also shown to not be rare among kinases. Finally, these function-specific structural features may serve as potential starting points for the development of highly specific kinase inhibitors

    An integrated national scale SARS-CoV-2 genomic surveillance network.

    Get PDF
    The Coronavirus Disease 2019 (COVID-19) Genomics UK Consortium (COG-UK) was launched in March, 2020, with £20 million support from UK Research and Innovation, the UK Department of Health and Social Care, and Wellcome Trust. The goal of this consortium is to sequence severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) for up to 230 000 patients, health-care workers, and other essential workers in the UK with COVID-19, which will help to enable the tracking of SARS-CoV-2 transmission, identify viral mutations, and integrate with health data to assess how the viral genome interacts with cofactors and consequences of COVID-19

    d-Omix: a mixer of generic protein domain analysis tools

    Get PDF
    Domain combination provides important clues to the roles of protein domains in protein function, interaction and evolution. We have developed a web server d-Omix (a Mixer of Protein Domain Analysis Tools) aiming as a unified platform to analyze, compare and visualize protein data sets in various aspects of protein domain combinations. With InterProScan files for protein sets of interest provided by users, the server incorporates four services for domain analyses. First, it constructs protein phylogenetic tree based on a distance matrix calculated from protein domain architectures (DAs), allowing the comparison with a sequence-based tree. Second, it calculates and visualizes the versatility, abundance and co-presence of protein domains via a domain graph. Third, it compares the similarity of proteins based on DA alignment. Fourth, it builds a putative protein network derived from domain–domain interactions from DOMINE. Users may select a variety of input data files and flexibly choose domain search tools (e.g. hmmpfam, superfamily) for a specific analysis. Results from the d-Omix could be interactively explored and exported into various formats such as SVG, JPG, BMP and CSV. Users with only protein sequences could prepare an InterProScan file using a service provided by the server as well. The d-Omix web server is freely available at http://www.biotec.or.th/isl/Domix
    corecore