33 research outputs found

    Benchmarking transposable element annotation methods for creation of a streamlined, comprehensive pipeline

    Get PDF
    BACKGROUND: Sequencing technology and assembly algorithms have matured to the point that high-quality de novo assembly is possible for large, repetitive genomes. Current assemblies traverse transposable elements (TEs) and provide an opportunity for comprehensive annotation of TEs. Numerous methods exist for annotation of each class of TEs, but their relative performances have not been systematically compared. Moreover, a comprehensive pipeline is needed to produce a non-redundant library of TEs for species lacking this resource to generate whole-genome TE annotations. RESULTS: We benchmark existing programs based on a carefully curated library of rice TEs. We evaluate the performance of methods annotating long terminal repeat (LTR) retrotransposons, terminal inverted repeat (TIR) transposons, short TIR transposons known as miniature inverted transposable elements (MITEs), and Helitrons. Performance metrics include sensitivity, specificity, accuracy, precision, FDR, and F1. Using the most robust programs, we create a comprehensive pipeline called Extensive de-novo TE Annotator (EDTA) that produces a filtered non-redundant TE library for annotation of structurally intact and fragmented elements. EDTA also deconvolutes nested TE insertions frequently found in highly repetitive genomic regions. Using other model species with curated TE libraries (maize and Drosophila), EDTA is shown to be robust across both plant and animal species. CONCLUSIONS: The benchmarking results and pipeline developed here will greatly facilitate TE annotation in eukaryotic genomes. These annotations will promote a much more in-depth understanding of the diversity and evolution of TEs at both intra- and inter-species levels. EDTA is open-source and freely available: https://github.com/oushujun/EDTA

    Novel algorithmic approach predicts tumor mutation load and correlates with immunotherapy clinical outcomes using a defined gene mutation set

    Get PDF
    BACKGROUND: While clinical outcomes following immunotherapy have shown an association with tumor mutation load using whole exome sequencing (WES), its clinical applicability is currently limited by cost and bioinformatics requirements. METHODS: We developed a method to accurately derive the predicted total mutation load (PTML) within individual tumors from a small set of genes that can be used in clinical next generation sequencing (NGS) panels. PTML was derived from the actual total mutation load (ATML) of 575 distinct melanoma and lung cancer samples and validated using independent melanoma (n = 312) and lung cancer (n = 217) cohorts. The correlation of PTML status with clinical outcome, following distinct immunotherapies, was assessed using the Kaplan–Meier method. RESULTS: PTML (derived from 170 genes) was highly correlated with ATML in cutaneous melanoma and lung adenocarcinoma validation cohorts (R(2) = 0.73 and R(2) = 0.82, respectively). PTML was strongly associated with clinical outcome to ipilimumab (anti-CTLA-4, three cohorts) and adoptive T-cell therapy (1 cohort) clinical outcome in melanoma. Clinical benefit from pembrolizumab (anti-PD-1) in lung cancer was also shown to significantly correlate with PTML status (log rank P value < 0.05 in all cohorts). CONCLUSIONS: The approach of using small NGS gene panels, already applied to guide employment of targeted therapies, may have utility in the personalized use of immunotherapy in cancer. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12916-016-0705-4) contains supplementary material, which is available to authorized users

    Novel insights into the genomic basis of citrus canker based on the genome sequences of two strains of Xanthomonas fuscans subsp. aurantifolii

    Get PDF
    Background: Citrus canker is a disease that has severe economic impact on the citrus industry worldwide. There are three types of canker, called A, B, and C. The three types have different phenotypes and affect different citrus species. The causative agent for type A is Xanthomonas citri subsp. citri, whose genome sequence was made available in 2002. Xanthomonas fuscans subsp. aurantifolii strain B causes canker B and Xanthomonas fuscans subsp. aurantifolii strain C causes canker C. Results: We have sequenced the genomes of strains B and C to draft status. We have compared their genomic content to X. citri subsp. citri and to other Xanthomonas genomes, with special emphasis on type III secreted effector repertoires. In addition to pthA, already known to be present in all three citrus canker strains, two additional effector genes, xopE3 and xopAI, are also present in all three strains and are both located on the same putative genomic island. These two effector genes, along with one other effector-like gene in the same region, are thus good candidates for being pathogenicity factors on citrus. Numerous gene content differences also exist between the three cankers strains, which can be correlated with their different virulence and host range. Particular attention was placed on the analysis of genes involved in biofilm formation and quorum sensing, type IV secretion, flagellum synthesis and motility, lipopolysacharide synthesis, and on the gene xacPNP, which codes for a natriuretic protein. Conclusion: We have uncovered numerous commonalities and differences in gene content between the genomes of the pathogenic agents causing citrus canker A, B, and C and other Xanthomonas genomes. Molecular genetics can now be employed to determine the role of these genes in plant-microbe interactions. The gained knowledge will be instrumental for improving citrus canker control.Fundacao de Amparo a Pesquisa do Estado de Sao Paulo (FAPESP)Conselho Nacional de Desenvolvimento CientIfico e Tecnologico (CNPq)Coordenacao para Aperfeicoamento de Pessoal de Ensino Superior (CAPES)Fundo de Defesa da Citricultura (FUNDECITRUS

    A molecular-based identification resource for the arthropods of Finland

    Get PDF
    Publisher Copyright: © 2021 The Authors. Molecular Ecology Resources published by John Wiley & Sons Ltd.To associate specimens identified by molecular characters to other biological knowledge, we need reference sequences annotated by Linnaean taxonomy. In this study, we (1) report the creation of a comprehensive reference library of DNA barcodes for the arthropods of an entire country (Finland), (2) publish this library, and (3) deliver a new identification tool for insects and spiders, as based on this resource. The reference library contains mtDNA COI barcodes for 11,275 (43%) of 26,437 arthropod species known from Finland, including 10,811 (45%) of 23,956 insect species. To quantify the improvement in identification accuracy enabled by the current reference library, we ran 1000 Finnish insect and spider species through the Barcode of Life Data system (BOLD) identification engine. Of these, 91% were correctly assigned to a unique species when compared to the new reference library alone, 85% were correctly identified when compared to BOLD with the new material included, and 75% with the new material excluded. To capitalize on this resource, we used the new reference material to train a probabilistic taxonomic assignment tool, FinPROTAX, scoring high success. For the full-length barcode region, the accuracy of taxonomic assignments at the level of classes, orders, families, subfamilies, tribes, genera, and species reached 99.9%, 99.9%, 99.8%, 99.7%, 99.4%, 96.8%, and 88.5%, respectively. The FinBOL arthropod reference library and FinPROTAX are available through the Finnish Biodiversity Information Facility (www.laji.fi) at https://laji.fi/en/theme/protax. Overall, the FinBOL investment represents a massive capacity-transfer from the taxonomic community of Finland to all sectors of society.Peer reviewe

    A molecular-based identification resource for the arthropods of Finland

    Get PDF
    To associate specimens identified by molecular characters to other biological knowledge, we need reference sequences annotated by Linnaean taxonomy. In this study, we (1) report the creation of a comprehensive reference library of DNA barcodes for the arthropods of an entire country (Finland), (2) publish this library, and (3) deliver a new identification tool for insects and spiders, as based on this resource. The reference library contains mtDNA COI barcodes for 11,275 (43%) of 26,437 arthropod species known from Finland, including 10,811 (45%) of 23,956 insect species. To quantify the improvement in identification accuracy enabled by the current reference library, we ran 1000 Finnish insect and spider species through the Barcode of Life Data system (BOLD) identification engine. Of these, 91% were correctly assigned to a unique species when compared to the new reference library alone, 85% were correctly identified when compared to BOLD with the new material included, and 75% with the new material excluded. To capitalize on this resource, we used the new reference material to train a probabilistic taxonomic assignment tool, FinPROTAX, scoring high success. For the full-length barcode region, the accuracy of taxonomic assignments at the level of classes, orders, families, subfamilies, tribes, genera, and species reached 99.9%, 99.9%, 99.8%, 99.7%, 99.4%, 96.8%, and 88.5%, respectively. The FinBOL arthropod reference library and FinPROTAX are available through the Finnish Biodiversity Information Facility (www.laji.fi) at https://laji.fi/en/theme/protax. Overall, the FinBOL investment represents a massive capacity-transfer from the taxonomic community of Finland to all sectors of society.</p

    A simplified interventional mapping system (SIMS) for the selection of combinations of targeted treatments in non-small cell lung cancer

    Get PDF
    Non-small cell lung cancer (NSCLC) is a leading cause of death worldwide. Targeted monotherapies produce high regression rates, albeit for limited patient subgroups, who inevitably succumb. We present a novel strategy for identifying customized combinations of triplets of targeted agents, utilizing a simplified interventional mapping system (SIMS) that merges knowledge about existent drugs and their impact on the hallmarks of cancer. Based on interrogation of matched lung tumor and normal tissue using targeted genomic sequencing, copy number variation, transcriptomics, and miRNA expression, the activation status of 24 interventional nodes was elucidated. An algorithm was developed to create a scoring system that enables ranking of the activated interventional nodes for each patient. Based on the trends of co-activation at interventional points, combinations of drug triplets were defined in order to overcome resistance. This methodology will inform a prospective trial to be conducted by the WIN consortium, aiming to significantly impact survival in metastatic NSCLC and other malignancies

    Valorization of sugarcane bagasse ash: Producing glass-ceramic materials

    No full text
    Some aluminosilicates, for example mullite and wollastonite, are very important in the ceramic and construction industries. The most significant glass-ceramic for building applications has wollastonite as the main crystal phase. In this work we report on the use of sugarcane bagasse ash (SCBA) to produce glass-ceramics with silicates as the major crystalline phases. The glasses (frits) were prepared by mixing ash, limestone (calcium and magnesium carbonates) and potassium carbonate as the fluxing agent. X-ray fluorescence was used to determine the chemical composition of the glasses and their crystallization was assessed by using thermal analysis (DTA/DSC/TGA) and X-ray diffraction. The results showed that glass-ceramic material can be produced with wollastonite as the major phase, at a temperature lower than 900 degrees C. (C) 2014 Elsevier Ltd. All rights reserved.Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP
    corecore