7 research outputs found

    Integrating transcriptomic and proteomic data for accurate assembly and annotation of genomes

    Get PDF
    © 2017 Wong et al.; Published by Cold Spring Harbor Laboratory Press. Complementing genome sequence with deep transcriptome and proteome data could enable more accurate assembly and annotation of newly sequenced genomes. Here, we provide a proof-of-concept of an integrated approach for analysis of the genome and proteome of Anopheles stephensi, which is one of the most important vectors of the malaria parasite. To achieve broad coverage of genes, we carried out transcriptome sequencing and deep proteome profiling of multiple anatomically distinct sites. Based on transcriptomic data alone, we identified and corrected 535 events of incomplete genome assembly involving 1196 scaffolds and 868 protein-coding gene models. This proteogenomic approach enabled us to add 365 genes that were missed during genome annotation and identify 917 gene correction events through discovery of 151 novel exons, 297 protein extensions, 231 exon extensions, 192 novel protein start sites, 19 novel translational frames, 28 events of joining of exons, and 76 events of joining of adjacent genes as a single gene. Incorporation of proteomic evidence allowed us to change the designation of more than 87 predicted noncoding RNAs to conventional mRNAs coded by protein-coding genes. Importantly, extension of the newly corrected genome assemblies and gene models to 15 other newly assembled Anopheline genomes led to the discovery of a large number of apparent discrepancies in assembly and annotation of these genomes. Our data provide a framework for how future genome sequencing efforts should incorporate transcriptomic and proteomic analysis in combination with simultaneous manual curation to achieve near complete assembly and accurate annotation of genomes

    Metabolite Dysregulation by Pranlukast in Mycobacterium tuberculosis

    No full text
    Mycobacterium tuberculosis has been infecting millions of people worldwide over the years, causing tuberculosis. Drugs targeting distinct cellular mechanisms including synthesis of the cell wall, lipids, proteins, and nucleic acids in Mtb are currently being used for the treatment of TB. Although extensive research is being carried out at the molecular level in the infected host and pathogen, the identification of suitable drug targets and drugs remains under explored. Pranlukast, an allosteric inhibitor of MtArgJ (Mtb ornithine acetyltransferase) has previously been shown to inhibit the survival and virulence of Mtb. The main objective of this study was to identify the altered metabolic pathways and biological processes associated with the differentially expressed metabolites by PRK in Mtb. Here in this study, metabolomics was carried out using an LC-MS/MS-based approach. Collectively, 50 metabolites were identified to be differentially expressed with a significant p-value through a global metabolomic approach using a high-resolution mass spectrometer. Metabolites downstream of argJ were downregulated in the arginine biosynthetic pathway following pranlukast treatment. Predicted human protein interactors of pranlukast-treated Mtb metabolome were identified in association with autophagy, inflammation, DNA repair, and other immune-related processes. Further metabolites including N-acetylglutamate, argininosuccinate, L-arginine, succinate, ergothioneine, and L-phenylalanine were validated by multiple reaction monitoring, a targeted mass spectrometry-based metabolomic approach. This study facilitates the understanding of pranlukast-mediated metabolic changes in Mtb and holds the potential to identify novel therapeutic approaches using metabolic pathways in Mtb

    Rifampicin-Mediated Metabolic Changes in Mycobacterium tuberculosis

    No full text
    Mycobacterium tuberculosis (Mtb) is considered to be a devastating pathogen worldwide, affecting millions of people globally. Several drugs targeting distinct pathways are utilized for the treatment of tuberculosis. Despite the monumental efforts being directed at the discovery of drugs for Mtb, the pathogen has also developed mechanisms to evade the drug action and host processes. Rifampicin was an early anti-tuberculosis drug, and is still being used as the first line of treatment. This study was carried out in order to characterize the in-depth rifampicin-mediated metabolic changes in Mtb, facilitating a better understanding of the physiological processes based on the metabolic pathways and predicted protein interactors associated with the dysregulated metabolome. Although there are various metabolomic studies that have been carried out on rifampicin mutants, this is the first study that reports a large number of significantly altered metabolites in wild type Mtb upon rifampicin treatment. In this study, a total of 173 metabolites, associated with pyrimidine, purine, arginine, phenylalanine, tyrosine, and tryptophan metabolic pathways, were significantly altered by rifampicin. The predicted host protein interactors of the rifampicin-dysregulated Mtb metabolome were implicated in transcription, inflammation, apoptosis, proteolysis, and DNA replication. Further, tricarboxylic acidcycle metabolites, arginine, and phosphoenolpyruvate were validated by multiple-reaction monitoring. This study provides a comprehensive list of altered metabolites that serves as a basis for understanding the rifampicin-mediated metabolic changes, and associated functional processes, in Mtb, which holds therapeutic potential for the treatment of Mtb

    Characterization of human pineal gland proteome

    No full text
    We employed a high-resolution mass spectrometry-based approach to characterize the proteome of the human pineal gland.</p

    A draft map of the human proteome

    No full text
    The availability of human genome sequence has transformed biomedical research over the past decade. However, an equivalent map for the human proteome with direct measurements of proteins and peptides does not exist yet. Here we present a draft map of the human proteome using high-resolution Fourier-transform mass spectrometry. In-depth proteomic profiling of 30 histologically normal human samples, including 17 adult tissues, 7 fetal tissues and 6 purified primary haematopoietic cells, resulted in identification of proteins encoded by 17,294 genes accounting for approximately 84% of the total annotated protein-coding genes in humans. A unique and comprehensive strategy for proteogenomic analysis enabled us to discover a number of novel protein-coding regions, which includes translated pseudogenes, non-coding RNAs and upstream open reading frames. This large human proteome catalogue (available as an interactive web-based resource at http://www.humanproteomemap.org) will complement available human genome and transcriptome data to accelerate biomedical research in health and disease. © 2014 Macmillan Publishers Limited
    corecore