Abstract Background Discovery of new medicinal agents from natural sources has largely been an adventitious process based on screening of plant and microbial extracts combined with bioassay-guided identification and natural product structure elucidation. Increasingly rapid and more cost-effective genome sequencing technologies coupled with advanced computational power have converged to transform this trend toward a more rational and predictive pursuit. Results We have developed a rapid method of scanning genome sequences for multiple polyketide, nonribosomal peptide, and mixed combination natural products with output in a text format that can be readily converted to two and three dimensional structures using conventional software. Our open-source and web-based program can assemble various small molecules composed of twenty standard amino acids and twenty two other chain-elongation intermediates used in nonribosomal peptide systems, and four acyl-CoA extender units incorporated into polyketides by reading a hidden Markov model of DNA. This process evaluates and selects the substrate specificities along the assembly line of nonribosomal synthetases and modular polyketide synthases. Conclusion Using this approach we have predicted the structures of natural products from a diverse range of bacteria based on a limited number of signature sequences. In accelerating direct DNA to metabolomic analysis, this method bridges the interface between chemists and biologists and enables rapid scanning for compounds with potential therapeutic value.</p

Garneau-Tsodikova, Sylvie

Li, Michael HT

Sherman, David H

Ung, Peter MU

Zajkowski, James

English

PubMed

Abstract
              
                Background
                Discovery of new medicinal agents from natural sources has largely been an adventitious process based on screening of plant and microbial extracts combined with bioassay-guided identification and natural product structure elucidation. Increasingly rapid and more cost-effective genome sequencing technologies coupled with advanced computational power have converged to transform this trend toward a more rational and predictive pursuit.
              
              
                Results
                We have developed a rapid method of scanning genome sequences for multiple polyketide, nonribosomal peptide, and mixed combination natural products with output in a text format that can be readily converted to two and three dimensional structures using conventional software. Our open-source and web-based program can assemble various small molecules composed of twenty standard amino acids and twenty two other chain-elongation intermediates used in nonribosomal peptide systems, and four acyl-CoA extender units incorporated into polyketides by reading a hidden Markov model of DNA. This process evaluates and selects the substrate specificities along the assembly line of nonribosomal synthetases and modular polyketide synthases.
              
              
                Conclusion
                Using this approach we have predicted the structures of natural products from a diverse range of bacteria based on a limited number of signature sequences. In accelerating direct DNA to metabolomic analysis, this method bridges the interface between chemists and biologists and enables rapid scanning for compounds with potential therapeutic value.http://deepblue.lib.umich.edu/bitstream/2027.42/112362/1/12859_2008_Article_2915.pd

Li, Michael H

Ung, Peter M

Deep Blue at the University of Michigan

Automated genome mining for natural products

Michael HT Li

Peter MU Ung

James Zajkowski

Sylvie Garneau-Tsodikova

David H Sherman

Crossref

Springer - Publisher Connector

Abstract Background Discovery of new medicinal agents from natural sources has largely been an adventitious process based on screening of plant and microbial extracts combined with bioassay-guided identification and natural product structure elucidation. Increasingly rapid and more cost-effective genome sequencing technologies coupled with advanced computational power have converged to transform this trend toward a more rational and predictive pursuit. Results We have developed a rapid method of scanning genome sequences for multiple polyketide, nonribosomal peptide, and mixed combination natural products with output in a text format that can be readily converted to two and three dimensional structures using conventional software. Our open-source and web-based program can assemble various small molecules composed of twenty standard amino acids and twenty two other chain-elongation intermediates used in nonribosomal peptide systems, and four acyl-CoA extender units incorporated into polyketides by reading a hidden Markov model of DNA. This process evaluates and selects the substrate specificities along the assembly line of nonribosomal synthetases and modular polyketide synthases. Conclusion Using this approach we have predicted the structures of natural products from a diverse range of bacteria based on a limited number of signature sequences. In accelerating direct DNA to metabolomic analysis, this method bridges the interface between chemists and biologists and enables rapid scanning for compounds with potential therapeutic value.</p

Zajkowski James

Ung Peter MU

Li Michael HT

Garneau-Tsodikova Sylvie

Sherman David H

Directory of Open Access Journals

BMC Bioinformatics

Algorithm for Generation of Unique Smiles Notation.

Alteration of the substrate specificity of a modular polyketide synthase acyltransferase domain through sitespecific mutations. Biochemistry

ASMPKS: an analysis system for modular polyketide synthases.

CA: Predictive, structure-based model of amino acid recognition by nonribosomal peptide synthetase adenylation domains. Chem Biol

Challis GL: Substrate recognition by nonribosomal peptide synthetase multi-enzymes.

Cloning and heterologous expression of the epothilone gene cluster. Science

Cloning, sequencing, and biochemical characterization of the nostocyclopeptide biosynthetic gene cluster: molecular basis for imine macrocyclization. Gene

ClustScan: an integrated program package for the semiautomatic annotation of modular biosynthetic gene clusters and in silico prediction of novel chemical structures. Nucl Acids Res

Comprehensive analysis of distinctive polyketide and nonribosomal peptide structural motifs encoded in microbial genomes.

CT: Assembly-line enzymology for polyketide and nonribosomal peptide antibiotics: logic, machinery, and mechanisms. Chem Rev

CT: Initiation, elongation, and termination strategies in polyketide and polypeptide antibiotic biosynthesis. Curr Op Chem Biol

CT: The parallel and convergent universes of polyketide synthases and nonribosomal peptide synthetases. Chem Biol

CT: Vancomycin assembly: Nature's way. Angew Chem Int Ed Engl

DH: Specificity prediction of adenylation domains in nonribosomal peptide synthetases (NRPS) using transductive support vector machines (TSVMs). Nucl Acids Res

Drugs from the deep: marine natural products as drug candidates. Drug Discovery Today

Exploiting the mosaic structure of trans-acyltransferase polyketide synthases for natural product discovery and pathway dissection. Nature Biotechnol

HM: Tailoring enzymes that modify nonribosomal peptides during and after chain elongation on NRPS assembly lines. Curr Op Chem Biol

Keasling JD: Engineering a mevalonate pathway in Escherichia coli for production of terpenoids.

Kucherov G: NORINE: a database of nonribosomal peptides. Nucl Acids Res

Kutty SK: Plant-derived compounds in clinical trials. Drug Discovery Today

Lead discovery using molecular docking. Curr Op Chem Biol

Leadlay PF: Active-site residue, domain and module swaps in modular polyketide synthases.

Leadlay PF: Divergent sequence motifs correlated with the substrate-specificity of (methyl)malonyl-CoA-acyl carrier protein transacylase domains in modular polyketide syntheses. FEBS Lett

Lipman DJ: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucl Acids Res

Marahiel MA: The specificity-conferring code of adenylation domains in nonribosomal peptide synthetases. Chem Biol

Mevalonate and nonmevalonate pathways for the biosynthesis of isoprene units.

Microbial genomics for the improvement of natural product discovery. Curr Op Microbiol

Mohanty D: NRPS-PKS: a knowledge-based resource for analysis of NRPS/PKS megasynthases. Nucl Acids Res

Pagie L: Chemical warfare between microbes promotes biodiversity.

Pichersky E: Metabolomics, genomics, proteomics, and the identification of enzymes and their substrates and products. Curr Op Plant Biol

Plant natural products: Back to the future or into extinction? Phytochemistry

Plants as source of drugs. Toxicon

The biosynthetic gene cluster for the microtubule-stabilizing agents epothilones A and B from Sorangium cellulosum So ce90. Chem Biol

The role of natural product chemistry in drug discovery.

Whole-genome random sequencing and assembly of Haemophilus-influenzae Rd.

http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=2712472

Automated genome mining for natural products

Abstract

Similar works

Full text

Available Versions

Deep Blue at the University of Michigan

Crossref

Springer - Publisher Connector

Directory of Open Access Journals