80 research outputs found
Engineered polyketides: Synergy between protein and host level engineering
Metabolic engineering efforts toward rewiring metabolism of cells to produce new compounds often require the utilization of non-native enzymatic machinery that is capable of producing a broad range of chemical functionalities. Polyketides encompass one of the largest classes of chemically diverse natural products. With thousands of known polyketides, modular polyketide synthases (PKSs) share a particularly attractive biosynthetic logic for generating chemical diversity. The engineering of modular PKSs could open access to the deliberate production of both existing and novel compounds. In this review, we discuss PKS engineering efforts applied at both the protein and cellular level for the generation of a diverse range of chemical structures, and we examine future applications of PKSs in the production of medicines, fuels and other industrially relevant chemicals
ClusterCAD: a computational platform for type I modular polyketide synthase design
ClusterCAD is a web-based toolkit designed to leverage the collinear structure and deterministic logic of type I modular polyketide synthases (PKSs) for synthetic biology applications. The unique organization of these megasynthases, combined with the diversity of their catalytic domain building blocks, has fueled an interest in harnessing the biosynthetic potential of PKSs for the microbial production of both novel natural product analogs and industrially relevant small molecules. However, a limited theoretical understanding of the determinants of PKS fold and function poses a substantial barrier to the design of active variants, and identifying strategies to reliably construct functional PKS chimeras remains an active area of research. In this work, we formalize a paradigm for the design of PKS chimeras and introduce ClusterCAD as a computational platform to streamline and simplify the process of designing experiments to test strategies for engineering PKS variants. ClusterCAD provides chemical structures with stereochemistry for the intermediates generated by each PKS module, as well as sequence- and structure-based search tools that allow users to identify modules based either on amino acid sequence or on the chemical structure of the cognate polyketide intermediate. ClusterCAD can be accessed at https://clustercad.jbei.org and at http://clustercad.igb.uci.edu
Recommended from our members
Chemoinformatic-Guided Engineering of Polyketide Synthases.
Polyketide synthase (PKS) engineering is an attractive method to generate new molecules such as commodity, fine and specialty chemicals. A significant challenge is re-engineering a partially reductive PKS module to produce a saturated β-carbon through a reductive loop (RL) exchange. In this work, we sought to establish that chemoinformatics, a field traditionally used in drug discovery, offers a viable strategy for RL exchanges. We first introduced a set of donor RLs of diverse genetic origin and chemical substrates into the first extension module of the lipomycin PKS (LipPKS1). Product titers of these engineered unimodular PKSs correlated with chemical structure similarity between the substrate of the donor RLs and recipient LipPKS1, reaching a titer of 165 mg/L of short-chain fatty acids produced by the host Streptomyces albus J1074. Expanding this method to larger intermediates that require bimodular communication, we introduced RLs of divergent chemosimilarity into LipPKS2 and determined triketide lactone production. Collectively, we observed a statistically significant correlation between atom pair chemosimilarity and production, establishing a new chemoinformatic method that may aid in the engineering of PKSs to produce desired, unnatural products
The Baryon Oscillation Spectroscopic Survey of SDSS-III
The Baryon Oscillation Spectroscopic Survey (BOSS) is designed to measure the
scale of baryon acoustic oscillations (BAO) in the clustering of matter over a
larger volume than the combined efforts of all previous spectroscopic surveys
of large scale structure. BOSS uses 1.5 million luminous galaxies as faint as
i=19.9 over 10,000 square degrees to measure BAO to redshifts z<0.7.
Observations of neutral hydrogen in the Lyman alpha forest in more than 150,000
quasar spectra (g<22) will constrain BAO over the redshift range 2.15<z<3.5.
Early results from BOSS include the first detection of the large-scale
three-dimensional clustering of the Lyman alpha forest and a strong detection
from the Data Release 9 data set of the BAO in the clustering of massive
galaxies at an effective redshift z = 0.57. We project that BOSS will yield
measurements of the angular diameter distance D_A to an accuracy of 1.0% at
redshifts z=0.3 and z=0.57 and measurements of H(z) to 1.8% and 1.7% at the
same redshifts. Forecasts for Lyman alpha forest constraints predict a
measurement of an overall dilation factor that scales the highly degenerate
D_A(z) and H^{-1}(z) parameters to an accuracy of 1.9% at z~2.5 when the survey
is complete. Here, we provide an overview of the selection of spectroscopic
targets, planning of observations, and analysis of data and data quality of
BOSS.Comment: 49 pages, 16 figures, accepted by A
A deep learning system accurately classifies primary and metastatic cancers using passenger mutation patterns.
In cancer, the primary tumour's organ of origin and histopathology are the strongest determinants of its clinical behaviour, but in 3% of cases a patient presents with a metastatic tumour and no obvious primary. Here, as part of the ICGC/TCGA Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium, we train a deep learning classifier to predict cancer type based on patterns of somatic passenger mutations detected in whole genome sequencing (WGS) of 2606 tumours representing 24 common cancer types produced by the PCAWG Consortium. Our classifier achieves an accuracy of 91% on held-out tumor samples and 88% and 83% respectively on independent primary and metastatic samples, roughly double the accuracy of trained pathologists when presented with a metastatic tumour without knowledge of the primary. Surprisingly, adding information on driver mutations reduced accuracy. Our results have clinical applicability, underscore how patterns of somatic passenger mutations encode the state of the cell of origin, and can inform future strategies to detect the source of circulating tumour DNA
Sloan Digital Sky Survey IV: Mapping the Milky Way, Nearby Galaxies, and the Distant Universe
We describe the Sloan Digital Sky Survey IV (SDSS-IV), a project encompassing three major spectroscopic programs. The Apache Point Observatory Galactic Evolution Experiment 2 (APOGEE-2) is observing hundreds of thousands of Milky Way stars at high resolution and high signal-to-noise ratios in the near-infrared. The Mapping Nearby Galaxies at Apache Point Observatory (MaNGA) survey is obtaining spatially resolved spectroscopy for thousands of nearby galaxies (median ). The extended Baryon Oscillation Spectroscopic Survey (eBOSS) is mapping the galaxy, quasar, and neutral gas distributions between and 3.5 to constrain cosmology using baryon acoustic oscillations, redshift space distortions, and the shape of the power spectrum. Within eBOSS, we are conducting two major subprograms: the SPectroscopic IDentification of eROSITA Sources (SPIDERS), investigating X-ray AGNs and galaxies in X-ray clusters, and the Time Domain Spectroscopic Survey (TDSS), obtaining spectra of variable sources. All programs use the 2.5 m Sloan Foundation Telescope at the Apache Point Observatory; observations there began in Summer 2014. APOGEE-2 also operates a second near-infrared spectrograph at the 2.5 m du Pont Telescope at Las Campanas Observatory, with observations beginning in early 2017. Observations at both facilities are scheduled to continue through 2020. In keeping with previous SDSS policy, SDSS-IV provides regularly scheduled public data releases; the first one, Data Release 13, was made available in 2016 July
The Baryon Oscillation Spectroscopic Survey of SDSS-III
The Baryon Oscillation Spectroscopic Survey (BOSS) is designed to measure the scale of baryon acoustic oscillations (BAO) in the clustering of matter over a larger volume than the combined efforts of all previous spectroscopic surveys of large-scale structure. BOSS uses 1.5 million luminous galaxies as faint as i = 19.9 over 10,000 deg(2) to measure BAO to redshifts z < 0.7. Observations of neutral hydrogen in the Ly alpha forest in more than 150,000 quasar spectra (g < 22) will constrain BAO over the redshift range 2.15 < z < 3.5. Early results from BOSS include the first detection of the large-scale three-dimensional clustering of the Ly alpha forest and a strong detection from the Data Release 9 data set of the BAO in the clustering of massive galaxies at an effective redshift z = 0.57. We project that BOSS will yield measurements of the angular diameter distance d(A) to an accuracy of 1.0% at redshifts z = 0.3 and z = 0.57 and measurements of H(z) to 1.8% and 1.7% at the same redshifts. Forecasts for Ly alpha forest constraints predict a measurement of an overall dilation factor that scales the highly degenerate D-A(z) and H-1(z) parameters to an accuracy of 1.9% at z similar to 2.5 when the survey is complete. Here, we provide an overview of the selection of spectroscopic targets, planning of observations, and analysis of data and data quality of BOSS
Sloan Digital Sky Survey IV: mapping the Milky Way, nearby galaxies, and the distant universe
We describe the Sloan Digital Sky Survey IV (SDSS-IV), a project encompassing three major spectroscopic programs. The Apache Point Observatory Galactic Evolution Experiment 2 (APOGEE-2) is observing hundreds of thousands of Milky Way stars at high resolution and high signal-to-noise ratios in the near-infrared. The Mapping Nearby Galaxies at Apache Point Observatory (MaNGA) survey is obtaining spatially resolved spectroscopy for thousands of nearby galaxies (median ). The extended Baryon Oscillation Spectroscopic Survey (eBOSS) is mapping the galaxy, quasar, and neutral gas distributions between and 3.5 to constrain cosmology using baryon acoustic oscillations, redshift space distortions, and the shape of the power spectrum. Within eBOSS, we are conducting two major subprograms: the SPectroscopic IDentification of eROSITA Sources (SPIDERS), investigating X-ray AGNs and galaxies in X-ray clusters, and the Time Domain Spectroscopic Survey (TDSS), obtaining spectra of variable sources. All programs use the 2.5 m Sloan Foundation Telescope at the Apache Point Observatory; observations there began in Summer 2014. APOGEE-2 also operates a second near-infrared spectrograph at the 2.5 m du Pont Telescope at Las Campanas Observatory, with observations beginning in early 2017. Observations at both facilities are scheduled to continue through 2020. In keeping with previous SDSS policy, SDSS-IV provides regularly scheduled public data releases; the first one, Data Release 13, was made available in 2016 July
The 13th Data Release of the Sloan Digital Sky Survey: First Spectroscopic Data from the SDSS-IV Survey Mapping Nearby Galaxies at Apache Point Observatory
The fourth generation of the Sloan Digital Sky Survey (SDSS-IV) began observations in July 2014. It pursues three core programs: APOGEE-2,MaNGA, and eBOSS. In addition, eBOSS contains two major subprograms: TDSS and SPIDERS. This paper describes the first data release from SDSS-IV, Data Release 13 (DR13), which contains new data, reanalysis of existing data sets and, like all SDSS data releases, is inclusive of previously released data. DR13 makes publicly available 1390 spatially resolved integral field unit observations of nearby galaxies from MaNGA,the first data released from this survey. It includes new observations from eBOSS, completing SEQUELS. In addition to targeting galaxies and quasars, SEQUELS also targeted variability-selected objects from TDSS and X-ray selected objects from SPIDERS. DR13 includes new reductions ofthe SDSS-III BOSS data, improving the spectrophotometric calibration and redshift classification. DR13 releases new reductions of the APOGEE-1data from SDSS-III, with abundances of elements not previously included and improved stellar parameters for dwarf stars and cooler stars. For the SDSS imaging data, DR13 provides new, more robust and precise photometric calibrations. Several value-added catalogs are being released in tandem with DR13, in particular target catalogs relevant for eBOSS, TDSS, and SPIDERS, and an updated red-clump catalog for APOGEE.This paper describes the location and format of the data now publicly available, as well as providing references to the important technical papers that describe the targeting, observing, and data reduction. The SDSS website, http://www.sdss.org, provides links to the data, tutorials and examples of data access, and extensive documentation of the reduction and analysis procedures. DR13 is the first of a scheduled set that will contain new data and analyses from the planned ~6-year operations of SDSS-IV.PostprintPeer reviewe
- …