83 research outputs found

    Solving Common-Payoff Games with Approximate Policy Iteration

    Full text link
    For artificially intelligent learning systems to have widespread applicability in real-world settings, it is important that they be able to operate decentrally. Unfortunately, decentralized control is difficult -- computing even an epsilon-optimal joint policy is a NEXP complete problem. Nevertheless, a recently rediscovered insight -- that a team of agents can coordinate via common knowledge -- has given rise to algorithms capable of finding optimal joint policies in small common-payoff games. The Bayesian action decoder (BAD) leverages this insight and deep reinforcement learning to scale to games as large as two-player Hanabi. However, the approximations it uses to do so prevent it from discovering optimal joint policies even in games small enough to brute force optimal solutions. This work proposes CAPI, a novel algorithm which, like BAD, combines common knowledge with deep reinforcement learning. However, unlike BAD, CAPI prioritizes the propensity to discover optimal joint policies over scalability. While this choice precludes CAPI from scaling to games as large as Hanabi, empirical results demonstrate that, on the games to which CAPI does scale, it is capable of discovering optimal joint policies even when other modern multi-agent reinforcement learning algorithms are unable to do so. Code is available at https://github.com/ssokota/capi .Comment: AAAI 202

    A New High Contrast Imaging Program at Palomar Observatory

    Get PDF
    We describe a new instrument that forms the core of a long-term high contrast imaging program at the 200-inch Hale Telescope at Palomar Observatory. The primary scientific thrust is to obtain images and low-resolution spectroscopy of brown dwarfs and young Jovian mass exoplanets in the vicinity of stars within 50 parsecs of the Sun. The instrument is a microlens-based integral field spectrograph integrated with a diffraction limited, apodized-pupil Lyot coronagraph, mounted behind the Palomar adaptive optics system. The spectrograph obtains imaging in 23 channels across the J and H bands (1.06 - 1.78 microns). In addition to obtaining spectra, this wavelength resolution allows suppression of the chromatically dependent speckle noise, which we describe. We have recently installed a novel internal wave front calibration system that will provide continuous updates to the AO system every 0.5 - 1.0 minutes by sensing the wave front within the coronagraph. The Palomar AO system is undergoing an upgrade to a much higher-order AO system ("PALM-3000"): a 3388-actuator tweeter deformable mirror working together with the existing 241-actuator mirror. This system will allow correction with subapertures as small as 8cm at the telescope pupil using natural guide stars. The coronagraph alone has achieved an initial dynamic range in the H-band of 2 X 10^-4 at 1 arcsecond, without speckle noise suppression. We demonstrate that spectral speckle suppression is providing a factor of 10-20 improvement over this bringing our current contrast at an arcsecond to ~2 X 10^-5. This system is the first of a new generation of apodized pupil coronagraphs combined with high-order adaptive optics and integral field spectrographs (e.g. GPI, SPHERE, HiCIAO), and we anticipate this instrument will make a lasting contribution to high contrast imaging in the Northern Hemisphere for years.Comment: Accepted to PASP: 12 pages, 12 figure

    Electric Field Conjugation with the Project 1640 coronagraph

    Full text link
    The Project 1640 instrument on the 200-inch Hale telescope at Palomar Observatory is a coronagraphic instrument with an integral field spectrograph at the back end, designed to find young, self-luminous planets around nearby stars. To reach the necessary contrast for this, the PALM-3000 adaptive optics system corrects for fast atmospheric speckles, while CAL, a phase-shifting interferometer in a Mach-Zehnder configuration, measures the quasistatic components of the complex electric field in the pupil plane following the coronagraphic stop. Two additional sensors measure and control low-order modes. These field measurements may then be combined with a system model and data taken separately using a white-light source internal to the AO system to correct for both phase and amplitude aberrations. Here, we discuss and demonstrate the procedure to maintain a half-plane dark hole in the image plane while the spectrograph is taking data, including initial on-sky performance.Comment: 9 pages, 7 figures, in Proceedings of SPIE, 8864-19 (2013

    Spectral Typing of Late Type Stellar Companions to Young Stars from Low Dispersion Near-Infrared Integral Field Unit Data

    Get PDF
    We used the Project 1640 near-infrared coronagraph and integral field spectrograph to observe 19 young solar type stars. Five of these stars are known binary stars and we detected the late-type secondaries and were able to measure their JH spectra with a resolution of R\sim30. The reduced, extracted, and calibrated spectra were compared to template spectra from the IRTF spectral library. With this comparison we test the accuracy and consistency of spectral type determination with the low-resolution near-infrared spectra from P1640. Additionally, we determine effective temperature and surface gravity of the companions by fitting synthetic spectra calculated with the PHOENIX model atmosphere code. We also present several new epochs of astrometry of each of the systems. Together these data increase our knowledge and understanding of the stellar make up of these systems. In addition to the astronomical results, the analysis presented helps validate the Project 1640 data reduction and spectral extraction processes and the utility of low-resolution, near-infrared spectra for characterizing late-type companions in multiple systems.Comment: Accepted to Astronomical Journal, 25 pages, 8 figure

    Project 1640: the world's first ExAO coronagraphic hyperspectral imager for comparative planetary science

    Get PDF
    Project 1640, a high-contrast spectral-imaging effort involving a coordinated set of instrumentation and software, built at AMNH, JPL, Cambridge and Caltech, has been commissioned and is fully operational. This novel suite of instrumentation includes a 3388+241-actuator adaptive optics system, an optimized apodized pupil Lyot coronagraph, an integral field spectrograph, and an interferometric calibration wave front sensor. Project 1640 is the first of its kind of instrumentation, designed to image and characterize planetary systems around nearby stars, employing a variety of techniques to break the speckle-noise barrier. It is operational roughly one year before any similar project, with the goal of reaching a contrast of 10^(-7) at 1 arcsecond separation. We describe the instrument, highlight recent results, and document on-sky performance at the start of a 3-year, 99-night survey at the Palomar 5-m Hale telescope

    High-resolution Infrared Imaging and Spectroscopy of the Z Canis Majoris System during Quiescence and Outburst

    Get PDF
    We present adaptive optics photometry and spectra in the JHKL bands along with high spectral resolution K-band spectroscopy for each component of the Z Canis Majoris system. Our high angular resolution photometry of this very young (≾1 Myr) binary, comprised of an FU Ori object and a Herbig Ae/Be star, was gathered shortly after the 2008 outburst while our high-resolution spectroscopy was gathered during a quiescent phase. Our photometry conclusively determines that the outburst was due solely to the embedded Herbig Ae/Be member, supporting results from earlier works, and that the optically visible FU Ori component decreased slightly (~30%) in luminosity during the same period, consistent with previous works on the variability of FU Ori type systems. Further, our high-resolution K-band spectra definitively demonstrate that the 2.294 μm CO absorption feature seen in composite spectra of the system is due solely to the FU Ori component, while a prominent CO emission feature at the same wavelength, long suspected to be associated with the innermost regions of a circumstellar accretion disk, can be assigned to the Herbig Ae/Be member. These findings clarify previous analyses of the origin of the CO emission in this complex system
    corecore