13,953 research outputs found

    Using Synthetic Data to Train Neural Networks is Model-Based Reasoning

    Full text link
    We draw a formal connection between using synthetic training data to optimize neural network parameters and approximate, Bayesian, model-based reasoning. In particular, training a neural network using synthetic data can be viewed as learning a proposal distribution generator for approximate inference in the synthetic-data generative model. We demonstrate this connection in a recognition task where we develop a novel Captcha-breaking architecture and train it using synthetic data, demonstrating both state-of-the-art performance and a way of computing task-specific posterior uncertainty. Using a neural network trained this way, we also demonstrate successful breaking of real-world Captchas currently used by Facebook and Wikipedia. Reasoning from these empirical results and drawing connections with Bayesian modeling, we discuss the robustness of synthetic data results and suggest important considerations for ensuring good neural network generalization when training with synthetic data.Comment: 8 pages, 4 figure

    Deep Variational Reinforcement Learning for POMDPs

    Full text link
    Many real-world sequential decision making problems are partially observable by nature, and the environment model is typically unknown. Consequently, there is great need for reinforcement learning methods that can tackle such problems given only a stream of incomplete and noisy observations. In this paper, we propose deep variational reinforcement learning (DVRL), which introduces an inductive bias that allows an agent to learn a generative model of the environment and perform inference in that model to effectively aggregate the available information. We develop an n-step approximation to the evidence lower bound (ELBO), allowing the model to be trained jointly with the policy. This ensures that the latent state representation is suitable for the control task. In experiments on Mountain Hike and flickering Atari we show that our method outperforms previous approaches relying on recurrent neural networks to encode the past

    Auto-Encoding Sequential Monte Carlo

    Full text link
    We build on auto-encoding sequential Monte Carlo (AESMC): a method for model and proposal learning based on maximizing the lower bound to the log marginal likelihood in a broad family of structured probabilistic models. Our approach relies on the efficiency of sequential Monte Carlo (SMC) for performing inference in structured probabilistic models and the flexibility of deep neural networks to model complex conditional probability distributions. We develop additional theoretical insights and introduce a new training procedure which improves both model and proposal learning. We demonstrate that our approach provides a fast, easy-to-implement and scalable means for simultaneous model learning and proposal adaptation in deep generative models

    The shapes of light curves of Mira-type variables

    Full text link
    Using a sample of 454 mira light curves from the ASAS survey we study the shape of the light variations in this kind of variable stars. Opposite to earlier studies, we choose a general approach to identify any deviation from a sinusoidal light change. We find that about 30% of the studied light curves show a significant deviation from the sinusoidal reference shape. Among these stars two characteristic light curve shapes of comparable frequency could be identified. Some hint for a connection between atmospheric chemistry and light curve shape was found, but beside that no or only very weak relations between light curve shape and other stellar parameters seem to exist.Comment: 7 pages, 7 figures, accepted for publication in A

    Why metallic surfaces with grooves a few nanometers deep and wide may strongly absorb visible light

    Full text link
    It is theoretically shown that nanometric silver lamellar gratings present very strong visible light absorption inside the grooves, leading to electric field intensities by several orders of magnitude larger than that of the impinging light. This effect, due to the excitation of long wave vector surface plasmon polaritons with particular small penetration depth in the metal, may explain the abnormal optical absorption observed a long time ago on almost flat Ag films. Surface enhanced Raman scattering in rough metallic films could also be due to the excitation of such plasmon polaritons in the grain boundaries or notches of the films.Comment: 5 pages, 5 figure, submitted to Phys. Rev. Let

    Bayesian Optimization for Probabilistic Programs

    Full text link
    We present the first general purpose framework for marginal maximum a posteriori estimation of probabilistic program variables. By using a series of code transformations, the evidence of any probabilistic program, and therefore of any graphical model, can be optimized with respect to an arbitrary subset of its sampled variables. To carry out this optimization, we develop the first Bayesian optimization package to directly exploit the source code of its target, leading to innovations in problem-independent hyperpriors, unbounded optimization, and implicit constraint satisfaction; delivering significant performance improvements over prominent existing packages. We present applications of our method to a number of tasks including engineering design and parameter optimization

    Variability and spectral classification of LMC giants: results from DENIS and EROS

    Get PDF
    We present the first cross-identifications of sources in the near-infrared DENIS survey and in the micro-lensing EROS survey in a field of about 0.5 square degrees around the optical center (OC) of the Large Magellanic Cloud. We analyze the photometric history of these stars in the EROS data base and obtain light-curves for about 800 variables. Most of the stars are long period variables (Miras and Semi-Regulars), a few Cepheids are also present. We also present new spectroscopic data on 126 asymptotic giant branch stars in the OC field, 30 previously known and 96 newly discovered by the DENIS survey. The visible spectra are used to assign a carbon- (C-) or oxygen-rich (O-rich) nature to the observed stars on the basis of the presence of molecular bands of TiO, VO, CN, C2. For the remaining of the stars we used the (J-Ks) color to determine whether they are O-rich or C-rich. Plotting Log(period) versus Ks we find three very distinct period-luminosity relations, mainly populated by Semi-Regular of type a (SRa), b (SRb) and Mira variables. Carbon-rich stars occupy mostly the upper part of these relations. We find that 65% of the asymptotic giant branch population are long period variables (LPVs).Comment: 9 pages, 7 figures, 4 tables (2 via CDS), accepted by A&A journa
    • …
    corecore