521 research outputs found

    Ambient Sound Provides Supervision for Visual Learning

    Full text link
    The sound of crashing waves, the roar of fast-moving cars -- sound conveys important information about the objects in our surroundings. In this work, we show that ambient sounds can be used as a supervisory signal for learning visual models. To demonstrate this, we train a convolutional neural network to predict a statistical summary of the sound associated with a video frame. We show that, through this process, the network learns a representation that conveys information about objects and scenes. We evaluate this representation on several recognition tasks, finding that its performance is comparable to that of other state-of-the-art unsupervised learning methods. Finally, we show through visualizations that the network learns units that are selective to objects that are often associated with characteristic sounds.Comment: ECCV 201

    Visually Indicated Sounds

    Get PDF
    Objects make distinctive sounds when they are hit or scratched. These sounds reveal aspects of an object's material properties, as well as the actions that produced them. In this paper, we propose the task of predicting what sound an object makes when struck as a way of studying physical interactions within a visual scene. We present an algorithm that synthesizes sound from silent videos of people hitting and scratching objects with a drumstick. This algorithm uses a recurrent neural network to predict sound features from videos and then produces a waveform from these features with an example-based synthesis procedure. We show that the sounds predicted by our model are realistic enough to fool participants in a "real or fake" psychophysical experiment, and that they convey significant information about material properties and physical interactions

    Just how difficult can it be counting up R&D funding for emerging technologies (and is tech mining with proxy measures going to be any better?)

    Get PDF
    Decision makers considering policy or strategy related to the development of emerging technologies expect high quality data on the support for different technological options. A natural starting point would be R&D funding statistics. This paper explores the limitations of such aggregated data in relation to the substance and quantification of funding for emerging technologies. Using biotechnology as an illustrative case, we test the utility of a novel taxonomy to demonstrate the endemic weaknesses in the availability and quality of data from public and private sources. Using the same taxonomy, we consider the extent to which tech-mining presents an alternative, or potentially complementary, way to determine support for emerging technologies using proxy measures such as patents and scientific publications

    Newsprint coverage of smoking in cars carrying children : a case study of public and scientific opinion driving the policy debate

    Get PDF
    Acknowledgements Date of Acceptance:17/10/2014 Acknowledgements: This project was funded by Cancer Research UK (MC_U130085862) and the Scottish School of Public Health Research. Cancer Research UK and the Scottish School of Public Health Research was not involved in the collection, analysis, and interpretation of data, writing of the manuscript or the decision to submit the manuscript for publication. Shona Hilton, Karen Wood, Josh Bain and Chris Patterson are funded by the UK Medical Research Council as part of the Understandings and Uses of Public Health Research programme (MC_UU_12017/6) at the MRC/CSO Social and Public Health Sciences Unit, University of Glasgow. We thank Alan Pollock who provided assistance with coding.Peer reviewedPublisher PD

    GC-337 Analysis of Alternatives for Workstation Deployment for GTRI

    Get PDF
    Georgia Tech Research Institute is requesting an analysis of alternatives of workstation deployment solution that can deploy an operating system to a workstation. This deployment solution must at least be able to deploy Windows 10, Windows 11, Red Hat Enterprise Linux 8 and 9; support multiple users simultaneously, support various models, and run custom scripts. The objective of the analysis of alternatives is to provide Georgia Tech Research Institute with one deployment solution that can replace their current software that facilitates this process. There are inefficiencies to resolve in their current environment such as deploying various operating systems and applying packages, updates, and applicable security policies. This multi-step process takes time to attain the desired outcome. By providing an alternate solution, it is foreseeable to provide cost savings along with overall efficiency to Georgia Tech Research Institute
    corecore