40 research outputs found

    MusCaps: generating captions for music audio

    Get PDF
    Content-based music information retrieval has seen rapid progress with the adoption of deep learning. Current approaches to high-level music description typically make use of classification models, such as in auto tagging or genre and mood classification. In this work, we propose to address music description via audio captioning, defined as the task of generating a natural language description of music audio content in a human-like manner. To this end, we present the first music audio captioning model, MusCaps, consisting of an encoder-decoder with temporal attention. Our method combines convolutional and recurrent neural network architectures to jointly process audio-text inputs through a multimodal encoder and leverages pre-training on audio data to obtain representations that effectively capture and summarise musical features in the input. Evaluation of the generated captions through automatic metrics shows that our method outperforms a baseline designed for non-music audio captioning. Through an ablation study, we unveil that this performance boost can be mainly attributed to pre-training of the audio encoder, while other design choices – modality fusion, decoding strategy and the use of attention -- contribute only marginally. Our model represents a shift away from classification-based music description and combines tasks requiring both auditory and linguistic understanding to bridge the semantic gap in music information retrieval

    Social Media, Gender and the Mediatisation of War: Exploring the German Armed Forces’ Visual Representation of the Afghanistan Operation on Facebook

    Get PDF
    Studies on the mediatisation of war point to attempts of governments to regulate the visual perspective of their involvements in armed conflict – the most notable example being the practice of ‘embedded reporting’ in Iraq and Afghanistan. This paper focuses on a different strategy of visual meaning-making, namely, the publication of images on social media by armed forces themselves. Specifically, we argue that the mediatisation of war literature could profit from an increased engagement with feminist research, both within Critical Security/Critical Military Studies and within Science and Technology Studies that highlight the close connection between masculinity, technology and control. The article examines the German military mission in Afghanistan as represented on the German armed forces’ official Facebook page. Germany constitutes an interesting, and largely neglected, case for the growing literature on the mediatisation of war: its strong antimilitarist political culture makes the representation of war particularly delicate. The paper examines specific representational patterns of Germany’s involvement in Afghanistan and discusses the implications which arise from what is placed inside the frame of visibility and what remains out of its view

    An Open Resource for Non-human Primate Imaging.

    Get PDF
    Non-human primate neuroimaging is a rapidly growing area of research that promises to transform and scale translational and cross-species comparative neuroscience. Unfortunately, the technological and methodological advances of the past two decades have outpaced the accrual of data, which is particularly challenging given the relatively few centers that have the necessary facilities and capabilities. The PRIMatE Data Exchange (PRIME-DE) addresses this challenge by aggregating independently acquired non-human primate magnetic resonance imaging (MRI) datasets and openly sharing them via the International Neuroimaging Data-sharing Initiative (INDI). Here, we present the rationale, design, and procedures for the PRIME-DE consortium, as well as the initial release, consisting of 25 independent data collections aggregated across 22 sites (total = 217 non-human primates). We also outline the unique pitfalls and challenges that should be considered in the analysis of non-human primate MRI datasets, including providing automated quality assessment of the contributed datasets

    Code for Factorial Switching Linear Dynamical System (FSLDS) Monitoring of Intensive Care Unit Data ï»ż

    No full text
    The submission contains: (1) realtime - C++ code for performing Factoral Switching Linear Dynamical System (FSLDS) inference in real time on CSV input files; (2) matlab - Matlab code used for offline parameter estimation and inference on historical data;(3) preprocessor - C++ code, documentation and executables for the conversion of high frequency waveform data to 1Hz data; (4) licensing information; and (5) a report describing the development of this code and results of its use on a set of patients from the Neuro ICU in the Southern General Hospital, Glasgow.Williams, Chris; Lal, Partha; Shaw, Martin. (2015). Code for Factorial Switching Linear Dynamical System (FSLDS) Monitoring of Intensive Care Unit Data, [software]. University of Edinburgh, School of Informatics, Institute for Adaptive and Neural Computation. http://dx.doi.org/10.7488/ds/300

    Effective input variable selection for function approximation

    No full text
    Input variable selection is a key preprocess step in any I/O modelling problem. Normally, better generalization performance is obtained when unneeded parameters coming from irrelevant or redundant variables are eliminated. Information theory provides a robust theoretical framework for performing input variable selection thanks to the concept of mutual information. Nevertheless, for continuous variables, it is usually a more difficult task to determine the mutual information between the input variables and the output variable than for classification problems. This paper presents a modified approach for variable selection for continuous variables adapted from a previous approach for classification problems, making use of a mutual information estimator based on the k-nearest neighbors
    corecore