515 research outputs found

    Query-based Deep Improvisation

    Get PDF
    In this paper we explore techniques for generating new music using a Variational Autoencoder (VAE) neural network that was trained on a corpus of specific style. Instead of randomly sampling the latent states of the network to produce free improvisation, we generate new music by querying the network with musical input in a style different from the training corpus. This allows us to produce new musical output with longer-term structure that blends aspects of the query to the style of the network. In order to control the level of this blending we add a noisy channel between the VAE encoder and decoder using bit-allocation algorithm from communication rate-distortion theory. Our experiments provide new insight into relations between the representational and structural information of latent states and the query signal, suggesting their possible use for composition purposes

    The Variable Markov Oracle: Algorithms for Human Gesture Applications

    Get PDF
    This article introduces the Variable Markov Oracle (VMO) data structure for multivariate time series indexing. VMO can identify repetitive fragments and find sequential similarities between observations. VMO can also be viewed as a combination of online clustering algorithms with variable-order Markov constraints. The authors use VMO for gesture query-by-content and gesture following. A probabilistic interpretation of the VMO query-matching algorithm is proposed to find an analogy to the inference problem in a hidden Markov model (HMM). This probabilistic interpretation extends VMO to be not only a data structure but also a model for time series. Query-by-content experiments were conducted on a gesture database that was recorded using a Kinect 3D camera, showing state-of-the-art performance. The query-by-content experiments' results are compared to previous works using HMM and dynamic time warping. Gesture following is described in the context of an interactive dance environment that aims to integrate human movements with computer-generated graphics to create an augmented reality performance

    Rethinking Recurrent Latent Variable Model for Music Composition

    Full text link
    We present a model for capturing musical features and creating novel sequences of music, called the Convolutional Variational Recurrent Neural Network. To generate sequential data, the model uses an encoder-decoder architecture with latent probabilistic connections to capture the hidden structure of music. Using the sequence-to-sequence model, our generative model can exploit samples from a prior distribution and generate a longer sequence of music. We compare the performance of our proposed model with other types of Neural Networks using the criteria of Information Rate that is implemented by Variable Markov Oracle, a method that allows statistical characterization of musical information dynamics and detection of motifs in a song. Our results suggest that the proposed model has a better statistical resemblance to the musical structure of the training data, which improves the creation of new sequences of music in the style of the originals.Comment: Published as a conference paper at IEEE MMSP 201

    Adversarial Reprogramming of Text Classification Neural Networks

    Get PDF
    Adversarial Reprogramming has demonstrated success in utilizing pre-trained neural network classifiers for alternative classification tasks without modification to the original network. An adversary in such an attack scenario trains an additive contribution to the inputs to repurpose the neural network for the new classification task. While this reprogramming approach works for neural networks with a continuous input space such as that of images, it is not directly applicable to neural networks trained for tasks such as text classification, where the input space is discrete. Repurposing such classification networks would require the attacker to learn an adversarial program that maps inputs from one discrete space to the other. In this work, we introduce a context-based vocabulary remapping model to reprogram neural networks trained on a specific sequence classification task, for a new sequence classification task desired by the adversary. We propose training procedures for this adversarial program in both white-box and black-box settings. We demonstrate the application of our model by adversarially repurposing various text-classification models including LSTM, bi-directional LSTM and CNN for alternate classification tasks

    On Ecological Fallacy and Assessment Errors Stemming From Misguided Variable Selection: Investigating the Effect of Data Aggregation on the Outcome of Epidemiological Study

    Get PDF
    In behavioral studies, ecological fallacy is a wrong assumption about an individual based on aggregate data for a group. In the present study, the validity of this assumption was tested using both individual estimates of exposure to air pollution and aggregate air pollution data estimated for 1,492 schoolchildren living in the in vicinity of a major coal-fired power station in the Hadera sub-district of Israel. In 1996 and 1999, the children underwent subsequent pulmonary function (PF) tests, and their parents completed a detailed questionnaire on their health status, and housing conditions. The association between children’s PF development and their long-term exposure to air pollution was then investigated in two phases. During the first phase, the average rates of PF change observed in small statistical areas in which the children reside were compared with average levels of air pollution detected in these areas. During the second phase of the analysis, an individual pollution estimate was calculated for each child covered by the survey, using a "spatial join" tool in ArcGIS. While the analysis of aggregate data showed no significant differences in the PF development among the schoolchildren surveyed, the comparison of individual pollution estimates with the results of PF tests detected a significant negative association between changes in PF results and the estimated level of air pollution. As argued, these differences are attributed to the fact that average exposure levels are likely to cause a misclassification bias of individual exposure, as further demonstrated in the study using pattern detection techniques of spatial analysis (local Moran's I and Gettis-Ord statistic). The implications of the results of the analysis for geographical and epidemiological studies are discussed, and recommendations for public health policy are formulated.
    • …
    corecore