224 research outputs found

    MorpheuS: Generating Structured Music with Constrained Patterns and Tension

    Get PDF
    Automatic music generation systems have gained in popularity and sophistication as advances in cloud computing have enabled large-scale complex computations such as deep models and optimization algorithms on personal devices. Yet, they still face an important challenge, that of long-term structure, which is key to conveying a sense of musical coherence. We present the MorpheuS music generation system designed to tackle this problem. MorpheuS' novel framework has the ability to generate polyphonic pieces with a given tension profile and long- and short-term repeated pattern structures. A mathematical model for tonal tension quantifies the tension profile and state-of-the-art pattern detection algorithms extract repeated patterns in a template piece. An efficient optimization metaheuristic, variable neighborhood search, generates music by assigning pitches that best fit the prescribed tension profile to the template rhythm while hard constraining long-term structure through the detected patterns. This ability to generate affective music with specific tension profile and long-term structure is particularly useful in a game or film music context. Music generated by the MorpheuS system has been performed live in concerts.Comment: IEEE Transactions on Affective Computing. PP(99

    A Functional Taxonomy of Music Generation Systems

    Get PDF
    Digital advances have transformed the face of automatic music generation since its beginnings at the dawn of computing. Despite the many breakthroughs, issues such as the musical tasks targeted by different machines and the degree to which they succeed remain open questions. We present a functional taxonomy for music generation systems with reference to existing systems. The taxonomy organizes systems according to the purposes for which they were designed. It also reveals the inter-relatedness amongst the systems. This design-centered approach contrasts with predominant methods-based surveys and facilitates the identification of grand challenges to set the stage for new breakthroughs.Comment: survey, music generation, taxonomy, functional survey, survey, automatic composition, algorithmic compositio

    Revisiting the Onsets and Frames Model with Additive Attention

    Get PDF
    Accepted in IJCNN 2021 Special Session S04. https://dr-costas.github.io/rlasmp2021-website/Accepted in IJCNN 2021 Special Session S04. https://dr-costas.github.io/rlasmp2021-website/Recent advances in automatic music transcription (AMT) have achieved highly accurate polyphonic piano transcription results by incorporating onset and offset detection. The existing literature, however, focuses mainly on the leverage of deep and complex models to achieve state-of-the-art (SOTA) accuracy, without understanding model behaviour. In this paper, we conduct a comprehensive examination of the Onsets-and-Frames AMT model, and pinpoint the essential components contributing to a strong AMT performance. This is achieved through exploitation of a modified additive attention mechanism. The experimental results suggest that the attention mechanism beyond a moderate temporal context does not benefit the model, and that rule-based post-processing is largely responsible for the SOTA performance. We also demonstrate that the onsets are the most significant attentive feature regardless of model complexity. The findings encourage AMT research to weigh more on both a robust onset detector and an effective post-processor

    Impact of inactivated poliovirus vaccine on mucosal immunity: implications for the polio eradication endgame.

    Get PDF
    The polio eradication endgame aims to bring transmission of all polioviruses to a halt. To achieve this aim, it is essential to block viral replication in individuals via induction of a robust mucosal immune response. Although it has long been recognized that inactivated poliovirus vaccine (IPV) is incapable of inducing a strong mucosal response on its own, it has recently become clear that IPV may boost immunity in the intestinal mucosa among individuals previously immunized with oral poliovirus vaccine. Indeed, mucosal protection appears to be stronger following a booster dose of IPV than oral poliovirus vaccine, especially in older children. Here, we review the available evidence regarding the impact of IPV on mucosal immunity, and consider the implications of this evidence for the polio eradication endgame. We conclude that the implementation of IPV in both routine and supplementary immunization activities has the potential to play a key role in halting poliovirus transmission, and thereby hasten the eradication of polio

    The effect of spectrogram reconstructions on automatic music transcription: an alternative approach to improve transcription accuracy

    Get PDF
    Most of the state-of-the-art automatic music transcription (AMT) models break down the main transcription task into sub-tasks such as onset prediction and offset prediction and train them with onset and offset labels. These predictions are then concatenated together and used as the input to train another model with the pitch labels to obtain the final transcription. We attempt to use only the pitch labels (together with spectrogram reconstruction loss) and explore how far this model can go without introducing supervised sub-tasks. In this paper, we do not aim at achieving state-of-the-art transcription accuracy, instead, we explore the effect that spectrogram reconstruction has on our AMT model. Our proposed model consists of two U-nets: the first U-net transcribes the spectrogram into a posteriorgram, and a second U-net transforms the posteriorgram back into a spectrogram. A reconstruction loss is applied between the original spectrogram and the reconstructed spectrogram to constrain the second U-net to focus only on reconstruction. We train our model on three different datasets: MAPS, MAESTRO, and MusicNet. Our experiments show that adding the reconstruction loss can generally improve the note-level transcription accuracy when compared to the same model without the reconstruction part. Moreover, it can also boost the frame-level precision to be higher than the state-of-the-art models. The feature maps learned by our U-net contain gridlike structures (not present in the baseline model) which implies that with the presence of the reconstruction loss, the model is probably trying to count along both the time and frequency axis, resulting in a higher note-level transcription accuracy

    Revisiting the onsets and frames model with additive attention

    Get PDF
    Recent advances in automatic music transcription (AMT) have achieved highly accurate polyphonic piano transcription results by incorporating onset and offset detection. The existing literature, however, focuses mainly on the leverage of deep and complex models to achieve state-of-the-art (SOTA) accuracy, without understanding model behaviour. In this paper, we conduct a comprehensive examination of the Onsets-and-Frames AMT model, and pinpoint the essential components contributing to a strong AMT performance. This is achieved through exploitation of a modified additive attention mechanism. The experimental results suggest that the attention mechanism beyond a moderate temporal context does not benefit the model, and that rule-based post-processing is largely responsible for the SOTA performance. We also demonstrate that the onsets are the most significant attentive feature regardless of model complexity. The findings encourage AMT research to weigh more on both a robust onset detector and an effective post-processor

    Characterizing the tissue of apple air-dried and osmo-air-dried rings by X-CT and OCT and relationship with ring crispness and fruit maturity at harvest measured by TRS

    Get PDF
    Air-dried apple rings were prepared from ‘Golden Delicious’ apples selected at harvest as less mature and more mature according to the absorption coefficient measured at 670 nm by time-resolved reflectance spectroscopy (TRS), stored in air for 5 months, and subjected to air-drying with (OSMO) and without (noOSMO) osmodehydration pre-treatment (60% sucrose syrup). Selected rings were submitted to microstructural analysis by X-ray computed tomography (X-CT), to subsurface structure analysis by optical coherence tomography (OCT) and to texture and sound emission analysis by bending–snapping test. Higher crispness index, higher number of sound events and higher average sound pressure level (SPL) characterized the OSMO rings. Total porosity was related to SPLav 60, pore fragmentation index to fracturability and specific surface area to the work required to snap the ring. A differentiation of the drying treatments, as well as of the products according to the TRS maturity class at harvest was obtained analyzing by principal component analysis (PCA) microstructure parameters and texture and acoustic parameters. The differences in mechanical and acoustic characteristics between OSMO and noOSMO rings were due to the different subsurface structure as found with OCT analysis

    Machine learning research that matters for music creation : a case study

    Get PDF
    Research applying machine learning to music modeling and generation typically proposes model architectures, training methods and datasets, and gauges system performance using quantitative measures like sequence likelihoods and/or qualitative listening tests. Rarely does such work explicitly question and analyse its usefulness for and impact on real-world practitioners, and then build on those outcomes to inform the development and application of machine learning. This article attempts to do these things for machine learning applied to music creation. Together with practitioners, we develop and use several applications of machine learning for music creation, and present a public concert of the results. We reflect on the entire experience to arrive at several ways of advancing these and similar applications of machine learning to music creation.QC 20180827</p
    • …
    corecore