Search CORE

19 research outputs found

Crowdsourcing Linked Data on listening experiences through reuse and enhancement of library data

Author: A Basharat
Alessandro Adamou
C Bizer
C Page
Carlo Allocca
H Halpin
Helen Barlow
M Bradley
Mathieu d’Aquin
P Golden
PN Juslin
RC Wegman
S Burstyn
Simon Brown
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 06/02/2018
Field of study

Research has approached the practice of musical reception in a multitude of ways, such as the analysis of professional critique, sales figures and psychological processes activated by the act of listening. Studies in the Humanities, on the other hand, have been hindered by the lack of structured evidence of actual experiences of listening as reported by the listeners themselves, a concern that was voiced since the early Web era. It was however assumed that such evidence existed, albeit in pure textual form, but could not be leveraged until it was digitised and aggregated. The Listening Experience Database (LED) responds to this research need by providing a centralised hub for evidence of listening in the literature. Not only does LED support search and reuse across nearly 10,000 records, but it also provides machine-readable structured data of the knowledge around the contexts of listening. To take advantage of the mass of formal knowledge that already exists on the Web concerning these contexts, the entire framework adopts Linked Data principles and technologies. This also allows LED to directly reuse open data from the British Library for the source documentation that is already published. Reused data are re-published as open data with enhancements obtained by expanding over the model of the original data, such as the partitioning of published books and collections into individual stand-alone documents. The database was populated through crowdsourcing and seamlessly incorporates data reuse from the very early data entry phases. As the sources of the evidence often contain vague, fragmentary of uncertain information, facilities were put in place to generate structured data out of such fuzziness. Alongside elaborating on these functionalities, this article provides insights into the most recent features of the latest instalment of the dataset and portal, such as the interlinking with the MusicBrainz database, the relaxation of geographical input constraints through text mining, and the plotting of key locations in an interactive geographical browser

Crossref

Open Research Online (The Open University)

Sigmoidal NMFD : convolutional NMF with saturating activations for drum mixture decomposition

Author: De Bie Tijl
De Boom Cedric
Vande Veire Len
Publication venue: 'MDPI AG'
Publication date: 01/01/2021
Field of study

In many types of music, percussion plays an essential role to establish the rhythm and the groove of the music. Algorithms that can decompose the percussive signal into its constituent components would therefore be very useful, as they would enable many analytical and creative applications. This paper describes a method for the unsupervised decomposition of percussive recordings, building on the non-negative matrix factor deconvolution (NMFD) algorithm. Given a percussive music recording, NMFD discovers a dictionary of time-varying spectral templates and corresponding activation functions, representing its constituent sounds and their positions in the mix. We observe, however, that the activation functions discovered using NMFD do not show the expected impulse-like behavior for percussive instruments. We therefore enforce this behavior by specifying that the activations should take on binary values: either an instrument is hit, or it is not. To this end, we rewrite the activations as the output of a sigmoidal function, multiplied with a per-component amplitude factor. We furthermore define a regularization term that biases the decomposition to solutions with saturated activations, leading to the desired binary behavior. We evaluate several optimization strategies and techniques that are designed to avoid poor local minima. We show that incentivizing the activations to be binary indeed leads to the desired impulse-like behavior, and that the resulting components are better separated, leading to more interpretable decompositions

Multidisciplinary Digital Publishing Institute

Ghent University Academic Bibliography

Directory of Open Access Journals

DMRN+11: Digital Music Research Network Workshop Proceedings 2016

Author: DMRN+11: Digital Music Research Network Workshop 2016
KUDUMAKIS P
SANDLER M
Publication venue: 'Queen Mary University of London'
Publication date: 26/01/2017
Field of study

Crossref

Queen Mary Research Online

CoDi: Leveraging Compatibility and Diversity in Computational Mashup Creation from Large Loop Collections

Author: Gonçalo Nuno Botelho Amaral Rolão Bernardo
Publication venue
Publication date: 14/07/2021
Field of study

Repositório Aberto da Universidade do Porto

Trennung und Schätzung der Anzahl von Audiosignalquellen mit Zeit- und Frequenzüberlappung

Author: Stöter Fabian-Robert
Publication venue
Publication date: 01/01/2020
Field of study

Everyday audio recordings involve mixture signals: music contains a mixture of instruments; in a meeting or conference, there is a mixture of human voices. For these mixtures, automatically separating or estimating the number of sources is a challenging task. A common assumption when processing mixtures in the time-frequency domain is that sources are not fully overlapped. However, in this work we consider some cases where the overlap is severe — for instance, when instruments play the same note (unison) or when many people speak concurrently ("cocktail party") — highlighting the need for new representations and more powerful models. To address the problems of source separation and count estimation, we use conventional signal processing techniques as well as deep neural networks (DNN). We ﬁrst address the source separation problem for unison instrument mixtures, studying the distinct spectro-temporal modulations caused by vibrato. To exploit these modulations, we developed a method based on time warping, informed by an estimate of the fundamental frequency. For cases where such estimates are not available, we present an unsupervised model, inspired by the way humans group time-varying sources (common fate). This contribution comes with a novel representation that improves separation for overlapped and modulated sources on unison mixtures but also improves vocal and accompaniment separation when used as an input for a DNN model. Then, we focus on estimating the number of sources in a mixture, which is important for real-world scenarios. Our work on count estimation was motivated by a study on how humans can address this task, which lead us to conduct listening experiments, conﬁrming that humans are only able to estimate the number of up to four sources correctly. To answer the question of whether machines can perform similarly, we present a DNN architecture, trained to estimate the number of concurrent speakers. Our results show improvements compared to other methods, and the model even outperformed humans on the same task. In both the source separation and source count estimation tasks, the key contribution of this thesis is the concept of “modulation”, which is important to computationally mimic human performance. Our proposed Common Fate Transform is an adequate representation to disentangle overlapping signals for separation, and an inspection of our DNN count estimation model revealed that it proceeds to ﬁnd modulation-like intermediate features.Im Alltag sind wir von gemischten Signalen umgeben: Musik besteht aus einer Mischung von Instrumenten; in einem Meeting oder auf einer Konferenz sind wir einer Mischung menschlicher Stimmen ausgesetzt. Für diese Mischungen ist die automatische Quellentrennung oder die Bestimmung der Anzahl an Quellen eine anspruchsvolle Aufgabe. Eine häuﬁge Annahme bei der Verarbeitung von gemischten Signalen im Zeit-Frequenzbereich ist, dass die Quellen sich nicht vollständig überlappen. In dieser Arbeit betrachten wir jedoch einige Fälle, in denen die Überlappung immens ist zum Beispiel, wenn Instrumente den gleichen Ton spielen (unisono) oder wenn viele Menschen gleichzeitig sprechen (Cocktailparty) —, so dass neue Signal-Repräsentationen und leistungsfähigere Modelle notwendig sind. Um die zwei genannten Probleme zu bewältigen, verwenden wir sowohl konventionelle Signalverbeitungsmethoden als auch tiefgehende neuronale Netze (DNN). Wir gehen zunächst auf das Problem der Quellentrennung für Unisono-Instrumentenmischungen ein und untersuchen die speziellen, durch Vibrato ausgelösten, zeitlich-spektralen Modulationen. Um diese Modulationen auszunutzen entwickelten wir eine Methode, die auf Zeitverzerrung basiert und eine Schätzung der Grundfrequenz als zusätzliche Information nutzt. Für Fälle, in denen diese Schätzungen nicht verfügbar sind, stellen wir ein unüberwachtes Modell vor, das inspiriert ist von der Art und Weise, wie Menschen zeitveränderliche Quellen gruppieren (Common Fate). Dieser Beitrag enthält eine neuartige Repräsentation, die die Separierbarkeit für überlappte und modulierte Quellen in Unisono-Mischungen erhöht, aber auch die Trennung in Gesang und Begleitung verbessert, wenn sie in einem DNN-Modell verwendet wird. Im Weiteren beschäftigen wir uns mit der Schätzung der Anzahl von Quellen in einer Mischung, was für reale Szenarien wichtig ist. Unsere Arbeit an der Schätzung der Anzahl war motiviert durch eine Studie, die zeigt, wie wir Menschen diese Aufgabe angehen. Dies hat uns dazu veranlasst, eigene Hörexperimente durchzuführen, die bestätigten, dass Menschen nur in der Lage sind, die Anzahl von bis zu vier Quellen korrekt abzuschätzen. Um nun die Frage zu beantworten, ob Maschinen dies ähnlich gut können, stellen wir eine DNN-Architektur vor, die erlernt hat, die Anzahl der gleichzeitig sprechenden Sprecher zu ermitteln. Die Ergebnisse zeigen Verbesserungen im Vergleich zu anderen Methoden, aber vor allem auch im Vergleich zu menschlichen Hörern. Sowohl bei der Quellentrennung als auch bei der Schätzung der Anzahl an Quellen ist ein Kernbeitrag dieser Arbeit das Konzept der “Modulation”, welches wichtig ist, um die Strategien von Menschen mittels Computern nachzuahmen. Unsere vorgeschlagene Common Fate Transformation ist eine adäquate Darstellung, um die Überlappung von Signalen für die Trennung zugänglich zu machen und eine Inspektion unseres DNN-Zählmodells ergab schließlich, dass sich auch hier modulationsähnliche Merkmale ﬁnden lassen

Proceedings of the first Symposium of the ICTM Study Group on Sound, Movement, and the Sciences (SoMoS)

Author
Publication venue
Publication date: 17/09/2021
Field of study

MPG.PuRe

Bittm: A core biterms-based topic model for targeted analysis

Author: Chen L
Li L
Wang J
Wu X
Publication venue: 'MDPI AG'
Publication date: 18/05/2022
Field of study

While most of the existing topic models perform a full analysis on a set of documents to discover all topics, it is noticed recently that in many situations users are interested in fine-grained topics related to some specific aspects only. As a result, targeted analysis (or focused analysis) has been proposed to address this problem. Given a corpus of documents from a broad area, targeted analysis discovers only topics related with user-interested aspects that are expressed by a set of user-provided query keywords. Existing approaches for targeted analysis suffer from problems such as topic loss and topic suppression because of their inherent assumptions and strategies. Moreover, existing approaches are not designed to address computation efficiency, while targeted analysis is supposed to provide responses to user queries as soon as possible. In this paper, we propose a core BiTerms-based Topic Model (BiTTM). By modelling topics from core biterms that are potentially relevant to the target query, on one hand, BiTTM captures the context information across documents to alleviate the problem of topic loss or suppression; on the other hand, our proposed model enables the efficient modelling of topics related to specific aspects. Our experiments on nine real-world datasets demonstrate BiTTM outperforms existing approaches in terms of both effectiveness and efficiency

OPUS - University of Technology Sydney

Between Occitania and Al-Andalus: Reconsidering the Emergence of Troubadour Melody Through Algorithmic Analysis

Author: da Rosa Guimaraes Veronica Maria
Publication venue: CUNY Academic Works
Publication date: 01/09/2021
Field of study

How the musical and poetic traditions of the troubadours arose remains unknown, despite a century of scholarship that has attempted to account for their seemingly ex nihilo appearance in late twelfth-century Europe. Scholarly debate was particularly intense during the first half of the twentieth century and revolved around two competing theories: the Andalusi theory, which linked the troubadours to the poetic-musical traditions of medieval Muslim Iberia (also known by its Arabic name al-Andalus), and the Aquitanian theory, which argued that the troubadours were rooted in the folk and sacred traditions of the Aquitanian region. Since the 1980s, interest in the topic has mostly focused on new evidence that supports the Andalusi theory. However, because of the paucity of musical sources from al-Andalus and the first generation of troubadours, all scholarship on the topic has based itself upon isolated case studies of the lyric texts, extending conclusions drawn from textual analysis into the musical realm. This dissertation, by contrast, focuses upon the emergence of troubadour melody, using algorithmic analysis to perform a large comparative study of three repertoires: (1) the complete corpus of extant and complete troubadour melodies, (2) a sample of 158 melodies from contemporary unwritten Andalusi music of Morocco, and (3) 380 melodies from the sacred repertoires of Saint Martial of Limoges. I have used two of the most popular algorithms in bioinformatics – the Pairwise Sequence Alignment algorithm (PSA) and the Multiple Sequence Alignment (MSA) – to compare the melodies, and study common musical idioms. I found three melodic pairs between troubadour and contemporary Moroccan Andalusi melodies, thus demonstrating the existence of musical exchange between Occitania and al-Andalus. In addition, based on common musical idioms (such as common ways of beginning a melody and shared motives) found among all three traditions, I posit that the boundaries between the sacred and the secular were fluid, as both musical spheres drew on a pool of well-known unwritten melodies. Thus, I argue that the generic boundaries through which sacred and secular repertoires are theorized today are anachronistic. Finally, I reconstruct two proto-melodies of the troubadour tradition based on the common idioms found

City University of New York

Kulturelle Diversität in der empirischen Rhythmusforschung. Drei Analysen eines Audio-Korpus von Percussion-Ensemblemusik aus Mali

Author: Justin London
Nori Jacoby
Rainer Polak
Publication venue: 'Gesellschaft fur Musiktheorie (GMTH)'
Publication date: 01/01/2016
Field of study

Die Erörterung kultureller Varianz sowohl zwischen als auch innerhalb von Gesellschaften leidet unter der Neigung, das je eigene kulturelle Umfeld als ›natürlich‹ wahrzunehmen. Diese Neigung ist in der Musikforschung nicht zuletzt als eurozentrisches Bias nach wie vor präsent. Unser Artikel möchte zu punktuellen Korrekturen dieses Bias im Feld der empirischen Rhythmusforschung beitragen. Wir analysieren einen Korpus von Audio-Aufnahmen aus Mali und untersuchen dabei das Verhältnis von Rhythmus und Metrum (Studie 1), den theoretischen Status ungleichmäßiger Grundschlagsunterteilung (›Swing-Timing‹, Studie 2) und die Interaktion musikalischer Rollen bei der Ensemble-Synchronisation (Studie 3). Discussions of cultural variation, both between and within societies, suffer from the disposition to conceive of one’s own cultural environment as ›natural.‹ This disposition is still present not least as a euro-centric bias in the study of music. Our article strives to offer a partial correction to the lingering euro-centric bias in the field of empirical rhythm research. We analyze a corpus of audio recordings from Mali, wherein we study three issues: rhythm-meter relations (study 1), the theoretical status of uneven beat subdivision or ›swing timing‹ (study 2), and the interaction amongst musical roles in ensemble synchronization (study 3)

Directory of Open Access Journals

Permanent Hosting, Archiving and Indexing of Digital Resources and Assets