497 research outputs found

    Score following: An artificially intelligent musical accompanist

    Get PDF
    Score Following is the process by which a musician can be tracked through their performance of a piece, for the purpose of accompanying the musician with the appropriate notes. This tracking is done by following the progress of the musician through the score (written music) of the piece, using observations of the notes they are playing. Artificially intelligent musical accompaniment is where a human musician is accompanied by a computer musician. The computer musician is able to produce musical accompaniment that relates musically to the human performance. Hidden Markov Models (HMMs) are a stochastic modelling tool that can be used to represent real-world systems in a variety of domains. This project discusses how HMMs can be used in the domain of Score Following and describes the construction and evaluation of a score following system that uses HMMs to implement score following. It explores the hypothesis that using an HMM to represent a musical score is an efficient and practical way to implement score following, and that in particular this method is suitable for providing real-time accompaniment to a human performer. The score followers developed during this project are tested and compared against other score following systems and against human musicians. The resulting performances support the project hypothesis to a large extent

    Pattern Matching Techniques for Replacing Missing Sections of Audio Streamed across Wireless Networks

    Get PDF
    Streaming media on the Internet can be unreliable. Services such as audio-on-demand drastically increase the loads on networks; therefore, new, robust, and highly efficient coding algorithms are necessary. One method overlooked to date, which can work alongside existing audio compression schemes, is that which takes into account the semantics and natural repetition of music. Similarity detection within polyphonic audio has presented problematic challenges within the field of music information retrieval. One approach to deal with bursty errors is to use self-similarity to replace missing segments. Many existing systems exist based on packet loss and replacement on a network level, but none attempt repairs of large dropouts of 5 seconds or more. Music exhibits standard structures that can be used as a forward error correction (FEC) mechanism. FEC is an area that addresses the issue of packet loss with the onus of repair placed as much as possible on the listener's device. We have developed a server--client-based framework (SoFI) for automatic detection and replacement of large packet losses on wireless networks when receiving time-dependent streamed audio. Whenever dropouts occur, SoFI swaps audio presented to the listener between a live stream and previous sections of the audio stored locally. Objective and subjective evaluations of SoFI where subjects were presented with other simulated approaches to audio repair together with simulations of replacements including varying lengths of time in the repair give positive results.</jats:p

    Analysis and application of hash-based similarity estimation techniques for biological sequence analysis

    Get PDF
    In Bioinformatics, a large group of problems requires the computation or estimation of sequence similarity. However, the analysis of biological sequence data has, among many others, three capital challenges: a large amount generated data which contains technology-specific errors (that can be mistaken for biological signals), and that might need to be analyzed without access to a reference genome. Through the use of locality sensitive hashing methods, both the efficient estimation of sequence similarity and tolerance against the errors specific to biological data can be achieved. We developed a variant of the winnowing algorithm for local minimizer computation, which is specifically geared to deal with repetitive regions within biological sequences. Through compressing redundant information, we can both reduce the size of the hash tables required to save minimizer sketches, as well as reduce the amount of redundant low quality alignment candidates. Analyzing the distribution of segment lengths generated by this approach, we can better judge the size of required data structures, as well as identify hash functions feasible for this technique. Our evaluation could verify that simple and fast hash functions, even when using small hash value spaces (hash functions with small codomain), are sufficient to compute compressed minimizers and perform comparable to uniformly randomly chosen hash values. We also outlined an index for a taxonomic protein database using multiple compressed winnowings to identify alignment candidates. To store MinHash values, we present a cache-optimized implementation of a hash table using Hopscotch hashing to resolve collisions. As a biological application of similarity based analysis, we describe the analysis of double digest restriction site associated DNA sequencing (ddRADseq). We implemented a simulation software able to model the biological and technological influences of this technology to allow better development and testing of ddRADseq analysis software. Using datasets generated by our software, as well as data obtained from population genetic experiments, we developed an analysis workflow for ddRADseq data, based on the Stacks software. Since the quality of results generated by Stacks strongly depends on how well the used parameters are adapted to the specific dataset, we developed a Snakemake workflow that automates preprocessing tasks while also allowing the automatic exploration of different parameter sets. As part of this workflow, we developed a PCR deduplication approach able to generate consensus reads incorporating the base quality values (as reported by the sequencing device), without performing an alignment first. As an outlook, we outline a MinHashing approach that can be used for a faster and more robust clustering, while addressing incomplete digestion and null alleles, two effects specific for ddRADseq that current analysis tools cannot reliably detect

    Semantic Models for Machine Learning

    No full text
    In this thesis we present approaches to the creation and usage of semantic models by the analysis of the data spread in the feature space. We aim to introduce the general notion of using feature selection techniques in machine learning applications. The applied approaches obtain new feature directions on data, such that machine learning applications would show an increase in performance. We review three principle methods that are used throughout the thesis. Firstly Canonical Correlation Analysis (CCA), which is a method of correlating linear relationships between two multidimensional variables. CCA can be seen as using complex labels as a way of guiding feature selection towards the underlying semantics. CCA makes use of two views of the same semantic object to extract a representation of the semantics. Secondly Partial Least Squares (PLS), a method similar to CCA. It selects feature directions that are useful for the task at hand, though PLS only uses one view of an object and the label as the corresponding pair. PLS could be thought of as a method that looks for directions that are good for distinguishing the different labels. The third method is the Fisher kernel. A method that aims to extract more information of a generative model than simply by their output probabilities. The aim is to analyse how the Fisher score depends on the model and which aspects of the model are important in determining the Fisher score. We focus our theoretical investigation primarily on CCA and its kernel variant. Providing a theoretical analysis of the method's stability using Rademacher complexity, hence deriving the error bound for new data. We conclude the thesis by applying the described approaches to problems in the various fields of image, text, music application and medical analysis, describing several novel applications on relevant real-world data. The aim of the thesis is to provide a theoretical understanding of semantic models, while also providing a good application foundation on how these models can be practically used

    Sounds of Science: Copyright Infringement in AI Music Generator Outputs

    Get PDF
    The music business is no stranger to disruptive technology. The industry’s apparent comeback from the devastating downturn caused by illegal file sharing seems to have arrived just in time for what may be an even more disruptive technological phenomenon: artificial intelligence (“AI”). Much has been said about the implications of AI-generated music, ranging from issues of ownership, to rights of publicity. However, there has been surprisingly little discussion of infringement in the AI systems’ outputs. By examining the functionality of AI music generators through the lens of de minimis use case law, this paper will explain how the outputs of AI music generators potentially infringe the exclusive reproduction right granted to musical work and sound recording copyright owners. Going forward, courts and policymakers must not ignore AI’s capacity to undermine our incentives for human authorship, and craft rules that promote a mutually beneficial AI music ecosystem for technology companies and copyright owners alike

    Comparison Structure Analysis

    Get PDF
    This study presents an automatic, computer-aided analytical method called Comparison Structure Analysis (CSA), which can be applied to different dimensions of music. The aim of CSA is first and foremost practical: to produce dynamic and understandable representations of musical properties by evaluating the prevalence of a chosen musical data structure through a musical piece. Such a comparison structure may refer to a mathematical vector, a set, a matrix or another type of data structure and even a combination of data structures. CSA depends on an abstract systematic segmentation that allows for a statistical or mathematical survey of the data. To choose a comparison structure is to tune the apparatus to be sensitive to an exclusive set of musical properties. CSA settles somewhere between traditional music analysis and computer aided music information retrieval (MIR). Theoretically defined musical entities, such as pitch-class sets, set-classes and particular rhythm patterns are detected in compositions using pattern extraction and pattern comparison algorithms that are typical within the field of MIR. In principle, the idea of comparison structure analysis can be applied to any time-series type data and, in the music analytical context, to polyphonic as well as homophonic music. Tonal trends, set-class similarities, invertible counterpoints, voice-leading similarities, short-term modulations, rhythmic similarities and multiparametric changes in musical texture were studied. Since CSA allows for a highly accurate classification of compositions, its methods may be applicable to symbolic music information retrieval as well. The strength of CSA relies especially on the possibility to make comparisons between the observations concerning different musical parameters and to combine it with statistical and perhaps other music analytical methods. The results of CSA are dependent on the competence of the similarity measure. New similarity measures for tonal stability, rhythmic and set-class similarity measurements were proposed. The most advanced results were attained by employing the automated function generation – comparable with the so-called genetic programming – to search for an optimal model for set-class similarity measurements. However, the results of CSA seem to agree strongly, independent of the type of similarity function employed in the analysis.Tämä tutkimus esittelee uuden musiikkianalyyttisen metodin, vertailurakenneanalyysin (VRA, engl. Comparison Structure Analysis, CSA), jonka avulla voidaan analysoida musiikin eri ulottuvuuksia, kuten harmoniaa tai rytmiä. VRA:n ideana on mitata tietyn ennalta valitun musiikillisen rakenteen, vaikkapa jonkin sävelasteikon, vallitsevuutta musiikin kullakin ajanhetkellä. Tämä edellyttää kolmea asiaa. Ensiksi, intuitiivisesti tai muulla tavoin valittu musiikillinen piirre, jota tässä kutsutaan yleisesti vertailurakenteeksi, on esitettävä matemaattisessa muodossa, esimerkiksi matemaattisen avaruuden vektorina. Vertailurakenne voidaan muodostaa myös useiden eri tyyppisten, musiikin eri ulottuvuuksiin liittyvien tietorakenteiden yhdistelmänä. Toiseksi, analysoitava musiikillinen data, esimerkiksi musiikista muodostetut sävelluokat (C:stä H:hon), on pystyttävä ryhmittelemään vastaavantyyppisiksi objekteiksi. Lisäksi tarvitaan vielä matemaattinen funktio, joka kykenee mittaamaan valitun vertailurakenteen ja musiikista ryhmiteltyjen segmenttien välistä samankaltaisuutta tai vastaavasti, etäisyyttä. Toisin sanoen, VRA:ssa verrataan valittua vertailurakennetta, esimerkiksi diatonista asteikkoa, kaikkiin musiikista segmentoituihin vastaavantyyppisiin objekteihin. Mittaustulokset saadaan lukuarvoina yleensä välillä 0–1, jossa arvo 1 voi – mittausfunktion luonteesta riippuen – tarkoittaa joko täydellistä samankaltaisuutta tai suurinta mahdollista etäisyyttä. Havainnollisena analyysin kohteena voisimme kuvitella länsimaista taidemusiikkia edustavan sävellyksen, jossa siirrytään keskiaikaisesta diatonisesta musiikista historiallisesti ja tyylillisesti kohti 1900-luvun atonaalista musiikkia. Mikäli tässä tapauksessa vertailurakenteena käytettäisiin mainittua diatonista asteikkoa, VRA paljastaisi musiikissa korvinkin havaittavan ei-diatonisoitumisen. Tulosten esittämisellä esimerkiksi ajallisia muutoksia esittävin mittauskäyrin tai luokittelua havainnollistavin keskiarvopistein on merkittävä asema analyysissa. VRA sijoittuu perinteisen musiikkianalyysin ja tietokonetta hyödyntävien musiikin sisältöhakuun (music information retrieval, MIR) keskittyvien tekniikoiden välimaastoon. Sen avulla voidaan tunnistaa ja mitata perinteiselle musiikkianalyysille tyypillisia kohteita kuten karakteristisia rytmejä, sävelluokkajoukkoja, joukkoluokkia, tonaliteetteja ja käänteiskontrapunkteja soveltamalla MIR:lle tyypillisiä segmentointi- ja vertailualgoritmeja. Vertailurakenneanalyysin suurimmaksi haasteeksi on osoittautunut musiikillisten segmenttien muodostamiseen tarvittavan automaattisen algoritmin kehittäminen. Voidaan näet osoittaa, että sama musiikillinen data on useimmiten mahdollista segmentoida – musiikillisesti mielekkäästi – monella eri tavalla. Silloin, kun kyse on harmoniaan liittyvistä objekteista, tehtävä on erityisen haastava, sillä tällöin musiikin säveltapahtumia joudutaan tarkastelemaan niin ajallisessa kuin vertikaalisessakin suunnassa. Musiikin tonaalisuudessa ja sävelluokkasisällössä tapahtuvien muutosten analysoimista varten tässä tutkimuksessa kehitettiinkin kaksi erilaista segmentointialgoritmia, jotka muodostavat musiikillisesta datasta osin limittäisiä sävelluokkajoukkoja. Metodien erilaisuudesta huolimatta ‘herkkyysanalyysillä’ voitiin osoittaa, että molemmat menetelmät ovat hyvin vähän riippuvaisia syötetyn datan luonteesta; niiden avulla saadut tulokset olivat hyvin samankaltaisia. VRA:lla saatuja tuloksia voidaan edelleen tarkastella myös tilastollisen merkitsevyyden näkökulmasta. Koska VRA:lla pystytään havaitsemaan musiikin eri dimensioissa tapahtuvia muutoksia, tämän johdannaisena voidaan tutkia myös sitä, missä määrin jokin sävellys on tyylillisesti koherentti verrattuna johonkin toiseen sävellykseen eli kummassa muutokset ovat tarkasteltavan ominaisuuden suhteen keskimäärin pienemmät ja kummassa suuremmat. Lisäksi VRA tarjoaa mahdollisuuden musiikin luokitteluun saatujen mittausarvojen perusteella: mitä enemmän musiikillisia parametrejä ja useampia vertailurakenteita analyysissa hyödynnetään, sitä tarkemmin sävellyksiä voidaan luokitella. Niinpä VRA:n keinoja voidaan tulevaisuudessa kuvitella käytettävän myös musiikin sisältöhakuun (MIR). Tällaisessa tapauksessa vertailurakenne tai -rakenteet voitaisiin ‘laskea’ musiikillisesta datasta suoraan jollakin matemaattisella menetelmällä – kuten pääkomponenttianalyysilla – etukäteen suoritettavan intuitiivisen valinnan sijaan. Tutkimuksen tuloksiin lukeutuvat myös useat VRA:n tarpeisiin kehitetyt samankaltaisuusmittarit. Näistä mielenkiintoisin lienee sävelluokkajoukkojen välisen samankaltaisuuden mittaamiseen kehitetty funktio expcos, joka löytyi ns. geneettisen ohjelmoinnin avulla. Mainitussa kokeessa tietokoneella generoitiin arviolta n. 800 000 samankaltaisuusmittaria, joiden tuottamia tuloksia verrattiin ihmisten tekemiin samankaltaisuusarvioihin. Niistä n. 450 osoittautui käyttökelpoiseksi. Sensitiivisyysanalyysi osoitti, että em. funktio paitsi korreloi voimakkaammin empiiristen samankaltaisuusarvioiden kanssa, on VRA:ssa myös robustimpi kuin kenties tunnetuin samaan tarkoitukseen kehitetty funktio, REL (David Lewin, 1980). Käytännössä tällä ei ole kuitenkaan merkitystä: REL toimii VRA:ssa aivan yhtä hyvin kuin expcos. VRA:n avulla musiikkia tarkastellaan ikään kuin jonkinlaisena tilastollisena sävelmassana, eikä se niin muodoin kykene kertomaan siitä, miten analysoitava musiikki on yksityiskohtien tasolla sävelletty; perinteiset musiikkianalyysimenetelmät pureutuvat tehtävään paremmin. Toisaalta, tämä ei ole VRA:n tarkoituskaan vaan päinvastoin, sen avulla sävellysten muodosta pystytään muodostamaan laajoja yleiskuvia, jotka ovat useimmiten havaintokykymme ulottumattomissa. Vertailurakenneanalyysi on hyvin joustava menetelmä. Mikään ei nimittäin estä tarkastelemasta musiikin eri dimensioista saatuja mittaustuloksia keskenään ja näin etsimästä niiden välisiä yhteyksiä. Lisäksi menetelmän periaatteita voitaisiin kuvitella käytettävän yleisemminkin, esimerkiksi linnunlaulun muodon tarkasteluun tai vaikkapa jokipuron solinasta löytyvien toistuvien jaksojen havainnointiin. VRA:n periaatteita voidaankin soveltaa mihin tahansa numeerisesti diskreettiin muotoon saatettuun aikasarjaan.Siirretty Doriast

    Population Structure and Frankish Ethnogenesis (AD 400-900)

    Get PDF
    abstract: The transition from Late Antiquity to Early Medieval Europe (ca. AD 400-900) is often characterized as a period of ethnogenesis for a number of peoples, such as the Franks. Arising during protracted contact with the Roman Empire, the Franks would eventually form an enduring kingdom in Western Europe. However, there is little consensus about the processes by which they formed an ethnic group. This study takes a fresh look at the question of Frankish ethnogenesis by employing a number of theoretical and methodological subdisciplines, including population genetics and ethnogenetic theory. The goals of this work were 1) to validate the continued use of biological data in questions of historical and archaeological significance; and 2) to elucidate how Frankish population structure changed over time. Toward this end, measurements from the human dentition and crania were subjected to rigorous analytical techniques and interpreted within a theoretical framework of ethnogenetic life cycles. Results validate existing interpretations of intra-regional biological continuity over time. However, they also reveal that 1) there are clear biological and geographical differences between communities, and 2) there are hints of diachronic shifts, whereby some communities became more similar to each other over time. These conclusions complement current ethnohistoric work arguing for the increasing struggle of the Frankish kingdom to unify itself when confronted by strong regionally-based politics.Dissertation/ThesisDoctoral Dissertation Anthropology 201
    corecore