7,349 research outputs found

    Computer Simulation of Musical Evolution: A Lesson from Whales

    Get PDF
    Simulating musical creativity using computers needs more than the ability to devise elegant computational implementations of sophisticated algorithms. It requires, firstly, an understanding of what phenomena might be regarded as music; and, secondly, an understanding of the nature of such phenomena — including their evolutionary history, their recursive-hierarchic structure, and the mechanisms by which they are transmitted within cultural groups. To understand these issues it is fruitful to compare human music, and indeed human language, with analogous phenomena in other areas of the animal kingdom. Whale song, specifically that of the humpback (Megaptera novaeangeliae), possesses many structural and functional similarities to human music (as do certain types of birdsong). Using a memetic perspective, this paper compares the “musilanguage” of humpbacks with the music of humans, and aims to identify a number of shared characteristics. A consequence of nature and nurture, these commonalities appear to arise partly from certain constraints of perception and cognition (and thus they determine an aspect of the environment within which the “musemes” (musical memes) constituting whale vocalizations and human music is replicated), and partly from the social-emotive-embodied and sexual-selective nature of musemic transmission. The paper argues that Universal-Darwinian forces give rise to uniformities of structure in phenomena we might regard as “music”, irrespective of the animal group — certain primates, cetaceans or birds - within which it occurs. It considers the extent to which whale song might be regarded as creative, by invoking certain criteria used to assess this attribute in human music. On the basis of these various comparisons, the paper concludes by attempting to draw conclusions applicable to those engaged in designing evolutionary music simulation/generation algorithms

    Speaker segmentation and clustering

    Get PDF
    This survey focuses on two challenging speech processing topics, namely: speaker segmentation and speaker clustering. Speaker segmentation aims at finding speaker change points in an audio stream, whereas speaker clustering aims at grouping speech segments based on speaker characteristics. Model-based, metric-based, and hybrid speaker segmentation algorithms are reviewed. Concerning speaker clustering, deterministic and probabilistic algorithms are examined. A comparative assessment of the reviewed algorithms is undertaken, the algorithm advantages and disadvantages are indicated, insight to the algorithms is offered, and deductions as well as recommendations are given. Rich transcription and movie analysis are candidate applications that benefit from combined speaker segmentation and clustering. © 2007 Elsevier B.V. All rights reserved

    On the design of an ECOC-compliant genetic algorithm

    Get PDF
    Genetic Algorithms (GA) have been previously applied to Error-Correcting Output Codes (ECOC) in state-of-the-art works in order to find a suitable coding matrix. Nevertheless, none of the presented techniques directly take into account the properties of the ECOC matrix. As a result the considered search space is unnecessarily large. In this paper, a novel Genetic strategy to optimize the ECOC coding step is presented. This novel strategy redefines the usual crossover and mutation operators in order to take into account the theoretical properties of the ECOC framework. Thus, it reduces the search space and lets the algorithm to converge faster. In addition, a novel operator that is able to enlarge the code in a smart way is introduced. The novel methodology is tested on several UCI datasets and four challenging computer vision problems. Furthermore, the analysis of the results done in terms of performance, code length and number of Support Vectors shows that the optimization process is able to find very efficient codes, in terms of the trade-off between classification performance and the number of classifiers. Finally, classification performance per dichotomizer results shows that the novel proposal is able to obtain similar or even better results while defining a more compact number of dichotomies and SVs compared to state-of-the-art approaches

    Methodological considerations concerning manual annotation of musical audio in function of algorithm development

    Get PDF
    In research on musical audio-mining, annotated music databases are needed which allow the development of computational tools that extract from the musical audiostream the kind of high-level content that users can deal with in Music Information Retrieval (MIR) contexts. The notion of musical content, and therefore the notion of annotation, is ill-defined, however, both in the syntactic and semantic sense. As a consequence, annotation has been approached from a variety of perspectives (but mainly linguistic-symbolic oriented), and a general methodology is lacking. This paper is a step towards the definition of a general framework for manual annotation of musical audio in function of a computational approach to musical audio-mining that is based on algorithms that learn from annotated data. 1

    Musical pattern extraction using genetic algorithms

    Get PDF
    This paper describes a research work in which we study the possibility of applying genetic algorithms to the extraction of musical patterns in monophonic musical pieces. Each individual in the population represents a possible segmentation of the piece being analysed. The goal is to find a segmentation that allows the identification of the most significant patterns of the piece. In order to calculate an individual’s fitness, all its segments are compared among each other. The bigger the area occupied by similar segments the better the quality of the segmentation
    corecore