Search CORE

38 research outputs found

Sparse and structured decomposition of audio signals on hybrid dictionaries using musical priors

Author: Kowalski Matthieu
Papadopoulos Hélène
Publication venue: 'Acoustical Society of America (ASA)'
Publication date: 02/04/2013
Field of study

International audienceThis paper investigates the use of musical priors for sparse expansion of audio signals of music, on an overcomplete dual-resolution dictionary taken from the union of two orthonormal bases that can describe both transient and tonal components of a music audio signal. More specifically, chord and metrical structure information are used to build a structured model that takes into account dependencies between coefficients of the decomposition, both for the tonal and for the transient layer. The denoising task application is used to provide a proof of concept of the proposed musical priors. Several configurations of the model are analyzed. Evaluation on monophonic and complex polyphonic excerpts of real music signals shows that the proposed approach provides results whose quality measured by the signal-to-noise ratio is competitive with state-of-the-art approaches, and more coherent with the semantic content of the signal. A detailed analysis of the model in terms of sparsity and in terms of interpretability of the representation is also provided, and shows that the model is capable of giving a relevant and legible representation of Western tonal music audio signals

HAL-CentraleSupelec

20 Years of Automatic Chord Recognition from Audio

Author: Gómez E
O'Hanlon K
Pauwels J
Proceedings of the 20th Conference of the International Society for Music Information Retrieval (ISMIR)
Sandler M
Publication venue
Publication date: 07/11/2019
Field of study

In 1999, Fujishima published "Realtime Chord Recognition of Musical Sound: a System using Common Lisp Music". This paper kickstarted an active research topic that has been popular in and around the ISMIR community. The field of Automatic Chord Recognition (ACR) has evolved considerably from early knowledge-based systems towards data-driven methods, with neural network approaches arguably being central to current ACR research. Nonetheless, many of its core issues were already addressed or referred to in the Fujishima paper. In this paper, we review those twenty years of ACR according to these issues. We furthermore attempt to frame current directions in the field in order to establish some perspective for future research

Automated analysis and transcription of rhythm data and their use for composition

Author: Boenn Georg
Publication venue
Publication date: 01/02/2011
Field of study

OPUS

Learning, Probability and Logic: Toward a Unified Approach for Content-Based Music Information Retrieval

Author: Al Farabi
Anglade
Anglade
Anglade
Anglade
Arabi
Aucouturier
Bartsch
Bello
Bello
Bengio
Bergmann
Besold
Blockeel
Boulanger-Lewandowski
Burgoyne
Burgoyne
Böck
Casey
Cella
Cho
Crane
d'Avila Garcez
Dannenberg
Davis
De Raedt
De Raedt
De Raedt
De Raedt
De Raedt
Deng
Deng
Dobrian
Domingos
Donadello
Donadello
Dovey
Downie
Ellis
Ellis
Ellis
Flach
Foote
Friedman
Fujishima
Gaudefroy
Getoor
Getoor
Grosche
Gurevych
Haack
Hamel
Harte
Herremans
Humphrey
Humphrey
Humphrey
Jain
Jain
Jernite
Kameoka
Kempf
Kernfeld
Kersting
Kersting
Kim
Kimmig
Kindermann
Kok
Kok
Koller
Koops
Korzeniowski
Korzeniowski
Korzeniowski
Korzeniowski
Krumhansl
Kuzelka
Lafferty
Lee
Leivant
Lew
Lewin
Liu
Lostanlen
Malkin
Mallat
Mallory
Maresz
Marsík
Mauch
Mauch
McFee
McVicar
Mihalkova
Minsky
Mishkin
Morales
Morales
Muggleton
Muller
Murphy
Müller
Müller
Ni
Nilsson
Ojima
Orio
Oudre
Pachet
Paiement
Pan
Papadopoulos
Papadopoulos
Papadopoulos
Papadopoulos
Papadopoulos
Papai
Paulus
Pauwels
Pawar
Pearl
Pereira
Poole
Poon
Poon
Poon
Prince
Pápai
Raedt
Rameau
Ramirez
Ramirez
Repetto
Richardson
Richardson
Riedel
Riemann
Russell
Salamon
Sarkhel
Schedl
Schedl
Schoenberg
Schuller
Serrà
Serrá
Sheh
Shenoy
Sigtia
Singla
Smith
Snidaro
Socher
Srinivasamurthy
Sutton
Sztyler
Thimm
Tsushima
Van Baelen
Van Haaren
Venugopal
Wang
Widmer
Wu
Zalkow
Zhou
Šourek
Publication venue: 'Frontiers Media SA'
Publication date: 01/01/2019
Field of study

Within the last 15 years, the field of Music Information Retrieval (MIR) has made tremendous progress in the development of algorithms for organizing and analyzing the ever-increasing large and varied amount of music and music-related data available digitally. However, the development of content-based methods to enable or ameliorate multimedia retrieval still remains a central challenge. In this perspective paper, we critically look at the problem of automatic chord estimation from audio recordings as a case study of content-based algorithms, and point out several bottlenecks in current approaches: expressiveness and flexibility are obtained to the expense of robustness and vice versa; available multimodal sources of information are little exploited; modeling multi-faceted and strongly interrelated musical information is limited with current architectures; models are typically restricted to short-term analysis that does not account for the hierarchical temporal structure of musical signals. Dealing with music data requires the ability to tackle both uncertainty and complex relational structure at multiple levels of representation. Traditional approaches have generally treated these two aspects separately, probability and learning being the usual way to represent uncertainty in knowledge, while logical representation being the usual way to represent knowledge and complex relational information. We advocate that the identified hurdles of current approaches could be overcome by recent developments in the area of Statistical Relational Artificial Intelligence (StarAI) that unifies probability, logic and (deep) learning. We show that existing approaches used in MIR find powerful extensions and unifications in StarAI, and we explain why we think it is time to consider the new perspectives offered by this promising research field

HAL-CentraleSupelec

Directory of Open Access Journals

HAL-Rennes 1