Search CORE

3 research outputs found

Automatic Transcription of Drum Strokes in Carnatic Music

Author: Chandramouli Kausthubh
Sethares William
Publication venue
Publication date: 28/11/2022
Field of study

The mridangam is a double-headed percussion instrument that plays a key role in Carnatic music concerts. This paper presents a novel automatic transcription algorithm to classify the strokes played on the mridangam. Onset detection is first performed to segment the audio signal into individual strokes, and feature vectors consisting of the DFT magnitude spectrum of the segmented signal are generated. A multi-layer feedforward neural network is trained using the feature vectors as inputs and the manual transcriptions as targets. Since the mridangam is a tonal instrument tuned to a given tonic, tonic invariance is an important feature of the classifier. Tonic invariance is achieved by augmenting the dataset with pitch-shifted copies of the audio. This algorithm consistently yields over 83% accuracy on a held-out test dataset.Comment: 7 pages, 9 figure

arXiv.org e-Print Archive

Akshara transcription of mrudangam strokes in carnatic music

Author: Kumar J Chaitanya
Kuriakose Jom
Murthy Hema A.
Sarala Padi
Sivaraman Umayalpuram K
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2015
Field of study

Percussion instruments play a significant role in/nCarnatic music concerts. The percussion artist enjoys a great/ndegree of freedom in improvising within the defined tala/nstructure/nof a composition. The objective of this paper is to transcribe the/nimprovisations, treating the percussion strokes as syllables or/naksharas./nOnset detection is performed to segment the waveform at/neach/nakshara/n. Using the transcriptions from the training data,/na three-state Hidden Markov Model is built for each/nakshara/n./nThe language model is derived from the training data. Testing/nis also performed isolated style using onset detection to segment/nthe phrase, and the language model to correct the transcription./nTranscription is performed on both concert recordings and studio/nrecordings. This technique yields upto/n≈/n96%/naccuracy on studio/nrecordings and/n≈/n76%/naccuracy for concert recordings./nAs the mrudangam/n1/nis an instrument that is based on tonic;/ntonic normalised features, namely, Cent Filterbank Cepstral/ncoefficients are used. It is shown that tonic normalisation helps/nin transcription across different tonics.This research was partly funded by the European Research/nCouncil under the European Unions Seventh Framework Pro-/ngram, as part of the CompMusic project (ERC grant agreement/n267583)

Crossref

UPF Digital Repository