Search CORE

92 research outputs found

Towards building a Deep Learning based Automated Indian Classical Music Tutor for the Masses

Author: Kulkarni Chetan
Pendyala Vishnu S.
Vadlamudi Lokesh
Yadav Nupur
Publication venue: SJSU ScholarWorks
Publication date: 01/12/2022
Field of study

Music can play an important role in the well-being of the world. Indian classical music is unique in its requirement for rigorous, disciplined, expert-led training that typically goes on for years before the learner can reach a reasonable level of performance. This keeps many, including the first author of this paper, away from mastering the skill. The problem is particularly compounded in rural areas, where the available expertise may be limited and prohibitively expensive, but the interest in learning classical music still prevails, nevertheless. Machine Learning has been complementing, enhancing, and replacing many white-collar jobs and we believe it can help with this problem as well. This paper describes efforts at using Machine Learning techniques, particularly, Long Short-Term Memory for building a system that is a step toward provisioning an Indian Classical Music Tutor for the masses. The system is deployed in the cloud using orchestrated containerization for potential worldwide access, load balancing, and other robust features

Directory of Open Access Journals

SJSU ScholarWorks

DeepSRGM -- Sequence Classification and Ranking in Indian Classical Music with Deep Learning

Author: Chowdhary Girish
Madhusudhan Sathwik Tejaswi
Publication venue
Publication date: 15/02/2024
Field of study

A vital aspect of Indian Classical Music (ICM) is Raga, which serves as a melodic framework for compositions and improvisations alike. Raga Recognition is an important music information retrieval task in ICM as it can aid numerous downstream applications ranging from music recommendations to organizing huge music collections. In this work, we propose a deep learning based approach to Raga recognition. Our approach employs efficient pre possessing and learns temporal sequences in music data using Long Short Term Memory based Recurrent Neural Networks (LSTM-RNN). We train and test the network on smaller sequences sampled from the original audio while the final inference is performed on the audio as a whole. Our method achieves an accuracy of 88.1% and 97 % during inference on the Comp Music Carnatic dataset and its 10 Raga subset respectively making it the state-of-the-art for the Raga recognition task. Our approach also enables sequence ranking which aids us in retrieving melodic patterns from a given music data base that are closely related to the presented query sequence

arXiv.org e-Print Archive

Hindustani raga and singer classification using 2D and 3D pose estimation from video recordings

Author: Clarke Alison
Clayton Martin
Li Jin
Weinzierl Marion
Publication venue: Taylor and Francis Group
Publication date: 03/04/2024
Field of study

Using pose estimation with video recordings, we apply an action recognition machine learning algorithm to demonstrate the use of the movement information to classify singers and the ragas (melodic modes) they perform. Movement information is derived from a specially recorded video dataset of solo Hindustani (North Indian) raga recordings by three professional singers each performing the same nine ragas, a smaller duo dataset (one singer with tabla accompaniment) as well as recordings of concert performances by the same singers. Data is extracted using pose estimation algorithms, both 2D (OpenPose) and 3D. A two-pathway convolutional neural network structure is proposed for skeleton action recognition to train a model to classify 12-second clips by singer and raga. The model is capable of distinguishing the three singers on the basis of movement information alone. For each singer, it is capable of distinguishing between the nine ragas with a mean accuracy of 38.2% (with the most successful model). The model trained on solo recordings also proved effective at classifying duo and concert recordings. These findings are consistent with the view that while the gesturing of Indian singers is idiosyncratic, it remains tightly linked to patterns of melodic movement: indeed we show that in some cases different ragas are distinguishable on the basis of movement information alone. A series of technical challenges are identified and addressed, with code shared alongside audiovisual data to accompany the paper

Durham Research Online

Exploring deep learning based methods for information retrieval in Indian classical music

Author: Madhusudhan Sathwik Tejaswi
Publication venue
Publication date: 01/05/2019
Field of study

A vital aspect of Indian Classical Music (ICM) is Raga, which serves as a melodic framework for compositions and improvisations alike. Raga Recognition is an important music information retrieval task in ICM as it can aid numerous downstream applications ranging from music recommendations to organizing huge music collections. In this work, we propose a deep learning based approach to Raga recognition. Our approach employs efficient pre-possessing and learns temporal sequences in music data using Long Short Term Memory based Recurrent Neural Networks (LSTM-RNN). We train and test the network on smaller sequences sampled from the original audio while the final inference is performed on the audio as a whole. Our method achieves an accuracy of 88.1% and 97 % during inference on the Comp Music Carnatic dataset and its 10 Raga subset respectively making it the state of-the-art for the Raga recognition task. Our approach also enables sequence ranking which aids us in retrieving melodic patterns from a given music data base that are closely related to the presented query sequence

Illinois Digital Environment for Access to Learning and Scholarship Repository

Repertoire-Specific Vocal Pitch Data Generation for Improved Melodic Analysis of Carnatic Music

Author: Genís Plaja-Roglans
Lara Pearson
Marius Miron
Thomas Nuttall
Xavier Serra
Publication venue: 'Ubiquity Press, Ltd.'
Publication date: 01/06/2023
Field of study

Deep Learning methods achieve state-of-the-art in many tasks, including vocal pitch extraction. However, these methods rely on the availability of pitch track annotations without errors, which are scarce and expensive to obtain for Carnatic Music. Here we identify the tradition-related challenges and propose tailored solutions to generate a novel, large, and open dataset, the Saraga-Carnatic-Melody-Synth (SCMS), comprising audio mixtures and time-aligned vocal pitch annotations. Through a cross-cultural evaluation leveraging this novel dataset, we show improvements in the performance of Deep Learning vocal pitch extraction methods on Indian Art Music recordings. Additional experiments show that the trained models outperform the currently used heuristic-based pitch extraction solutions for the computational melodic analysis of Carnatic Music and that this improvement leads to better results in the musicologically relevant task of repeated melodic pattern discovery when evaluated using expert annotations. The code and annotations are made available for reproducibility. The novel dataset and trained models are also integrated into the Python package compIAM1 which allows them to be used out-of-the-box

Directory of Open Access Journals

Computational Analysis of Greek folk music of the Aegean islands

Author: Christodoulou Anna Maria
Χριστοδούλου Αννα Μαρία
Publication venue
Publication date: 01/01/2022
Field of study

Αν και έχουν αναπτυχθεί νεότερα και πιο ανεπτυγμένα μοντέλα υπολογιστικής μουσικής ανάλυσης με στόχο την αύξηση διαθέσιμης πληροφορίας στον κλάδο της μουσικολογίας, υπάρχει πολύ λίγη έρευνα στην υπολογιστική ανάλυση δημοτικής μουσικής γενικότερα και ελληνικής δημοτικής μουσικής ειδικότερα. Στόχος της παρούσας εργασίας είναι η διερεύνηση ποικίλων τύπων μουσικών χαρακτηριστικών και προτύπων στη δημοτική μουσική των νησιών του Αιγαίου και η παροχή χρήσιμης πληροφορίας σχετικά με τη δομή και το περιεχόμενο του εν λόγω είδους. Επιπρόσθετα, με στόχο τη σύγκριση μουσικών αποσπασμάτων χορών Συρτού και Μπάλου, αλλά και γεωγραφικών περιοχών από τις οποίες προέρχονται, 73 αποσπάσματα συγκεντρώθηκαν συνολικά σε μια βάση δεδομένων και αναλύθηκαν. Η εξαγωγή χαρακτηριστικών και η ανάλυση προτύπων ανέδειξαν μελωδικές και ρυθμικές διαφορές τόσο ανάμεσα στα δύο είδη χορών όσο και στις διάφορες νησιωτικές περιοχές, ενώ υπήρξαν επίσης ποικίλες ομοιότητες σε όλο το σύνολο των δεδομένων.While newer, advanced computational music analysis models have been developed with the intentions of increasing available information in this field, very little research exists on the computational analysis of folk music in general and Greek folk music in specific. The aim of this study was to examine various types of musical features and patterns in the folk music of the Aegean islands and provide useful information about the structure and the content of this music style. In addition, to compare the tunes of Syrtos and Mpalos dances, but also the various island regions from which they originate, a total of 73 tunes were included in the constructed dataset and the analyses. Feature extraction and pattern analysis revealed that there are indeed melodic and temporal differences both between the two dance types and between the island regions, while there were also various important similarities throughout the whole dataset

Pergamos : Unified Institutional Repository / Digital Library Platform of the National and Kapodistrian University of Athens

Listening for the Cosmic Other: Postcolonial Approaches to Music in the Space Age

Author: Zalman Paige
Publication venue: The Research Repository @ WVU
Publication date: 01/01/2019
Field of study

As government programs such as NASA and SETI seek signs of intelligent life in space and privately-funded programs such as SpaceX finalize plans to colonize Mars in the coming decades, representations of space and extraterrestrial life in American culture have become increasingly relevant. Focusing on Jóhann Jóhannsson’s musical score for Denis Villeneuve’s science-fiction film Arrival (2016), Terry Riley’s Sun Rings (2002) for string quartet, chorus, and recorded space sounds, and former International Space Station Commander Chris Hadfield’s “Songs about Space” Spotify playlist, my research problematizes the ways in which composers, musicians, and even astronauts depict alterity through music and reinforce colonial narratives about outer space. Drawing upon the work of postcolonial theorists and musicologists such as Charles Forsdick (2003), Olivia A. Bloechl (2008), Ania Loomba (2015), and others, this thesis argues that musical depictions of extraterrestrials and space exploration, more generally, reveal the potential for discrimination, misrepresentation, and abuses of power to emerge from space colonization. But, back on Earth, this study also suggests that such fraught theoretical relationships between human colonists and extraterrestrials echo the real-life suffering of colonized indigenous groups and the necessity of decolonization

The Research Repository @ WVU (West Virginia University)

Computational analysis of world music corpora

Author: Panteli Maria
Publication venue: 'Queen Mary University of London'
Publication date: 02/05/2018
Field of study

PhDThe comparison of world music cultures has been considered in musicological research since the end of the 19th century. Traditional methods from the field of comparative musicology typically involve the process of manual music annotation. While this provides expert knowledge, the manual input is timeconsuming and limits the potential for large-scale research. This thesis considers computational methods for the analysis and comparison of world music cultures. In particular, Music Information Retrieval (MIR) tools are developed for processing sound recordings, and data mining methods are considered to study similarity relationships in world music corpora. MIR tools have been widely used for the study of (mainly) Western music. The first part of this thesis focuses on assessing the suitability of audio descriptors for the study of similarity in world music corpora. An evaluation strategy is designed to capture challenges in the automatic processing of world music recordings and different state-of-the-art descriptors are assessed. Following this evaluation, three approaches to audio feature extraction are considered, each addressing a different research question. First, a study of singing style similarity is presented. Singing is one of the most common forms of musical expression and it has played an important role in the oral transmission of world music. Hand-designed pitch descriptors are used to model aspects of the singing voice and clustering methods reveal singing style similarities in world music. Second, a study on music dissimilarity is performed. While musical exchange is evident in the history of world music it might be possible that some music cultures have resisted external musical influence. Low-level audio features are combined with machine learning methods to find music examples that stand out in a world music corpus, and geographical patterns are examined. The last study models music similarity using descriptors learned automatically with deep neural networks. It focuses on identifying music examples that appear to be similar in their audio content but share no (obvious) geographical or cultural links in their metadata. Unexpected similarities modelled in this way uncover possible hidden links between world music cultures. This research investigates whether automatic computational analysis can uncover meaningful similarities between recordings of world music. Applications derive musicological insights from one of the largest world music corpora studied so far. Computational analysis as proposed in this thesis advances the state-of-the-art in the study of world music and expands the knowledge and understanding of musical exchange in the world.Queen Mary Principal’s research studentship

Queen Mary Research Online