Search CORE

20 research outputs found

Automatic musical instrument recognition for multimedia indexing

Author: Malheiro Frederico Alberto Santos de Carteado
Publication venue: Faculdade de Ciências e Tecnologia
Publication date: 01/01/2011
Field of study

Trabalho apresentado no âmbito do Mestrado em Engenharia Informática, como requisito parcial para obtenção do grau de Mestre em Engenharia InformáticaThe subject of automatic indexing of multimedia has been a target of numerous discussion and study. This interest is due to the exponential growth of multimedia content and the subsequent need to create methods that automatically catalogue this data. To fulfil this idea, several projects and areas of study have emerged. The most relevant of these are the MPEG-7 standard, which defines a standardized system for the representation and automatic extraction of information present in the content, and Music Information Retrieval (MIR), which gathers several paradigms and areas of study relating to music. The main approach to this indexing problem relies on analysing data to obtain and identify descriptors that can help define what we intend to recognize (as, for instance,musical instruments, voice, facial expressions, and so on), this then provides us with information we can use to index the data. This dissertation will focus on audio indexing in music, specifically regarding the recognition of musical instruments from recorded musical notes. Moreover, the developed system and techniques will also be tested for the recognition of ambient sounds (such as the sound of running water, cars driving by, and so on). Our approach will use non-negative matrix factorization to extract features from various types of sounds, these will then be used to train a classification algorithm that will be then capable of identifying new sounds

Repositório da Universidade Nova de Lisboa

Classification of Musical Instruments sounds by Using MFCC and Timbral Audio Descriptors

Author: Priyanka S. Jadhav
Publication venue: 'Auricle Technologies, Pvt., Ltd.'
Publication date: 31/07/2015
Field of study

Identification of the musical instrument from a music piece is becoming area of interest for researchers in recent years. The system for identification of musical instrument from monophonic audio recording is basically performs three tasks: i) Pre-processing of inputted music signal; ii) Feature extraction from the music signal; iii) Classification. There are many methods to extract the audio features from an audio recording like Mel-frequency Cepstral Coefficients (MFCC), Linear Predictive Codes (LPC), Linear Predictive Cepstral Coefficients (LPCC), Perceptual Linear Predictive Coefficients (PLP), etc. The paper presents an idea to identify musical instruments from monophonic audio recordings by extracting MFCC features and timbre related audio descriptors. Further, three classifiers K-Nearest Neighbors (K-NN), Support Vector Machine (SVM) and Binary Tree Classifier (BT) are used to identify the musical instrument name by using feature vector generated in feature extraction process. The analysis is made by studying results obtained by all possible combinations of feature extraction methods and classifiers. Percentage accuracies for each combination are calculated to find out which combinations can give better musical instrument identification results. The system gives higher percentage accuracies of 90.00%, 77.00% and 75.33% for five, ten and fifteen musical instruments respectively if MFCC is used with K-NN classifier and for Timbral ADs higher percentage accuracies of 88.00%, 84.00% and 73.33% are obtained for five, ten and fifteen musical instruments respectively if BT classifier is used. DOI: 10.17762/ijritcc2321-8169.150713

International Journal on Recent and Innovation Trends in Computing and Communication

A Novel Techniques for Classification of Musical Instruments

Author: Bormane D.S.
Dusane Meenakshi
Publication venue: The International Institute for Science, Technology and Education (IISTE)
Publication date: 31/10/2013
Field of study

Musical instrument classification provides a framework for developing and evaluating features for any type of content-based analysis of musical signals. Signal is subjected to wavelet decomposition. A suitable wavelet is selected for decomposition. In our work for decomposition we used Wavelet Packet transform. After the wavelet decomposition, some sub band signals can be analyzed, particular band can be representing the particular characteristics of musical signal. Finally these wavelet features set were formed and then musical instrument will be classified by using suitable machine learning algorithm (classifier). In this paper, the problem of classifying of musical instruments is addressed. We propose a new musical instrument classification method based on wavelet represents both local and global information by computing wavelet coefficients at different frequency sub bands with different resolutions. Using wavelet packet transform (WPT) along with advanced machine learning techniques, accuracy of music instrument classification has been significantly improved. Keywords: Musical instrument classification, WPT, Feature Extraction Techniques, Machine learning techniques

International Institute for Science, Technology and Education (IISTE): E-Journals

A Comprehensive Review on Audio based Musical Instrument Recognition: Human-Machine Interaction towards Industry 4.0

Author: Chakraborty Soubhik
Dash Sukanta Kumar
Solanki S S
Publication venue: CSIR-National Institute of Science Communication and Policy Research (NIScPR)
Publication date: 19/01/2023
Field of study

Over the last two decades, the application of machine technology has shifted from industrial to residential use. Further, advances in hardware and software sectors have led machine technology to its utmost application, the human-machine interaction, a multimodal communication. Multimodal communication refers to the integration of various modalities of information like speech, image, music, gesture, and facial expressions. Music is the non-verbal type of communication that humans often use to express their minds. Thus, Music Information Retrieval (MIR) has become a booming field of research and has gained a lot of interest from the academic community, music industry, and vast multimedia users. The problem in MIR is accessing and retrieving a specific type of music as demanded from the extensive music data. The most inherent problem in MIR is music classification. The essential MIR tasks are artist identification, genre classification, mood classification, music annotation, and instrument recognition. Among these, instrument recognition is a vital sub-task in MIR for various reasons, including retrieval of music information, sound source separation, and automatic music transcription. In recent past years, many researchers have reported different machine learning techniques for musical instrument recognition and proved some of them to be good ones. This article provides a systematic, comprehensive review of the advanced machine learning techniques used for musical instrument recognition. We have stressed on different audio feature descriptors of common choices of classifier learning used for musical instrument recognition. This review article emphasizes on the recent developments in music classification techniques and discusses a few associated future research problems

Online Publishing @ NISCAIR

Recommended from our members

Real-time decoding of question-and-answer speech dialogue using human cortical activity.

Author: Chang Edward F
Leonard Matthew K
Makin Joseph G
Moses David A
Publication venue: eScholarship, University of California
Publication date: 01/07/2019
Field of study

Natural communication often occurs in dialogue, differentially engaging auditory and sensorimotor brain regions during listening and speaking. However, previous attempts to decode speech directly from the human brain typically consider listening or speaking tasks in isolation. Here, human participants listened to questions and responded aloud with answers while we used high-density electrocorticography (ECoG) recordings to detect when they heard or said an utterance and to then decode the utterance's identity. Because certain answers were only plausible responses to certain questions, we could dynamically update the prior probabilities of each answer using the decoded question likelihoods as context. We decode produced and perceived utterances with accuracy rates as high as 61% and 76%, respectively (chance is 7% and 20%). Contextual integration of decoded question likelihoods significantly improves answer decoding. These results demonstrate real-time decoding of speech in an interactive, conversational setting, which has important implications for patients who are unable to communicate

eScholarship - University of California

Music Transcription with ISA and HMM

Author: D. Hand
M. Ostendorf
W. Penny
Z. Ghahramani
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

An review of automatic drum transcription

Author: Dittmar Christian
Hockman Jason
Lerch Alexander
Muller Meinard
Southall Carl
Vogl Richard
Widmer Gerhard
Wu Chih-Wei
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 26/04/2018
Field of study

In Western popular music, drums and percussion are an important means to emphasize and shape the rhythm, often deﬁning the musical style. If computers were able to analyze the drum part in recorded music, it would enable a variety of rhythm-related music processing tasks. Especially the detection and classiﬁcation of drum sound events by computational methods is considered to be an important and challenging research problem in the broader ﬁeld of Music Information Retrieval. Over the last two decades, several authors have attempted to tackle this problem under the umbrella term Automatic Drum Transcription(ADT).This paper presents a comprehensive review of ADT research, including a thorough discussion of the task-speciﬁc challenges, categorization of existing techniques, and evaluation of several state-of-the-art systems. To provide more insights on the practice of ADT systems, we focus on two families of ADT techniques, namely methods based on Nonnegative Matrix Factorization and Recurrent Neural Networks. We explain the methods’ technical details and drum-speciﬁc variations and evaluate these approaches on publicly available datasets with a consistent experimental setup. Finally, the open issues and under-explored areas in ADT research are identiﬁed and discussed, providing future directions in this ﬁel

Birmingham City University Open Access Repository

BCU Open Access

Analysis and recognition of similar environmental sounds

Author: Rodeia José Pedro dos Santos
Publication venue: FCT - UNL
Publication date: 01/01/2009
Field of study

Dissertação apresentada na Faculdade de Ciências e Tecnologia da Universidade Nova de Lisboa para obtenção do grau de Mestre em Engenharia InformáticaHumans have the ability to identify sound sources just by hearing a sound. Adapting the same problem to computers is called (automatic) sound recognition. Several sound recognizers have been developed throughout the years. The accuracy provided by these recognizers is influenced by the features they use and the classification method implemented. While there are many approaches in sound feature extraction and in sound classification, most have been used to classify sounds with very different characteristics. Here, we implemented a similar sound recognizer. This recognizer uses sounds with very similar properties making the recognition process harder. Therefore, we will use both temporal and spectral properties of the sound. These properties will be extracted using the Intrinsic Structures Analysis (ISA) method, which uses Independent Component Analysis and Principal Component Analysis. We will implement the classification method based on k-Nearest Neighbor algorithm. Here we prove that the features extracted in this way are powerful in sound recognition. We tested our recognizer with several sets of features the ISA method retrieves, and achieved great results. We, finally, did a user study to compare human performance distinguishing similar sounds against our recognizer. The study allowed us to conclude the sounds are in fact really similar and difficult to distinguish and that our recognizer has much more ability than humans to identify them

Repositório da Universidade Nova de Lisboa