552 research outputs found
Multimodal music information processing and retrieval: survey and future challenges
Towards improving the performance in various music information processing
tasks, recent studies exploit different modalities able to capture diverse
aspects of music. Such modalities include audio recordings, symbolic music
scores, mid-level representations, motion, and gestural data, video recordings,
editorial or cultural tags, lyrics and album cover arts. This paper critically
reviews the various approaches adopted in Music Information Processing and
Retrieval and highlights how multimodal algorithms can help Music Computing
applications. First, we categorize the related literature based on the
application they address. Subsequently, we analyze existing information fusion
approaches, and we conclude with the set of challenges that Music Information
Retrieval and Sound and Music Computing research communities should focus in
the next years
Automatic musical instrument recognition for multimedia indexing
Trabalho apresentado no âmbito do Mestrado em
Engenharia Informática, como requisito parcial
para obtenção do grau de Mestre em Engenharia
InformáticaThe subject of automatic indexing of multimedia has been a target of numerous discussion and study. This interest is due to the exponential growth of multimedia content and the subsequent need to create methods that automatically catalogue this data. To fulfil this idea, several projects and areas of study have emerged. The most relevant of these are the MPEG-7 standard, which defines a standardized system for the representation and automatic extraction of information present in the content, and Music
Information Retrieval (MIR), which gathers several paradigms and areas of study relating
to music.
The main approach to this indexing problem relies on analysing data to obtain and
identify descriptors that can help define what we intend to recognize (as, for instance,musical instruments, voice, facial expressions, and so on), this then provides us with information we can use to index the data.
This dissertation will focus on audio indexing in music, specifically regarding the
recognition of musical instruments from recorded musical notes. Moreover, the developed system and techniques will also be tested for the recognition of ambient sounds
(such as the sound of running water, cars driving by, and so on).
Our approach will use non-negative matrix factorization to extract features from
various types of sounds, these will then be used to train a classification algorithm that
will be then capable of identifying new sounds
Proceedings of the 7th Sound and Music Computing Conference
Proceedings of the SMC2010 - 7th Sound and Music Computing Conference, July 21st - July 24th 2010
Lung Sounds Classification Based on Time Domain Features
Signal complexity in lung sounds is assumed to be able to differentiate and classify characteristic lung sound between normal and abnormal in most cases. Previous research has employed a variety of modification approaches to obtain lung sound features. In contrast to earlier research, time-domain features were used to extract features in lung sound classification. Electromyogram (EMG) signal analysis frequently employs this time-domain characteristic. Time-domain features are MAV, SSI, Var, RMS, LOG, WL, AAC, DASDV, and AFB. The benefit of this method is that it allows for direct feature extraction without the requirement for transformation. Several classifiers were used to examine five different types of lung sound data. The highest accuracy was 93.9 percent, obtained Using the decision tree with 9 types of time-domain features. The proposed method could extract features from lung sounds as an alternative
Clustering by compression
We present a new method for clustering based on compression. The method
doesn't use subject-specific features or background knowledge, and works as
follows: First, we determine a universal similarity distance, the normalized
compression distance or NCD, computed from the lengths of compressed data files
(singly and in pairwise concatenation). Second, we apply a hierarchical
clustering method. The NCD is universal in that it is not restricted to a
specific application area, and works across application area boundaries. A
theoretical precursor, the normalized information distance, co-developed by one
of the authors, is provably optimal but uses the non-computable notion of
Kolmogorov complexity. We propose precise notions of similarity metric, normal
compressor, and show that the NCD based on a normal compressor is a similarity
metric that approximates universality. To extract a hierarchy of clusters from
the distance matrix, we determine a dendrogram (binary tree) by a new quartet
method and a fast heuristic to implement it. The method is implemented and
available as public software, and is robust under choice of different
compressors. To substantiate our claims of universality and robustness, we
report evidence of successful application in areas as diverse as genomics,
virology, languages, literature, music, handwritten digits, astronomy, and
combinations of objects from completely different domains, using statistical,
dictionary, and block sorting compressors. In genomics we presented new
evidence for major questions in Mammalian evolution, based on
whole-mitochondrial genomic analysis: the Eutherian orders and the Marsupionta
hypothesis against the Theria hypothesis.Comment: LaTeX, 27 pages, 20 figure
A full order sliding mode tracking controller design for an electrohydraulic control system
Electrohydraulic control system are widely use in industry due to continuous
operation, higher speed of response with fast motion etc. However, there is a
drawback that it is difficult to control because of the highly nonlinear and
parameters uncertainties. In this project, a Full Order Sliding Mode Controller is
design to control the system. First, the mathematical model of the electrohydraulic
servo control system is developed. Then the mathematic model will be transformed
into state space representation for the purposed of designing the controller. The
system will be treated as an uncertain system with bounded uncertainties where the
bounded are assumed known. The proposed controller will be designed based on
deterministic approach, such that the overall system is practically stable and tracks
the desired trajectory in spite the uncertainties and nonlinearities present in the
system. The performance and reliability of the proposal controller will be determined
by performing extensive simulation using MATLAB/SIMULINK. Lastly, the
performance of the controller is to be compared with Independent Joint Linear
Control and advanced deterministic controller
Polyphonic music information retrieval based on multi-label cascade classification system
Recognition and separation of sounds played by various instruments is very useful in labeling audio files with semantic information. This is a non-trivial task requiring sound analysis, but the results can aid automatic indexing and browsing music data when searching for melodies played by user specified instruments. Melody match based on pitch detection technology has drawn much attention and a lot of MIR systems have been developed to fulfill this task. However, musical instrument recognition remains an unsolved problem in the domain. Numerous approaches on acoustic feature extraction have already been proposed for timbre recognition. Unfortunately, none of those monophonic timbre estimation algorithms can be successfully applied to polyphonic sounds, which are the more usual cases in the real music world. This has stimulated the research on multi-labeled instrument classification and new features development for content-based automatic music information retrieval. The original audio signals are the large volume of unstructured sequential values, which are not suitable for traditional data mining algorithms; while the acoustical features are sometime not sufficient for instrument recognition in polyphonic sounds because they are higher-level representatives of raw signal lacking details of original information. In order to capture the patterns which evolve on the time scale, new temporal features are introduced to supply more temporal information for the timbre recognition. We will introduce the multi-labeled classification system to estimate multiple timbre information from the polyphonic sound by classification based on acoustic features and short-term power spectrum matching. In order to achieve higher estimation rate, we introduced the hierarchically structured cascade classification system under the inspiration of the human perceptual process. This cascade classification system makes a first estimate on the higher level decision attribute, which stands for the musical instrument family. Then, the further estimation is done within that specific family range. Experiments showed better performance of a hierarchical system than the traditional flat classification method which directly estimates the instrument without higher level of family information analysis.
Traditional hierarchical structures were constructed in human semantics, which are meaningful from human perspective but not appropriate for the cascade system. We introduce the new hierarchical instrument schema according to the clustering results of the acoustic features. This new schema better describes the similarity among different instruments or among different playing techniques of the same instrument. The classification results show the higher accuracy of cascade system with the new schema compared to the traditional schemas. The query answering system is built based on the cascade classifier
- …