Search CORE

1,833 research outputs found

Analisis dan Implementasi Short Time Fourier Transform (STFT) dan Approximate String Matching pada Query-by-Humming

Author: Pandapotan Christian
Publication venue: Universitas Telkom
Publication date: 01/01/2010
Field of study

ABSTRAKSI: Music Information Retrieval (MIR) merupakan sebuah bidang ilmu untuk mengambil dan mengolah informasi dari file musik, dapat berupa metadata, ataupun konten, seperti nada, melodi dan tempo. Query by Humming merupakan bidang turunan dari MIR yang mencari suatu lagu di dalam database berdasarkan konten dari musik yang berupa melodi.Query by Humming terdiri dari dua tahap utama yaitu mengubah hasil humming dan file musik pada database menjadi representasi yang sesuai (menggunakan teknik Short Time Fourier Transform (STFT)) serta membandingkan hasil humming dan file musik pada database (menggunakan teknik Approximate String Matching (ASM)). STFT mengubah hasil humming dari domain waktu-amplitudo ke dalam domain waktu-frekuensi untuk memperoleh melodi yang dinyanyikan. Sedangkan ASM membandingkan dua buah string dan menghitung banyak operasi dasar yang harus dilakukan untuk mengubah string pertama menjadi string kedua. Semakin kecil hasil dari metode ini, maka semakin mirip kedua string tersebut.Pada tugas akhir ini dilakukan penelitian untuk mengetahui ukuran jendela yang sesuai pada STFT dan pembobotan operasi dasar dari ASM terhadap Query by Humming. Akurasi ditentukan oleh keberhasilan sistem untuk menebak lagu yang dinyanyikan. Untuk menghasilkan banyak kemungkinan akurasi, jendela dirancang dalam beberapa lebar segmentasi yang berbeda dan pembobotan operasi dasar juga akan dilakukan untuk beberapa nilai yang berbeda. Dari rancangan tersebut, hasil akurasi sistem mencapai 34.62%-61.54% untuk koleksi data penyanyi berpengalaman dan 9.38%-28.13% untuk koleksi data penyanyi dengan latar belakang paduan suara saja.Kata Kunci : music information retrieval (MIR), query by humming, short time fourier transform (STFT), approximate string matching (ASM), jendela, operasi dasarABSTRACT: Music Information Retrieval (MIR) is a field to retrieve and process information from music files, can be metadata, such as title and singer, or content, such as tone, melody and tempo. Query by Humming is derived from the MIR field which is looking for a song in the database based on the content of the music as melody. Query by Humming consists of two main stages, they are change the results of humming and music files in the database into the appropriate representation (using the technique of Short Time Fourier Transform (STFT)) and compare the results of humming and music files in the database (using the technique of Approximate String Matching (ASM) ). STFT change the humming of the time-amplitude domain into time-frequency domain to obtain a sung melody. While ASM compares two strings and calculates how many basic operations that must be done to change the first string to second string. The smaller the result of this method, the more similar the two strings.This final project will determine the proper window length in STFT and weighting the basic operation of the ASM to Query by Humming. Accuracy is determined by the success of the system to guess the song is sung. To generate many possible accuracy, the window is designed in several different segmentations wide and weighted base operations will also be conducted for several different values. From the draft, the accuracy of the system reaches 34.62%-61.54% for data set collected from experience singers and 9.38%-28.13% for data set collecter from singers experienced in choir only.Keyword: music information retrieval (MIR), query by humming, short time fourier transform (STFT), approximate string matching (ASM), window, basic operatio

Open Library

Multimodal music information processing and retrieval: survey and future challenges

Author: Avanzini Federico
Ntalampiras Stavros
Simonetta Federico
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 14/02/2019
Field of study

Towards improving the performance in various music information processing tasks, recent studies exploit different modalities able to capture diverse aspects of music. Such modalities include audio recordings, symbolic music scores, mid-level representations, motion, and gestural data, video recordings, editorial or cultural tags, lyrics and album cover arts. This paper critically reviews the various approaches adopted in Music Information Processing and Retrieval and highlights how multimodal algorithms can help Music Computing applications. First, we categorize the related literature based on the application they address. Subsequently, we analyze existing information fusion approaches, and we conclude with the set of challenges that Music Information Retrieval and Sound and Music Computing research communities should focus in the next years

arXiv.org e-Print Archive

Crossref

Towards an All-Purpose Content-Based Multimedia Information Retrieval System

Author: Gasser Ralph
Rossetto Luca
Schuldt Heiko
Publication venue
Publication date: 01/01/2019
Field of study

The growth of multimedia collections - in terms of size, heterogeneity, and variety of media types - necessitates systems that are able to conjointly deal with several forms of media, especially when it comes to searching for particular objects. However, existing retrieval systems are organized in silos and treat different media types separately. As a consequence, retrieval across media types is either not supported at all or subject to major limitations. In this paper, we present vitrivr, a content-based multimedia information retrieval stack. As opposed to the keyword search approach implemented by most media management systems, vitrivr makes direct use of the object's content to facilitate different types of similarity search, such as Query-by-Example or Query-by-Sketch, for and, most importantly, across different media types - namely, images, audio, videos, and 3D models. Furthermore, we introduce a new web-based user interface that enables easy-to-use, multimodal retrieval from and browsing in mixed media collections. The effectiveness of vitrivr is shown on the basis of a user study that involves different query and media types. To the best of our knowledge, the full vitrivr stack is unique in that it is the first multimedia retrieval system that seamlessly integrates support for four different types of media. As such, it paves the way towards an all-purpose, content-based multimedia information retrieval system

arXiv.org e-Print Archive

edoc

Musical audio-mining

Author: Leman Marc
Publication venue: STT Netherlands Study Centre for Technology Trends
Publication date: 01/01/2002
Field of study

Ghent University Academic Bibliography

Progressive Filtering Using Multiresolution Histograms for Query by Humming System

Author: AK Tripathy
G-P Bonneau
L Fu
Z Wang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 11/01/2014
Field of study

The rising availability of digital music stipulates effective categorization and retrieval methods. Real world scenarios are characterized by mammoth music collections through pertinent and non-pertinent songs with reference to the user input. The primary goal of the research work is to counter balance the perilous impact of non-relevant songs through Progressive Filtering (PF) for Query by Humming (QBH) system. PF is a technique of problem solving through reduced space. This paper presents the concept of PF and its efficient design based on Multi-Resolution Histograms (MRH) to accomplish searching in manifolds. Initially the entire music database is searched to obtain high recall rate and narrowed search space. Later steps accomplish slow search in the reduced periphery and achieve additional accuracy. Experimentation on large music database using recursive programming substantiates the potential of the method. The outcome of proposed strategy glimpses that MRH effectively locate the patterns. Distances of MRH at lower level are the lower bounds of the distances at higher level, which guarantees evasion of false dismissals during PF. In due course, proposed method helps to strike a balance between efficiency and effectiveness. The system is scalable for large music retrieval systems and also data driven for performance optimization as an added advantage.Comment: 12 Pages, 6 Figures, Full version of the paper published at ICMCCA-2012 with the same title, Link:http://link.springer.com/chapter/10.1007/978-81-322-1143-3_2

arXiv.org e-Print Archive

Crossref

An Effective Music Information Retrieval Method Using Three-Dimensional Continuous DP

Author: Heo Sung-Phil
Ito Akinori
Makino Shozo
Suzuki Motoyuki
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 05/03/2010
Field of study

Tohoku University Repository (TOUR) / 東北大学機関リポジトリ

Institutional Repositories DataBase (IRDB)