Search CORE

7,977 research outputs found

Understanding Optical Music Recognition

Author: Calvo-Zaragoza Jorge
Hajič Jr. Jan
Pacha Alexander
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/07/2020
Field of study

For over 50 years, researchers have been trying to teach computers to read music notation, referred to as Optical Music Recognition (OMR). However, this field is still difficult to access for new researchers, especially those without a significant musical background: Few introductory materials are available, and, furthermore, the field has struggled with defining itself and building a shared terminology. In this work, we address these shortcomings by (1) providing a robust definition of OMR and its relationship to related fields, (2) analyzing how OMR inverts the music encoding process to recover the musical notation and the musical semantics from documents, and (3) proposing a taxonomy of OMR, with most notably a novel taxonomy of applications. Additionally, we discuss how deep learning affects modern OMR research, as opposed to the traditional pipeline. Based on this work, the reader should be able to attain a basic understanding of OMR: its objectives, its inherent structure, its relationship to other fields, the state of the art, and the research opportunities it affords

Repositorio Institucional de la Universidad de Alicante

arXiv.org e-Print Archive

Document recognition of printed scores and transformation into MIDI

Author: Baumann Stephan
Publication venue: Sonstige Einrichtungen. DFKI Deutsches Forschungszentrum für Künstliche Intelligenz
Publication date: 01/01/1993
Field of study

The processing of printed music pieces on paper images is an interesting application to analyze printed information by a computer. The music notation presented on paper should be recognized and reproduced. Numerous methods of image processing and knowledge-based procedures are necessary. The DOREMIDI System allows the processing of simple piano music pieces for two hands characterized by the following steps: - Scanning paper images - Processing of binary image data into basic components - Knowledge-based analysis and symbolic representation of a musical score - Visual and acoustic reproduction of the results. DOREMIDI has been realized on a Macintosh II, using Common-Lisp (Clos) programming language. The user interface is equivalent to the common Macintosh-interface, which enables in an uncomplicated way to use windows and menus. A keyboard presents the results of the acoustical reproduction

Universaar

Acronym

Deep Neural Networks for Document Processing of Music Score Images

Author: Calvo-Zaragoza Jorge
Castellanos F.J.
Fujinaga I.
Vigliensoni G.
Publication venue: 'MDPI AG'
Publication date: 01/01/2018
Field of study

[EN] There is an increasing interest in the automatic digitization of medieval music documents. Despite efforts in this field, the detection of the different layers of information on these documents still poses difficulties. The use of Deep Neural Networks techniques has reported outstanding results in many areas related to computer vision. Consequently, in this paper, we study the so-called Convolutional Neural Networks (CNN) for performing the automatic document processing of music score images. This process is focused on layering the image into its constituent parts (namely, background, staff lines, music notes, and text) by training a classifier with examples of these parts. A comprehensive experimentation in terms of the configuration of the networks was carried out, which illustrates interesting results as regards to both the efficiency and effectiveness of these models. In addition, a cross-manuscript adaptation experiment was presented in which the networks are evaluated on a different manuscript from the one they were trained. The results suggest that the CNN is capable of adapting its knowledge, and so starting from a pre-trained CNN reduces (or eliminates) the need for new labeled data.This work was supported by the Social Sciences and Humanities Research Council of Canada, and Universidad de Alicante through grant GRE-16-04.Calvo-Zaragoza, J.; Castellanos, F.; Vigliensoni, G.; Fujinaga, I. (2018). Deep Neural Networks for Document Processing of Music Score Images. Applied Sciences. 8(5). https://doi.org/10.3390/app8050654S85Bainbridge, D., & Bell, T. (2001). Computers and the Humanities, 35(2), 95-121. doi:10.1023/a:1002485918032Byrd, D., & Simonsen, J. G. (2015). Towards a Standard Testbed for Optical Music Recognition: Definitions, Metrics, and Page Images. Journal of New Music Research, 44(3), 169-195. doi:10.1080/09298215.2015.1045424LeCun, Y., Bengio, Y., & Hinton, G. (2015). Deep learning. Nature, 521(7553), 436-444. doi:10.1038/nature14539Rebelo, A., Fujinaga, I., Paszkiewicz, F., Marcal, A. R. S., Guedes, C., & Cardoso, J. S. (2012). Optical music recognition: state-of-the-art and open issues. International Journal of Multimedia Information Retrieval, 1(3), 173-190. doi:10.1007/s13735-012-0004-6Louloudis, G., Gatos, B., Pratikakis, I., & Halatsis, C. (2008). Text line detection in handwritten documents. Pattern Recognition, 41(12), 3758-3772. doi:10.1016/j.patcog.2008.05.011Montagner, I. S., Hirata, N. S. T., & Hirata, R. (2017). Staff removal using image operator learning. Pattern Recognition, 63, 310-320. doi:10.1016/j.patcog.2016.10.002Calvo-Zaragoza, J., Micó, L., & Oncina, J. (2016). Music staff removal with supervised pixel classification. International Journal on Document Analysis and Recognition (IJDAR), 19(3), 211-219. doi:10.1007/s10032-016-0266-2Calvo-Zaragoza, J., Pertusa, A., & Oncina, J. (2017). Staff-line detection and removal using a convolutional neural network. Machine Vision and Applications, 28(5-6), 665-674. doi:10.1007/s00138-017-0844-4Shelhamer, E., Long, J., & Darrell, T. (2017). Fully Convolutional Networks for Semantic Segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 39(4), 640-651. doi:10.1109/tpami.2016.2572683Kato, Z. (2011). Markov Random Fields in Image Segmentation. Foundations and Trends® in Signal Processing, 5(1-2), 1-155. doi:10.1561/2000000035Lecun, Y., Bottou, L., Bengio, Y., & Haffner, P. (1998). Gradient-based learning applied to document recognition. Proceedings of the IEEE, 86(11), 2278-2324. doi:10.1109/5.72679

Repositorio Institucional de la Universidad de Alicante

Multidisciplinary Digital Publishing Institute

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Directory of Open Access Journals

RiuNet

Multimodal music information processing and retrieval: survey and future challenges

Author: Avanzini Federico
Ntalampiras Stavros
Simonetta Federico
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 14/02/2019
Field of study

Towards improving the performance in various music information processing tasks, recent studies exploit different modalities able to capture diverse aspects of music. Such modalities include audio recordings, symbolic music scores, mid-level representations, motion, and gestural data, video recordings, editorial or cultural tags, lyrics and album cover arts. This paper critically reviews the various approaches adopted in Music Information Processing and Retrieval and highlights how multimodal algorithms can help Music Computing applications. First, we categorize the related literature based on the application they address. Subsequently, we analyze existing information fusion approaches, and we conclude with the set of challenges that Music Information Retrieval and Sound and Music Computing research communities should focus in the next years

arXiv.org e-Print Archive

Crossref

Staff-line removal with selectional auto-encoders

Author: Calvo-Zaragoza Jorge
Gallego Antonio-Javier
Publication venue: 'Elsevier BV'
Publication date: 01/01/2017
Field of study

Staff-line removal is an important preprocessing stage as regards most Optical Music Recognition systems. The common procedures employed to carry out this task involve image processing techniques. In contrast to these traditional methods, which are based on hand-engineered transformations, the problem can also be approached from a machine learning point of view if representative examples of the task are provided. We propose doing this through the use of a new approach involving auto-encoders, which select the appropriate features of an input feature set (Selectional Auto-Encoders). Within the context of the problem at hand, the model is trained to select those pixels of a given image that belong to a musical symbol, thus removing the lines of the staves. Our results show that the proposed technique is quite competitive and significantly outperforms the other state-of-art strategies considered, particularly when dealing with grayscale input images.This work was partially supported by the Spanish Ministerio de Educación, Cultura y Deporte through a FPU fellowship (AP2012- 0939) and the Spanish Ministerio de Economía y Competitividad through Project TIMuL (No. TIN2013-48152-C2-1-R, supported by UE FEDER funds)

Repositorio Institucional de la Universidad de Alicante

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Robust optical recognition of handwritten musical scores based on domain knowledge

Author: Ana Maria Silva Rebelo
Publication venue
Publication date: 30/10/2012
Field of study

Repositório Aberto da Universidade do Porto

An Expert System for Guitar Sheet Music to Guitar Tablature

Author: He Chuanjun
Publication venue: TRACE: Tennessee Research and Creative Exchange
Publication date: 01/05/2013
Field of study

This project applies analysis, design and implementation of the Optical Music Recognition (OMR) to an expert system for transforming guitar sheet music to guitar tablature. The first part includes image processing and music semantic interpretation to interpret and transform sheet music or printed scores into editable and playable electronic form. Then after importing the electronic form of music into internal data structures, our application uses effective pruning to explore the entire search space to find the best guitar tablature. Also considered are alternate guitar tunings and transposition of the music to improve the resulting tablature

University of Tennessee, Knoxville: Trace

A survey of computer uses in music

Author: Russell Maria N
Publication venue: Digital Scholarship@UNLV
Publication date: 01/01/1997
Field of study

This thesis covers research into the mathematical basis inherent in music including review of projects related to optical character recognition (OCR) of musical symbols. Research was done about fractals creating new pieces by assigning pitches to numbers. Existing musical pieces can be taken apart and reassembled creating new ideas for composers. Musical notation understanding is covered and its requirement for the recognition of a music sheet by the computer for editing and reproduction purposes is explained. The first phase of a musical OCR was created in this thesis with the recognition of staff lines on a good quality image. Modifications will need to be made to take care of noise and tilted images that may result from scanning

University of Nevada, Las Vegas Repository

SPARC 2017 retrospect & prospects : Salford postgraduate annual research conference book of abstracts

Author: Abba MK
Abdalla K
Abubaker M
Abuzukhar M
Adeleke AH
Adwick S
Akangbou HN
Akarem A
Al Ruwaili M
Al Somali S
Al-Aaloosi AB
Al-ani A
Al-doori Q
Al-Falahy NFA
Al-ibadi INA
Al-Jazaairry A
Al-Mukaram NAR
Al-Murshedi S
Al-Shammari S
Al-Turki RA
Al-Wasity A
Alawfi B
Albelazi MS
Aletaiby A
Alhamaidah A
Ali A
Alkarawi KA
Allarakia B
Almania A
Almojahed A
Almuktar S
Alogla KD
Alshaeri A
Alshammari A
Ayadurai C
Bakare W
Barrow E
Bozzi N
Broadley R
Butler DE
Chadwell AEA
Chang C-C
Chuku KI
Corry E
Costamagna E
Dang B
Danjuma AD
Davis D
Demonte P
Dinu LI
Doroh RM
Enya O
Eshenshani A
Eshlak MS
Fagbola LO
Fajemidagba A
Fakhrudeen A
Farren W
Fawkes R
Frigenti PP
Gabr M
Gill N
Gou J
Hadgraft N
Hakeem M
Hall J
Hammood H
Hart AM
Harvey VM
Heale R
Higgins RN
Hornby S
Hudson A
Hussein A
Iliyasu U
Iredale S
Joel AA
Jones C
Jones LA
Jones M
Kevill J
Khalil A
Khan A
Khan IM
Langford R
Lawal O
Mansour F
Martin P
Mohammed R
Mollasalehi S
Monk A
Mtonga VJ
Muhammad M
Nimoh C
O'Regan TJ
Obeng JY
Odufuwa K
Ogunoye OF
Ohaeresaba PIG
Oko Uka JC
Okudolor AG
Oni O
Oseki PE
Owoeye A
Oyewale A
Pace A
Panwar P
Perrin LD
Qaisi A
Qasim A
Rajab M
Robertson S
Rufasha E
Scaletta KL
Stan DM
Stein M
Stevens P
Stoddart K
Takhtravanchi M
Tang J
Tate D
Temile SO
Tomar N
Tyler N
Uwagie-Ero O
Ward L
Willcock M
Xu D
Yaseen DA
Zaeim A
Publication venue: University of Salford
Publication date: 01/06/2017
Field of study

Welcome to the Book of Abstracts for the 2017 SPARC conference. This year we not only celebrate the work of our PGRs but also the 50th anniversary of Salford as a University, which makes this year’s conference extra special. Once again we have received a tremendous contribution from our postgraduate research community; with over 130 presenters, the conference truly showcases a vibrant PGR community at Salford. These abstracts provide a taster of the research strengths of their works, and provide delegates with a reference point for networking and initiating critical debate. With such wide-ranging topics being showcased, we encourage you to exploit this great opportunity to engage with researchers working in different subject areas to your own. To meet global challenges, high impact research inevitably requires interdisciplinary collaboration. This is recognised by all major research funders. Therefore engaging with the work of others and forging collaborations across subject areas is an essential skill for the next generation of researchers

University of Salford Institutional Repository