10 research outputs found
Converting images to music using their colour properties
Presented at the 12th International Conference on Auditory Display (ICAD), London, UK, June 20-23, 2006.Music is associated to colors since ancient years. Different mappings between attributes of sound and images allow the efficient conversion between the two types of media. The proposed method for converting images to music using the concept of chromaticism provides the area of computer music with a parameterized environment for audio-visual presentations. The auditory display of colour images may bring the different ways that a listener perceives a musical piece (because of colour transitions) to light. A design template for chromatic synthesis is described. A short example, based on a graphical digital icon, demonstrates the preliminary results
Axmedis 2005
The AXMEDIS conference aims to promote discussions and interactions among researchers, practitioners, developers and users of tools, technology transfer experts, and project managers, to bring together a variety of participants. The conference focuses on the challenges in the cross-media domain (which include production, protection, management, representation, formats, aggregation, workflow, distribution, business and transaction models), and the integration of content management systems and distribution chains, with particular emphasis on cost reduction and effective solutions for complex cross-domain problems
Rights and services interoperability for multimedia content management
The main goal of the work presented in this thesis is to describe the definition of interoperability mechanisms between rights expression languages and policy languages. Starting from languages interoperability, the intention is to go a step further and define how services for multimedia content management can interoperate by means of service-oriented generic and standardised architectures.
In order to achieve this goal, several standards and existing initiatives will be analysed and taken into account. Regarding rights expression languages and policy languages, standards like MPEG-21 Rights Expression Language (REL), Open Digital Rights Language (ODRL) and eXtensible Access Control Markup Language (XACML) are considered. Regarding services for content management, the Multimedia Information Protection And Management System (MIPAMS), a standards-based implemented architecture, and the Multimedia Service Platform Technologies (MSPT), also known as MPEG-M standard, are considered.
The contribution of this thesis is divided into two parts, one devoted to languages interoperability and the other one devoted to services interoperability, both addressed to multimedia content management. They are briefly described next.
The first part of the contribution describes how MPEG-21 REL, ODRL and XACML can interoperate, defining the mapping mechanisms to translate expressions from language to language. The mappings provided have different levels of granularity, starting from a mapping based on a programmatic approach coming from high-level modelling diagrams done using Unified Modelling Language (UML) and Entity-Relationship (ER). The next level of mappings includes specific mappings between MPEG-21 REL and XACML and ODRL and XACML. Finally, a more general solution is proposed by using a broker. Part of this work was done in the context of the VISNET-II Network of Excellence and the AXMEDIS Integrated Project. The findings done prove the validity of the interoperability methods described.
The second part of the contribution describes how to describe standards based building blocks to provide interoperable services for multimedia content management. This definition is based on the analysis of existing content management use cases, from the ones involving less security over multimedia content managed to the ones providing full-featured digital rights management (DRM) (including access control and ciphering techniques) to support secure content management. In this section it is also presented the work done in the research projects AXMEDIS, Musiteca and Culturalive. It is also shown the standardisation work done for MPEG-M, particularly on elementary services and service aggregation. To demonstrate the usage of both technologies a mobile application integrating both MPEG-M and MIPAMS is presented.
Furthermore, some conclusions and future work is presented in the corresponding section, together with the refereed publications, which are briefly described in the document. In summary, the work presented can follow different research lines. On the one hand, further study on rights expression languages and policy languages is required as new versions of them have recently appeared. It is worth noting the standardisation of a contract expression language, MPEG-21 CEL, which has also to be further analysed in order to evaluate its interoperability with rights and policy languages. On the other hand, standard initiatives must be followed in order to complete the map of SB3's, considering MPEG standards and also other standards not only related to multimedia but also other application scenarios, like e-health or e-government
Designing instruments towards networked music practices
It is commonly noted in New Interfaces for Musical Expression (NIME) research that
few of these make it to the mainstream and are adopted by the general public. Some
research in Sound and Music Computing (SMC) suggests that the lack of humanistic
research guiding technological development may be one of the causes. Many new
technologies are invented, however without real aim else than for technical
innovation, great products however emphasize the user-friendliness, user involvement
in the design process or User-Centred Design (UCD), that seek to guarantee that
innovation address real, existing needs among users. Such an approach includes not
only traditionally quantifiable usability goals, but also qualitative, psychological,
philosophical and musical such. The latter approach has come to be called experience
design, while the former is referred to as interaction design. Although the Human
Computer Interaction (HCI) community in general has recognized the significance of
qualitative needs and experience design, NIME has been slower to adopt this new
paradigm. This thesis therefore attempts to investigate its relevance in NIME, and
specifically Computer Supported Cooperative Work (CSCW) for music applications
by devising a prototype for group music action based on needs defined from pianists
engaging in piano duets, one of the more common forms of group creation seen in the
western musical tradition. These needs, some which are socio-emotional in nature, are
addressed through our prototype although in the context of computers and global
networks by allowing for composers from all over the world to submit music to a
group concert on a Yamaha Disklavier in location in Porto, Portugal. Although this
prototype is not a new gestural controller per se, and therefore not a traditional NIME,
but rather a platform that interfaces groups of composers with a remote audience, the
aim of this research is on investigating how contextual parameters like venue, audience, joint concert and technologies impact the overall user experience of such a
system. The results of this research has been important not only in understanding the
processes, services, events or environments in which NIME’s operate, but also
understanding reciprocity, creativity, experience design in Networked Music
practices.É de conhecimento generalizado que na área de investigação em novos interfaces para
expressão musical (NIME - New Interfaces for Musical Expression), poucos dos
resultantes dispositivos acabam por ser popularizados e adoptados pelo grande
público. Algum do trabalho em computação sonora e musical (SMC- Sound and
Music Computing) sugere que uma das causas para esta dificuldade, reside
numalacuna ao nível da investigação dos comportamentos humanos como linha
orientadora para os desenvolvimentos tecnológicos. Muitos dos desenvolvimentos
tecnológicos são conduzidos sem um real objectivo, para além da inovação
tecnológica, resultando em excelentes produtos, mas sem qualquer enfâse na
usabilidade humana ou envolvimento do utilizador no processo de Design (UCDUser
Centered Design), no sentido de garantir que a inovação atende a necessidades
reais dos utilizadores finais. Esta estratégia implica, não só objectivos quantitativos
tradicionais de usabilidade, mas também princípios qualitativos, fisiológicos,
psicológicos e musicológicos. Esta ultima abordagem é atualmente reconhecida como
Design de Experiência (Experience Design) enquanto a abordagem tradicional é
vulgarmente reconhecida apenas como Design de Interação (Interaction Design).
Apesar de na área Interação Homem-Computador (HCI – Human Computer
Interaction) as necessidades qualitativas no design de experiência ser amplamente
reconhecido em termos do seu significado e aplicabilidade, a comunidade NIME tem
sido mais lenta em adoptar este novo paradigma. Neste sentido, esta Tese procura
investigar a relevância em NIME, especificamente nu subtópico do trabalho
cooperativo suportado por Computadores (CSCW – Computer Supported Cooperative
Work), para aplicações musicais, através do desenvolvimento de um protótipo de um
sistema que suporta ações musicais coletivas, baseado nas necessidades especificas de Pianistas em duetos de Piano, uma das formas mais comuns de criação musical em
grupo popularizada na tradição musical ocidental. Estes requisitos, alguns sócioemocionais
na sua natureza, são atendidos através do protótipo, neste caso aplicado ao
contexto informático e da rede de comunicações global, permitindo a compositores de
todo o mundo submeterem a sua música para um concerto de piano em grupo num
piano acústico Yamaha Disklavier, localizado fisicamente na cidade do Porto,
Portugal. Este protótipo não introduz um novo controlador em si mesmo, e
consequentemente não está alinhado com as típicas propostas de NIME. Trata-se sim,
de uma nova plataforma de interface em grupo para compositores com uma audiência
remota, enquadrado com objectivos de experimentação e investigação sobre o
impacto de diversos parâmetros, tais como o espaço performativo, as audiências,
concertos colaborativos e tecnologias em termos do sistema global. O resultado deste
processo de investigação foi relevante, não só para compreender os processos,
serviços, eventos ou ambiente em que os NIME podem operar, mas também para
melhor perceber a reciprocidade, criatividade e design de experiencia nas práticas
musicais em rede
Language-independent pre-processing of large document bases for text classification
Text classification is a well-known topic in the research of knowledge discovery in
databases. Algorithms for text classification generally involve two stages. The first
is concerned with identification of textual features (i.e. words andlor phrases) that
may be relevant to the classification process. The second is concerned with
classification rule mining and categorisation of "unseen" textual data. The first
stage is the subject of this thesis and often involves an analysis of text that is both
language-specific (and possibly domain-specific), and that may also be
computationally costly especially when dealing with large datasets. Existing
approaches to this stage are not, therefore, generally applicable to all languages. In
this thesis, we examine a number of alternative keyword selection methods and
phrase generation strategies, coupled with two potential significant word list
construction mechanisms and two final significant word selection mechanisms, to
identify such words andlor phrases in a given textual dataset that are expected to
serve to distinguish between classes, by simple, language-independent statistical
properties. We present experimental results, using common (large) textual datasets
presented in two distinct languages, to show that the proposed approaches can
produce good performance with respect to both classification accuracy and
processing efficiency. In other words, the study presented in this thesis
demonstrates the possibility of efficiently solving the traditional text classification
problem in a language-independent (also domain-independent) manner
Music Encoding Conference Proceedings 2021, 19–22 July, 2021 University of Alicante (Spain): Onsite & Online
Este documento incluye los artículos y pósters presentados en el Music Encoding Conference 2021 realizado en Alicante entre el 19 y el 22 de julio de 2022.Funded by project Multiscore, MCIN/AEI/10.13039/50110001103
Third International Conference on Technologies for Music Notation and Representation TENOR 2017
The third International Conference on Technologies for Music Notation and Representation seeks to focus on a set of specific research issues associated with Music Notation that were elaborated at the first two editions of TENOR in Paris and Cambridge. The theme of the conference is vocal music, whereas the pre-conference workshops focus on innovative technological approaches to music notation
Music Encoding Conference Proceedings 2021. 19–22 July, 2021 University of Alicante (Spain): Onsite & Online. Edited by Stefan Münnich and David Rizo
Conference proceedings of the Music Encoding Conference 2021 with Foreword by Stefan Münnich and David Rizo
On Improving Generalization of CNN-Based Image Classification with Delineation Maps Using the CORF Push-Pull Inhibition Operator
Deployed image classification pipelines are typically dependent on the images captured in real-world environments. This means that images might be affected by different sources of perturbations (e.g. sensor noise in low-light environments). The main challenge arises by the fact that image quality directly impacts the reliability and consistency of classification tasks. This challenge has, hence, attracted wide interest within the computer vision communities. We propose a transformation step that attempts to enhance the generalization ability of CNN models in the presence of unseen noise in the test set. Concretely, the delineation maps of given images are determined using the CORF push-pull inhibition operator. Such an operation transforms an input image into a space that is more robust to noise before being processed by a CNN. We evaluated our approach on the Fashion MNIST data set with an AlexNet model. It turned out that the proposed CORF-augmented pipeline achieved comparable results on noise-free images to those of a conventional AlexNet classification model without CORF delineation maps, but it consistently achieved significantly superior performance on test images perturbed with different levels of Gaussian and uniform noise