25 research outputs found

    Immersivity in Music Performance with Original Compositions

    Get PDF
    The aim of this study is to critically investigate immersivity in music performance. It will evaluate how the combination of the performance space, musical material and delivery methods can produce unique and valuable sonic experiences to an audience. This investigative process hopes to highlight what compositional and spatial characteristics define this performance paradigm, with the goal to provide a taxonomy of key characteristics which creators must consider when devising an immersive music performance

    Axmedis 2005

    Get PDF
    The AXMEDIS conference aims to promote discussions and interactions among researchers, practitioners, developers and users of tools, technology transfer experts, and project managers, to bring together a variety of participants. The conference focuses on the challenges in the cross-media domain (which include production, protection, management, representation, formats, aggregation, workflow, distribution, business and transaction models), and the integration of content management systems and distribution chains, with particular emphasis on cost reduction and effective solutions for complex cross-domain problems

    L’individualità del parlante nelle scienze fonetiche: applicazioni tecnologiche e forensi

    Full text link

    The Effect Of Acoustic Variability On Automatic Speaker Recognition Systems

    Get PDF
    This thesis examines the influence of acoustic variability on automatic speaker recognition systems (ASRs) with three aims. i. To measure ASR performance under 5 commonly encountered acoustic conditions; ii. To contribute towards ASR system development with the provision of new research data; iii. To assess ASR suitability for forensic speaker comparison (FSC) application and investigative/pre-forensic use. The thesis begins with a literature review and explanation of relevant technical terms. Five categories of research experiments then examine ASR performance, reflective of conditions influencing speech quantity (inhibitors) and speech quality (contaminants), acknowledging quality often influences quantity. Experiments pertain to: net speech duration, signal to noise ratio (SNR), reverberation, frequency bandwidth and transcoding (codecs). The ASR system is placed under scrutiny with examination of settings and optimum conditions (e.g. matched/unmatched test audio and speaker models). Output is examined in relation to baseline performance and metrics assist in informing if ASRs should be applied to suboptimal audio recordings. Results indicate that modern ASRs are relatively resilient to low and moderate levels of the acoustic contaminants and inhibitors examined, whilst remaining sensitive to higher levels. The thesis provides discussion on issues such as the complexity and fragility of the speech signal path, speaker variability, difficulty in measuring conditions and mitigation (thresholds and settings). The application of ASRs to casework is discussed with recommendations, acknowledging the different modes of operation (e.g. investigative usage) and current UK limitations regarding presenting ASR output as evidence in criminal trials. In summary, and in the context of acoustic variability, the thesis recommends that ASRs could be applied to pre-forensic cases, accepting extraneous issues endure which require governance such as validation of method (ASR standardisation) and population data selection. However, ASRs remain unsuitable for broad forensic application with many acoustic conditions causing irrecoverable speech data loss contributing to high error rates

    Model konsep reka bentuk antaramuka koswer berbantu mahasiswa Tunakerna

    Get PDF
    The era of global education today is in need of educational media, especially those involving people with disabilities, as an effective media can have an impact on the quality of education for this group. However, in previous studies revealed that the medium of learning for this group is very limited and more focused on conventional learning because the lack of learning technology support has made it difficult for the learning process. The main purpose of this study is to propose a conceptual design model of assistive courseware interface for hearing impaired undergraduates.The development of student learning that has a hearing problem known as „tunakerna‟ according to the Fourth Edition Kamus Dewan Bahasa dan Pustaka (2008). The theoretical framework of scientific research design was chosen as the methodology used in this study. Expert validation approach has been implemented on Conceptual Design Model of Assistive Courseware Interface for Hearing Impaired Undergraduates (KOSMAT) model is designed with seven reserve components of a generic component structure, multimedia design elements, multiple intelligences for visual, interpersonal and intrapersonal, a model of instructional design, learning theory, object-oriented learning styles, and development process. Prototype courseware was developed by applying the applicability of KOSMAT model that aims to test the usability of the target users among hearing impaired undergraduates. The study found that usability testing showed a satisfactory performance for the three dimensions of ease of use, ease of learn and content. KOSMAT model is the improved results of the expert validation through consultation with experts. Hence, it should serve as a guideline or a reference to develop a learning courseware to hearing impaired undergraduates especially in special education

    MediaSync: Handbook on Multimedia Synchronization

    Get PDF
    This book provides an approachable overview of the most recent advances in the fascinating field of media synchronization (mediasync), gathering contributions from the most representative and influential experts. Understanding the challenges of this field in the current multi-sensory, multi-device, and multi-protocol world is not an easy task. The book revisits the foundations of mediasync, including theoretical frameworks and models, highlights ongoing research efforts, like hybrid broadband broadcast (HBB) delivery and users' perception modeling (i.e., Quality of Experience or QoE), and paves the way for the future (e.g., towards the deployment of multi-sensory and ultra-realistic experiences). Although many advances around mediasync have been devised and deployed, this area of research is getting renewed attention to overcome remaining challenges in the next-generation (heterogeneous and ubiquitous) media ecosystem. Given the significant advances in this research area, its current relevance and the multiple disciplines it involves, the availability of a reference book on mediasync becomes necessary. This book fills the gap in this context. In particular, it addresses key aspects and reviews the most relevant contributions within the mediasync research space, from different perspectives. Mediasync: Handbook on Multimedia Synchronization is the perfect companion for scholars and practitioners that want to acquire strong knowledge about this research area, and also approach the challenges behind ensuring the best mediated experiences, by providing the adequate synchronization between the media elements that constitute these experiences

    Design editorial na tradução de português para libras

    Get PDF
    Tese (doutorado) - Universidade Federal de Santa Catarina, Centro de Comunicação e Expressão, Programa de Pós-Graduação em Estudos da Tradução, Florianópolis, 2017.Na tradução de textos escritos para vídeos em língua de sinais mudamos a composição, o meio e a forma de ler. O design é uma parte importante da tradução entre textos de modalidades diferentes, como no caso de textos escritos de uma língua de modalidade oral/auditiva para um texto em vídeo de uma língua de modalidade visual/espacial. Encontramos extensa bibliografia sobre tipografia, design editorial e muitas pesquisas sobre a leitura de textos escritos, mas encontramos poucos estudos sobre a leitura em língua de sinais. Visando identificar recursos gráficos e ferramentas computacionais que possam contribuir para uma leitura fluida e agradável, desenvolvemos uma pesquisa aplicada com foco na tradução de português para Libras de textos com função didáticada utilizados em disciplinas de cursos do IFSC câmpus Palhoça Bilíngue. Adotamos uma abordagem funcionalista da tradução, embasada na proposta de Nord (2005) que possibilita desenvolver o trabalho de design de forma integrada com o projeto de tradução e buscamos subsídios teóricos para compreender o que contribui para o desempenho e satisfação do leitor, tanto na leitura de textos em português como em Libras. A partir da análise dos elementos do design editorial dos textos escritos e das funções que desempenham na leitura, procuramos estabelecer relações com os recursos das interfaces de vídeos com textos na língua de sinais identificando quais são os elementos que desempenham essas funções no texto traduzido. Para conhecer melhor como os estudantes surdos (e ouvintes que utilizam a língua de sinais) leem em Libras, e identificar que recursos gráficos e ferramentas computacionais contribuem, ou ao contrário, atrapalham a leitura, foi feita uma avaliação diagnóstica da usabilidade. Seguindo orientações da norma ABNT NBR ISO 9241-Requisitos ergonômicos para o trabalho com dispositivos de interação visual, Parte 11-Orientações sobre usabilidade (2011), selecionamos os requisitos de usabilidade a serem avaliados, de acordo com os principais objetivos dos leitores, e definimos os critérios para a avaliação da eficácia, eficiência e satisfação do leitor. Para a análise da leitura em Libras, foram traduzidos três textos e elaborados protótipos em diferentes versões de design. Nos testes de leitura foram utilizados players de vídeo comuns e um player específico para leitura em língua de sinais (vbook). Também foram elaboradosdos questionários e uma tarefa para observar a compreensão do texto, como os estudantes buscam informações no texto e suas opiniões sobre os recursos da interface. A ação dos estudantes durante a leitura e desenvolvimento das tarefas foi registrada com o programa Camtasia e a análise foi feita com o programa Elan. A pesquisa mostrou a importância de planejar recursos tecnológicos específicos para facilitar a leitura de textos em vídeos na língua de sinais como ferramenta para alterar a velocidade de reprodução do vídeo de fácil manuseio, recursos interativos que permitam navegar facilmente pelo conteúdo, possibilidade de visualização da estrutura e hierarquia das informações de forma clara e ampla, instrumentos para fazer marcações no texto e voltar à elas quando desejado e recursos gráficos que permitam relacionar os diferentes conteúdos verbais e visuais conduzindo agradavelmente a leitura.Abstract : The composition, means and manner of reading are changed in the translation of written texts for sign language videos. Design is an important component of translation between texts of different modalities, as in the case of written texts from a spoken/listening language to a video text of a visual/spatial language. Although the literature on typography, editorial design and reading of written texts is abundant, few studies concerning the reading processes of sign language are extant. Research applied in the textual translation of portuguese into Libras has been developed, for didactic purposes, in IFSC courses at the Palhoça bilingue campus to identify graphic resources and computer tools that may contribute for a pleasant and flowing reading. Based on Nord (2005), the translation´s functional approach was employed to develop design work in an integrated manner with the translation project. Theoretical foregrounding was also pursued to understand what contributes to the reader´s performance and satisfaction, not only in the readings of texts in portuguese, but also in Libras. By analyzing elements of editorial design in written texts and the functions they perform in reading, relationships were established between resources of video interfaces and the texts in sign language in order to identify which elements play these functions in a translated text. A diagnostic evaluation of usability was undertaken so that one may know better the manner deaf students (or hearing people using sign language) read Libras and identify the graphic resources and computer tools or, contrastingly, those that impair reading. Following instructions by ABNT NBR ISO 9241- Ergonomic requirements for office work with visual display terminal, Part 11 - Guidance on usability (2011), we selected the usability requirements to be assessed, in line with the main readers goals, and also defined evaluation criteria for readers efficaciousness, efficiency and satisfaction. In order to analyze reading in Libras, three texts in different design versions were translated and their prototypes elaborated. A common video player and a specific player for reading in sign language (vbook) were employed to analyze reading in Libras. Two questionnaires and an specific assignment were also devised to observe text comprehension and the manner students look for information in the text and their opinion on interface resources. Students´ activities during reading and the development of activities were registered by Camtasia and analyzed by Elan. Current research revealed the importance of planning specific technological resources to facilitate text reading in sign language videos as tools for speed change in user friendly video reproduction, interactive resources with easy browsing, clear visualization of resources and stratification of information, tools for marking texts and return to markings when required, graphic resources to relate different verbal and visual contents for easy reading

    Digital Watermarking for Verification of Perception-based Integrity of Audio Data

    Get PDF
    In certain application fields digital audio recordings contain sensitive content. Examples are historical archival material in public archives that preserve our cultural heritage, or digital evidence in the context of law enforcement and civil proceedings. Because of the powerful capabilities of modern editing tools for multimedia such material is vulnerable to doctoring of the content and forgery of its origin with malicious intent. Also inadvertent data modification and mistaken origin can be caused by human error. Hence, the credibility and provenience in terms of an unadulterated and genuine state of such audio content and the confidence about its origin are critical factors. To address this issue, this PhD thesis proposes a mechanism for verifying the integrity and authenticity of digital sound recordings. It is designed and implemented to be insensitive to common post-processing operations of the audio data that influence the subjective acoustic perception only marginally (if at all). Examples of such operations include lossy compression that maintains a high sound quality of the audio media, or lossless format conversions. It is the objective to avoid de facto false alarms that would be expectedly observable in standard crypto-based authentication protocols in the presence of these legitimate post-processing. For achieving this, a feasible combination of the techniques of digital watermarking and audio-specific hashing is investigated. At first, a suitable secret-key dependent audio hashing algorithm is developed. It incorporates and enhances so-called audio fingerprinting technology from the state of the art in contentbased audio identification. The presented algorithm (denoted as ”rMAC” message authentication code) allows ”perception-based” verification of integrity. This means classifying integrity breaches as such not before they become audible. As another objective, this rMAC is embedded and stored silently inside the audio media by means of audio watermarking technology. This approach allows maintaining the authentication code across the above-mentioned admissible post-processing operations and making it available for integrity verification at a later date. For this, an existent secret-key ependent audio watermarking algorithm is used and enhanced in this thesis work. To some extent, the dependency of the rMAC and of the watermarking processing from a secret key also allows authenticating the origin of a protected audio. To elaborate on this security aspect, this work also estimates the brute-force efforts of an adversary attacking this combined rMAC-watermarking approach. The experimental results show that the proposed method provides a good distinction and classification performance of authentic versus doctored audio content. It also allows the temporal localization of audible data modification within a protected audio file. The experimental evaluation finally provides recommendations about technical configuration settings of the combined watermarking-hashing approach. Beyond the main topic of perception-based data integrity and data authenticity for audio, this PhD work provides new general findings in the fields of audio fingerprinting and digital watermarking. The main contributions of this PhD were published and presented mainly at conferences about multimedia security. These publications were cited by a number of other authors and hence had some impact on their works
    corecore