Search CORE

7,641 research outputs found

DolphinAtack: Inaudible Voice Commands

Author: Aviv Adam J.
Backes Michael
Carlini Nicholas
Castro Simon
Dey Sanorita
Francillon Aurélien
Ishtiaq Roufa Rob Millerb
Ittichaichareon Chadawan
Michalevsky Yan
Schlegel Roman
Shin Hocheol
Son Yunmok
Vaidya Tavish
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 30/08/2017
Field of study

Speech recognition (SR) systems such as Siri or Google Now have become an increasingly popular human-computer interaction method, and have turned various systems into voice controllable systems(VCS). Prior work on attacking VCS shows that the hidden voice commands that are incomprehensible to people can control the systems. Hidden voice commands, though hidden, are nonetheless audible. In this work, we design a completely inaudible attack, DolphinAttack, that modulates voice commands on ultrasonic carriers (e.g., f > 20 kHz) to achieve inaudibility. By leveraging the nonlinearity of the microphone circuits, the modulated low frequency audio commands can be successfully demodulated, recovered, and more importantly interpreted by the speech recognition systems. We validate DolphinAttack on popular speech recognition systems, including Siri, Google Now, Samsung S Voice, Huawei HiVoice, Cortana and Alexa. By injecting a sequence of inaudible voice commands, we show a few proof-of-concept attacks, which include activating Siri to initiate a FaceTime call on iPhone, activating Google Now to switch the phone to the airplane mode, and even manipulating the navigation system in an Audi automobile. We propose hardware and software defense solutions. We validate that it is feasible to detect DolphinAttack by classifying the audios using supported vector machine (SVM), and suggest to re-design voice controllable systems to be resilient to inaudible voice command attacks.Comment: 15 pages, 17 figure

arXiv.org e-Print Archive

Crossref

Simple4All proposals for the Albayzin Evaluations in Speech Synthesis

Author: Barra-Chicote Roberto
King Simon
Lorenzo-Trueba Jaime
Montero Juan M
Watts Oliver
Yamagishi Junichi
Publication venue
Publication date: 01/01/2012
Field of study

Edinburgh Research Explorer

Museums as disseminators of niche knowledge: Universality in accessibility for all

Author: Rizzo A
Publication venue: country:GB
Publication date: 01/01/2019
Field of study

Accessibility has faced several challenges within audiovisual translation Studies and gained great opportunities for its establishment as a methodologically and theoretically well-founded discipline. Initially conceived as a set of services and practices that provides access to audiovisual media content for persons with sensory impairment, today accessibility can be viewed as a concept involving more and more universality thanks to its contribution to the dissemination of audiovisual products on the topic of marginalisation. Against this theoretical backdrop, accessibility is scrutinised from the perspective of aesthetics of migration and minorities within the field of the visual arts in museum settings. These aesthetic narrative forms act as modalities that encourage the diffusion of ‘niche’ knowledge, where processes of translation and interpretation provide access to all knowledge as counter discourse. Within this framework, the ways in which language is used can be considered the beginning of a type of local grammar in English as lingua franca for interlingual translation and subtitling, both of which ensure access to knowledge for all citizens as a human rights principle and regardless of cultural and social differences. Accessibility is thus gaining momentum as an agent for the democratisation and transparency of information against media discourse distortions and oversimplifications

Archivio istituzionale della ricerca - Università di Palermo

Synthetic voices in the foreign language context

Author: Bione Tiago
Cardoso Walcir
Publication venue: (co-sponsored by Center for Open Educational Resources and Language Learning, University of Texas at Austin)
Publication date: 01/02/2020
Field of study

This study evaluated the voice of a modern English text-to-speech (TTS) system in an English as a foreign language (EFL) context in terms of its speech quality, ability to be understood by L2 users, and potential for focus on specific language forms. Twenty-nine Brazilian EFL learners listened to stories and sentences, produced by a TTS voice and a human voice, and rated them on a 6-point Likert scale according to holistic criteria for evaluating pronunciation: Comprehensibility, naturalness, and accuracy. In addition, they were asked to answer a set of comprehension questions (to assess understanding), to complete a dictation/transcription task to measure intelligibility, and to identify whether the target past -ed form was present or not in decontextualized sentences. Results indicate that the performance of both the TTS and human voices were perceived similarly in terms of comprehensibility, while ratings for naturalness were unfavorable for the synthesized voice. For text comprehension, dictation, and aural identification tasks, participants performed relatively similarly in response to both voices. These findings suggest that TTS systems have the potential to be used as pedagogical tools for L2 learning, particularly in EFL settings, where natural occurrence of the target language is limited or non-existent

ScholarSpace at University of Hawai'i at Manoa

Delivery approaches in audio description for the scenic arts

Author: Hermosa Ramírez Irene
Publication venue
Publication date: 01/01/2020
Field of study

Altres ajuts: Esta investigación forma parte del proyecto RAD (Researching Audio Description: Translation, Delivery and New Scenarios), código de referencia PGC2018-096566-B-I00.Audio description (AD) is becoming an increasingly mature modality within Audiovisual Translation Studies (AVTS) and Media Accessibility Studies. Concurrently, technological advances are steadily being put at the forefront of its practice. The aim of this article is to define the current status and development of AD for the scenic arts from a technical perspective. First, an overview of guidelines that specifically include recommendations on delivering AD for the scenic arts is presented. The emphasis is then placed on the implications of the delivery approaches currently applied to this modality. In this context, theatre venues can offer AD - along with other access services - in a live, semi-live or automated manner. The advantages and challenges for each approach are thus analysed and compared by presenting examples and applications in practice. Ultimately, the present descriptive study concludes that the live, on-site delivery approach is no longer the default in Spanish venues. This conclusion opens up new research paths on the reception of innovative practices and software solutions. It is tentatively suggested that involving the creative team and the blind and visually impaired patrons would be key to choosing the most suitable delivery approach for each production

Diposit Digital de Documents de la UAB

Growing grassroots innovations: exploring the role of community-based initiatives in governing sustainable energy transitions

Author: Alex Haxeltine
Brangwyn B
Church C
Curry A
Defra
Douthwaite R
FEASTA
Gill Seyfang
Government H M
Hargreaves T
Heinberg R
Hess D J
Hielscher S
Hopkins R
Hopkins R
Jackson T
Loorbach D
ONS
ONS
Rip A
Seyfang G
Smith A
Sorrell S
Spratt S
Transition Network
UKERC
WSSD
Publication venue: 'Pion Ltd'
Publication date: 21/06/2012
Field of study

Crossref

University of East Anglia digital repository

Assistive technology : guidance for teaching practitioners to support learners with specific learning difficulties

Author
Publication venue: Welsh Government
Publication date: 01/01/2015
Field of study

Digital Education Resource Archive