24 research outputs found

    Constructing a low-cost, open-source, VoiceXML

    Get PDF
    Voice-enabled applications, applications that interact with a user via an audio channel, are used extensively today. Their use is growing as speech related technologies improve, as speech is one of the most natural methods of interaction. They can provide customer support as IVRs, can be used as an assistive technology, or can become an aural interface to the Internet. Given that the telephone is used extensively throughout the globe, the number of potential users of voice-enabled applications is very high. VoiceXML is a popular, open, high-level, standard means of creating voice-enabled applications which was designed to bring the benefits of web based development to services. While VoiceXML is an ideal language for creating these applications, VoiceXML gateways, the hardware and software responsible for interpreting VoiceXML applications and interfacing with the PSTN, are still expensive and so there is a need for a low-cost gateway. Asterisk, and open-source, TDM/VoIP telephony platform, can be used as a low-cost PSTN interface. This thesis investigates adding a VoiceXML service to Asterisk, creating a low-cost VoiceXML prototype gateway which is able to render voice-enabled applications. Following the Component-Based Software Engineering (CBSE) paradigm, the VoiceXML gateway is divided into a set of components which are sourced from the open-source community, and integrated to create the gateway. The browser requires a VoiceXML interpreter (OpenVXI), a Text-To-Speech engine (Festival) and a speech recognition engine (Sphinx 4). The integration of the components results in a low-cost, open-source VoiceXML gateway. System tests show that the integration of the components was successful, and that the system can handle concurrent calls. A fully compliant version of the gateway can be used in the real world to render voice-enabled applications at a low cost.KMBT_363Adobe Acrobat 9.55 Paper Capture Plug-i

    Flexible context aware interface for ambient assisted living

    Get PDF
    A Multi Agent System that provides a (cared for) person, the subject, with assistance and support through an Ambient Assisted Living Flexible Interface (AALFI) during the day while complementing the night time assistance offered by NOCTURNAL with feedback assistance, is presented. It has been tailored to the subject’s requirements profile and takes into account factors associated with the time of day; hence it attempts to overcome shortcomings of current Ambient Assisted Living Systems. The subject is provided with feedback that highlights important criteria such as quality of sleep during the night and possible breeches of safety during the day. This may help the subject carry out corrective measures and/or seek further assistance. AALFI provides tailored interaction that is either visual or auditory so that the subject is able to understand the interactions and this process is driven by a Multi-Agent System. User feedback gathered from a relevant user group through a workshop validated the ideas underpinning the research, the Multi-agent system and the adaptable interface

    Flexible context aware interface for ambient assisted living

    Get PDF
    A Multi Agent System that provides a (cared for) person, the subject, with assistance and support through an Ambient Assisted Living Flexible Interface (AALFI) during the day while complementing the night time assistance offered by NOCTURNAL with feedback assistance, is presented. It has been tailored to the subject’s requirements profile and takes into account factors associated with the time of day; hence it attempts to overcome shortcomings of current Ambient Assisted Living Systems. The subject is provided with feedback that highlights important criteria such as quality of sleep during the night and possible breeches of safety during the day. This may help the subject carry out corrective measures and/or seek further assistance. AALFI provides tailored interaction that is either visual or auditory so that the subject is able to understand the interactions and this process is driven by a Multi-Agent System. User feedback gathered from a relevant user group through a workshop validated the ideas underpinning the research, the Multi-agent system and the adaptable interface

    FRAMEWORK AND IMPLEMENTATION FOR DIALOG BASED ARABIC SPEECH RECOGNITION

    Get PDF

    Integration of accessibility requirements in the design of multimedia user agents interfaces

    Get PDF
    Mención Internacional en el título de doctorThe continuous increase of multimedia content in the Web, especially video content, is not accompanied by a similar increase of accessibility; there is a lack of synchronized alternatives for the content such as captions, audio description, etc. that allow anyone with or without disability to access such content. This lack of accessibility in video content access is not only due to the lack of alternatives, but also because of the fact that user agents which deliver this content do not provide the necessary means to present them. This fact leads to the noncompliance of the current regulations and legislation in terms of accessibility. This noncompliance could be due to the lack of knowledge, or because of the fact that applying these regulations from an engineering point of view is not trivial. There is a lack of authoring tools and methodological approaches which assist in the development of an accessible product in the Engineering scope as it is the case of the development of a quality user agent which includes accessibility requirements. All these facts, multimedia content’s progressive increase on the Web, accessibility barriers both in the content and in the user agent together with current regulations and legislation regarding accessibility is what has motivated the accomplishment of this Doctoral Thesis. With this Doctoral Thesis, a set of accessibility requirements that a user agent which delivers multimedia content must fulfil is provided. Besides, a workspace is provided following a methodological approach which assists in the design and development of the interface of an accessible user agent which delivers accessible multimedia content. This workspace is composed of an architecture and models following a Model-Based User Interface Development (MBUID) approach and is oriented to be used by designers with knowledge in modeling. Finally, as a support to any professional regardless of their knowledge in modeling and in accessibility, an authoring tool based on models is offered in order to create user agents with accessibility requirements.El continuo incremento del contenido multimedia en la Web, especialmente del contenido vídeo, no va acompañado de un incremento similar de accesibilidad, hay una falta de alternativas sincronizadas al contenido como subtitulado, audiodescripción, etc., que permitan acceder a cualquier persona con y sin discapacidad a dicho contenido. Esta falta de accesibilidad en el acceso al contenido vídeo no solo se debe a la ausencia de alternativas, también es debido a que los agentes de usuario que entregan dicho contenido no proporcionan los medios necesarios para presentarlas. Este hecho da lugar a que no se cumpla la normativa y la legislación vigente en materia de accesibilidad. Dicho incumplimiento, puede ser debido al desconocimiento, o a que aplicar esa normativa desde el punto de vista de la ingeniería no es trivial. Hay una falta de herramientas de autor y de enfoques metodológicos que asistan en el desarrollo de un producto accesible en el ámbito de la Ingeniería, como es el caso del desarrollo de un agente de usuario con calidad que incluya requisitos de accesibilidad. Todos estos hechos, el incremento progresivo del contenido multimedia en la Web, las barreras de accesibilidad tanto en el contenido como en el agente de usuario junto con la normativa y legislación vigente en materia de accesibilidad es lo que ha motivado la realización de esta Tesis Doctoral. Con esta Tesis Doctoral se proporciona el conjunto de requisitos de accesibilidad que debe cumplir un agente de usuario que sirva contenido multimedia accesible. Además se proporciona un espacio de trabajo siguiendo un enfoque metodológico que asista en el diseño y desarrollo de la interfaz de un agente de usuario accesible que sirve contenido multimedia accesible. Este espacio de trabajo está compuesto de una arquitectura y modelos siguiendo el enfoque de Model-Based User Interface Development (MBUID) y está orientado a ser utilizado por diseñadores con conocimientos en modelado. Por último, como recurso de ayuda a cualquier profesional, independientemente de sus conocimientos en modelado y accesibilidad, se ofrece una herramienta de autor basada en modelos para crear agentes de usuario con requisitos de accesibilidad.Programa Oficial de Doctorado en Ciencia y Tecnología InformáticaPresidente: José Antonio Macías Iglesias.- Vocal: Hugo Alexandre Paredes Guede

    WebVoice: Speech Access to Traditional Web Content for Blind Users

    Get PDF
    Traditional web content and navigation features are made available to blind users by converting a webpage into a speech enabled X+V application, which allows blind users to follow the links present in a web page via speech commands. Also the application can read the different paragraphs and search for a word. This X+V application runs on the Opera browser

    Evaluation of mobile and communication technologies for language learning

    Get PDF
    Results from a study by the Ministry of Higher Education in Malaysia indicate that the English language performance of Malaysian university students and graduates is a cause of concern. The National Higher Education Strategic Plan was launched by the Malaysian government in 2007 as a response to the challenges of the education sector that needs to be more internationalised and industry driven. In the strategic plan, the English language is identified as a crucial element in the effort to achieve a developed country status by the year 2020. Therefore, academicians and researchers are actively finding ways to improve students English skills in reading, listening, writing and speaking. Mobile Learning (or m-learning) is a new approach to enhance the learning experience utilising mobile technologies. For example, in order to learn new words the brain requires repeated reminders. The use of mobile devices can help to reinforce the learning process. The use of mobile devices to deliver learning in chunks or nugget sizes, on the move, at any time and anywhere, have shown to engage the learners very effectively in some research projects. Communication technologies such as blogs and Wikis also hold promises for enhancing learning. For instance, writing for a wider audience encourages students' ownership and responsibility. Moreover, comments and feedback from peers can motivate and encourage students. This, in turn, will lead to more active participation. Recognising the potential of these technologies for language learning, the aim of this study is to evaluate the effects of using mobile phones and communication technologies for English language learning with Malaysian students. Two experiments were carried out in this study. The initial pilot experiment was carried out with a small group of students to determine the feasibility of using mobile and communication technologies for language learning for Malaysian students in higher education. The main experiment was conducted after addressing the lessons learned from the initial experiment. An experimental group and a control group from a public higher education institution in Malaysia took part in the study. Quantitative and qualitative data were gathered and analysed. The quantitative results show that the experimental group performed significantly better than the control group in the post written test. The experimental group is in favour of receiving lesson reminders and quizzes that were sent to their mobile phones. However, they did not like receiving messages about web resources. They also did not like reading learning material on a wiki and updating wiki entries. Three themes are derived from the interviews and questionnaires: 1) access, 2) communication, and 3) usability. Access to learning focuses on the ease of use to access learning materials. Students agreed that mobile phones and wikis allowed them to access learning material easily. However, the use of wiki did not engage the students. In terms of communication, lecturers and students can use mobile phone and wiki platforms for communication. However, students were not keen to communicate with the lecturer. As for usability, the students have no problems using a mobile phone but the problem is with the small screen size and it is difficult to type long replies. The students did not want to invest time in learning how to use a wiki as they see it as being irrelevant because they did not want to publish and share their ideas with others. In conclusion, the use of a mobile phone and wiki for language learning is feasible, but further investigation is required regarding student engagement. The lessons learned from this study can help practitioners, in particular those in Malaysia, to adapt their language learning processes when integrating mobile and communication technologies
    corecore