1,023 research outputs found

    An assisting model for the visually challenged to detect bus door accurately

    Get PDF
    Visually impaired individuals are increasing and as per global statistics, around 39 million are blind, and 246 million are affected by low vision. Even in India, as per the recent reviews, over 5 million visually challenged people are present. Authors performed a survey of some critical problems the visually challenged people faced in India from the centre for visually challenged (CVC) School established by UVSM Hospitals. Among the major problems identified through survey, most of these persons prefer carrying out their tasks independently, and depend on public transport buses for migration. However, critical sub-problems being faced include; bus door identification and identifying the bus route number accurately. This article aims to provide solutions in helping visually challenged individuals to identify exact bus that drives them to their destination, its door, bus number, and the path for boarding bus. A video sequence of current scenario would be sent to mobile, in which the actual processing of image is carried out. After the video sequence processing, generated output is a voice message that specifies the bus's location, door, and exact information of the bus number along the road path directly to the user using a wireless device aiming foa a low-cost solution

    Using Sonic Enhancement to Augment Non-Visual Tabular Navigation

    Get PDF
    More information is now readily available to computer users than at any time in human history; however, much of this information is often inaccessible to people with blindness or low-vision, for whom information must be presented non-visually. Currently, screen readers are able to verbalize on-screen text using text-to-speech (TTS) synthesis; however, much of this vocalization is inadequate for browsing the Internet. An auditory interface that incorporates auditory-spatial orientation was created and tested. For information that can be structured as a two-dimensional table, links can be semantically grouped as cells in a row within an auditory table, which provides a consistent structure for auditory navigation. An auditory display prototype was tested. Sixteen legally blind subjects participated in this research study. Results demonstrated that stereo panning was an effective technique for audio-spatially orienting non-visual navigation in a five-row, six-column HTML table as compared to a centered, stationary synthesized voice. These results were based on measuring the time- to-target (TTT), or the amount of time elapsed from the first prompting to the selection of each tabular link. Preliminary analysis of the TTT values recorded during the experiment showed that the populations did not conform to the ANOVA requirements of normality and equality of variances. Therefore, the data were transformed using the natural logarithm. The repeated-measures two-factor ANOVA results show that the logarithmically-transformed TTTs were significantly affected by the tonal variation method, F(1,15) = 6.194, p= 0.025. Similarly, the results show that the logarithmically transformed TTTs were marginally affected by the stereo spatialization method, F(1,15) = 4.240, p=0.057. The results show that the logarithmically transformed TTTs were not significantly affected by the interaction of both methods, F(1,15) = 1.381, p=0.258. These results suggest that some confusion may be caused in the subject when employing both of these methods simultaneously. The significant effect of tonal variation indicates that the effect is actually increasing the average TTT. In other words, the presence of preceding tones increases task completion time on average. The marginally-significant effect of stereo spatialization decreases the average log(TTT) from 2.405 to 2.264

    Multimodal Based Audio-Visual Speech Recognition for Hard-of-Hearing: State of the Art Techniques and Challenges

    Get PDF
    Multimodal Integration (MI) is the study of merging the knowledge acquired by the nervous system using sensory modalities such as speech, vision, touch, and gesture. The applications of MI expand over the areas of Audio-Visual Speech Recognition (AVSR), Sign Language Recognition (SLR), Emotion Recognition (ER), Bio Metrics Applications (BMA), Affect Recognition (AR), Multimedia Retrieval (MR), etc. The fusion of modalities such as hand gestures- facial, lip- hand position, etc., are mainly used sensory modalities for the development of hearing-impaired multimodal systems. This paper encapsulates an overview of multimodal systems available within literature towards hearing impaired studies. This paper also discusses some of the studies related to hearing-impaired acoustic analysis. It is observed that very less algorithms have been developed for hearing impaired AVSR as compared to normal hearing. Thus, the study of audio-visual based speech recognition systems for the hearing impaired is highly demanded for the people who are trying to communicate with natively speaking languages.  This paper also highlights the state-of-the-art techniques in AVSR and the challenges faced by the researchers for the development of AVSR systems

    Design of an Automated Book Reader as an Assistive Technology for Blind Persons

    Get PDF
    This dissertation introduces a novel automated book reader as an assistive technology tool for persons with blindness. The literature shows extensive work in the area of optical character recognition, but the current methodologies available for the automated reading of books or bound volumes remain inadequate and are severely constrained during document scanning or image acquisition processes. The goal of the book reader design is to automate and simplify the task of reading a book while providing a user-friendly environment with a realistic but affordable system design. This design responds to the main concerns of (a) providing a method of image acquisition that maintains the integrity of the source (b) overcoming optical character recognition errors created by inherent imaging issues such as curvature effects and barrel distortion, and (c) determining a suitable method for accurate recognition of characters that yields an interface with the ability to read from any open book with a high reading accuracy nearing 98%. This research endeavor focuses in its initial aim on the development of an assistive technology tool to help persons with blindness in the reading of books and other bound volumes. But its secondary and broader aim is to also find in this design the perfect platform for the digitization process of bound documentation in line with the mission of the Open Content Alliance (OCA), a nonprofit Alliance at making reading materials available in digital form. The theoretical perspective of this research relates to the mathematical developments that are made in order to resolve both the inherent distortions due to the properties of the camera lens and the anticipated distortions of the changing page curvature as one leafs through the book. This is evidenced by the significant increase of the recognition rate of characters and a high accuracy read-out through text to speech processing. This reasonably priced interface with its high performance results and its compatibility to any computer or laptop through universal serial bus connectors extends greatly the prospects for universal accessibility to documentation

    Southwest Research Institute assistance to NASA in biomedical areas of the technology utilization program

    Get PDF
    The activities are reported of the NASA Biomedical Applications Team at Southwest Research Institute between 25 August, 1972 and 15 November, 1973. The program background and methodology are discussed along with the technology applications, and biomedical community impacts

    Fast Speech in Unit Selection Speech Synthesis

    Get PDF
    Moers-Prinz D. Fast Speech in Unit Selection Speech Synthesis. Bielefeld: Universität Bielefeld; 2020.Speech synthesis is part of the everyday life of many people with severe visual disabilities. For those who are reliant on assistive speech technology the possibility to choose a fast speaking rate is reported to be essential. But also expressive speech synthesis and other spoken language interfaces may require an integration of fast speech. Architectures like formant or diphone synthesis are able to produce synthetic speech at fast speech rates, but the generated speech does not sound very natural. Unit selection synthesis systems, however, are capable of delivering more natural output. Nevertheless, fast speech has not been adequately implemented into such systems to date. Thus, the goal of the work presented here was to determine an optimal strategy for modeling fast speech in unit selection speech synthesis to provide potential users with a more natural sounding alternative for fast speech output

    Designing Accessible Nonvisual Maps

    Get PDF
    Access to nonvisual maps has long required special equipment and training to use; Google Maps, ESRI, and other commonly used digital maps are completely visual and thus inaccessible to people with visual impairments. This project presents the design and evaluation of an easy to use digital auditory map and 3D model interactive map. A co-design was also undertaken to discover tools for an ideal nonvisual navigational experience. Baseline results of both studies are presented so future work can improve on the designs. The user evaluation revealed that both prototypes were moderately easy to use. An ideal nonvisual navigational experience, according to these participants, consists of both an accurate turn by turn navigational system, and an interactive map. Future work needs to focus on the development of appropriate tools to enable this ideal experience

    Deconstructing Disability, Assistive Technology: Secondary Orality, The Path To Universal Access

    Get PDF
    When Thomas Edison applied for a patent for his phonograph, he listed the talking books for the blind as one of the benefits of his invention. Edison was correct in his claim about talking books or audio books. Audio books have immensely helped the blind to achieve their academic and professional goals. Blind and visually impaired people have also been using audio books for pleasure reading. But several studies have demonstrated the benefits of audio books for people who are not defined as disabled. Many nondisabled people listen to audio books and take advantage of speech based technology, such as text-to-speech programs, in their daily activities. Speech-based technology, however, has remained on the margins of the academic environments, where hegemony of the sense of vision is palpable. Dominance of the sense of sight can be seen in school curricula, class rooms, libraries, academic conferences, books and journals, and virtually everywhere else. This dissertation analyzes the reason behind such an apathy towards technology based on speech. Jacques Derrida\u27s concept of \u27metaphysics of presence\u27 helps us understand the arbitrary privileging of one side of a binary at the expense of the other side. I demonstrate in this dissertation that both, the \u27disabled\u27 and technology used by them, are on the less privileged side of the binary formation they are part of. I use Derrida\u27s method of \u27deconstruction\u27 to deconstruct the binaries of \u27assistive\u27 and \u27main stream technology\u27 on one hand, and that of the \u27disabled\u27 and \u27nondisabled\u27 on the other. Donna Haraway and Katherine Hayles present an alternative reading of body to conceive of a post-gendered posthuman identity, I borrow from their work on cyborgism and iii posthumanism to conceive of a technology driven post-disabled world. Cyberspace is a good and tested example of an identity without body and a space without disability. The opposition between mainstream and speech-based assistive technology can be deconstructed with the example of what Walter Ong calls \u27secondary orality.\u27 Both disabled and non-disabled use the speech-based technology in their daily activities. Sighted people are increasingly listening to audio books and podcasts. Secondary Orality is also manifest on their GPS devices. Thus, Secondary Orality is a common element in assistive and mainstream technologies, hitherto segregated by designers. The way Derrida uses the concept of \u27incest\u27 to deconstruct binary opposition between Nature and Culture, I employ \u27secondary orality\u27 as a deconstructing tool in the context of mainstream and assistive technology. Mainstream electronic devices, smart phones, mp3 players, computers, for instance, can now be controlled with speech and they also can read the screen aloud. With Siri assistant, the new application on iPhone that allows the device to be controlled with speech, we seem to be very close to the age of talking computers that William Crossman foretells. As a result of such a progress in speech technology, I argue, we don\u27t need the concept of speech based assistive technology any more

    A better life through information technology? The techno-theological eschatology of posthuman speculative science

    Get PDF
    This is the pre-peer reviewed version of the article, published in Zygon 41(2) pp.267-288, which has been published in final form at http://www3.interscience.wiley.com/journal/118588124/issueThe depiction of human identity in the pop-science futurology of engineer/inventor Ray Kurzweil, the speculative-robotics of Carnegie Mellon roboticist Hans Moravec and the physics of Tulane University mathematics professor Frank Tipler elevate technology, especially information technology, to a point of ultimate significance. For these three figures, information technology offers the potential means by which the problem of human and cosmic finitude can be rectified. Although Moravec’s vision of intelligent robots, Kurzweil’s hope for immanent human immorality, and Tipler’s description of human-like von Neumann probe colonising the very material fabric of the universe, may all appear to be nothing more than science fictional musings, they raise genuine questions as to the relationship between science, technology, and religion as regards issues of personal and cosmic eschatology. In an attempt to correct what I see as the ‘cybernetic-totalism’ inherent in these ‘techno-theologies’, I will argue for a theology of technology, which seeks to interpret technology hermeneutically and grounds human creativity in the broader context of divine creative activity
    • …
    corecore