2,192 research outputs found

    A practice-led approach to facial animation research

    Get PDF
    In facial expression research, it is well established that certain emotional expressions are universally recognized. Studies into the observer perception of dynamic expressions have built upon this research by highlighting the importance of particular facial regions, timings, and temporal configurations to perception and interpretation. In many studies, the stimuli for such studies have been generated through posing by non-experts or performances by trained actors. However, skilled character animators are capable of crafting recognizable, believable emotional facial expressions as a part of their professional practice. ‘Emotional Avatars’ was conceived as an interdisciplinary research project which would draw upon the knowledge of animation practice and emotional psychology. The aim of the project was to jointly investigate the artistic generation and observer perception of emotional expression animation to determine whether the nuances of emotional facial expression could be artistically choreographed to enhance audience interpretation

    Development of a Deep Neural Network for Speeding Up a Model of Loudness for Time-Varying Sounds

    Get PDF
    The “time-varying loudness” (TVL) model of Glasberg and Moore calculates “instantaneous loudness” every 1 ms, and this is used to generate predictions of short-term loudness, the loudness of a short segment of sound, such as a word in a sentence, and of long-term loudness, the loudness of a longer segment of sound, such as a whole sentence. The calculation of instantaneous loudness is computationally intensive and real-time implementation of the TVL model is difficult. To speed up the computation, a deep neural network (DNN) was trained to predict instantaneous loudness using a large database of speech sounds and artificial sounds (tones alone and tones in white or pink noise), with the predictions of the TVL model as a reference (providing the “correct” answer, specifically the loudness level in phons). A multilayer perceptron with three hidden layers was found to be sufficient, with more complex DNN architecture not yielding higher accuracy. After training, the deviations between the predictions of the TVL model and the predictions of the DNN were typically less than 0.5 phons, even for types of sounds that were not used for training (music, rain, animal sounds, and washing machine). The DNN calculates instantaneous loudness over 100 times more quickly than the TVL model. Possible applications of the DNN are discussed

    Age-group differences in speech identification despite matched audiometrically normal hearing: contributions from auditory temporal processing and cognition

    Get PDF
    Hearing loss with increasing age adversely affects the ability to understand speech, an effect that results partly from reduced audibility. The aims of this study were to establish whether aging reduces speech intelligibility for listeners with normal audiograms, and, if so, to assess the relative contributions of auditory temporal and cognitive processing. Twenty-one older normal-hearing (ONH; 60-79 years) participants with bilateral audiometric thresholds = 20 dB HL at 0.125-6 kHz were matched to nine young (YNH; 18-27 years) participants in terms of mean audiograms, years of education, and performance IQ. Measures included: (1) identification of consonants in quiet and in noise that was unmodulated or modulated at 5 or 80 Hz; (2) identification of sentences in quiet and in co-located or spatially separated two-talker babble; (3) detection of modulation of the temporal envelope (TE) at frequencies 5-180 Hz; (4) monaural and binaural sensitivity to temporal fine structure (TFS); (5) various cognitive tests. Speech identification was worse for ONH than YNH participants in all types of background. This deficit was not reflected in self-ratings of hearing ability. Modulation masking release (the improvement in speech identification obtained by amplitude modulating a noise background) and spatial masking release (the benefit obtained from spatially separating masker and target speech) were not affected by age. Sensitivity to TE and TFS was lower for ONH than YNH participants, and was correlated positively with speech-in-noise (SiN) identification. Many cognitive abilities were lower for ONH than YNH participants, and generally were correlated positively with SiN identification scores. The best predictors of the intelligibility of SiN were composite measures of cognition and TFS sensitivity. These results suggest that declines in speech perception in older persons are partly caused by cognitive and perceptual changes separate from age-related changes in audiometric sensitivity

    The Infrared Imaging Spectrograph (IRIS) for TMT: the atmospheric dispersion corrector

    Get PDF
    We present a conceptual design for the atmospheric dispersion corrector (ADC) for TMT's Infrared Imaging Spectrograph (IRIS). The severe requirements of this ADC are reviewed, as are limitations to observing caused by uncorrectable atmospheric effects. The requirement of residual dispersion less than 1 milliarcsecond can be met with certain glass combinations. The design decisions are discussed and the performance of the design ADC is described. Alternative options and their performance tradeoffs are also presented.Comment: SPIE Astronomical Instrumentation 201

    Echoic Sensory Substitution Information in a Single Obstacle Circumvention Task.

    Get PDF
    Accurate motor control is required when walking around obstacles in order to avoid collisions. When vision is unavailable, sensory substitution can be used to improve locomotion through the environment. Tactile sensory substitution devices (SSDs) are electronic travel aids, some of which indicate the distance of an obstacle using the rate of vibration of a transducer on the skin. We investigated how accurately such an SSD guided navigation in an obstacle circumvention task. Using an SSD, 12 blindfolded participants navigated around a single flat 0.6 x 2 m obstacle. A 3-dimensional Vicon motion capture system was used to quantify various kinematic indices of human movement. Navigation performance under full vision was used as a baseline for comparison. The obstacle position was varied from trial to trial relative to the participant, being placed at two distances 25 cm to the left, right or directly ahead. Under SSD guidance, participants navigated without collision in 93% of trials. No collisions occurred under visual guidance. Buffer space (clearance between the obstacle and shoulder) was larger by a factor of 2.1 with SSD guidance than with visual guidance, movement times were longer by a factor of 9.4, and numbers of velocity corrections were larger by a factor of 5 (all p<0.05). Participants passed the obstacle on the side affording the most space in the majority of trials for both SSD and visual guidance conditions. The results are consistent with the idea that SSD information can be used to generate a protective envelope during locomotion in order to avoid collisions when navigating around obstacles, and to pass on the side of the obstacle affording the most space in the majority of trials.Vision and Eye Research Unit, Postgraduate Medical Institute at Anglia Ruskin University; Medical Research Council (Grant ID: G0701870)This is the final version of the article. It first appeared from the Public Library of Science via http://dx.doi.org/10.1371/journal.pone.016087

    Effects of spectral smearing on the intelligibility of sentences in the presence of interfering speech,’’

    Get PDF
    In a previous study IT. Baer and B.C. J. Moore, J. Acoust. Soc. Am. 94, 1229-1241 (1993)], a spectral smearing technique was used to simulate some of the effects of impaired frequency selectivity so as to assess its influence on speech intelligibility. Results showed that spectral smearing to simulate broadening of the auditory filters by a factor of 3 or 6 had little effect on the intelligibility of speech in quiet but had a large effect on the intelligibility of speech in noise. The present study examines the effect of spectral smearing on the intelligibility of speech in the presence of a single interfering talker. The results were generally consistent with those of the previous study, suggesting that impaired frequency selectivity contributes significantly to the problems experienced by people with cochlear hearing loss when they listen to speech in the presence of interfering sounds

    Differential expression of type X collagen in a mechanically active 3-D chondrocyte culture system: a quantitative study

    Get PDF
    OBJECTIVE: Mechanical loading of cartilage influences chondrocyte metabolism and gene expression. The gene encoding type X collagen is expressed specifically by hypertrophic chondrocytes and up regulated during osteoarthritis. In this study we tested the hypothesis that the mechanical microenvironment resulting from higher levels of local strain in a three dimensional cell culture construct would lead to an increase in the expression of type X collagen mRNA by chondrocytes in those areas. METHODS: Hypertrophic chondrocytes were isolated from embryonic chick sterna and seeded onto rectangular Gelfoam sponges. Seeded sponges were subjected to various levels of cyclic uniaxial tensile strains at 1 Hz with the computer-controlled Bio-Stretch system. Strain distribution across the sponge was quantified by digital image analysis. After mechanical loading, sponges were cut and the end and center regions were separated according to construct strain distribution. Total RNA was extracted from the cells harvested from these regions, and real-time quantitative RT-PCR was performed to quantify mRNA levels for type X collagen and a housing-keeping gene 18S RNA. RESULTS: Chondrocytes distributed in high (9%) local strain areas produced more than two times type X collagen mRNA compared to the those under no load conditions, while chondrocytes located in low (2.5%) local strain areas had no appreciable difference in type X collagen mRNA production in comparison to non-loaded samples. Increasing local strains above 2.5%, either in the center or end regions of the sponge, resulted in increased expression of Col X mRNA by chondrocytes in that region. CONCLUSION: These findings suggest that the threshold of chondrocyte sensitivity to inducing type X collagen mRNA production is more than 2.5% local strain, and that increased local strains above the threshold results in an increase of Col X mRNA expression. Such quantitative analysis has important implications for our understanding of mechanosensitivity of cartilage and mechanical regulation of chondrocyte gene expression

    A Framework to Account for the Effects of Visual Loss on Human Auditory Abilities

    Get PDF
    Until recently, a commonly held view was that blindness resulted in enhanced auditory abilities, underpinned by the beneficial effects of cross-modal neuroplasticity. This viewpoint has been challenged by studies showing that blindness results in poorer performance for some auditory spatial tasks. It is now clear that visual loss does not result in a general increase or decrease in all auditory abilities. Although several hypotheses have been proposed to explain why certain auditory abilities are enhanced while others are degraded, these are often limited to a specific subset of tasks. A comprehensive explanation encompassing auditory abilities assessed in fully blind and partially sighted populations and spanning spatial and non-spatial cognition has not so far been proposed. The current article proposes a framework comprising a set of nine principles that can be used to predict whether auditory abilities are enhanced or degraded. The validity of these principles is assessed by comparing their predictions with a wide range of empirical evidence concerning the effects of visual loss on spatial and non-spatial auditory abilities. Developmental findings and the effects of early- versus late-onset visual loss are discussed. Ways of improving auditory abilities for individuals with visual loss and reducing auditory spatial deficits are summarized. A new Perceptual Restructuring Hypothesis is proposed within the framework, positing that the auditory system is restructured to provide the most accurate information possible given the loss of the visual signal and utilizing available cortical resources, resulting in different auditory abilities getting better or worse according to the nine principles
    • 

    corecore