12 research outputs found

    Vocal imitation for query by vocalisation

    Get PDF
    PhD ThesisThe human voice presents a rich and powerful medium for expressing sonic ideas such as musical sounds. This capability extends beyond the sounds used in speech, evidenced for example in the art form of beatboxing, and recent studies highlighting the utility of vocal imitation for communicating sonic concepts. Meanwhile, the advance of digital audio has resulted in huge libraries of sounds at the disposal of music producers and sound designers. This presents a compelling search problem: with larger search spaces, the task of navigating sound libraries has become increasingly difficult. The versatility and expressive nature of the voice provides a seemingly ideal medium for querying sound libraries, raising the question of how well humans are able to vocally imitate musical sounds, and how we might use the voice as a tool for search. In this thesis we address these questions by investigating the ability of musicians to vocalise synthesised and percussive sounds, and evaluate the suitability of different audio features for predicting the perceptual similarity between vocal imitations and imitated sounds. In the first experiment, musicians were tasked with imitating synthesised sounds with one or two time–varying feature envelopes applied. The results show that participants were able to imitate pitch, loudness, and spectral centroid features accurately, and that imitation accuracy was generally preserved when the imitated stimuli combined two, non-necessarily congruent features. This demonstrates the viability of using the voice as a natural means of expressing time series of two features simultaneously. The second experiment consisted of two parts. In a vocal production task, musicians were asked to imitate drum sounds. Listeners were then asked to rate the similarity between the imitations and sounds from the same category (e.g. kick, snare etc.). The results show that drum sounds received the highest similarity ratings when rated against their imitations (as opposed to imitations of another sound), and overall more than half the imitated sounds were correctly identified with above chance accuracy from the imitations, although this varied considerably between drum categories. The findings from the vocal imitation experiments highlight the capacity of musicians to vocally imitate musical sounds, and some limitations of non– verbal vocal expression. Finally, we investigated the performance of different audio features as predictors of perceptual similarity between the imitations and imitated sounds from the second experiment. We show that features learned using convolutional auto–encoders outperform a number of popular heuristic features for this task, and that preservation of temporal information is more important than spectral resolution for differentiating between the vocal imitations and same–category drum sounds

    Vocal imitation for query by vocalisation

    Get PDF
    PhDThe human voice presents a rich and powerful medium for expressing sonic ideas such as musical sounds. This capability extends beyond the sounds used in speech, evidenced for example in the art form of beatboxing, and recent studies highlighting the utility of vocal imitation for communicating sonic concepts. Meanwhile, the advance of digital audio has resulted in huge libraries of sounds at the disposal of music producers and sound designers. This presents a compelling search problem: with larger search spaces, the task of navigating sound libraries has become increasingly difficult. The versatility and expressive nature of the voice provides a seemingly ideal medium for querying sound libraries, raising the question of how well humans are able to vocally imitate musical sounds, and how we might use the voice as a tool for search. In this thesis we address these questions by investigating the ability of musicians to vocalise synthesised and percussive sounds, and evaluate the suitability of different audio features for predicting the perceptual similarity between vocal imitations and imitated sounds. In the fi rst experiment, musicians were tasked with imitating synthesised sounds with one or two time{varying feature envelopes applied. The results show that participants were able to imitate pitch, loudness, and spectral centroid features accurately, and that imitation accuracy was generally preserved when the imitated stimuli combined two, non-necessarily congruent features. This demonstrates the viability of using the voice as a natural means of expressing time series of two features simultaneously. The second experiment consisted of two parts. In a vocal production task, musicians were asked to imitate drum sounds. Listeners were then asked to rate the similarity between the imitations and sounds from the same category (e.g. kick, snare etc.). The results show that drum sounds received the highest similarity ratings when rated against their imitations (as opposed to imitations of another sound), and overall more than half the imitated sounds were correctly identi ed with above chance accuracy from the imitations, although this varied considerably between drum categories. The fi ndings from the vocal imitation experiments highlight the capacity of musicians to vocally imitate musical sounds, and some limitations of non- verbal vocal expression. Finally, we investigated the performance of different audio features as predictors of perceptual similarity between the imitations and imitated sounds from the second experiment. We show that features learned using convolutional auto-encoders outperform a number of popular heuristic features for this task, and that preservation of temporal information is more important than spectral resolution for differentiating between the vocal imitations and same-category drum sounds.Engineering and Physical Sciences Research Council (EP/G03723X/1)

    Acoustic Room Compensation Using Local PCA-based Room Average Power Response Estimation

    Full text link
    Acoustic room compensation techniques, which allow a sound reproduction system to counteract undesired alteration to the sound scene due to excessive room resonances, have been widely studied. Extensive efforts have been reported to enlarge the region over which room equalization is effective and to contrast variations of room transfer functions in space. A speaker-tuning technology "Trueplay" allows users to compensate for undesired room effects over an extended listening area based on a spatially averaged power response of the room, which is conventionally measured using microphones on portable devices when users move around the room. In this work, we propose a novel system that leverages measured speaker echo path self-responses to predict the room average power responses using a local PCA based approach. Experimental results confirm the effectiveness of the proposed estimation method, which further leads to a room compensation filter design that achieves a good sound similarity compared to the reference system with the ground-truth room average power response while outperforming other systems that do not leverage the proposed estimator.Comment: 5 pages, 7 figures, to appear in IWAENC 202

    Vocal imitation of percussion sounds: on the perceptual similarity between imitations and imitated sounds

    No full text
    <p>Dataset of the drum sounds and vocal imitations used in the listening study. There are 30 drum sounds, indexed 0-29. The imitations are indexed by imitator (0-13), with imitations of each drum sound in the respective directories. Included is a csv file containing the participant responses from the listening test.</p> <p>NOTE: The BFD drum samples have been made available with the permission of FXpansion Audio UK. Permission is granted for their use in further academic research. Contact SKoT McDonald for further information."</p> <p> </p> <p> </p

    Activation and Regulation of NLRP3 Inflammasome by Intrathecal Application of SDF-1a in a Spinal Cord Injury Model

    No full text
    Stromal cell-derived factor-1 alpha (SDF-1a) or CXCL12 is an important cytokine with multiple functions in the brain during development and in adulthood. The inflammatory response initiated by spinal cord injury (SCI) involves the processing of interleukin-1beta (IL-1�) and IL-18 mediated by caspase-1 which is under the control of an intracellular multiprotein complex termed inflammasome. Using an SCI rat model, we found improved functional long-term recovery which is paralleled by a reduction of apoptosis after intrathecal treatment with SDF-1a. An intriguing aspect is that SDF-1a changed the number of neuroinflammatory cells in the damaged area. We further examined the cellular localization and sequential expression of several inflammasomes during SCI at 6 h, 24 h, 3 days, and 7 days as well as the role of SDF-1a as a regulatory factor for inflammasomes. Using 14-week old male Wistar rats, spinal cord contusion was applied at the thoracic segment 9, and animals were subsequently treated with SDF-1a via intrathecal application through an osmotic pump. SCI temporally increased the expression of the inflammasomes NLRP3, ASC, the inflammatory marker tumor necrosis factor-a (TNF-a), interleukin-1� (IL-1β) and IL-18. SDF-1a significantly reduced the levels of IL-18, IL-1b, TNF-a, NLRP3, ASC, and caspase-1. Immunofluorescence double-labeling demonstrated that microglia and neurons are major sources of the ASC and NLRP3 respectivley. Our data provide clear evidence that SCI stimulates a complex scenario of inflammasome activation at the injured site and that SDF-1a-mediated neuroprotection presumably depends on the attenuation of the inflammasome complex. © 2015, Springer Science+Business Media New York

    Cell Therapy in Spinal Cord Injury: a Mini- Reivew

    No full text
    Spinal cord injury (SCI) is a debilitating disease which leads to progressive functional damages. Because of limited axonal regeneration in the central nervous system, there is no or little recovery expected in the patients. Different cellular and molecular approaches were investigated in SCI animal models. Cellular transplantation of stem cells can potentially replace damaged tissue and provide a suitable microenvironment for axons to regenerate. Here, we reviewed the last approaches applied by our colleagues and others in order to improve axonal regeneration following SCI. We used different types of stem cells via different methods. First, fetal olfactory mucosa, schwann, and bone marrow stromal cells were transplanted into the injury sites in SCI models. In later studies, was applied simultaneous transplantation of stem cells with chondroitinase ABC in SCI models with the aid of nanoparticles. Using these approaches, considerable functional recovery was observed. However, considering some challenges in stem cell therapy such as rejection, infection, and development of a new cancer, our more recent strategy was application of cytokines. We observed a significant improvement in motor function of rats when stromal derived factor-1 was used to attract innate stem cells to the injury site. In conclusion, it seems that co-transplantation of different cells accompanies with other factors like enzymes and growth factors via new delivery systems may yield better results in SCI
    corecore