30 research outputs found
Creating a space for young women's voices: Using participatory 'video drama' in Uganda
This article draws upon research that explored the experiences of young women in relation to sexual health in Uganda with a view to enhancing gender-sensitive strategies. We have coined the phrase ‘participatory video drama’ to describe the exploratory methodology that the young women participants in our research used to present stories about their lives. The aim of this article is to suggest that ‘participatory video’ (PV) and ‘participatory video drama’ (PVD) are innovative methodological tools to utilise when working with participants who experience voicelessness in their everyday lives. We contribute to an emerging body of work around this methodology by suggesting that the process of PV provides a novel and engaging platform for participants to express their experiences. PVD further creates spaces for the performative exploration of embedded power relations and is therefore informative and has the potential to be transformatory and empowering
Recommended from our members
Arizona Child Acoustic Database: Task List
The Arizona Child Acoustic Database consists of longitudinal audio recordings from a group of children over a critical period of growth and development (ages 2-7 years). The goal of this database is to 1) document acoustic changes in speech production that may be related to physical growth 2) inform development of a model of speech production for child talkers. This work was funded by NSF BSC-1145011 awarded to Kate Bunton, Ph.D. and Brad Story, Ph.D, Principal Investigators.
This database contains longitudinal audio recordings of 55 American English speaking children between the ages of 2-7 at 3-month intervals. Since children began the study at different ages, some children have fewer recording sessions than others. The database can also be used to provide cross-sectional data for children of a specific age. Please refer to the subject data table for information on specific sessions available here http://arizona.openrepository.com/arizona/handle/10150/316065.
All children were recorded using the same protocol; therefore, task numbers are consistent across children and sessions. A calibration tone is included as Record 1 for all sessions. The speech protocol focused on production of English monopthong and diphthong vowels in isolation, sVd, hVd, and monosyllabic real words. In addition, the protocol includes several nonsense vowel-to-vowel transitions. Speakers were prompted either verbally by investigators or by graphical prompts. Details of the protocol with reference to task numbers can be found in the protocol spreadsheet available here http://arizona.openrepository.com/arizona/handle/10150/316065.
Details on data recording:
All samples were recorded digitally using an AKG SE 300B microphone with a mouth to mic distance of approximately 10 inches. Signals were recorded digitally using a Marantz PMD671, 16 bit PCM (uncompressed) at 44.1KHz. Recordings are made available in .wav format. Individual zip files contain all recordings from a single session.NSF BSC-114501
Recommended from our members
Arizona Child Acoustic Database: Participant Table
The Arizona Child Acoustic Database consists of longitudinal audio recordings from a group of children over a critical period of growth and development (ages 2-7 years). The goal of this database is to 1) document acoustic changes in speech production that may be related to physical growth 2) inform development of a model of speech production for child talkers. This work was funded by NSF BSC-1145011 awarded to Kate Bunton, Ph.D. and Brad Story, Ph.D, Principal Investigators.
This database contains longitudinal audio recordings of 55 American English speaking children between the ages of 2-7 at 3-month intervals. Since children began the study at different ages, some children have fewer recording sessions than others. The database can also be used to provide cross-sectional data for children of a specific age. Please refer to the subject data table for information on specific sessions available here http://arizona.openrepository.com/arizona/handle/10150/316065.
All children were recorded using the same protocol; therefore, task numbers are consistent across children and sessions. A calibration tone is included as Record 1 for all sessions. The speech protocol focused on production of English monopthong and diphthong vowels in isolation, sVd, hVd, and monosyllabic real words. In addition, the protocol includes several nonsense vowel-to-vowel transitions. Speakers were prompted either verbally by investigators or by graphical prompts. Details of the protocol with reference to task numbers can be found in the protocol spreadsheet available here http://arizona.openrepository.com/arizona/handle/10150/316065.
Details on data recording:
All samples were recorded digitally using an AKG SE 300B microphone with a mouth to mic distance of approximately 10 inches. Signals were recorded digitally using a Marantz PMD671, 16 bit PCM (uncompressed) at 44.1KHz. Recordings are made available in .wav format. Individual zip files contain all recordings from a single session.NSF BSC-114501
Recommended from our members
The relation of velopharyngeal coupling area to the identification of stop versus nasal consonants in North American English based on speech generated by acoustically driven vocal tract modulations
The purpose of this study was to determine the threshold of velopharyngeal coupling area at which listeners switch from identifying a consonant as a stop to a nasal in North American English, based on V1CV2 stimuli generated with a speech production model that encodes phonetic segments as relative acoustic targets. Each V1CV2 was synthesized with a set of velopharyngeal coupling functions whose area ranged from 0 to 0.1 cm2. Results show that consonants were identified by listeners as a stop when the coupling area was less than 0.035-0.057 cm2, depending on place of articulation and final vowel. The smallest coupling area (0.035 cm2) at which the stop-to-nasal switch occurred was found for an alveolar consonant in the /aCi/ context, whereas the largest (0.057 cm2) was for a bilabial in /aCa/. For each stimulus, the balance of oral versus nasal acoustic energy was characterized by the peak nasalance during the consonant. Stimuli with peak nasalance below 40% were mostly identified by listeners as stops, whereas those above 40% were identified as nasals. This study was intended to be a precursor to further investigations using the same model but scaled to represent the developing speech production system of male and female talkers.6 month embargo; published online: 16 November 2021This item from the UA Faculty Publications collection is made available by the University of Arizona with support from the University of Arizona Libraries. If you have questions, please contact us at [email protected]
An acoustically-driven vocal tract model for stop consonant production
The purpose of this study was to further develop a multi-tier model of the vocal tract area function in which the modulations of shape to produce speech are generated by the product of a vowel substrate and a consonant superposition function. The new approach consists of specifying input parameters for a target consonant as a set of directional changes in the resonance frequencies of the vowel substrate. Using calculations of acoustic sensitivity functions, these "resonance deflection patterns" are transformed into time-varying deformations of the vocal tract shape without any direct specification of location or extent of the consonant constriction along the vocal tract. The configuration of the constrictions and expansions that are generated by this process were shown to be physiologically-realistic and produce speech sounds that are easily identifiable as the target consonants. This model is a useful enhancement for area function-based synthesis and can serve as a tool for understanding how the vocal tract is shaped by a talker during speech production. (C) 2016 Elsevier B.V. All rights reserved.NIH [R01-DC011275]; NSF [BCS-1145011]24 month embargo; Available online 9 December 2016This item from the UA Faculty Publications collection is made available by the University of Arizona with support from the University of Arizona Libraries. If you have questions, please contact us at [email protected]
A model of speech production based on the acoustic relativity of the vocal tract
A model is described in which the effects of articulatory movements to produce speech are generated by specifying relative acoustic events along a time axis. These events consist of directional changes of the vocal tract resonance frequencies that, when associated with a temporal event function, are transformed via acoustic sensitivity functions, into time-varying modulations of the vocal tract shape. Because the time course of the events may be considerably overlapped in time, coarticulatory effects are automatically generated. Production of sentence-level speech with the model is demonstrated with audio samples and vocal tract animations. (C) 2019 Acoustical Society of America.6 month embargo; published online: 17 October 2019This item from the UA Faculty Publications collection is made available by the University of Arizona with support from the University of Arizona Libraries. If you have questions, please contact us at [email protected]
Recommended from our members
The relation of velopharyngeal coupling area and vocal tract scaling to identification of stop-nasal cognates
The purpose of this study was to determine whether the threshold of velopharyngeal (VP) coupling area at which listeners switch from identifying a consonant as a stop to a nasal in North American English was different for speech produced by a model based on an adult male, an adult female, and a 4-year-old child. V1CV2 stimuli were generated with a speech production model that encodes phonetic segments as relative acoustic targets imposed on an underlying vocal tract and laryngeal structure that can be scaled according to sex and age. Each V1CV2 was synthesized with a set of VP coupling functions whose maximum area ranged from 0 to 0.1 cm2. Results showed that scaling the vocal tract and vocal folds had essentially no effect on the VP coupling area at which listener identification shifted from stop to nasal. The range of coupling areas at which the crossover occurred was 0.037-0.049 cm2 for the male model, 0.040-0.055 cm2 for the female model, and 0.039-0.052 cm2 for the 4-year-old child model, and overall mean was 0.044 cm2. Calculations of band limited peak nasalance indicated that 85% peak nasalance during the consonant was well aligned with listener responses.6 month embargo; first published 15 December 2023This item from the UA Faculty Publications collection is made available by the University of Arizona with support from the University of Arizona Libraries. If you have questions, please contact us at [email protected]