8,779 research outputs found
Acoustic modeling using the digital waveguide mesh
The digital waveguide mesh has been an active area of music acoustics research for over ten years. Although founded in 1-D digital waveguide modeling, the principles on which it is based are not new to researchers grounded in numerical simulation, FDTD methods, electromagnetic simulation, etc. This article has attempted to provide a considerable review of how the DWM has been applied to acoustic modeling and sound synthesis problems, including new 2-D object synthesis and an overview of recent research activities in articulatory vocal tract modeling, RIR synthesis, and reverberation simulation. The extensive, although not by any means exhaustive, list of references indicates that though the DWM may have parallels in other disciplines, it still offers something new in the field of acoustic simulation and sound synth
Sex-specific fundamental and formant frequency patterns in a cross-sectional study
An extensive developmental acoustic study of the speech patterns of children and adults was reported by Lee and colleagues [Lee et al., J. Acoust. Soc. Am. 105, 1455-1468 (1999)]. This paper presents a reexamination of selected fundamental frequency and formant frequency data presented in their report for 10 monophthongs by investigating sex-specific and developmental patterns using two different approaches. The first of these includes the investigation of age- and sex-specific formant frequency patterns in the monophthongs. The second, the investigation of fundamental frequency and formant frequency data using the critical band rate (bark) scale and a number of acoustic-phonetic dimensions of the monophthongs from an age- and sex-specific perspective. These acoustic-phonetic dimensions include: vowel spaces and distances from speaker centroids; frequency differences between the formant frequencies of males and females; vowel openness/closeness and frontness/backness; the degree of vocal effort; and formant frequency ranges. Both approaches reveal both age- and sex-specific development patterns which also appear to be dependent on whether vowels are peripheral or non-peripheral. The developmental emergence of these sex-specific differences are discussed with reference to anatomical, physiological, sociophonetic and culturally determined factors. Some directions for further investigation into the age-linked sex differences in speech across the lifespan are also proposed
Modeling the production of VCV sequences via the inversion of a biomechanical model of the tongue
A control model of the production of VCV sequences is presented, which
consists in three main parts: a static forward model of the relations between
motor commands and acoustic properties; the specification of targets in the
perceptual space; a planning procedure based on optimization principles.
Examples of simulations generated with this model illustrate how it can be used
to assess theories and models of coarticulation in speech
The Effects of Humming and Pitch on Craniofacial and Craniocervical Morphology Measured Using MRI
Peer reviewedPreprin
Singing synthesis with an evolved physical model
A two-dimensional physical model of the human vocal tract is described. Such a system promises increased realism and control in the synthesis. of both speech and singing. However, the parameters describing the shape of the vocal tract while in use are not easily obtained, even using medical imaging techniques, so instead a genetic algorithm (GA) is applied to the model to find an appropriate configuration. Realistic sounds are produced by this method. Analysis of these, and the reliability of the technique (convergence properties) is provided
Real-Time Vocal Tract Modelling
To date, most speech synthesis techniques have relied upon the representation of the vocal tract by some form of filter, a typical example being linear predictive coding (LPC). This paper describes the development of a physiologically realistic model of the vocal tract using the well-established technique of transmission line modelling (TLM). This technique is based on the principle of wave scattering at transmission line segment boundaries and may be used in one, two, or three dimensions. This work uses this technique to model the vocal tract using a one-dimensional transmission line. A six-port scattering node is applied in the region separating the pharyngeal, oral, and the nasal parts of the vocal tract
A stabilized finite element method for the mixed wave equation in an ALE framework with application to diphthong production
The archived file is not the final published version of the article.
© (2016) S. Hirzel Verlag/European Acoustics Association
The definitive publisher-authenticated version is available online at http://www.ingentaconnect.com/contentone/dav/aaua/2016/00000102/00000001/art00012
Readers must contact the publisher for reprint or permission to use the material in any form.Working with the wave equation in mixed rather than irreducible form allows one to directly account for both, the acoustic pressure field and the acoustic particle velocity field. Indeed, this becomes the natural option in many problems, such as those involving waves propagating in moving domains, because the equations can easily be set in an arbitrary Lagrangian-Eulerian (ALE) frame of reference. Yet, when attempting a standard Galerkin finite element solution (FEM) for them, it turns out that an inf-sup compatibility constraint has to be satisfied, which prevents from using equal interpolations for the approximated acoustic pressure and velocity fields. In this work it is proposed to resort to a subgrid scale stabilization strategy to circumvent this condition and thus facilitate code implementation. As a possible application, we address the generation of diphthongs in voice production.Peer ReviewedPostprint (author's final draft
Using Active Shape Modeling Based on MRI to Study Morphologic and Pitch-Related Functional Changes Affecting Vocal Structures and the Airway
Copyright © 2013 The Voice Foundation. Published by Mosby, Inc. All rights reserved.Peer reviewedPostprin
Computational and Robotic Models of Early Language Development: A Review
We review computational and robotics models of early language learning and
development. We first explain why and how these models are used to understand
better how children learn language. We argue that they provide concrete
theories of language learning as a complex dynamic system, complementing
traditional methods in psychology and linguistics. We review different modeling
formalisms, grounded in techniques from machine learning and artificial
intelligence such as Bayesian and neural network approaches. We then discuss
their role in understanding several key mechanisms of language development:
cross-situational statistical learning, embodiment, situated social
interaction, intrinsically motivated learning, and cultural evolution. We
conclude by discussing future challenges for research, including modeling of
large-scale empirical data about language acquisition in real-world
environments.
Keywords: Early language learning, Computational and robotic models, machine
learning, development, embodiment, social interaction, intrinsic motivation,
self-organization, dynamical systems, complexity.Comment: to appear in International Handbook on Language Development, ed. J.
Horst and J. von Koss Torkildsen, Routledg
- …