Search CORE

8,779 research outputs found

Acoustic modeling using the digital waveguide mesh

Author: Kelloniemi Antti
Mullen Jack
Murphy Damian
Shelley Simon
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2007
Field of study

The digital waveguide mesh has been an active area of music acoustics research for over ten years. Although founded in 1-D digital waveguide modeling, the principles on which it is based are not new to researchers grounded in numerical simulation, FDTD methods, electromagnetic simulation, etc. This article has attempted to provide a considerable review of how the DWM has been applied to acoustic modeling and sound synthesis problems, including new 2-D object synthesis and an overview of recent research activities in articulatory vocal tract modeling, RIR synthesis, and reverberation simulation. The extensive, although not by any means exhaustive, list of references indicates that though the DWM may have parallels in other disciplines, it still offers something new in the field of acoustic simulation and sound synth

CiteSeerX

Crossref

White Rose Research Online

Sex-specific fundamental and formant frequency patterns in a cross-sectional study

Author: Childers D. G.
Dalston R. M.
Decoster W.
Decoster W.
Deterding D.
Eguchi S.
Henton C.
Ladefoged P.
Sandra P. Whiteside
Smith B. L.
Traunmüller H.
Traunmüller H.
White P.
Whiteside S. P.
Wu K.
Xue A.
Publication venue: 'Acoustical Society of America (ASA)'
Publication date: 01/07/2001
Field of study

An extensive developmental acoustic study of the speech patterns of children and adults was reported by Lee and colleagues [Lee et al., J. Acoust. Soc. Am. 105, 1455-1468 (1999)]. This paper presents a reexamination of selected fundamental frequency and formant frequency data presented in their report for 10 monophthongs by investigating sex-specific and developmental patterns using two different approaches. The first of these includes the investigation of age- and sex-specific formant frequency patterns in the monophthongs. The second, the investigation of fundamental frequency and formant frequency data using the critical band rate (bark) scale and a number of acoustic-phonetic dimensions of the monophthongs from an age- and sex-specific perspective. These acoustic-phonetic dimensions include: vowel spaces and distances from speaker centroids; frequency differences between the formant frequencies of males and females; vowel openness/closeness and frontness/backness; the degree of vocal effort; and formant frequency ranges. Both approaches reveal both age- and sex-specific development patterns which also appear to be dependent on whether vowels are peripheral or non-peripheral. The developmental emergence of these sex-specific differences are discussed with reference to anatomical, physiological, sociophonetic and culturally determined factors. Some directions for further investigation into the age-linked sex differences in speech across the lifespan are also proposed

Crossref

White Rose Research Online

Modeling the production of VCV sequences via the inversion of a biomechanical model of the tongue

Author: Ma Liang
Payan Yohan
Perrier Pascal
Publication venue
Publication date: 01/01/2005
Field of study

A control model of the production of VCV sequences is presented, which consists in three main parts: a static forward model of the relations between motor commands and acoustic properties; the specification of targets in the perceptual space; a planning procedure based on optimization principles. Examples of simulations generated with this model illustrate how it can be used to assess theories and models of coarticulation in speech

arXiv.org e-Print Archive

CiteSeerX

Hal - Université Grenoble Alpes

CERN Document Server

The Effects of Humming and Pitch on Craniofacial and Craniocervical Morphology Measured Using MRI

Author: Aspden Richard Malcolm
Gilbert Fiona Jane
Gregory Jenny
Miller Nicola Anne
Semple Scott
Stollery Pete
Publication venue: 'Elsevier BV'
Publication date: 25/03/2011
Field of study

Peer reviewedPreprin

Aberdeen University Research

Crossref

Singing synthesis with an evolved physical model

Author: Cooper Crispin
Howard D.
Murphy D.T.
Tyrrell A.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2006
Field of study

A two-dimensional physical model of the human vocal tract is described. Such a system promises increased realism and control in the synthesis. of both speech and singing. However, the parameters describing the shape of the vocal tract while in use are not easily obtained, even using medical imaging techniques, so instead a genetic algorithm (GA) is applied to the model to find an appropriate configuration. Realistic sounds are produced by this method. Analysis of these, and the reliability of the technique (convergence properties) is provided

Online Research @ Cardiff

White Rose Research Online

Real-Time Vocal Tract Modelling

Author: Benallal A.
Benkrid A.
Benkrid K.
Publication venue: 'Hindawi Limited'
Publication date: 01/01/2007
Field of study

To date, most speech synthesis techniques have relied upon the representation of the vocal tract by some form of filter, a typical example being linear predictive coding (LPC). This paper describes the development of a physiologically realistic model of the vocal tract using the well-established technique of transmission line modelling (TLM). This technique is based on the principle of wave scattering at transmission line segment boundaries and may be used in one, two, or three dimensions. This work uses this technique to model the vocal tract using a one-dimensional transmission line. A six-port scattering node is applied in the region separating the pharyngeal, oral, and the nasal parts of the vocal tract

Crossref

Directory of Open Access Journals

Edinburgh Research Explorer

A stabilized finite element method for the mixed wave equation in an ALE framework with application to diphthong production

Author: Arnela Marc
Codina Ramon
Espinoza Román Héctor Gabriel
Guasch Fortuny Oriol
Publication venue: 'S. Hirzel Verlag'
Publication date: 01/01/2016
Field of study

The archived file is not the final published version of the article. © (2016) S. Hirzel Verlag/European Acoustics Association The definitive publisher-authenticated version is available online at http://www.ingentaconnect.com/contentone/dav/aaua/2016/00000102/00000001/art00012 Readers must contact the publisher for reprint or permission to use the material in any form.Working with the wave equation in mixed rather than irreducible form allows one to directly account for both, the acoustic pressure field and the acoustic particle velocity field. Indeed, this becomes the natural option in many problems, such as those involving waves propagating in moving domains, because the equations can easily be set in an arbitrary Lagrangian-Eulerian (ALE) frame of reference. Yet, when attempting a standard Galerkin finite element solution (FEM) for them, it turns out that an inf-sup compatibility constraint has to be satisfied, which prevents from using equal interpolations for the approximated acoustic pressure and velocity fields. In this work it is proposed to resort to a subgrid scale stabilization strategy to circumvent this condition and thus facilitate code implementation. As a possible application, we address the generation of diphthongs in voice production.Peer ReviewedPostprint (author's final draft

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Crossref

UPCommons. Portal del coneixement obert de la UPC

Scipedia

Using Active Shape Modeling Based on MRI to Study Morphologic and Pitch-Related Functional Changes Affecting Vocal Structures and the Airway

Author: Aspden Richard Malcolm
Gilbert Fiona Jane
Gregory Jenny
Miller Nicola
Stollery Pete
Publication venue: 'Elsevier BV'
Publication date: 15/03/2014
Field of study

Aberdeen University Research

Computational and Robotic Models of Early Language Development: A Review

Author: Kachergis George
Oudeyer Pierre-Yves
Schueller William
Publication venue
Publication date: 25/03/2019
Field of study

We review computational and robotics models of early language learning and development. We first explain why and how these models are used to understand better how children learn language. We argue that they provide concrete theories of language learning as a complex dynamic system, complementing traditional methods in psychology and linguistics. We review different modeling formalisms, grounded in techniques from machine learning and artificial intelligence such as Bayesian and neural network approaches. We then discuss their role in understanding several key mechanisms of language development: cross-situational statistical learning, embodiment, situated social interaction, intrinsically motivated learning, and cultural evolution. We conclude by discussing future challenges for research, including modeling of large-scale empirical data about language acquisition in real-world environments. Keywords: Early language learning, Computational and robotic models, machine learning, development, embodiment, social interaction, intrinsic motivation, self-organization, dynamical systems, complexity.Comment: to appear in International Handbook on Language Development, ed. J. Horst and J. von Koss Torkildsen, Routledg

arXiv.org e-Print Archive

INRIA a CCSD electronic archive server