Search CORE

159 research outputs found

Structural Representation and Matching of Articulatory Speech Structures based on the Evolving Transformation System (ETS) Formalism [2]

Author: Gay David R
Gutkin Alexander
Publication venue
Publication date: 01/01/2005
Field of study

A formal structural representation of speech consistent with the principles of combinatorial structure theory is presented in this paper. The representation is developed within the Evolving Transformation System (ETS) formalism and encapsulates speech processes at the articulatory level. We show how the class structure of several consonantal phonemes of English can be expressed with the help of articulatory gestures-the atomic combinatorial units of speech. As a preliminary step towards the design of a speech recognition architecture based on the structural approaches to physiology and articulatory phonology, we present an algorithm for the structural detection of phonemic class elements inside gestural ETS structures derived from continuous speech. Experiments designed to verify the adequacy of the hypothesised gestural class structure conducted on the MOCHA articulatory corpus are then described. Our experimental results support the hypothesis that the articulatory representation captures sufficient information for the accurate structural identification of the phonemic classes in question

CiteSeerX

Edinburgh Research Archive

Towards Formal Structural Representation of Spoken Language: An Evolving Transformation System (ETS) Approach

Author: Alexander Gutkin,
Publication venue: The University of Edinburgh
Publication date: 01/01/2005
Field of study

Speech recognition has been a very active area of research over the past twenty years. Despite an evident progress, it is generally agreed by the practitioners of the field that performance of the current speech recognition systems is rather suboptimal and new approaches are needed. The motivation behind the undertaken research is an observation that the notion of representation of objects and concepts that once was considered to be central in the early days of pattern recognition, has been largely marginalised by the advent of statistical approaches. As a consequence of a predominantly statistical approach to speech recognition problem, due to the numeric, feature vector-based, nature of representation, the classes inductively discovered from real data using decision-theoretic techniques have little meaning outside the statistical framework. This is because decision surfaces or probability distributions are difficult to analyse linguistically. Because of the later limitation it is doubtful that the gap between speech recognition and linguistic research can be bridged by the numeric representations. This thesis investigates an alternative, structural, approach to spoken language representation and categorisation. The approach pursued in this thesis is based on a consistent program, known as the Evolving Transformation System (ETS), motivated by the development and clarification of the concept of structural representation in pattern recognition and artificial intelligence from both theoretical and applied points of view. This thesis consists of two parts. In the first part of this thesis, a similarity-based approach to structural representation of speech is presented. First, a linguistically well-motivated structural representation of phones based on distinctive phonological features recovered from speech is proposed. The representation consists of string templates representing phones together with a similarity measure. The set of phonological templates together with a similarity measure defines a symbolic metric space. Representation and ETS-inspired categorisation in the symbolic metric spaces corresponding to the phonological structural representation are then investigated by constructing appropriate symbolic space classifiers and evaluating them on a standard corpus of read speech. In addition, similarity-based isometric transition from phonological symbolic metric spaces to the corresponding non-Euclidean vector spaces is investigated. Second part of this thesis deals with the formal approach to structural representation of spoken language. Unlike the approach adopted in the first part of this thesis, the representation developed in the second part is based on the mathematical language of the ETS formalism. This formalism has been specifically developed for structural modelling of dynamic processes. In particular, it allows the representation of both objects and classes in a uniform event-based hierarchical framework. In this thesis, the latter property of the formalism allows the adoption of a more physiologically-concreteapproach to structural representation. The proposed representation is based on gestural structures and encapsulates speech processes at the articulatory level. Algorithms for deriving the articulatory structures from the data are presented and evaluated

CiteSeerX

Edinburgh Research Archive

Coded excitation and sub-band processing for blood velocity estmation in medical ultrasound

Author: Gran Fredrik
Jensen Jørgen Arendt
Udesen Jesper
Publication venue: 'Acoustical Society of America (ASA)'
Publication date: 01/01/2007
Field of study

Online Research Database In Technology

Underwater noise due to precipitation

Author: Crum Lawrence A.
Jensen Leif Bjørnø
Prosperetti Andrea
Pumphrey Hugh C.
Publication venue: 'Acoustical Society of America (ASA)'
Publication date: 01/01/1989
Field of study

Crossref

Online Research Database In Technology

Modeling and experiments with low-frequency pressure wave propagation in liquid-filled, flexible tubes

Author: Bjarnø Leif
Bjelland C
Publication venue: 'Acoustical Society of America (ASA)'
Publication date: 01/01/1992
Field of study

Online Research Database In Technology

Predicting room acoustical behavior with the ODEON computer model

Author: Naylor Graham
Rindel Jens Holger
Publication venue: 'Acoustical Society of America (ASA)'
Publication date: 01/01/1992
Field of study

Crossref

Online Research Database In Technology

Effects of measurement procedure and equipment on average room acoustic measurements

Author: Bradley J S
Gade Anders Christian
Siebein G W
Publication venue: 'Acoustical Society of America (ASA)'
Publication date: 01/01/1993
Field of study

Crossref

Online Research Database In Technology

Temporal integration of loudness as a function of level

Author: Buus Søren
Florentine Mary
Poulsen Torben
Publication venue: 'Acoustical Society of America (ASA)'
Publication date: 01/01/1996
Field of study

Crossref

Online Research Database In Technology

Influence of statistical surface models on dynamic scattering of high-frequency signals from the ocean surface (A)

Author: Bjerrum-Niese Christian
Jensen Leif Bjørnø
Publication venue: 'Acoustical Society of America (ASA)'
Publication date: 01/01/1994
Field of study

Crossref

Online Research Database In Technology

Neural network modeling of a dolphin's sonar discrimination capabilities

Author: Andersen Lars Nonboe
Au WWL
Nachtigall PE
René Rasmussen A
Roitblat H.
Publication venue: 'Acoustical Society of America (ASA)'
Publication date: 01/01/1994
Field of study

Crossref

Online Research Database In Technology