15,368 research outputs found

    Tracking Articulator Movements Using Orientation Measurements

    Get PDF
    This paper introduces a new method to track articulator movements, specifically jaw position and angle, using 5 degree of freedom (5 DOF) orientation data. The approach uses a quaternion rotation method to accomplish this jaw tracking during speech using a single senor on the mandibular incisor. Data were collected using the NDI Wave Speech Research System for one pilot subject with various speech tasks. The degree of jaw rotation from the proposed approach is compared with traditional geometric calculation. Results show that the quaternion based method is able to describe jaw angle trajectory and gives more accurate and smooth estimation of jaw kinematics

    Vowel Production in Mandarin Accented English and American English: Kinematic and Acoustic Data from the Marquette University Mandarin Accented English Corpus

    Get PDF
    Few electromagnetic articulography (EMA) datasets are publicly available, and none have focused systematically on non-native accented speech. We introduce a kinematic-acoustic database of speech from 40 (gender and dialect balanced) participants producing upper-Midwestern American English (AE) L1 or Mandarin Accented English (MAE) L2 (Beijing or Shanghai dialect base). The Marquette University EMA-MAE corpus will be released publicly to help advance research in areas such as pronunciation modeling, acoustic-articulatory inversion, L1-L2 comparisons, pronunciation error detection, and accent modification training. EMA data were collected at a 400 Hz sampling rate with synchronous audio using the NDI Wave System. Articulatory sensors were placed on the midsagittal lips, lower incisors, and tongue blade and dorsum, as well as on the lip corner and lateral tongue body. Sensors provide five degree-of-freedom measurements including three-dimensional sensor position and two-dimensional orientation (pitch and roll). In the current work we analyze kinematic and acoustic variability between L1 and L2 vowels. We address the hypothesis that MAE is characterized by larger differences in the articulation of back vowels than front vowels and smaller vowel spaces compared to AE. The current results provide a seminal comparison of the kinematics and acoustics of vowel production between MAE and AE speakers

    The Electromagnetic Articulography Mandarin Accented English (EMA-MAE) Corpus of Acoustic and 3D Articulatory Kinematic Data

    Get PDF
    There is a significant need for more comprehensive electromagnetic articulography (EMA) datasets that can provide matched acoustics and articulatory kinematic data with good spatial and temporal resolution. The Marquette University Electromagnetic Articulography Mandarin Accented English (EMA-MAE) corpus provides kinematic and acoustic data from 40 gender and dialect balanced speakers representing 20 Midwestern standard American English L1 speakers and 20 Mandarin Accented English (MAE) L2 speakers, half Beijing region dialect and half are Shanghai region dialect. Three dimensional EMA data were collected at a 400 Hz sampling rate using the NDI Wave system, with articulatory sensors on the midsagittal lips, lower incisors, tongue blade and dorsum, plus lateral lip corner and tongue body. Sensors provide three-dimensional position data as well as two-dimensional orientation data representing the orientation of the sensor plane. Data have been corrected for head movement relative to a fixed reference sensor and also adjusted using a biteplate calibration system to place the data in an articulatory working space relative to each subject\u27s individual midsagittal and maxillary occlusal planes. Speech materials include isolated words chosen to focus on specific contrasts between the English and Mandarin languages, as well as sentences and paragraphs for continuous speech, totaling approximately 45 minutes of data per subject. A beta version of the EMA-MAE corpus is now available, and the full corpus is in preparation for public release to help advance research in areas such as pronunciation modeling, acoustic-articulatory inversion, L1-L2 comparisons, pronunciation error detection, and accent modification training

    Parallel Reference Speaker Weighting for Kinematic-Independent Acoustic-to-Articulatory Inversion

    Get PDF
    Acoustic-to-articulatory inversion, the estimation of articulatory kinematics from an acoustic waveform, is a challenging but important problem. Accurate estimation of articulatory movements has the potential for significant impact on our understanding of speech production, on our capacity to assess and treat pathologies in a clinical setting, and on speech technologies such as computer aided pronunciation assessment and audio-video synthesis. However, because of the complex and speaker-specific relationship between articulation and acoustics, existing approaches for inversion do not generalize well across speakers. As acquiring speaker-specific kinematic data for training is not feasible in many practical applications, this remains an important and open problem. This paper proposes a novel approach to acoustic-to-articulatory inversion, Parallel Reference Speaker Weighting (PRSW), which requires no kinematic data for the target speaker and a small amount of acoustic adaptation data. PRSW hypothesizes that acoustic and kinematic similarities are correlated and uses speaker-adapted articulatory models derived from acoustically derived weights. The system was assessed using a 20-speaker data set of synchronous acoustic and Electromagnetic Articulography (EMA) kinematic data. Results demonstrate that by restricting the reference group to a subset consisting of speakers with strong individual speaker-dependent inversion performance, the PRSW method is able to attain kinematic-independent acoustic-to-articulatory inversion performance nearly matching that of the speaker-dependent model, with an average correlation of 0.62 versus 0.63. This indicates that given a sufficiently complete and appropriately selected reference speaker set for adaptation, it is possible to create effective articulatory models without kinematic training data

    Palate-referenced Articulatory Features for Acoustic-to-Articulator Inversion

    Get PDF
    The selection of effective articulatory features is an important component of tasks such as acoustic-to-articulator inversion and articulatory synthesis. Although it is common to use direct articulatory sensor measurements as feature variables, this approach fails to incorporate important physiological information such as palate height and shape and thus is not as representative of vocal tract cross section as desired. We introduce a set of articulator feature variables that are palate referenced and normalized with respect to the articulatory working space in order to improve the quality of the vocal tract representation. These features include normalized horizontal positions plus the normalized palatal height of two midsagittal and one lateral tongue sensor, as well as normalized lip separation and lip protrusion. The quality of the feature representation is evaluated subjectively by comparing the variances and vowel separation in the working space and quantitatively through measurement of acoustic-to-articulator inversion error. Results indicate that the palate-referenced features have reduced variance and increased separation between vowels spaces and substantially lower inversion error than direct sensor measures

    What we can learn from magnetic Compton scattering : application to the determination of spin polarization

    Get PDF
    Studies of spin-resolved electron momentum densities involve the measurement of the so-called magnetic Compton profile. This is a one-dimensional projection of the electron momentum distribution of only those electrons that contribute to the spin moment of a sample. The technique is applicable to ferri- and ferromagnetic materials. The profile is obtained via the inelastic 'Compton' scattering of high energy X-rays. Since electrons originating from different atomic orbitals have specific momentum densities, it is often possible to determine the origin of the magnetism present. Typically, interpretation requires the use of electronic structure calculations using molecular orbital and band structure approaches. Here, we highlight the application of the technique to the determination of the Fermi level spin polarization, the knowledge of which is important to the development of novel spintronic materials

    Discrimination of Individual Tigers (\u3cem\u3ePanthera tigris\u3c/em\u3e) from Long Distance Roars

    Get PDF
    This paper investigates the extent of tiger (Panthera tigris) vocal individuality through both qualitative and quantitative approaches using long distance roars from six individual tigers at Omaha\u27s Henry Doorly Zoo in Omaha, NE. The framework for comparison across individuals includes statistical and discriminant function analysis across whole vocalization measures and statistical pattern classification using a hidden Markov model (HMM) with frame-based spectral features comprised of Greenwood frequency cepstral coefficients. Individual discrimination accuracy is evaluated as a function of spectral model complexity, represented by the number of mixtures in the underlying Gaussian mixture model (GMM), and temporal model complexity, represented by the number of sequential states in the HMM. Results indicate that the temporal pattern of the vocalization is the most significant factor in accurate discrimination. Overall baseline discrimination accuracy for this data set is about 70% using high level features without complex spectral or temporal models. Accuracy increases to about 80% when more complex spectral models (multiple mixture GMMs) are incorporated, and increases to a final accuracy of 90% when more detailed temporal models (10-state HMMs) are used. Classification accuracy is stable across a relatively wide range of configurations in terms of spectral and temporal model resolution

    Formation of Two Glass Phases in Binary Cu-Ag Liquid

    Get PDF
    The glass transition is alternatively described as either a dynamic transition in which there is a dramatic slowing down of the kinetics, or as a thermodynamic phase transition. To examine the physical origin of the glass transition in fragile Cu-Ag liquids, we employed molecular dynamics (MD) simulations on systems in the range of 32,000 to 2,048,000 atoms. Surprisingly, we identified a 1st order freezing transition from liquid (L) to metastable heterogenous solid-like phase, denoted as the G-glass, when a supercooled liquid evolves isothermally below its melting temperature at deep undercooling. In contrast, a more homogenous liquid-like glass, denoted as the L-glass, is achieved when the liquid is quenched continuously to room temperature with a fast cooling rate of ∼10¹¹ K/sec. We report a thermodynamic description of the L-G transition and characterize the correlation length of the heterogenous structure in the G-glass. The shear modulus of the G-glass is significantly higher than the L-glass, suggesting that the first order L-G transition is linked fundamentally to long-range elasticity involving elementary configurational excitations in the G-glass

    Kepler Exoplanet Candidate Host Stars are Preferentially Metal Rich

    Full text link
    We find that Kepler exoplanet candidate (EC) host stars are preferentially metal-rich, including the low-mass stellar hosts of small-radius ECs. The last observation confirms a tentative hint that there is a correlation between the metallicity of low-mass stars and the presence of low-mass and small-radius exoplanets. In particular, we compare the J-H--g-r color-color distribution of Kepler EC host stars with a control sample of dwarf stars selected from the ~150,000 stars observed during Q1 and Q2 of the Kepler mission but with no detected planets. We find that at J-H = 0.30 characteristic of solar-type stars, the average g-r color of stars that host giant ECs is 4-sigma redder than the average color of the stars in the control sample. At the same time, the average g-r color of solar-type stars that host small-radius ECs is indistinguishable from the average color of the stars in the control sample. In addition, we find that at J-H = 0.62 indicative of late K dwarfs, the average g-r color of stars that host small-radius ECs is 4-sigma redder than the average color of the stars in the control sample. These offsets are unlikely to be caused by differential reddening, age differences between the two populations, or the presence of giant stars in the control sample. Stellar models suggest that the first color offset is due to a 0.2 dex enhancement in [Fe/H] of the giant EC host population at M_star = 1 M_Sun, while Sloan photometry of M 67 and NGC 6791 suggests that the second color offset is due to a similar [Fe/H] enhancement of the small-radius EC host population at M_star = 0.7 M_Sun. These correlations are a natural consequence of the core-accretion model of planet formation.Comment: 15 pages, 8 figures, and 1 table in emulateapj format; accepted for publication in Ap
    • …
    corecore