479 research outputs found

    I hear you eat and speak: automatic recognition of eating condition and food type, use-cases, and impact on ASR performance

    Get PDF
    We propose a new recognition task in the area of computational paralinguistics: automatic recognition of eating conditions in speech, i. e., whether people are eating while speaking, and what they are eating. To this end, we introduce the audio-visual iHEARu-EAT database featuring 1.6 k utterances of 30 subjects (mean age: 26.1 years, standard deviation: 2.66 years, gender balanced, German speakers), six types of food (Apple, Nectarine, Banana, Haribo Smurfs, Biscuit, and Crisps), and read as well as spontaneous speech, which is made publicly available for research purposes. We start with demonstrating that for automatic speech recognition (ASR), it pays off to know whether speakers are eating or not. We also propose automatic classification both by brute-forcing of low-level acoustic features as well as higher-level features related to intelligibility, obtained from an Automatic Speech Recogniser. Prediction of the eating condition was performed with a Support Vector Machine (SVM) classifier employed in a leave-one-speaker-out evaluation framework. Results show that the binary prediction of eating condition (i. e., eating or not eating) can be easily solved independently of the speaking condition; the obtained average recalls are all above 90%. Low-level acoustic features provide the best performance on spontaneous speech, which reaches up to 62.3% average recall for multi-way classification of the eating condition, i. e., discriminating the six types of food, as well as not eating. The early fusion of features related to intelligibility with the brute-forced acoustic feature set improves the performance on read speech, reaching a 66.4% average recall for the multi-way classification task. Analysing features and classifier errors leads to a suitable ordinal scale for eating conditions, on which automatic regression can be performed with up to 56.2% determination coefficient

    Assessing the Prosody of Non-Native Speakers of English: Measures and Feature Sets

    Get PDF
    In this paper, we describe a new database with audio recordings of non-native (L2) speakers of English, and the perceptual evaluation experiment conducted with native English speakers for assessing the prosody of each recording. These annotations are then used to compute the gold standard using different methods, and a series of regression experiments is conducted to evaluate their impact on the performance of a regression model predicting the degree of Abstract naturalness of L2 speech. Further, we compare the relevance of different feature groups modelling prosody in general (without speech tempo), speech rate and pauses modelling speech tempo (fluency), voice quality, and a variety of spectral features. We also discuss the impact of various fusion strategies on performance.Overall, our results demonstrate that the prosody of non-native speakers of English as L2 can be reliably assessed using supra- segmental audio features; prosodic features seem to be the most important ones

    Rapid evolution of the primate larynx?

    Get PDF
    Tissue vibrations in the larynx produce most sounds that comprise vocal communication in mammals. Larynx morphology is thus predicted to be a key target for selection, particularly in species with highly developed vocal communication systems. Here, we present a novel database of digitally modeled scanned larynges from 55 different mammalian species, representing a wide range of body sizes in the primate and carnivoran orders. Using phylogenetic comparative methods, we demonstrate that the primate larynx has evolved more rapidly than the carnivoran larynx, resulting in a pattern of larger size and increased deviation from expected allometry with body size. These results imply fundamental differences between primates and carnivorans in the balance of selective forces that constrain larynx size and highlight an evolutionary flexibility in primates that may help explain why we have developed complex and diverse uses of the vocal organ for communication

    Imaging single cells in a beam of live cyanobacteria with an X-ray laser

    Get PDF
    Citation: van der Schot, G., Svenda, M., Maia, F., Hantke, M., DePonte, D. P., Seibert, M. M., . . . Ekeberg, T. (2015). Imaging single cells in a beam of live cyanobacteria with an X-ray laser. Nature Communications, 6, 9. doi:10.1038/ncomms6704There exists a conspicuous gap of knowledge about the organization of life at mesoscopic levels. Ultra-fast coherent diffractive imaging with X-ray free-electron lasers can probe structures at the relevant length scales and may reach sub-nanometer resolution on micron-sized living cells. Here we show that we can introduce a beam of aerosolised cyanobacteria into the focus of the Linac Coherent Light Source and record diffraction patterns from individual living cells at very low noise levels and at high hit ratios. We obtain two-dimensional projection images directly from the diffraction patterns, and present the results as synthetic X-ray Nomarski images calculated from the complex-valued reconstructions. We further demonstrate that it is possible to record diffraction data to nanometer resolution on live cells with X-ray lasers. Extension to sub-nanometer resolution is within reach, although improvements in pulse parameters and X-ray area detectors will be necessary to unlock this potential.Additional Authors: Almeida, N. F.;Odic, D.;Hasse, D.;Carlsson, G. H.;Larsson, D. S. D.;Barty, A.;Martin, A. V.;Schorb, S.;Bostedt, C.;Bozek, J. D.;Rolles, D.;Rudenko, A.;Epp, S.;Foucar, L.;Rudek, B.;Hartmann, R.;Kimmel, N.;Holl, P.;Englert, L.;Loh, N. T. D.;Chapman, H. N.;Andersson, I.;Hajdu, J.;Ekeberg, T

    Three-Dimensional Reconstruction of the Giant Mimivirus Particle with an X-Ray Free-Electron Laser

    Get PDF
    Citation: Ekeberg, T., Svenda, M., Abergel, C., Maia, F., Seltzer, V., Claverie, J. M., . . . Hajdu, J. (2015). Three-Dimensional Reconstruction of the Giant Mimivirus Particle with an X-Ray Free-Electron Laser. Physical Review Letters, 114(9), 6. doi:10.1103/PhysRevLett.114.098102We present a proof-of-concept three-dimensional reconstruction of the giant mimivirus particle from experimentally measured diffraction patterns from an x-ray free-electron laser. Three-dimensional imaging requires the assembly of many two-dimensional patterns into an internally consistent Fourier volume. Since each particle is randomly oriented when exposed to the x-ray pulse, relative orientations have to be retrieved from the diffraction data alone. We achieve this with a modified version of the expand, maximize and compress algorithm and validate our result using new methods.Additional Authors: Andersson, I.;Loh, N. D.;Martin, A. V.;Chapman, H.;Bostedt, C.;Bozek, J. D.;Ferguson, K. R.;Krzywinski, J.;Epp, S. W.;Rolles, D.;Rudenko, A.;Hartmann, R.;Kimmel, N.;Hajdu, J

    Gangs and guilt: Towards a new theory of horror film

    Get PDF
    The most basic and unanimous statement made in scholarship on horror films is that horror films are ‘about’ fear: the primary purpose of horror films is to scare viewers. Based on horror films from the 1970s until the present in which child gangs play a significant part, this essay advances a new theory of horror film, namely that horror films primarily seek to elicit not fear but guilt. The analysis focuses on four topics: themes, camera angles, horror’s cinematic casting of ‘abnormality,’ and the rift, unique to the horror genre, between audience ‘alignment’ and ‘allegiance.

    Regulatory coiled-coil domains promote head-to-head assemblies of AAA+ chaperones essential for tunable activity control

    Get PDF
    Ring-forming AAA+ chaperones exert ATP-fueled substrate unfolding by threading through a central pore. This activity is potentially harmful requiring mechanisms for tight repression and substrate-specific activation. The AAA+ chaperone ClpC with the peptidase ClpP forms a bacterial protease essential to virulence and stress resistance. The adaptor MecA activates ClpC by targeting substrates and stimulating ClpC ATPase activity. We show how ClpC is repressed in its ground state by determining ClpC cryo-EM structures with and without MecA. ClpC forms large two-helical assemblies that associate via head-to-head contacts between coiled-coil middle domains (MDs). MecA converts this resting state to an active planar ring structure by binding to MD interaction sites. Loss of ClpC repression in MD mutants causes constitutive activation and severe cellular toxicity. These findings unravel an unexpected regulatory concept executed by coiled-coil MDs to tightly control AAA+ chaperone activity

    Lambda and Antilambda polarization from deep inelastic muon scattering

    Full text link
    We report results of the first measurements of Lambda and Antilambda polarization produced in deep inelastic polarized muon scattering on the nucleon. The results are consistent with an expected trend towards positive polarization with increasing x_F. The polarizations of Lambda and Antilambda appear to have opposite signs. A large negative polarization for Lambda at low positive x_F is observed and is not explained by existing models.A possible interpretation is presented.Comment: 9 pages, 2 figure

    Shadowing in Inelastic Scattering of Muons on Carbon, Calcium and Lead at Low XBj

    Full text link
    Nuclear shadowing is observed in the per-nucleon cross-sections of positive muons on carbon, calcium and lead as compared to deuterium. The data were taken by Fermilab experiment E665 using inelastically scattered muons of mean incident momentum 470 GeV/c. Cross-section ratios are presented in the kinematic region 0.0001 < XBj <0.56 and 0.1 < Q**2 < 80 GeVc. The data are consistent with no significant nu or Q**2 dependence at fixed XBj. As XBj decreases, the size of the shadowing effect, as well as its A dependence, are found to approach the corresponding measurements in photoproduction.Comment: 22 pages, incl. 6 figures, to be published in Z. Phys.
    • …
    corecore