627 research outputs found
I hear you eat and speak: automatic recognition of eating condition and food type, use-cases, and impact on ASR performance
We propose a new recognition task in the area of computational paralinguistics: automatic recognition of eating conditions in speech, i. e., whether people are eating while speaking, and what they are eating. To this end, we introduce the audio-visual iHEARu-EAT database featuring 1.6 k utterances of 30 subjects (mean age: 26.1 years, standard deviation: 2.66 years, gender balanced, German speakers), six types of food (Apple, Nectarine, Banana, Haribo Smurfs, Biscuit, and Crisps), and read as well as spontaneous speech, which is made publicly available for research purposes. We start with demonstrating that for automatic speech recognition (ASR), it pays off to know whether speakers are eating or not. We also propose automatic classification both by brute-forcing of low-level acoustic features as well as higher-level features related to intelligibility, obtained from an Automatic Speech Recogniser. Prediction of the eating condition was performed with a Support Vector Machine (SVM) classifier employed in a leave-one-speaker-out evaluation framework. Results show that the binary prediction of eating condition (i. e., eating or not eating) can be easily solved independently of the speaking condition; the obtained average recalls are all above 90%. Low-level acoustic features provide the best performance on spontaneous speech, which reaches up to 62.3% average recall for multi-way classification of the eating condition, i. e., discriminating the six types of food, as well as not eating. The early fusion of features related to intelligibility with the brute-forced acoustic feature set improves the performance on read speech, reaching a 66.4% average recall for the multi-way classification task. Analysing features and classifier errors leads to a suitable ordinal scale for eating conditions, on which automatic regression can be performed with up to 56.2% determination coefficient
Assessing the Prosody of Non-Native Speakers of English: Measures and Feature Sets
In this paper, we describe a new database with audio recordings of non-native (L2) speakers of English, and the perceptual evaluation experiment conducted with native English speakers for assessing the prosody of each recording. These annotations are then used to compute the gold standard using different methods, and a series of regression experiments is conducted to evaluate their impact on the performance of a regression model predicting the degree of Abstract naturalness of L2 speech. Further, we compare the relevance of different feature groups modelling prosody in general (without speech tempo), speech rate and pauses modelling speech tempo (fluency), voice quality, and a variety of spectral features. We also discuss the impact of various fusion strategies on performance.Overall, our results demonstrate that the prosody of non-native speakers of English as L2 can be reliably assessed using supra- segmental audio features; prosodic features seem to be the most important ones
Renal accumulation of prooxidant mineral elements and CKD in domestic cats
Felids have a high incidence of chronic kidney disease (CKD), for which the most common renal lesion is chronic interstitial nephritis (CIN). CIN can be induced by tissue oxidative stress, which is determined by the cellular balance of pro- and anti-oxidant metabolites. Fish-flavoured foods are more often fed to cats than dogs, and such foods tend to have higher arsenic content. Arsenic is a pro-oxidant metallic element. We propose that renal accumulation of pro-oxidant elements such as arsenic and depletion of anti-oxidant elements such as zinc, underpin the high incidence of CIN in domestic cats. Total arsenic and other redox-reactive metal elements were measured in kidneys (after acid-digestion) and urine (both by inductively-coupled plasma-mass spectrometry) of domestic cats (kidneys, n = 56; urine, n = 21), domestic dogs (kidneys, n = 54; urine, n = 28) and non-domesticated Scottish Wildcats (kidneys, n = 17). Renal lesions were graded by severity of CIN. In our randomly sampled population, CIN was more prevalent in domestic cat versus domestic dog (51%, n = 32 of 62 cats; 15%, 11 of 70 dogs were positive for CIN, respectively). CIN was absent from all Scottish wildcats. Tissue and urinary (corrected for creatinine) arsenic content was higher in domestic cats, relative to domestic dogs and wildcats. Urine arsenic was higher in domestic cats and dogs with CIN. Arsenobetaine, an organic and relatively harmless species of arsenic, was the primary form of arsenic found in pet foods. In summary, the kidneys of domestic cats appear to have greater levels of pro-oxidant trace elements, as compared to dogs and wildcats. Since there was no difference in renal arsenic levels in cats with or without CIN, renal arsenic accumulation does not appear a primary driver of excess CIN in cats. Given clear differences in renal handling of pro vs. anti-oxidant minerals between cats and dogs, further in vivo balance studies are warranted. These may then inform species-specific guidelines for trace element incorporation into commercial diets
Gangs and guilt: Towards a new theory of horror film
The most basic and unanimous statement made in scholarship on horror films is that horror films are ‘about’ fear: the primary purpose of horror films is to scare viewers. Based on horror films from the 1970s until the present in which child gangs play a significant part, this essay advances a new theory of horror film, namely that horror films primarily seek to elicit not fear but guilt. The analysis focuses on four topics: themes, camera angles, horror’s cinematic casting of ‘abnormality,’ and the rift, unique to the horror genre, between audience ‘alignment’ and ‘allegiance.
A statistical approach to detect protein complexes at X-ray free electron laser facilities
The Flash X-ray Imaging (FXI) technique, under development at X-ray free electron lasers (XFEL), aims to achieve structure determination based on diffraction from individual macromolecular complexes. We report an FXI study on the first protein complex-RNA polymerase II-ever injected at an XFEL. A successful 3D reconstruction requires a high number of observations of the sample in various orientations. The measured diffraction signal for many shots can be comparable to background. Here we present a robust and highly sensitive hit-identification method based on automated modeling of beamline background through photon statistics. It can operate at controlled false positive hit-rate of 3 x10(-5). We demonstrate its power in determining particle hits and validate our findings against an independent hit-identification approach based on ion time-of-flight spectra. We also validate the advantages of our method over simpler hit-identification schemes via tests on other samples and using computer simulations, showing a doubled hit-identification power
Dynamics of Uptake and Metabolism of Small Molecules in Cellular Response Systems
BACKGROUND: Proper cellular function requires uptake of small molecules from the environment. In response to changes in extracellular conditions cells alter the import and utilization of small molecules. For a wide variety of small molecules the cellular response is regulated by a network motif that combines two feedback loops, one which regulates the transport and the other which regulates the subsequent metabolism. RESULTS: We analyze the dynamic behavior of two widespread but logically distinct two-loop motifs. These motifs differ in the logic of the feedback loop regulating the uptake of the small molecule. Our aim is to examine the qualitative features of the dynamics of these two classes of feedback motifs. We find that the negative feedback to transport is accompanied by overshoot in the intracellular amount of small molecules, whereas a positive feedback to transport removes overshoot by boosting the final steady state level. On the other hand, the negative feedback allows for a rapid initial response, whereas the positive feedback is slower. We also illustrate how the dynamical deficiencies of one feedback motif can be mitigated by an additional loop, while maintaining the original steady-state properties. CONCLUSIONS: Our analysis emphasizes the core of the regulation found in many motifs at the interface between the metabolic network and the environment of the cell. By simplifying the regulation into uptake and the first metabolic step, we provide a basis for elaborate studies of more realistic network structures. Particularly, this theoretical analysis predicts that FeS cluster formation plays an important role in the dynamics of iron homeostasis
Lambda and Antilambda polarization from deep inelastic muon scattering
We report results of the first measurements of Lambda and Antilambda
polarization produced in deep inelastic polarized muon scattering on the
nucleon. The results are consistent with an expected trend towards positive
polarization with increasing x_F. The polarizations of Lambda and Antilambda
appear to have opposite signs. A large negative polarization for Lambda at low
positive x_F is observed and is not explained by existing models.A possible
interpretation is presented.Comment: 9 pages, 2 figure
Shadowing in Inelastic Scattering of Muons on Carbon, Calcium and Lead at Low XBj
Nuclear shadowing is observed in the per-nucleon cross-sections of positive
muons on carbon, calcium and lead as compared to deuterium. The data were taken
by Fermilab experiment E665 using inelastically scattered muons of mean
incident momentum 470 GeV/c. Cross-section ratios are presented in the
kinematic region 0.0001 < XBj <0.56 and 0.1 < Q**2 < 80 GeVc. The data are
consistent with no significant nu or Q**2 dependence at fixed XBj. As XBj
decreases, the size of the shadowing effect, as well as its A dependence, are
found to approach the corresponding measurements in photoproduction.Comment: 22 pages, incl. 6 figures, to be published in Z. Phys.
- …