1,637 research outputs found
Combining in-domain and out-of-domain speech data for automatic recognition of disordered speech
On the non-persistence of Hamiltonian identity cycles
We study the leading term of the holonomy map of a perturbed plane polynomial
Hamiltonian foliation. The non-vanishing of this term implies the
non-persistence of the corresponding Hamiltonian identity cycle. We prove that
this does happen for generic perturbations and cycles, as well for cycles which
are commutators in Hamiltonian foliations of degree two. Our approach relies on
the Chen's theory of iterated path integrals which we briefly resume.Comment: 17 page
Spatial patterns in timing of the diurnal temperature cycle
This paper investigates the structural difference in timing of the diurnal temperature cycle (DTC) over land resulting from choice of measuring device or model framework. It is shown that the timing can be reliably estimated from temporally sparse observations acquired from a constellation of low Earth-orbiting satellites given record lengths of at least three months. Based on a year of data, the spatial patterns of mean DTC timing are compared between temperature estimates from microwave Ka-band, geostationary thermal infrared (TIR), and numerical weather prediction model output from the Global Modeling and Assimilation Office (GMAO). It is found that the spatial patterns can be explained by vegetation effects, sensing depth differences and more speculatively the orientation of orographic relief features. In absolute terms, the GMAO model puts the peak of the DTC on average at 12:50 local solar time, 23 min before TIR with a peak temperature at 13:13 (both averaged over Africa and Europe). Since TIR is the shallowest observation of the land surface, this small difference represents a structural error that possibly affects the model's ability to assimilate observations that are closely tied to the DTC. The equivalent average timing for Ka-band is 13:44, which is influenced by the effect of increased sensing depth in desert areas. For non-desert areas, the Ka-band observations lag the TIR observations by only 15 min, which is in agreement with their respective theoretical sensing depth. The results of this comparison provide insights into the structural differences between temperature measurements and models, and can be used as a first step to account for these differences in a coherent way
Automatic Speech Recognition Errors Detection Using Supervised Learning Techniques
Over the last years, many advances have been made in the field of Automatic Speech Recognition (ASR). However, the persistent presence of ASR errors is limiting the widespread adoption of speech technology in real life applications. This motivates the attempts to find alternative techniques to automatically detect and correct ASR errors, which can be very effective and especially when the user does not have access to tune the features, the models or the decoder of the ASR system or when the transcription serves as input to downstream systems like machine translation, information retrieval, and question answering. In this paper, we present an ASR errors detection system targeted towards substitution and insertion errors. The proposed system is based on supervised learning techniques and uses input features deducted only from the ASR output words and hence should be usable with any ASR system. Applying this system on TV program transcription data leads to identify 40.30% of the recognition errors generated by the ASR system
A Framework for Collecting Realistic Recordings of Dysarthric Speech - the homeService Corpus
This paper introduces a new British English speech database, named the homeService corpus, which has been gathered as part of the
homeService project. This project aims to help users with speech and motor disabilities to operate their home appliances using voice
commands. The audio recorded during such interactions consists of realistic data of speakers with severe dysarthria. The majority of the
homeService corpus is recorded in real home environments where voice control is often the normal means by which users interact with
their devices. The collection of the corpus is motivated by the shortage of realistic dysarthric speech corpora available to the scientific
community. Along with the details on how the data is organised and how it can be accessed, a brief description of the framework used
to make the recordings is provided. Finally, the performance of the homeService automatic recogniser for dysarthric speech trained
with single-speaker data from the corpus is provided as an initial baseline. Access to the homeService corpus is provided through the
dedicated web page at http://mini.dcs.shef.ac.uk/resources/homeservice-corpus/. This will also have the most
updated description of the data. At the time of writing the collection process is still ongoing
Summary report of neptun investigations into the transient thermal hydraulics of the passive decay heat removal
Separation of nitrogen and methane via periodic adsorption
The periodic process utilizes a rapid pressure swing cycle in an adsorbent bed to effect the separation of gas mixtures. During the first portion of a cycle the compressed gas mixture flows into the adsorbent-filled column. Next, while the feed gas is restrained, an exhaust orifice is opened at the feed end of the column providing depressurization. The product stream is enriched in the component exhibiting the lowest coefficient of adsorption. A mathematical model based upon the assumption of instantaneous equilibrium between the gas phase and the adsorbed gas was formulated and solved to simulate the periodic, adsorption process. The measured nitrogen content of the product gas stream was found to correlate with the ratio of the product gas rate to the feed gas rate. At 24°C. the calculated pressure response, feed gas flow rate, and product gas composition correspond favorably with related experimental measurements for all values of the feed gas pressure, cycling frequency, and product gas flow rate within the ranges investigated.Peer Reviewedhttp://deepblue.lib.umich.edu/bitstream/2027.42/37359/1/690170219_ftp.pd
Recommended from our members
Bioinspired Autonomous Visual Vertical Control of a Quadrotor Unmanned Aerial Vehicle
Near-ground maneuvers, such as hover, approach, and landing, are key elements of autonomy in unmanned aerial vehicles. Such maneuvers have been tackled conventionally by measuring or estimating the velocity and the height above the ground, often using ultrasonic or laser range finders. Near-ground maneuvers are naturally mastered by flying birds and insects because objects below may be of interest for food or shelter. These animals perform such maneuvers efficiently using only the available vision and vestibular sensory information. In this paper, the time-to-contact (tau) theory, which conceptualizes the visual strategy with which many species are believed to approach objects, is presented as a solution for relative ground distance control for unmanned aerial vehicles. The paper shows how such an approach can be visually guided without knowledge of height and velocity relative to the ground. A control scheme that implements the tau strategy is developed employing only visual information from a monocular camera and an inertial measurement unit. To achieve reliable visual information at a high rate, a novel filtering system is proposed to complement the control system. The proposed system is implemented onboard an experimental quadrotor unmanned aerial vehicle and is shown to not only successfully land and approach ground, but also to enable the user to choose the dynamic characteristics of the approach. The methods presented in this paper are applicable to both aerial and space autonomous vehicles
- …