2,045 research outputs found
Recommended from our members
A Pulse Model in Log-domain for a Uniform Synthesizer
The quality of the vocoder plays a crucial role in the performance of parametric speech synthesis systems. In order to improve the vocoder quality, it is necessary to reconstruct as much of the perceived components of the speech signal as possible. In this paper, we first show that the noise component is currently not accurately modelled in the widely used STRAIGHT vocoder, thus, limiting the voice range that can be covered and also limiting the overall quality. In order to motivate a new, alternative, approach to this issue, we present a new synthesizer, which uses a uniform representation for voiced and unvoiced segments. This synthesizer has also the advantage of using a simple signal model compared to other approaches, thus offering a convenient and controlled alternative for future developments. Experiments analysing the synthesis quality of the noise component shows improved speech reconstruction using the suggested synthesizer compared to STRAIGHT. Additionally an experiment about analysis/resynthesis shows that the suggested synthesizer solves some of the issues of another uniform vocoder, Harmonic Model plus Phase Distortion (HMPD). In text-to-speech synthesis, it outperforms HMPD and exhibits a similar, or only slightly worse, quality to STRAIGHT’s quality, which is encouraging for a new vocoding approach.This project has received funding from the European Union’s
Horizon 2020 research and innovation programme under the
Marie Sklodowska-Curie grant agreement No 655764. The research
for this paper was also partly supported by EPSRC grant
EP/I031022/1 (Natural Speech Technology)
Recommended from our members
A Log Domain Pulse Model for Parametric Speech Synthesis
Most of the degradation in current Statistical Parametric Speech Synthesis (SPSS) results from the form of the vocoder. One of the main causes of degradation is the reconstruction of the noise. In this article, a new signal model is proposed that leads to a simple synthesizer, without the need for ad-hoc tuning of model parameters. The model is not based on the traditional additive linear source-filter model, it adopts a combination of speech components that are additive in the log domain. Also, the same representation for voiced and unvoiced segments is used, rather than relying on binary voicing decisions. This avoids voicing error discontinuities that can occur in many current vocoders. A simple binary mask is used to denote the presence of noise in the time-frequency domain, which is less sensitive to classification errors. Four experiments have been carried out to evaluate this new model. The first experiment examines the noise reconstruction issue. Three listening tests have also been carried out that demonstrate the advantages of this model: comparison with the STRAIGHT vocoder; the direct prediction of the binary noise mask by using a mixed output configuration; and partial improvements of creakiness using a mask correction mechanism.European Union's Horizon 2020 research and innovation programme under the Marie Sklodowska-Curie; 10.13039/501100000266-EPSR
Recommended from our members
Noise shaping Asynchronous SAR ADC based time to digital converter
Time-to-digital converters (TDCs) are key elements for the digitization of timing information in modern mixed-signal circuits such as digital PLLs, DLLs, ADCs, and on-chip jitter-monitoring circuits. Especially, high-resolution TDCs are increasingly employed in on-chip timing tests, such as jitter and clock skew measurements, as advanced fabrication technologies allow fine on-chip time resolutions. Its main purpose is to quantize the time interval of a pulse signal or the time interval between the rising edges of two clock signals. Similarly to ADCs, the performance of TDCs are also primarily characterized by Resolution, Sampling Rate, FOM, SNDR, Dynamic Range and DNL/INL. This work proposes and demonstrates 2nd order noise shaping Asynchronous SAR ADC based TDC architecture with highest resolution of 0.25 ps among current state of art designs with respect to post-layout simulation results. This circuit is a combination of low power/High Resolution 2nd Order Noise Shaped Asynchronous SAR ADC backend with simple Time to Amplitude converter (TAC) front-end and is implemented in 40nm CMOS technology. Additionally, special emphasis is given on the discussion on various current state of art TDC architectures.Electrical and Computer Engineerin
High resolution angular sensor
Specifications for the pointing stabilization system of the large space telescope were used in an investigation of the feasibility of reducing ring laser gyro output quantization to the sub-arc-second level by the use of phase locked loops and associated electronics. Systems analysis procedures are discussed and a multioscillator laser gyro model is presented along with data on the oscillator noise. It is shown that a second order closed loop can meet the measurement noise requirements when the loop gain and time constant of the loop filter are appropriately chosen. The preliminary electrical design is discussed from the standpoint of circuit tradeoff considerations. Analog, digital, and hybrid designs are given and their applicability to the high resolution sensor is examined. the electrical design choice of a system configuration is detailed. The design and operation of the various modules is considered and system block diagrams are included. Phase 1 and 2 test results using the multioscillator laser gyro are included
Development of wideband radio channel measurement and modeling techniques for future radio systems
This thesis discusses the development of micro- and millimeterwave wideband radio channel measurement and modeling techniques for future radio networks. Characterization of the radio channel is needed for radio system, wireless network, and antenna design. A radio channel measurement system was designed for 2.154, 5.3 GHz and 60 GHz center frequencies, and completed at the two lower frequencies. The sounder uses a pseudonoise code in the transmitter. In the receiver, first a sliding correlator, and later direct digital sampling, where the impulse response is detected by digital post processing, were realized. Certain implementation questions, like link budget, effects of phase noise on impulse response and direction of arrival estimation, and achievable performance using the designed concept, are discussed.
Measurement campaigns included in this thesis were realized at 5.3 GHz frequency in micro- and picocells. A comprehensive measurement campaign performed inside different buildings was thoroughly analyzed. Propagation mechanisms were studied and empirical models for both large scale fading and multipath propagation were developed. Propagation through walls, diffraction through doorways, and propagation paths outside the building were observed. Pathloss in LOS was lower than the free space pathloss, due to wave guiding effects. In NLOS situation difference in the pathloss models in different buildings was significant. Behavior of the spatial diversity was estimated on the basis of spatial correlation functions extracted from the measurement data; an antenna separation of a fraction of a wavelength gives sufficient de-correlation for significant diversity gain in indoor environments at 5.3 GHz in NLOS.reviewe
DEVELOPMENT OF AN UWB RADAR SYSTEM
An ultra-wideband radar system is built at the University of Tennessee with the goal to develop a ground penetrating radar (GPR). The radar is required to transmit and receive a very narrow pulse signal in the time domain. The bistatic radar transmits a pulse through an ultrawide spiral antenna and receives the pulse by a similar antenna. Direct sampling is used to improve the performance of the impulse radar allowing up to 1.5 GHz of bandwidth to be used for signal processing and target detection with high resolution. Using direct sampling offers a less complex system design than traditional lower sample rate, super-heterodyne systems using continuous wave or step frequency methods while offering faster results than conventional equivalent time sampling techniques that require multiple data sets and significant post-processing. These two points are particularly important for a system that may be used in the field in potentially dangerous environments. Direct sampling radar systems, while still frequency limited, are continually improving their upper frequencies boundaries due to more power efficient, higher sampling rate analog to digital converters (ADCs) which relates directly to better subsurface resolution for potential target detection
- …