4,196 research outputs found

    HMM-based speech synthesiser using the LF-model of the glottal source

    Get PDF
    A major factor which causes a deterioration in speech quality in HMM-based speech synthesis is the use of a simple delta pulse signal to generate the excitation of voiced speech. This paper sets out a new approach to using an acoustic glottal source model in HMM-based synthesisers instead of the traditional pulse signal. The goal is to improve speech quality and to better model and transform voice characteristics. We have found the new method decreases buzziness and also improves prosodic modelling. A perceptual evaluation has supported this finding by showing a 55.6 % preference for the new system, as against the baseline. This improvement, while not being as significant as we had initially expected, does encourage us to work on developing the proposed speech synthesiser further

    Identification of New Drug Candidates Against \u3cem\u3eBorrelia burgdorferi\u3c/em\u3e Using High-Throughput Screening

    Get PDF
    Lyme disease is the most common zoonotic bacterial disease in North America. It is estimated that .300,000 cases per annum are reported in USA alone. A total of 10%–20% of patients who have been treated with antibiotic therapy report the recrudescence of symptoms, such as muscle and joint pain, psychosocial and cognitive difficulties, and generalized fatigue. This condition is referred to as posttreatment Lyme disease syndrome. While there is no evidence for the presence of viable infectious organisms in individuals with posttreatment Lyme disease syndrome, some researchers found surviving Borrelia burgdorferi population in rodents and primates even after antibiotic treatment. Although such observations need more ratification, there is unmet need for developing the therapeutic agents that focus on removing the persisting bacterial form of B. burgdorferi in rodent and nonhuman primates. For this purpose, high-throughput screening was done using BacTiter-Glo assay for four compound libraries to identify candidates that stop the growth of B. burgdorferi in vitro. The four chemical libraries containing 4,366 compounds (80% Food and Drug Administration [FDA] approved) that were screened are Library of Pharmacologically Active Compounds (LOPAC1280), the National Institutes of Health Clinical Collection, the Microsource Spectrum, and the Biomol FDA. We subsequently identified 150 unique compounds, which inhibited .90% of B. burgdorferi growth at a concentration of ,25 µM. These 150 unique compounds comprise many safe antibiotics, chemical compounds, and also small molecules from plant sources. Of the 150 unique compounds, 101 compounds are FDA approved. We selected the top 20 FDA-approved molecules based on safety and potency and studied their minimum inhibitory concentration and minimum bactericidal concentration. The promising safe FDA-approved candidates that show low minimum inhibitory concentration and minimum bactericidal concentration values can be chosen as lead molecules for further advanced studies

    A Cosmic Microwave Background Radiation Polarimeter Using Superconducting Bearings

    Full text link
    Measurements of the polarization of the cosmic microwave background (CMB) radiation are expected to significantly increase our understanding of the early universe. We present a design for a CMB polarimeter in which a cryogenically cooled half wave plate rotates by means of a high-temperature superconducting (HTS) bearing. The design is optimized for implementation in MAXIPOL, a balloon-borne CMB polarimeter. A prototype bearing, consisting of commercially available ring-shaped permanent magnet and an array of YBCO bulk HTS material, has been constructed. We measured the coefficient of friction as a function of several parameters including temperature between 15 and 80 K, rotation frequency between 0.3 and 3.5 Hz, levitation distance between 6 and 10 mm, and ambient pressure between 10^{-7} and 1 torr. The low rotational drag of the HTS bearing allows rotations for long periods of time with minimal input power and negligible wear and tear thus making this technology suitable for a future satellite mission.Comment: 6 pages, IEEE-Transactions of Applied Superconductivity, 2003, Vol. 13, in pres

    Combining vocal tract length normalization with hierarchial linear transformations

    Get PDF
    Recent research has demonstrated the effectiveness of vocal tract length normalization (VTLN) as a rapid adaptation technique for statistical parametric speech synthesis. VTLN produces speech with naturalness preferable to that of MLLR-based adaptation techniques, being much closer in quality to that generated by the original av-erage voice model. However with only a single parameter, VTLN captures very few speaker specific characteristics when compared to linear transform based adaptation techniques. This paper pro-poses that the merits of VTLN can be combined with those of linear transform based adaptation in a hierarchial Bayesian frame-work, where VTLN is used as the prior information. A novel tech-nique for propagating the gender information from the VTLN prior through constrained structural maximum a posteriori linear regres-sion (CSMAPLR) adaptation is presented. Experiments show that the resulting transformation has improved speech quality with better naturalness, intelligibility and improved speaker similarity. Index Terms — Statistical parametric speech synthesis, hidden Markov models, speaker adaptation, vocal tract length normaliza-tion, constrained structural maximum a posteriori linear regression 1

    Mage - Reactive articulatory feature control of HMM-based parametric speech synthesis

    Get PDF
    In this paper, we present the integration of articulatory control into MAGE, a framework for realtime and interactive (reactive) parametric speech synthesis using hidden Markov models (HMMs). MAGE is based on the speech synthesis engine from HTS and uses acoustic features (spectrum and f0) to model and synthesize speech. In this work, we replace the standard acoustic models with models combining acoustic and articulatory features, such as tongue, lips and jaw positions. We then use feature-space-switched articulatory-to-acoustic regression matrices to enable us to control the spectral acoustic features by manipulating the articulatory features. Combining this synthesis model with MAGE allows us to interactively and intuitively modify phones synthesized in real time, for example transforming one phone into another, by controlling the configuration of the articulators in a visual display. Index Terms: speech synthesis, reactive, articulators 1

    The Blizzard Challenge 2009

    Get PDF
    The Blizzard Challenge 2009 was the fifth annual Blizzard Challenge. As in 2008, UK English and Mandarin Chinese were the chosen languages for the 2009 Challenge. The English corpus was the same one used in 2008. The Mandarin corpus was provided by iFLYTEK. As usual, participants with limited resources or limited experience in these languages had the option of using unaligned labels that were provided for both corpora and for the test sentences. An accent-specific pronunciation dictionary was also available for the English speaker. This year, the tasks were organised in the form of ‘hubs ’ and ‘spokes ’ where each hub task involved building a general-purpose voice and each spoke task involved building a voice for a specific application. A set of test sentences was released to participants, who were given a limited time in which to synthesise them and submit the synthetic speech. An online listening test was conducted to evaluate naturalness, intelligibility, degree of similarity to the original speaker and, for one of the spoke tasks, “appropriateness.

    Speech Synthesis Based on Hidden Markov Models

    Get PDF
    corecore