698 research outputs found

    Particle Filter Inference in an Articulatory-Based Speech Model

    Get PDF

    Improving GNSS-R sea level determination through inverse modeling of SNR data

    Get PDF
    This paper presents a new method for retrieving sea surface heights from Global Navigation Satellite Systems reflectometry (GNSS-R) data by inverse modeling of SNR observations from a single geodetic receiver. The method relies on a B-spline representation of the temporal sea level variations in order to account for its continuity. The corresponding B-spline coefficients are determined through a nonlinear least squares fit to the SNR data, and a consistent choice of model parameters enables the combination of multiple GNSS in a single inversion process. This leads to a clear increase in precision of the sea level retrievals which can be attributed to a better spatial and temporal sampling of the reflecting surface. Tests with data from two different coastal GNSS sites and comparison with colocated tide gauges show a significant increase in precision when compared to previously used methods, reaching standard deviations of 1.4 cm at Onsala, Sweden, and 3.1 cm at Spring Bay, Tasmania

    Features and Measures for Speaker Recognition

    Get PDF
    Electrical Engineerin

    SATELLITE MICROWAVE MEASUREMENT OF LAND SURFACE PHENOLOGY: CLARIFYING VEGETATION PHENOLOGY RESPONSE TO CLIMATIC DRIVERS AND EXTREME EVENTS

    Get PDF
    The seasonality of terrestrial vegetation controls feedbacks to the climate system including land-atmosphere water, energy and carbon (CO2) exchanges with cascading effects on regional-to-global weather and circulation patterns. Proper characterization of vegetation phenology is necessary to understand and quantify changes in the earthÆs ecosystems and biogeochemical cycles and is a key component in tracking ecological species response to climate change. The response of both functional and structural vegetation phenology to climatic drivers on a global scale is still poorly understood however, which has hindered the development of robust vegetation phenology models. In this dissertation I use satellite microwave vegetation optical depth (VOD) in conjunction with an array of satellite measures, Global Positioning System (GPS) reflectometry, field observations and flux tower data to 1) clarify vegetation phenology response to water, temperature and solar irradiance constraints, 2) demonstrate the asynchrony between changes in vegetation water content and biomass and changes in greenness and leaf area in relation to land cover type and climate constraints, 3) provide enhanced assessment of seasonal recovery of vegetation biomass following wildfire and 4) present a method to more accurately model tropical vegetation phenology. This research will establish VOD as a useful and informative parameter for regional-to-global vegetation phenology modeling, more accurately define the drivers of both structural and functional vegetation phenology, and help minimize errors in phenology simulations within earth system models. This dissertation also includes the development of Gross Primary Productivity (GPP) and Net Primary Productivity (NPP) vegetation health climate indicators as part of a NASA funded project entitled Development and Testing of Potential Indicators for the National Climate Assessment; Translating EOS datasets into National Ecosystem Biophysical Indicators

    Percepcijska utemeljenost kepstranih mjera udaljenosti za primjene u obradi govora

    Get PDF
    Currently, one of the most widely used distance measures in speech and speaker recognition is the Euclidean distance between mel frequency cepstral coefficients (MFCC). MFCCs are based on filter bank algorithm whose filters are equally spaced on a perceptually motivated mel frequency scale. The value of mel cepstral vector, as well as the properties of the corresponding cepstral distance, are determined by several parameters used in mel cepstral analysis. The aim of this work is to examine compatibility of MFCC measure with human perception for different values of parameters in the analysis. By analysing mel filter bank parameters it is found that filter bank with 24 bands, 220 mels bandwidth and band overlap coefficient equal and higher than one gives optimal spectral distortion (SD) distance measures. For this kind of mel filter bank, the difference between vowels can be recognised for full-length mel cepstral SD RMS measure higher than 0.4 - 0.5 dB. Further on, we will show that usage of truncated mel cepstral vector (12 coefficients) is justified for speech recognition, but may be arguable for speaker recognition. We also analysed the impact of aliasing in cepstral domain on cepstral distortion measures. The results showed high correlation of SD distances calculated from aperiodic and periodic mel cepstrum, leading to the conclusion that the impact of aliasing is generally minor. There are rare exceptions where aliasing is present, and these were also analysed.Jedna od danas najčešće korištenih mjera u automatskom prepoznavanju govora i govornika je mjera euklidske udaljenosti MFCC vektora. Algoritam za izračunavanje mel frekvencijskih kepstralnih koeficijenata zasniva se na filtarskom slogu kod kojeg su pojasi ekvidistantno raspoređeni na percepcijski motiviranoj mel skali. Na vrijednost mel kepstralnog vektora, a samim time i na svojstva kepstralne mjere udaljenosti glasova, utječe veći broj parametara sustava za kepstralnu analizu. Tema ovog rada je ispitati usklađenost MFCC mjere sa stvarnim percepcijskim razlikama za različite vrijednosti parametara analize. Analizom parametara mel filtarskog sloga utvrdili smo da filtar sa 24 pojasa, širine 220 mel-a i faktorom preklapanja filtra većim ili jednakim jedan, daje optimalne SD mjere koje se najbolje slažu s percepcijom. Za takav mel filtarski slog granica čujnosti razlike između glasova je 0.4-0.5 dB, mjereno SD RMS razlikom potpunih mel kepstralnih vektora. Također, pokazat ćemo da je korištenje mel kepstralnog vektora odrezanog na konačnu dužinu (12 koeficijenata) opravdano za prepoznavanje govora, ali da bi moglo biti upitno u primjenama prepoznavanja govornika. Analizirali smo i utjecaj preklapanja spektara u kepstralnoj domeni na mjere udaljenosti glasova. Utvrđena je izrazita koreliranost SD razlika izračunatih iz aperiodskog i periodičkog mel kepstra iz čega zaključujemo da je utjecaj preklapanja spektara generalno zanemariv. Postoje rijetke iznimke kod kojih je utjecaj preklapanja spektara prisutan, te su one posebno analizirane

    An investigation into glottal waveform based speech coding

    Get PDF
    Coding of voiced speech by extraction of the glottal waveform has shown promise in improving the efficiency of speech coding systems. This thesis describes an investigation into the performance of such a system. The effect of reverberation on the radiation impedance at the lips is shown to be negligible under normal conditions. Also, the accuracy of the Image Method for adding artificial reverberation to anechoic speech recordings is established. A new algorithm, Pre-emphasised Maximum Likelihood Epoch Detection (PMLED), for Glottal Closure Instant detection is proposed. The algorithm is tested on natural speech and is shown to be both accurate and robust. Two techniques for giottai waveform estimation, Closed Phase Inverse Filtering (CPIF) and Iterative Adaptive Inverse Filtering (IAIF), are compared. In tandem with an LF model fitting procedure, both techniques display a high degree of accuracy However, IAIF is found to be slightly more robust. Based on these results, a Glottal Excited Linear Predictive (GELP) coding system for voiced speech is proposed and tested. Using a differential LF parameter quantisation scheme, the system achieves speech quality similar to that of U S Federal Standard 1016 CELP at a lower mean bit rate while incurring no extra delay
    corecore